NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F084560

Metagenome Family F084560

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F084560
Family Type Metagenome
Number of Sequences 112
Average Sequence Length 274 residues
Representative Sequence MARLARRYRIVVSAENSPYLAWQAKLFYFSCVSRLNRSPIVIVHDCGSKWRRDFQEIADAGAIVRRAPSYRITSNGDDYPPRNTAGTLLHAAELCSAKDEFIVLCDPDMIFVGQLDFSESLSGEYYGYVKYDRKPVRGAAKEIGIRLEMLDRQKEELCCGVPYVIPVAAAKPLADAWLQAIDAFSPRHWEDQMHAFGLAVVKLGLRVTLTHIMNHNYWPDAMVDRDVIHYAYGDKTWNKRSYFTTRQARKVWSPAAAAQQGTILAELLSQIREARDFYSRFH
Number of Associated Samples 96
Number of Associated Scaffolds 112

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 52.68 %
% of genes near scaffold ends (potentially truncated) 50.00 %
% of genes from short scaffolds (< 2000 bps) 60.71 %
Associated GOLD sequencing projects 88
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (100.000 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(33.929 % of family members)
Environment Ontology (ENVO) Unclassified
(48.214 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(46.429 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 33.69%    β-sheet: 13.12%    Coil/Unstructured: 53.19%
Feature Viewer
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 112 Family Scaffolds
PF05258DciA 11.61
PF136402OG-FeII_Oxy_3 9.82
PF00069Pkinase 3.57
PF13649Methyltransf_25 3.57
PF02585PIG-L 2.68
PF00856SET 2.68
PF01590GAF 0.89
PF04963Sigma54_CBD 0.89
PF13578Methyltransf_24 0.89
PF136612OG-FeII_Oxy_4 0.89
PF16400DUF5008 0.89
PF01979Amidohydro_1 0.89
PF13432TPR_16 0.89
PF04552Sigma54_DBD 0.89
PF13620CarboxypepD_reg 0.89
PF00343Phosphorylase 0.89
PF00027cNMP_binding 0.89

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 112 Family Scaffolds
COG0515Serine/threonine protein kinaseSignal transduction mechanisms [T] 14.29
COG5512Predicted nucleic acid-binding protein, contains Zn-ribbon domain (includes truncated derivatives)General function prediction only [R] 11.61
COG2120N-acetylglucosaminyl deacetylase, LmbE familyCarbohydrate transport and metabolism [G] 2.68
COG1508DNA-directed RNA polymerase specialized sigma subunit, sigma54 homologTranscription [K] 1.79
COG0058Glucan phosphorylaseCarbohydrate transport and metabolism [G] 0.89


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms100.00 %
UnclassifiedrootN/A0.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000787|JGI11643J11755_11649117All Organisms → cellular organisms → Bacteria799Open in IMG/M
3300001867|JGI12627J18819_10034121All Organisms → cellular organisms → Bacteria2112Open in IMG/M
3300003990|Ga0055455_10022171All Organisms → cellular organisms → Bacteria1569Open in IMG/M
3300004479|Ga0062595_100745303All Organisms → cellular organisms → Bacteria796Open in IMG/M
3300005166|Ga0066674_10182167All Organisms → cellular organisms → Bacteria998Open in IMG/M
3300005171|Ga0066677_10148634All Organisms → cellular organisms → Bacteria1282Open in IMG/M
3300005172|Ga0066683_10081212All Organisms → cellular organisms → Bacteria1953Open in IMG/M
3300005180|Ga0066685_10043059All Organisms → cellular organisms → Bacteria2871Open in IMG/M
3300005180|Ga0066685_10217158All Organisms → cellular organisms → Bacteria1314Open in IMG/M
3300005181|Ga0066678_10128820All Organisms → cellular organisms → Bacteria1563Open in IMG/M
3300005187|Ga0066675_10090000All Organisms → cellular organisms → Bacteria2001Open in IMG/M
3300005332|Ga0066388_100098716All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia3376Open in IMG/M
3300005332|Ga0066388_101378955All Organisms → cellular organisms → Bacteria1223Open in IMG/M
3300005332|Ga0066388_103191505All Organisms → cellular organisms → Bacteria838Open in IMG/M
3300005353|Ga0070669_100049012All Organisms → cellular organisms → Bacteria3084Open in IMG/M
3300005354|Ga0070675_100206818All Organisms → cellular organisms → Bacteria1705Open in IMG/M
3300005445|Ga0070708_100131782All Organisms → cellular organisms → Bacteria2314Open in IMG/M
3300005536|Ga0070697_100072331All Organisms → cellular organisms → Bacteria2829Open in IMG/M
3300005553|Ga0066695_10574850All Organisms → cellular organisms → Bacteria680Open in IMG/M
3300005558|Ga0066698_10253032All Organisms → cellular organisms → Bacteria1216Open in IMG/M
3300005598|Ga0066706_10158488All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1709Open in IMG/M
3300005764|Ga0066903_100413622All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Candidatus Acidoferrales → Candidatus Acidoferrum → Candidatus Acidoferrum panamensis2228Open in IMG/M
3300005937|Ga0081455_10007759All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia11253Open in IMG/M
3300006046|Ga0066652_100196858All Organisms → cellular organisms → Bacteria1731Open in IMG/M
3300006046|Ga0066652_101480577All Organisms → cellular organisms → Bacteria631Open in IMG/M
3300006175|Ga0070712_100030516All Organisms → cellular organisms → Bacteria3623Open in IMG/M
3300006791|Ga0066653_10167271All Organisms → cellular organisms → Bacteria1089Open in IMG/M
3300006796|Ga0066665_10108468All Organisms → cellular organisms → Bacteria2048Open in IMG/M
3300006797|Ga0066659_10113431All Organisms → cellular organisms → Bacteria1867Open in IMG/M
3300006797|Ga0066659_10278272All Organisms → cellular organisms → Bacteria1266Open in IMG/M
3300007255|Ga0099791_10000283All Organisms → cellular organisms → Bacteria18062Open in IMG/M
3300009012|Ga0066710_101424205All Organisms → cellular organisms → Bacteria1073Open in IMG/M
3300009137|Ga0066709_100894988All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1293Open in IMG/M
3300010361|Ga0126378_11600437All Organisms → cellular organisms → Bacteria740Open in IMG/M
3300012198|Ga0137364_10216586All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1407Open in IMG/M
3300012198|Ga0137364_10527810All Organisms → cellular organisms → Bacteria888Open in IMG/M
3300012200|Ga0137382_10019474All Organisms → cellular organisms → Bacteria3859Open in IMG/M
3300012201|Ga0137365_10066358All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium2723Open in IMG/M
3300012201|Ga0137365_10122365All Organisms → cellular organisms → Bacteria1957Open in IMG/M
3300012202|Ga0137363_10015937All Organisms → cellular organisms → Bacteria4931Open in IMG/M
3300012204|Ga0137374_10465347All Organisms → cellular organisms → Bacteria990Open in IMG/M
3300012205|Ga0137362_10035773All Organisms → cellular organisms → Bacteria3966Open in IMG/M
3300012205|Ga0137362_10075062All Organisms → cellular organisms → Bacteria2807Open in IMG/M
3300012206|Ga0137380_10324253All Organisms → cellular organisms → Bacteria1377Open in IMG/M
3300012209|Ga0137379_10117504All Organisms → cellular organisms → Bacteria2558Open in IMG/M
3300012210|Ga0137378_10055285All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium3578Open in IMG/M
3300012285|Ga0137370_10002313All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia8189Open in IMG/M
3300012285|Ga0137370_10182177All Organisms → cellular organisms → Bacteria1226Open in IMG/M
3300012349|Ga0137387_10162608All Organisms → cellular organisms → Bacteria1594Open in IMG/M
3300012350|Ga0137372_10018407All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium6586Open in IMG/M
3300012351|Ga0137386_10181250All Organisms → cellular organisms → Bacteria1511Open in IMG/M
3300012353|Ga0137367_10025160All Organisms → cellular organisms → Bacteria4623Open in IMG/M
3300012353|Ga0137367_10288462All Organisms → cellular organisms → Bacteria1176Open in IMG/M
3300012354|Ga0137366_10001522All Organisms → cellular organisms → Bacteria17218Open in IMG/M
3300012354|Ga0137366_10516910All Organisms → cellular organisms → Bacteria861Open in IMG/M
3300012356|Ga0137371_10164418All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1741Open in IMG/M
3300012357|Ga0137384_10377073All Organisms → cellular organisms → Bacteria1173Open in IMG/M
3300012359|Ga0137385_10315230All Organisms → cellular organisms → Bacteria1343Open in IMG/M
3300012361|Ga0137360_10001183All Organisms → cellular organisms → Bacteria15281Open in IMG/M
3300012362|Ga0137361_10029066All Organisms → cellular organisms → Bacteria4353Open in IMG/M
3300012582|Ga0137358_10009592All Organisms → cellular organisms → Bacteria5987Open in IMG/M
3300012582|Ga0137358_10041536All Organisms → cellular organisms → Bacteria3047Open in IMG/M
3300012923|Ga0137359_10031140All Organisms → cellular organisms → Bacteria4581Open in IMG/M
3300012929|Ga0137404_10047038All Organisms → cellular organisms → Bacteria3287Open in IMG/M
3300012929|Ga0137404_11212591All Organisms → cellular organisms → Bacteria694Open in IMG/M
3300012930|Ga0137407_10045592All Organisms → cellular organisms → Bacteria3530Open in IMG/M
3300012930|Ga0137407_10069566All Organisms → cellular organisms → Bacteria2927Open in IMG/M
3300012958|Ga0164299_10207508All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1138Open in IMG/M
3300012986|Ga0164304_10500944All Organisms → cellular organisms → Bacteria887Open in IMG/M
3300015053|Ga0137405_1005827All Organisms → cellular organisms → Bacteria3665Open in IMG/M
3300015053|Ga0137405_1271353All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1172Open in IMG/M
3300015264|Ga0137403_10243683All Organisms → cellular organisms → Bacteria1711Open in IMG/M
3300015371|Ga0132258_13501504All Organisms → cellular organisms → Bacteria1075Open in IMG/M
3300015374|Ga0132255_101713030All Organisms → cellular organisms → Bacteria954Open in IMG/M
3300017654|Ga0134069_1085654All Organisms → cellular organisms → Bacteria1018Open in IMG/M
3300018071|Ga0184618_10043510All Organisms → cellular organisms → Bacteria1604Open in IMG/M
3300018433|Ga0066667_10009889All Organisms → cellular organisms → Bacteria4524Open in IMG/M
3300018482|Ga0066669_10491992All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1060Open in IMG/M
3300018482|Ga0066669_10657275All Organisms → cellular organisms → Bacteria922Open in IMG/M
3300019879|Ga0193723_1010222All Organisms → cellular organisms → Bacteria3027Open in IMG/M
3300019885|Ga0193747_1006650All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium2799Open in IMG/M
3300019886|Ga0193727_1044425All Organisms → cellular organisms → Bacteria1459Open in IMG/M
3300019887|Ga0193729_1004709All Organisms → cellular organisms → Bacteria6595Open in IMG/M
3300019888|Ga0193751_1048722All Organisms → cellular organisms → Bacteria1838Open in IMG/M
3300020001|Ga0193731_1105058All Organisms → cellular organisms → Bacteria725Open in IMG/M
3300020002|Ga0193730_1094923All Organisms → cellular organisms → Bacteria834Open in IMG/M
3300020004|Ga0193755_1124917All Organisms → cellular organisms → Bacteria799Open in IMG/M
3300020170|Ga0179594_10007147All Organisms → cellular organisms → Bacteria2948Open in IMG/M
3300021080|Ga0210382_10056697All Organisms → cellular organisms → Bacteria1554Open in IMG/M
3300021344|Ga0193719_10031351All Organisms → cellular organisms → Bacteria2291Open in IMG/M
3300021418|Ga0193695_1000859All Organisms → cellular organisms → Bacteria4948Open in IMG/M
3300022694|Ga0222623_10006297All Organisms → cellular organisms → Bacteria4206Open in IMG/M
3300025315|Ga0207697_10294320All Organisms → cellular organisms → Bacteria720Open in IMG/M
3300025922|Ga0207646_10007754All Organisms → cellular organisms → Bacteria10863Open in IMG/M
3300025923|Ga0207681_10007635All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium6629Open in IMG/M
3300025926|Ga0207659_10386588All Organisms → cellular organisms → Bacteria1168Open in IMG/M
3300025972|Ga0207668_10624771All Organisms → cellular organisms → Bacteria940Open in IMG/M
3300026322|Ga0209687_1108628All Organisms → cellular organisms → Bacteria897Open in IMG/M
3300026323|Ga0209472_1079380All Organisms → cellular organisms → Bacteria1338Open in IMG/M
3300026326|Ga0209801_1064155All Organisms → cellular organisms → Bacteria1641Open in IMG/M
3300026327|Ga0209266_1136634All Organisms → cellular organisms → Bacteria1017Open in IMG/M
3300026540|Ga0209376_1047363All Organisms → cellular organisms → Bacteria2507Open in IMG/M
3300027502|Ga0209622_1010131All Organisms → cellular organisms → Bacteria1574Open in IMG/M
3300027548|Ga0209523_1024651All Organisms → cellular organisms → Bacteria1189Open in IMG/M
3300028381|Ga0268264_10700510All Organisms → cellular organisms → Bacteria1006Open in IMG/M
3300028768|Ga0307280_10072546All Organisms → cellular organisms → Bacteria1110Open in IMG/M
3300031446|Ga0170820_17309100All Organisms → cellular organisms → Bacteria844Open in IMG/M
3300031720|Ga0307469_10337740All Organisms → cellular organisms → Bacteria1259Open in IMG/M
3300031945|Ga0310913_10447030All Organisms → cellular organisms → Bacteria918Open in IMG/M
3300032001|Ga0306922_10953954All Organisms → cellular organisms → Bacteria888Open in IMG/M
3300032180|Ga0307471_100001231All Organisms → cellular organisms → Bacteria13929Open in IMG/M
3300032205|Ga0307472_100825097All Organisms → cellular organisms → Bacteria850Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil33.93%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil18.75%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil11.61%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil4.46%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil3.57%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere3.57%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil2.68%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil2.68%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere2.68%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere1.79%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Miscanthus Rhizosphere1.79%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment0.89%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands0.89%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.89%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.89%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil0.89%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil0.89%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.89%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.89%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil0.89%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.89%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.89%
Tabebuia Heterophylla RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Tabebuia Heterophylla Rhizosphere0.89%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.89%
Corn, Switchgrass And Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn, Switchgrass And Miscanthus Rhizosphere0.89%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000787Soil microbial communities from Great Prairies - Iowa, Continuous Corn soilEnvironmentalOpen in IMG/M
3300001867Texas A ecozone_OM1H0_M2 (Combined assembly for Texas A ecozone Site metagenome samples, ASSEMBLY_DATE=20130705)EnvironmentalOpen in IMG/M
3300003990Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Goodyear_PhragC_D2EnvironmentalOpen in IMG/M
3300004479Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAsEnvironmentalOpen in IMG/M
3300005166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123EnvironmentalOpen in IMG/M
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005187Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005353Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S5-3 metaGHost-AssociatedOpen in IMG/M
3300005354Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M4-3 metaGHost-AssociatedOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005553Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144EnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300005937Tabebuia heterophylla rhizosphere microbial communities from the University of Puerto Rico - S3T2R1Host-AssociatedOpen in IMG/M
3300006046Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101EnvironmentalOpen in IMG/M
3300006175Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaGEnvironmentalOpen in IMG/M
3300006791Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300010361Tropical forest soil microbial communities from Panama - MetaG Plot_23EnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012204Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012285Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012350Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012353Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012354Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012958Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_221_MGEnvironmentalOpen in IMG/M
3300012986Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_217_MGEnvironmentalOpen in IMG/M
3300015053Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300017654Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300018071Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_b1EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300019879Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2m2EnvironmentalOpen in IMG/M
3300019885Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L1m2EnvironmentalOpen in IMG/M
3300019886Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2c2EnvironmentalOpen in IMG/M
3300019887Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1c2EnvironmentalOpen in IMG/M
3300019888Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L1c2EnvironmentalOpen in IMG/M
3300020001Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1a2EnvironmentalOpen in IMG/M
3300020002Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1a1EnvironmentalOpen in IMG/M
3300020004Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1a2EnvironmentalOpen in IMG/M
3300020170Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021080Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_coex redoEnvironmentalOpen in IMG/M
3300021344Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2a2EnvironmentalOpen in IMG/M
3300021418Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L3s2EnvironmentalOpen in IMG/M
3300022694Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_30_coexEnvironmentalOpen in IMG/M
3300025315Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA, with PhiX - S5 (SPAdes)Host-AssociatedOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025923Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S5-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025926Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M4-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025972Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026322Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_136 (SPAdes)EnvironmentalOpen in IMG/M
3300026323Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130 (SPAdes)EnvironmentalOpen in IMG/M
3300026326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127 (SPAdes)EnvironmentalOpen in IMG/M
3300026327Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132 (SPAdes)EnvironmentalOpen in IMG/M
3300026540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 (SPAdes)EnvironmentalOpen in IMG/M
3300027502Forest soil microbial communities from Davy Crockett National Forest, Groveton, Texas, USA - Texas A ecozone_OM3H0_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027548Forest soil microbial communities from Davy Crockett National Forest, Groveton, Texas, USA - Texas A ecozone_OM3H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300028381Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S3-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300028768Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_119EnvironmentalOpen in IMG/M
3300031446Fir Summer Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031945Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.OX082EnvironmentalOpen in IMG/M
3300032001Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.082 (v2)EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI11643J11755_1164911713300000787SoilLKSVPKAKFLRCAAHHQADPDSYTVARNDVSIFAGDGIQLMARLARRYRIVVSAENSPYLAWQAKLFHFSCVSRLNRSPIVIVHDCGSKWRRDFQEIAHAGAIVSRAPSYRITSNGDDYPPRNTAGTLLHAAELCSAKDEFIVLCDPDMIFVGQPDFSGSLSGEYYGYVKYDRKPVRGAAKEIGIRLEMLDRQKEELCCGVPYVIPVAAAKRLADAWLQAIDAFSPRHWEDQMHAFGLAVVKLGLRVTLTHMMNHNYWPDAMVDR
JGI12627J18819_1003412133300001867Forest SoilMIHVPSSLLARHSSVEALAERRDRCKALRADLAPRYRIVVSAENSPYLAWQAKLFHFSCVSRVGRSPIVIVHDCGSKWRRDFQELADAGAIVSRAPSYRITSNGDDYLCRNIAGTLLHAAELCSAKDEFIVFCDPDMIFVRQPDFSRSLSGEYYGYVNYDRKPVRRAAKKIGIRLEMLDRQKRELCCGPPYVIPVAAAKQLAEAWLRAIDEVSPREWDDVMGAFGLAVVKLGLRITLTHMMNHNYWPNAMVNRDVIHYCYGDKTWNKRSYFTTRQAQKVWSPXAVAQQGTILAELLSQIREARDFYSKFH*
Ga0055455_1002217133300003990Natural And Restored WetlandsMPGLAPHYRIVVCAENSPYLAWQAKLFHFSCVSRVGRSPIVIVHDCGSKWRRDFQELADAGAIVSRAPTYRITSNGDDYPPRNTAGTLLHAAELCSAKNEFIVLCDPDMIFVRQPDFSRSLSGEYYSYVSYDRKPVRRAAKKIGIRLEMLDRQEEELCCGVPYVIPVAAAKQLAEAWLQAIDEVSPREWVDQMRAFGLAVVKLGLRIKLTHMMNHNYWPNAMVNRDVIH
Ga0062595_10074530313300004479SoilVSAENSPYLAWQAKLFHFSCVNRLKRSPIVIVHDCGSKLRRDFQEIADAGAIVSRAPSYRITSNGDDYPPRNTAGTLLHAAELCSAKDEFIVLCDPDMIFVGQPDFSGSLSGEYYGYVKYDRKPVRGAAKEIGIRLEMLDRQKEELCCGVPYVIPVAAAKRLADAWLQAIDAFSPRHWEDQMHAFGLAVVKLGLRVTLTHMMNHNYWPDAMVDRDVIHYAYGDKTWNKRSYFTSRQARKVWSPAAVAQQGTILAELLSQIREARD
Ga0066674_1018216713300005166SoilNPCRRRSSSDARCTTRQTLDSYTVARSAVSIFDRDGIQLMADLAPRYRIVVSAENSPYLAWQAKLFHFSCVSRLSRSPIVIVHDCGSKWRRDFQELADAGAIVSRAPSYRITSNGDDYVCRNIPGSLLHAAELCSARDEFIVFCDPDMIFVRQPDFSRSLSGEYYGYVNYDRKPVRRAAKKIGIRLEMLDRQKKELCCGPPYVIPVAAAKQLAEAWLQAIDEVSPREWDDVMGAFGLAVVKLGLKITLTHIMNHNYWPDAMVDRDVIHYSYGDKTWSKRSYFTTRQARKVWSPAAAAQQGTILAELLSQIREASDFYSKFH*
Ga0066677_1014863423300005171SoilMCQRYRIVVSAENSPYLAWQAKLFYFSCVSRLNHSPLIIVHDCGSRWRRDFQELADAGAMVSRVPSYRITSTGDDYLPRNTPGTLLHAAELCSAKDEYIVLCDPDMIFVRRPDFSSSLSGEYYGYVNYERKPVRRAAKQIGIRLEMLDRQKEELCCGVPYVIPVTAAKPLAEAWLQAIDAISRRQWEDQMRAFGLAVVKLGLRLTLTHMMNHNYWPNAVVDRDVIHYCYGDKMWNKRSFFTTRQARKVWSPAVAAQQGTIFAELLSQIREARDFYSTFHS*
Ga0066683_1008121213300005172SoilMADLAPRYRIVVSAENSPYLAWQAKLFHFSCVSRLSRSPIVIVHDCGSKWRRDFQELADAGAIVSRAPSYRITSNGDDYLCRNIPGSLLHAAELCSARDEFIVFCDPDMIFVRQPDFSRSLSGEYYGYVNYDRKPVRRAAKKIGIRLEMLDRQKKELCCGPPYVIPVAAAKQLAEAWLRAIDEVSPREWDDVMGAFGLAVVKLGLNVTLTHIMNHNYWPNAMVNRDVIHYCYGDKTWSKRSYFTTRQA
Ga0066685_1004305933300005180SoilMCQRYRIVVSAENSPYLAWQAKLFYFSCVSRLNHSPLIIVHDCGSRWRRDFQELADAGAMVSRVPSYRITSTGDDYLPRNTPGTLLHAAELCSAKDEYIVLCDPDMIFVRRPDFSSSLSGEYYGYVNYERKPVRRAAKQIGIRLEMLDRQKEELCCGVPYVIPVTAAKPLAEAWLQAIDAISRRQWEDQMRAFGLAVVKLGLRLTLTHMMNHNYWPNAVVDRDVIHYCYGDKMWNKRSFFTTRQARKVWSPAAPAEEGTILAELLSQIREARDFYSTFHS*
Ga0066685_1021715813300005180SoilVPQAKYLRCAAHHQADPDSYTVARNDVSIFAGDGIQLMARLARRYRIVVSAENSPYLAWQAKLFHFSCVSRLNRSPIVIVHDWGSKLRRDFQKIADAGAIVSRAPSYRITSNGDDYPPRNTAGTLLHAAELCSAKDEFIVLCDPDMIFVGQPDFSGSLSGEYYGYVKYDRKPVRGAAKEIGIRLEMLDRQKEELCCGVPYVIPVAAAKPLADAWLQAIDAFSPRHWEDQMHAFGLAVVKLGLRVTLTHIMNHNYWPDAMVDRDVIHYAYGDKTWNKRSYFTT
Ga0066678_1012882043300005181SoilMADMAPRYRIVVSAENSPYLAWQAKLFHFSCVSRLSRSPIVIVHDCGSTWCRDFQELADAGAIVSRAPSYRITSNGDDYLCRNIPGSLLHAAKLCSARDEFIVFCDPDMIFVRQPDFSRSLSGEYYGYVNYDRKPVRRAAKKIGIRLEMLDRQKKELCCGPPYVIPVAAAKQLAEAWLRAIDEVSPREWDDVMGAFGLAVVKLGLNVTLTHIMNHNYWPNAMVNRDVIH
Ga0066675_1009000023300005187SoilMARPARRYRIVVSAENSPYLAWQAKLFHFSCVSRLNRSPIVIVHDCGSRWRRDFQEIADAGAIVRRAPSYRITSNGDDYPPRNTAGTLLHAAQLCSAKDEFIVLCDPDMIFVDQPDFSGNLSGEYYGYMKYDRNPVRRAAKKIGIRLEILARQKEELCCGVPYVIPAAAAKRLAEAWLQAIDAFSPRRWEDQMQAFGLAVVKLGLRVTLTKIMNHNYWPDATVDRDVIHYCYGDKTWNKRSYFTTRQARKVWSPAAAAQQGTVLAELLSQIREARDFYSKFH*
Ga0066388_10009871613300005332Tropical Forest SoilIVIVHDCGSKWRRDFQELADAGAIVSRAPSYRITSNGDDYLTRNTAGTLLHAAELCSAQDEFIVFCDPDMIFVRQPDFSRSLSGEYYDYVNYERKPVRRAAKRIGIRLEMLDRQKQELRCGVPYVIPVAAAKQLAEAWLRAIDEVSPREWDDVMGAFGLAVVKLGLRVTLTHMMNHNYWPDTMVDRDVIHYCYGDKTWNKRNYFTARQAQKVWSSPAVAQQGTIVAELLSQIREARDFYSKFH*
Ga0066388_10137895513300005332Tropical Forest SoilMAGQTVPFSCVSRLNHSPIVVVHDCGSKWRPDFQELAGAGAIVSRAPSYRITANGDDYPPRNTAGTLLRAAQLCSAKDEFIVLCDPDMIFVRQPDFSGNLSGEYYGYLNYDRKPVRRAAKKIGIRLEMLDRQKEELCCGVPYVIPVAAAKQLADAWLQAIDAFSPRHWEDQMHAFGLAVVKLGLRVTLTYIMNHNHWPDAMLDRDIIHYCYGDKTWNKRSYFTTR
Ga0066388_10319150513300005332Tropical Forest SoilFRAIDIMIHVPSSFLARHSPVEALAERRDRCKALMADLAPRYRIVVSAENSPYLAWQAKLFHFSCVSRLSRSPIVIVHDCGSKWRRDFQELADAGAIVTRAPSYRITSNGDDYLPRNTAGTLLHAAELCSAKDEFIVLCDPDMIFVRQPDFSRSLSGEYYGYVNYDGKPVRRAAKKIGIGLEMLDRQKEELCCGVPYVIPVAAAKQLAEAWLQAIDEVSPREWQDVMGAFGLAVVKLGLRITLTHMMNHNYWPNALVNRDVIHYCYGDKTWNKRSYFTT
Ga0070669_10004901243300005353Switchgrass RhizosphereMADLAPRYRIVVSAENSPYLAWQAKLFHFSCVSRLSRSPIVIVHDCGSKWRRDFQELADAGAIVSRAPSYRMTSNGDDYVCRNIPGSLLHAAELCSARDEFIVFCDPDMIFVRQPDFSRNLSGEYYGYVNYDRKPVRRAAKKIGIRLEMLDRQKKELCCGPPYVIPVAAAKQLAEAWLRAIDEVSPREWDDVMGAFGLAVVKLDLNVTLTHIMNHNYWPNAMVDRDVIHYCYGDKTWSKRSYFTTRQAKKVWSPPAVAQQGTILAELLSQIREARDFYSKFH*
Ga0070675_10020681813300005354Miscanthus RhizosphereTTRQTLDSYTVARSAVSIFDRDGIQLMADLAPRYRIVVSAENSPYLAWQAKLFHFSCVSRLSRSPIVIVHDCGSKWRRDFQELADAGAIVSRAPSYRMTSNGDDYVCRNIPGSLLHAAELCSARDEFIVFCDPDMIFVRQPDFSRNLSGEYYGYVNYDRKPVRRAAKKIGIRLEMLDRQKKELCCGPPYVIPVAAAKQLAEAWLRAIDEVSPREWDDVMGAFGLAVVKLDLNVTLTHIMNHNYWPNAMVDRDVIHYCYGDKTWSKRSYFTTRQAKKVWSPPAVAQQGTILAELLSQIREARDFYSKFH*
Ga0070708_10013178223300005445Corn, Switchgrass And Miscanthus RhizosphereMCQRYRIVVSAENSPYLAWQAKLFYFSCVSRLNHSPLIIVHDCGSRWRRDFQELADAGAMVSRVPSYRITSTGDDYLPRNTPGTLLHAAELCSAKDEYIVLCDPDMIFVRRPDFSSSLSGEYYGYVNYERKPVRRAAKQIGIRLEMLDRQKEELCCGVPYVIPVAAAKPLAEAWLQAIDAISRRQWEDQMRAFGLAVVKLGLRLTLTHMMNHNYWPNAAVDRDVIHYCYGDKMWNKRSFFTTRQARKVWSPAAPAEEGTILAELLSQIREARDFYSSFHS*
Ga0070697_10007233133300005536Corn, Switchgrass And Miscanthus RhizosphereMARLARRYRIVVSAENSPYLAWQAKLFYFSCVSRLNRSPIVIVHDCGSKWRRDFQEIADAGAIVSRAPSYRITSNGNDYPPRNTAGTLLHAAELCSAKDEFLVLCDPDMIFVRQPDFSGSLSGEYYGYVKYDRKPVRGAAKEIGIRLEMLDRQKEELCCGVPYVIPVAAARRLADAWLQAIDAFSPRHWEDQMHAFGLAVVKLGLRVTLTHIMNHNYWPDAMVDRDVIHYAYGDKTWNKRSYFTTSQARKVWSSAAAAQQGTILAELLSQIREARDFYSRFR*
Ga0066695_1057485013300005553SoilARRYRIVVSAENSPYLAWQAKLFHFSCVSRLNRSPIVIVHDWGSKLRRDFQKIADAGAIVSRAPSYRITSNGDDYPPRNTAGTLLHAAELCSAKDEFIVLCDPDMIFVGQPDFSGSLSGEYYGYVKYDRKPVRGAAKEIGIRLEMLDRQKEELCCGVPYVIPVAAAKPLADAWLQAIDAFSPRHWEDQMHAFGLAVLKLGLRVKLTHMMNHNYWPDAMVDRDVIHY
Ga0066698_1025303213300005558SoilKYLRCAAHHQADPDSYTVARNDVSIFAGDGIQLMARLARRYRIVVSAENSPYLAWQAKLFHFSCVSRLNRSPIVIVHDWGSKLRRDFQKIADAGAIVSRAPSYRITSNGDDYPPRNTAGTLLHAAELCSAKDEFIVLCDPDMIFVGQPDFSGSLSGEYYGYVKYDRKPVRGAAKEIGIRLEMLDRQKEELCCGVPYVIPVAAAKPLADAWLQAIDAFSPRHWEDQMHAFGLAVVKLGLRVTLTHIMNHNYWPDAMVDRDVIHYAYGDKTWNKRSYFTTRQARKVWSPAAAAQQGTILAELLSQIREARDFYSRFH*
Ga0066706_1015848833300005598SoilMARPARRYRIVVSAENSPYLAWQAKLFHFSCVSRLNHSPIVIVHERGSKWRRDFQELADAGAIVSRAPSYRITSNGDDYLPRNTVGTLLHAAELCSAKDEFIVLCDPDMIFVRQPDFSRNLSGEYWGNMKYDQKQVRRTARKIGISLEMLDRQKEELCCGVPYVIPVTAAKRFADAWLQAIDAFSPRDWEDQMYAFGLAVVKLGLRVMLTHIVNHNYWPDAMVDRDVIHYAYGDKRWDKRNYETTRQARKVWSPAVAAQQGTIFAELLSQIREARDFYSASQFFYKPEAT*
Ga0066903_10041362243300005764Tropical Forest SoilMLSQYFDGDEIQLMAELAPRYRIVVSAENSPYLAWQAKLFHFSCLSRLNRSPTVIVHACGSKWHRDFQELAGAGAVVRRAPSYRMTSNGDYYLCRNHAGTLLHAAELALAKDDFIVLCDPDMIFLRQPDFSTDLSGEYYGYVNYDRKPIRRAAKKIGIRIEMLDRQKEELCCGVPYVIPVAAAKPLAKAWLRAIDEVSPREWADDMGAFGLAVVKLGLSIRTTHMTNHNFWPNAIVDREVIHYCYGDKTWSKRDYFTTRQARKVWSPPAVAQQGTILAELLSQIREARDFYSKFH*
Ga0081455_1000775973300005937Tabebuia Heterophylla RhizosphereMARLARRYRIVVSAENNPYLAWQAKLFHFSCVSRLNHSPTVIVHDCGSKWRRDFQELADAGAIVSRAPSYRITSNGDDYPPRNTAGTLLRAAQLCSAKDEFIVLCDPDMIFVRQPDFSRNLSGEYYGYLNYDREPVQRAAKKIGIRLEMLDRQKEELRCGVPYVIPLGIAKPLAEAWLQAIDAFSPRHWEDQMQAFGLAVVKLGLRITLSHIMNHNHWPDAMVDRDVIHYCYGDKTWNKRRYFTTAQARRIWSPAASVQKGTIRAELFAQIKEAREFYSRFH*
Ga0066652_10019685823300006046SoilVPQAKYLRCAAHHQADPDSYTVARNDVSIFAGDGIQLMARLARRYRIVVSAENSPYLAWQAKLFHFSCVSRLNRSPIVIVHDWGSKLRRDFQKIADAGAIVSRAPSYRITSNGDDYPPRNTAGTLLHAAELCSAKDEFIVLCDPDMIFVGQPDFSRSLSGEYYGYVKYDRKPVRGAAKEIGIRLEMLDRQKEELCCGVPYVIPVTAAKRFADAWLQAIDAFSPRDWEDQMYAFGLAVVKLGLRVTLTHIVNHNYWPDAMVDRDVIHYAYGDKTWNKRSYFTTRQARKVWSPAAAAQQGTVLAELLSQIREARDFYSKFH*
Ga0066652_10148057713300006046SoilIVVHDCGSKWRRDFQELANAGAIVSRAPGYRITSNGDDYSPRNTAGTLLHAAELCSAQDEFIVLCDPDMIFVRQPDFSRSLSGEYYGYVNYDRKPVRRAAKKIGIRLEMLDRQKKELCCGPPYVIPVAAAKQLAEAWLQAIDEVSPREWDDVMGAFGLAVVKLGLNVTLTHIMNHNYWPNAMVNRDVIHYCYGDKTWSKRSYFTTRQAR
Ga0070712_10003051683300006175Corn, Switchgrass And Miscanthus RhizosphereMAELAPRYRIVVSAENSPYLAWQAKLFHFSCLSRLSRSPIVIVHACGSKWRRDFQELADAGAIVSRAPSYRMTSNGDDYLCRNIAGTLLHAAELCSAKDEFIVFCDPDMIFVRQPDFSRSLSGEYYGYVNYDRKPVRRAAKKIGIGLEMLDRQKKELCCGPPYVIPVAAAKQLAEAWLRAIDEVSPREWDDVMGAFGLAVVKLGLRIRLTHMTNFNYWPNAMVNRDVIHYCYGDKTWNKRSYFTTRQA
Ga0066653_1016727113300006791SoilMADLAPRYRIVVSAENSPYLAWQAKLFHFSCVSRLSRSPIVIVHDCGSKWRRDFQELADAGAIVSRAPSYRITSNGDDYLCRNIPGSLLHAAELCSARDEFIVFCDPDMIFVRQPDFSRSLSGEYYGYVNYDRKPVRRAAKKIGIRLEMLDRQKKELCCGPPYVIPVAAAKQLAEAWLRAIDEVSPREWDDVMGAFGLAVVKLGLNVTLTHIMNHNYWPNAMVNRDVIHYCYGDKTWSKRSYFTTRQARKVWSPAAVAQQGTILAELLSQIREARDFYSKFH*
Ga0066665_1010846843300006796SoilMADMAPRYRIVVSAENSPYLAWQAKLFHFSCVSRLSRSPIVIVHDCGSKWRRDFQELADAGAIVSRAPSYRITSNGDDYLCRNIPGSLLHAAELCSARDEFIVFCDPDMIFVRQPDFSRSLSGEYYGYVNYDRKPVRRAAKKIGIRLEMLDRQKKELCCGPPYVIPVAAAKQLAEAWLRAIDEVSPREWDDVMGAFGLAVVKLGLNVTLTHIMNHNYWPNAMVNRDVIHYCYGDKTWSKRSYFTTRQARKVWSPAAVAQQGTILAELLSQIREASDFYSKIH*
Ga0066659_1011343123300006797SoilMADMAPRYRIVVSAENSPYLAWQAKLFHFSCVSRLSRSPIVIVHDCGSTWCRDFQELADAGAIVSRAPSYRITSNGDDYLCRNIPGSLLHAAKLCSARDEFIVFCDPDMIFVRQPDFSRSLSGEYYGYVNYDRKPVRRAAKKIGIRLEMLDRQKKELCCGPPYVIPVAAAKQLAEAWLRAIDEVSPREWDDVMGAFGLAVVKLGLNVTLTHIMNHNYWPNAMVNRDVIHYCYGDKTWSKRSYFTTRQARKVWSPAAVAQQGTILAELLSQIREARDFYSGFH*
Ga0066659_1027827213300006797SoilMARPARRYRIVVSAENSPYLAWQAKLFHFSCVSRLNHSPIVIVHERGSKWRRDFQELADAGAIVSRAPSYRITSNGDDYLPRNTAGTLLHAAELCSAKDEFIVLCDPDMIFVRQPNFSRNLSGEYWGNMKYDQKQVRRTARKIGISLEMLDRQKEELCCGVPYVIPVTAAKRFADAWLQAIDAFSPRDWEDQMYAFGLAGVKLGLRVTLTHIVNQNYWPDAMVDRDVIHYAYGDKRWDKRNYETTRQARKVWSAAVAAQQGTIF
Ga0099791_10000283103300007255Vadose Zone SoilMARLARRYRIVVSAENSPYLAWQAKLFYFSCVSRLNRSPIVIVHDCGSKWRRDFQEIADAGAIVRRAPSYRITSNGDDYPPRNTAGTLLHAAELCSAKDEFIVLCDPDMIFVGQLDFSESLSGEYYGYVKYDRKPVRGAAKEIGIRLEMLDRQKEELCCGVPYVIPVAAAKPLADAWLQAIDAFSPRHWEDQMHAFGLAVVKLGLRVTLTHIMNHNYWPDAMVDRDVIHYAYGDKTWNKRSYFTTRQARKVWSPAAAAQQGTILAELLSQIREARDFYSRFH*
Ga0066710_10142420513300009012Grasslands SoilQAKLFHFSCVSRLNHSPIVIVHDCGSKWRRDFQELADAGATVSRAPSYRLTSNGDDYPPRNTAGTLLSAAQLCSAKDEFIVLCDPDMIFVGQPDFSGNLSGEYYGYVKYDRKPVRGAAKEIGIRLKMLDRQKEELCCGVPYVIPVAAAKPLADAWLQAIDAFSPRHWEDQMHAFGLAVVKLGLRVTLTHIMNHNYWPDAMVDRDVIHYAYGDKTWNKRSYFTTRQARKVWSPAAAAQQGTILAELLAQIREARDFYSRFH
Ga0066709_10089498833300009137Grasslands SoilSIFAGDGIQLMARLARRYRIVVSAENSPYLAWQAKLFHFSCVSRLNHSPIVIVHDCGSKWRRDFQELADAGATVSRAPSYRITSNGDDYPPRNTAGTLLSAAQLCSAKDEFIVLCDPDMIFVGQPDFSGNLSGEYYGYVKYDRKPVRGAAKEIGIRLEMLDRQKEELCCGVPYVIPVAAAKPLADAWLQAIDAFSPRHWQDQMPAFGLAVVKLGLRVTLTHMMNHNYWPDTMVDRDVIHYCYGDKTWNKRSYLTTRQARKVWSPAAAAQQGTILAELLSQIREARDFYSRFH*
Ga0126378_1160043713300010361Tropical Forest SoilLSRLNRSPTVIVHACGSKWHRDFQELAGAGAVVRRAPSYRMTSNGDYYLCRNHAGTLLHAAELALAKDDFIVLCDPDMIFLRQPDFSTDLSGEYYGYVNYDRKPIRRAAKKIGIRIEMLDRQKEELCCGVPYVIPVAAAKPLAKAWLRAIDEVSPREWADDMGAFGLAVVKLGLSIRTTHMTNHNFWPNAIVDREVIHYCYGDKTWSKRDYFTTRQARKVWSPPAVAQQGTILAELLSQIREARDF
Ga0137364_1021658613300012198Vadose Zone SoilMARPARRYRIVVSAENSPYLAWQAKLFHFSCISRLNHSPIVIVHDRGSKWRRDFQELADAGAIVSRAPSYRITSNGDDYLPRNTAGTLLHAAELCSAKDEFIVLCDPDMIFVRQPNFSRNLSGEYWGNMKYDQKQVRRTARKIGVSLEMLDRQKEELCCGVPYVIPVTAAKRFADAWLQAIDAFLPRDWEDQMYAFGLAVVKLGLRVTLTHIVNHNYWPDAMVDRDVIHYAYGNKRWDKRNYETTRQARKVWSPAVAAQQGTIFAELLSQIREARDFYSARQFFYKPEAT*
Ga0137364_1052781013300012198Vadose Zone SoilMARLARGYRIVVSAENSPYLAWQAKLVHFSCVSRLNHSPIVIVHECGSRWLRDFQELADAGAIVSRVPSYCITPHGEDYPPRNTAGTLLYAAQLCSAKDEFIVLCDPDMIFMRRPSFSRNLSGEYYGCLNYDRKPVRRAAKKIGIRLEMLDRQKEELCCGVPYVIPVAAAKPLADAWLQAIDAFSPRRWEDQMHAFGLAVVKVGLRVTLTHMMNLNYWPDAMVDRDVIHYGYGDKTWNKRSYFTTRQARKVWSPAAVAQQGTIR
Ga0137382_1001947443300012200Vadose Zone SoilLADAGAIVSRAPSYRITSNGDDYLPRNTAGTLLHAAELCSAKDEFIVLCDPDMIFVRQPNFSRNLSGEYWGNMKYDQKQVRRTARKIGVSLEMLDRQKEELCCGVPYVIPVTAAKRFADAWLQAIDAFSPRDWEDQMYAFGLAVVKLGLRVTLTHIVNHNYWPDAMVDRDVIHYAYGDKRWDKRNYETTRQARKVWSPAVAAQQGTIFAELLSQIREARDFYSARQFFYKPEAT*
Ga0137365_1006635823300012201Vadose Zone SoilMADLAPRYRIVVSAENSPYLAWQAKLFHFSCVSRLSRSPIVIVHDCGSKWRRDFQELADAGAIVSRAPSYRMTSNGDDYVCRNIPGSLLHAAKLCSARDEFIVFCDPDMIFVRQPDFSRSLSGEYYGYVNYDRKPVRRAAKKIGIRLQMLDRPKKELCCGPPYVIPVAAAKQLAEAWLRAIDEVSPREWDDVMGAFGLAVVKLGLNVTLTHIMNHNYWPNAMVNRDVIHYCYGDKTWSKRSYFTTRQARKVWSPAAVAQQGTILAELLSQIREASDFYSKIH*
Ga0137365_1012236513300012201Vadose Zone SoilIFVGDGIQLMARPARRYRIVVSAENSPYLAWQAKLFHFSCVSRLNHSPIVIVHERGSKWRRDFQELADAGAIVSRAPSYRITSNGDDYLPRNTVGTLLHAAELCSAKDEFIVLCDPDMIFVRQPDFSRNLSGEYWGNMKYDQKQVRRTARKIGISLEMLDRQKEELCCGVPYVIPVTAAKRFADAWLQAIDAFSPRDWEDQMYAFGLAVVKLGLRVTLTHIVNHNYWPDAMVDRDVIHYAYGDKRWDKRNYETTRQARKVWSPAVAAQQGTIFAELLSQIREARDFYSARQFFYKPEAT*
Ga0137363_1001593713300012202Vadose Zone SoilLPCGVRAQLKSVPQAKFLRCAAHHQADPDSYTVARNDVSIFAGDGTQLMARLARRYRIVVSAENSPYLAWQAKLFYFSCVSRLNRSPIVIVHDCGSKWRRDFQEIADAGAIVRRAPSYRITSNGDDYPPRNTAGTLLHAAELCSAKDEFIVLCDPDMIFVGQLDFSESLSGEYYGYVKYDRKPVRGAAKEIGIRLEMLDRQKEELCCGVPYVIPVAAAKPLADAWLQAIDAFSPRHWEDQMHAFGLAVVKLGLRVTLTHIMNHNYWPDAMVDRDVIHYAYGDKTWNKRSYFTTRQARKVWSPAAAAQQGTILAELLSQIREARDFYSRFH*
Ga0137374_1046534713300012204Vadose Zone SoilYLAWQAKLFHFSCVSRLNLSPIVIVHHCGSKWRRDFQEIADAGAIVSRAPTYRITSNGDDYPPRNTAGTLLHAAQLCSAKDEFIVLCDPDMIFVDQPDFSGNLSGEYYGYMKYDRKPVRGAAKKIGIRLEMLDRQKEELCCGVPYVIPVAAAKRLAEAWLQAIDAFSPRRWEDQMHAFGLAVVKLGLRVTLTNIMNHNYWPDAMADRDVIHYCYGDKTWNKRSYFTTRQARKVWSPAAAAQQGTIRAEVLAQIREARDFYSGFH*
Ga0137362_1003577313300012205Vadose Zone SoilLPCGVRAQLKSVPQAKFLRCAAHHQADPDSYTVARNDVSIFAGDGTQLMARLARRYRIVVSAENSPYLAWQAKLFYFSCVSRLNRSPIVIVHDCGSKWRRDFQEIADAGAIVRRAPSYRITSNGDDYPPRNTAGTLLHAAELCSAKDEFIVLCDPDMIFVGQLDFSESLSGEYYGYVKYDRKPVRGAAKEIGIRLEMLDRQKEELCCGVPYVIPVAAAKPLADAWLQAIDAFSPRHWEDQMHAFGLAVVKLGLRVTLTHIMNHNYWPDAMVDRDVIHYAYGDKTWNKRSYFTTRQARKVWSPAAAAQQGTILAELL
Ga0137362_1007506223300012205Vadose Zone SoilMAWQAKLFHFSCVSRLNCSPIVIVHDCGSKWRRDFQELVEAGAIVSRAPSYRRTSNGEDYPPRNTAGTLLCAAQLCSAKDEFIVLCDPDMIFLGQPEFSRSLSGEYYGYVKYDRKPVRGAAKEIGIRLDTLDRQRDELCCGVPYVIPVAAAKCLAEVWLQAIDALSPRRWEDMMHAFGLAVVKLGLSVSLTHMMNHNYWPDAIVDRDVIHYCYGDKTWSKRSYVTTRQAQKVWSSAAAAQQAPILAELLSQIKEARDFYSRFH*
Ga0137380_1032425313300012206Vadose Zone SoilVARNAVSIFDGDGIQLMADMAPRYRIVVSAENSPYLAWQAKLFHFSCVSRLSRSPIVIVHDCGSKWCRDFQELADAGAIVSRAPSYRITSNGDDYLCRNIPGSLLHAAKLCSARDEFIVFCDPDMIFVRQPDFSRSLSGEYYGYVNYDRKPVRRAAKKIGIRLEMLDRQKKELCCGPPYVIPVAAAKQLAEAWLRAIDEVSPREWDDVMGAFGLAVVKLGLNVTLTHIMNHNYWPNAMVNRDVIHYCYGDKTWSKRSYFTTRQARKVWSPAAVAQQGTILAELLSQIREARDFYSKFH*
Ga0137379_1011750423300012209Vadose Zone SoilMADMAPRYRIVVSAENSPYLAWQAKLFHFSCVSRLSRSPIVIVHDCGSTWCRDFQELADAGAIVSRAPSYRITSNGDDYLCRNIPGSLLHAAKLCSARDEFIVFCDPDMIFVRQPDFSRSLSGEYYGYVNYDRKPVRRAAKKIGIRLEMLDRQKKELCCGPPYVIPVAAAKQLAEAWLRAIDEVSPREWDDVMGAFGLAVVKLGLNVTLTHIMNHNYWPNAMVNRDVIHYCYGDKTWSKRSYFTTRQARKVWSPAAVAQQGTILAELLSQIREASDFYSKIH*
Ga0137378_1005528543300012210Vadose Zone SoilMADMAPRYRIVVSAENSPYLAWQAKLFHFSCVSRVGRSPIVIVHACGSKWRRDFQELADAGAIVSRAPSYRITSNGDDYLCRNIPGSLLHAAKLCSARDEFIVFCDPDMIFVRQPDFSRSLSGEYYGYVNYDRKPVRRAAKKIGIRLEMLDRQKKELCCGPPYVIPVAAAKQLAEAWLRAIDEVSPREWDDVMGAFGLAVVKLGLNVTLTHIMNHNYWPNAMVNRDVIHYCYGDKTWSKRSYFTTRQARKVWSPAAVAQQGTILAELLSQIREASDFYSKIH*
Ga0137370_1000231373300012285Vadose Zone SoilMARPARRYRIVVSAENSPYLAWQAKLFHFSCVSRLNHSPIVIVHERGSKWRRDFQELADAGAIVSRAPSYRITSNGDDYLPRNTAGTLLHAAELCSAKDEFIVLCDPDMIFVRQPNFSRNLSGEYWGNMKYDQKQVRRTARKIGVSLEMLDRQKEELCCGVPYVIPVTAAKRIADAWLQAIDAFSPRDWEDQMYAFGLAVVKLGLRVTLTHIVNHNYWPDAMVDRDVIHYAYGNKRWDKRNYETTRQARKVWSPAVAAQQGTIFAELLSQIREARDFYSARQFFYKPEAT*
Ga0137370_1018217713300012285Vadose Zone SoilVSIFAGDGIQLMARLARRYRIVVSAENSPYLAWQAKLFHFSCVSRLNRSPIVIVHDWGSKLRRDFQEIADAGAIVSRAPSYRITSNGDDYPPRNTAGTLLHAAELCSAKDEFIVLCDPDMIFVGQPDFSGSLSGEYYGYVKYDRKPVRGAAKEIGIRLKMLDRQKEELCCGVPYVIPVAAAKPLADAWLQAIDAFSPRHWEDQMHAFGLAVVKLGLRVTLTHIMNHNYWPDAMVDRDVIHYAYGDKTWNKRSYFTTRQARKVWSPAAAAQQGTILAE
Ga0137387_1016260823300012349Vadose Zone SoilMADMAPRYRIVVSAENSPYLAWQAKLFHFSCVSRVGRSPIVIVHDCGSKWCRDFQELADAGAIVSRAPSYRITSNGDDYLCRNIPGSLLHAAKLCSARDEFIVFCDPDMIFVRQPDFSRSLSGEYYGYVNYDRKPVRRAAKKIGIRLEMLDRQKKELCCGPPYVIPVAAAKQLAEAWLRAIDEVSPREWDDVMGAFGLAVVKLGLNVTLTHIMNHNYWPNAMVNRDVIHYCYGDKTWSKRSYFTTRQARKVWSPPAVAQQGTILAELLSQIREARDFYSKFH*
Ga0137372_1001840753300012350Vadose Zone SoilVSRLSRSPIVIVHDCGSKWRRDFQELADAGAIVSRAPSYRMTSNGDDYVCRNIPGSLLHAAELCSARDEFIVFCDPDMIFVRQPDFSRNLSGEYYGYVNYDRKPVRRAAKKIGIRLEMLDRQKKELCCGPPYVIPVAAAKQLAEAWLRAIDEVSPREWDDVMGAFGLAVVKLGLNVTLTHIMNHNYWPNAMVNRDVIHYCYGDKTWSKRSYFTTRQARKVWSPAAVAQQGTILAELLSQIREASDFYSKIH*
Ga0137386_1018125023300012351Vadose Zone SoilMADMAPRYRIVVSAENSPYLAWQAKLFHFSCVSRVGRSPIVIVHDCGSKWCRDFQELADAGAIVSRAPSYRITSNGDDYLCRNIPGSLLHAAKLCSARDEFIVFCDPDMIFVRQPDFSRSLSGEYYGYVNYDRKPVRRAAKKIGIRLEMLDRQKKELCCGPPYVIPVAAAKQLAEAWLRAIDEVSPREWDDVMGAFGLAVVKLGLNVTLTHIMNHNYWPNAMVNRDVIHYCYGDKTWSKRSYFTTRQARKVWSPPAVAQQGTILAELLSQIREARDFFSRSH*
Ga0137367_1002516033300012353Vadose Zone SoilVSIFAGDGIQVMARRYRIVVCAENSPYLAWQAKLFHFSCVSRLNLSPIVIVHHCGSKWRRDFQEIADAGAIVSRAPTYRITSNGDDYPPRNTAGTLLHAAQLCSAKDEFIVLCDPDMIFVDQPDFSGNLSGEYYGYMKYDRKPVRGAAKKIGIRLEMLDRQKEELCCGVPYVIPVAAAKRLAEAWLQAIDAFSPRRWEDQMHAFGLAVVKLGLRVTLTNIMNHNYWPDAMADRDVIHYCYGDKTWNKRSYFTTRQARKVWSPAAAAQQGTIRAEVLAQIREARDFYSGFH*
Ga0137367_1028846213300012353Vadose Zone SoilMAELAPRYRIVVSAENSPYLAWQAKLFHFSCVSRLSRSPIVIVHACGSKWRRDFQELADAGAIVSRAPSYRMTSNGDDYLCRNIPGSLLHAAKLCSARDEFIVFCDPDMIFVRQPDFSRSLSGEYYGYVNYDRKPVRRAAKKIGIRLEMLDRQKKELCCGPPYVIPVAAAKQLAEAWLRAIDEVSPREWDDVMGAFGLAVVKLGLNVTLTHIMNHNYWPNAMVNRDVIHYCYGDKTWSKRSYFTTRQARKVWSPAAVAQQGTILAELLA
Ga0137366_10001522163300012354Vadose Zone SoilMADLAPRYRIVVSAENSPYLAWQAKLFHFSCVSRLSRSPIVIVHDCGSKWRRDFQELADAGAIVSRAPSYRMTSNGDDYVCRNIPGSLLHAAELCSARDEFIVFCDPDMIFVRQPDFSRNLSGEYYGYVNYDRKPVRRAAKKIGIRLEMLDRQKKELCCGPPYVIPVAAAKQLAEAWLRAIDEVSPREWDDVMGAFGLAVVKLGLNVTLTHIMNHNYWPNAMVNRDVIHYCYGDKTWSKRSYFTTRQARKVWSPAAVAQQGTILAELLSQIREASDFYSKIH*
Ga0137366_1051691013300012354Vadose Zone SoilFSCVSRLNHSPIVIVHERGSKWRRDFQELADAGAIVSRAPSYRITSNGDDYLPRNTVGTLLHAAELCSAKDEFIVLCDPDMIFVRQPNFSRNLSGEYWGNMKYDQKQVRRTARKIGISLEMLDRQKEELCCGVPYVIPVTAAKRFADAWLQAIDAFSPRDWEDQMYAFGLAVVKLGLRVTLTHIVNHNYWPDAMVDRDVIHYAYGDKTWNKRSYFTTRQARKVWSPAAAAQQGTILAELLSQIREARDFYSRFH*
Ga0137371_1016441823300012356Vadose Zone SoilMSIFAGDGIQLMERLARRYRIVVSAENSPYLAWQAKLFHFSCVSRLNHSPIVIVHERGSKWRRDFQELADAGAIVSRAPSYRITSNGDDYLPRNTVGTLLHAAELCSAKDEFIVLCDPDMIFVRQPDFSRNLSGEYWGNMKYDQKQVRRTARKIGISLEMLDRQKEELCCGVPYVIPVTAAKRFADAWLQAIDAFSPRDWEDQMYAFGLAVVKLGLRVTLTHIMNHNYWPDAMVDRHVIHYA*
Ga0137384_1037707313300012357Vadose Zone SoilMADMAPRYRIVVSAENSPYLAWQAKLFHFSCVSRLSRSPIVIVHDCGSKWCRDFQELADAGAIVSRAPSYRITSNGDDYLCRNIPGSLLHAAKLCSARDEFIVFCDPDMIFVRQPDFSRSLSGEYYGYVNYDRKPVRRAAKKIGIRLEMLDRQKKELCCGPPYVIPVAAAKQLAEAWLRAIDEVSPREWDDVMGAFGLAVVKLGLNVTLTHIMNHNYWPNAMVNRDVIHYCYGDKTWSKRSYFTTRQARKVWSPAAVAQQGTILAELLSQIREARDFYSKFH*
Ga0137385_1031523013300012359Vadose Zone SoilMADMAPRYRIVVSAENSPYLAWQAKLFHFSCVSRLSRSPIVIVHDCGSKWCRDFQELADAGAIVSRAPSYRITSNGDDYLCRNIPGSLLHAAKLCSARDEFIVFCDPDMIFVRQPDFSRSLSGEYYGYVNYDRKPVRRAAKKIGIRLEMLDRQKKELCCGPPYVIPVAAAKQLAEAWLRAIDEVSPREWDDVMGAFGLAVVKLGLNVTLTHIMNHNYWPNAMVNRDVIHYCYGDKTWSKRSYFTTRQARKVWSPAAVA
Ga0137360_1000118343300012361Vadose Zone SoilMARLARRYRIVVSAENSPYLAWQAKLFYFSCVSRLNRSPIVIVHDCGSKWRRDFQEIADVGAIVRRAPSYRITSNGDDYPPRNTAGTLLHAAELCSAKDEFIVLCDPDMIFVGQLDFSESLSGEYYGYVKYDRKPVRGAAKEIGIRLEMLDRQKEELCCGVPYVIPVAAAKPLADAWLQAIDAFSPRHWEDQMHAFGLAVVKLGLRVTLTHIMNHNYWPDAMVDRDVIHYAYGDKTWNKRSYFTTRQARKVWSPAAAAQQGTILAELLSQIREARDFYSRFH*
Ga0137361_1002906623300012362Vadose Zone SoilMAWQAKLFHFSCVSRLNCSPIVIVHDCGSKWRRDFQEIADAGAIVRRAPSYRITSNGDDYPPRNTAGTLLHAAELCSAKDEFIVLCDPDMIFVGQLDFSESLSGEYYGYVKYDRKPVRGAAKEIGIRLEMLDRQKEELCCGVPYVIPVAAAKPLADAWLQAIDAFSPRHWEDQMHAFGLAVVKLGLRVTLTHIMNHNYWPDAMVDRDVIHYAYGDKTWNKRSYFTTRRARKVWSPAAAAQQGTILAELLSQIREARDFYSRFH*
Ga0137358_1000959243300012582Vadose Zone SoilMARPARRYRIVVSAENSPYLAWQAKLFHFSCVSRLNHSPIVIVHDRGSKWRRDFQELADAGAIVSRAPSYRITSNGDDYLPRNTAGTLLHAAELCSAKDEFIVLCDPDMIFVRQPNFSRNLSGEYWGNMKYDQKQVRRTARKIGVSLEMLDRQKEELCCGVPYVIPVTAAKRFADAWLQAIDAFLPRDWEDQMYAFGLAVVKLGLRVTLTHIVNHNYWPDAMVDRDVIHYAYGDKRWDKRNYETTRQARKVWSPAVAAQQGTIFAELLSQIREARDFYSARQFFYKPEAT*
Ga0137358_1004153633300012582Vadose Zone SoilVPQAKFLRCAAHHQADLDSYTVARNDVSIFAGDGIQLMARLARRYRIVVSAENSPYLAWQAKLFHFSCVSRLNRSPIVIVHDCGSKWRRDFQEIADAGAIVSRAPSYRITSNGDDYPPRNTAGTLLHAAELCSAKDEFIVLCDPDMIFVGQPDFSGSLSGEYYGYVKYDRKPVRGAAKEIGIRLEMLDRQKEELCCGVPYVIPVAAAKRLADAWLQAIDAFSPRHWEDQMHAFGLAVVKLGLRVTLTHIMNHNYWPDAMVDRDVIHYAYGDKTWNKRSYFTTRQARKVWSPAAAAQQGTILAELLSQIREARDFYSRFH*
Ga0137359_1003114023300012923Vadose Zone SoilLPCGVRAQLKSVPQAKFLRCAAHHQADPDSYTVARNDVSIFAGDGTQLMARLARRYRIVVSAENSPYLAWQAKLFYFSCVSRLNRSPIVIVHDCGSKWRRDFQEIADAGAIVRRAPSYRITSNGDDYPPRNTAGTLLHAAELCSAKDEFIVLCDPDMIFVGQLDFSESLSGEYYGYVKYDRKPVRGAAKEIGIGLEMLDRQKEELCCGVPYVIPVAAAKPLADAWLQAIDAFSPRHWEDQMHAFGLAVVKLGLRVTLTHIMNHNYWPDAMVDRDVIHYAYGDKTWNKRSYFTTRRARKVWSPAAAAQQGTILAELLSQIREARDFYSRFH*
Ga0137404_1004703823300012929Vadose Zone SoilMARPARRYRIVVSAENSPYLAWQAKLFHFSCVSRLNHSPIVIVHERGSKWRRDFQDLADAGAIVSRAPSYRITSNGDDYLPRNTAGTLLHAAELCSAKDEFIVLCDPDMIFVRQPDFSRNLSGEYWGNMKYDQKQVRRTARKIGVSLEMLDRQKEELCCGVPYVIPVTAAKRFADAWLQAIDAFLPRDWEDQMYAFGLAVVKLGLRVTLTHIVNHNYWPDAMVDRDVIHYAYGDKRWDKRNYETTRQARKVWSPAVAAQQGTIFAELLSQIREARDFYSARQFFYKPEAT*
Ga0137404_1121259113300012929Vadose Zone SoilRIVVSAENSPYLAWQAKLFYFSCVSRLNRSPIVIVHDWGSKLCRDFQKIADAGAIVSRAPSYRITSNGDDYPPRNTAGTLLHAAELCSAKDEFIVLCDPDMIFVGQLDFSESLSGEYYGYVKYDRKPVRGAAKEIGIRLEMLDRQKEELCCGVPYVIPVAAAKPLADAWLQAIDAFSPRHWEDQMHAFGLAVVKLGLRVTLTHIMNHNYWPDAMVDRDVIHYAYGDKTWNK
Ga0137407_1004559213300012930Vadose Zone SoilLPCGVRAQLKSVPQAKFLRCAAHHQADPDSYTVARNDVSIFAGDGTQLMARLARRYRIVVSAENSPYLAWQAKLFYFSCVSRLNRSPIVIVHDCGSKWRRDFQEIADAGAIVRRAPSYRITSNGDDYPPRNTAGTLLHAAELCSAKDEFIVLCDPDMIFVGQLDFSESLSGEYYGYVKYDRKPVRGAAKEIGIRLEMLDRQKEELCCGVPYVIPVAAAKPLADAWLHAIDAFSPRHWEDQMHAFGLAVVKLGLRVTLTHIMNHNYWPDAMVDRDV
Ga0137407_1006956633300012930Vadose Zone SoilMAWQAKLFHFSCVSRLNHSPIFIVHERGSKWRRDFQELADAGAIVSRAPSYRITSNGDDYLPRNTAGTLLHAAELCSAKDEFIVLCDPDMIFVRQPNFSRNLSGEYWGNMKYDQKQVRRTARKIGVSLEMLDRQKEELCCGVPYVIPVTAAKRFADAWLQAIDAFLPRDWEDQMYAFGLAVVKLGLRVTLTHIVNHNYWPDAMVDRDVIHYAYGDKRWDKRNYETTRQARKVWSPAVAAQQGTIFAELLSQIREARDFYFARQFFYKPEAT*
Ga0164299_1020750813300012958SoilELADAGAIVSRAPSYRITSNGDYYPCRNHAGTLLQAAELFSAKDEFIVLCDPDMIFVRQPDLSRRLSGEYYGYVNYDRKPIRRAAKKIGIRLEMLDRQKKELCCGVPYVIPVAAAKQLAEAWLRAIDEVSPREWADDMSAFGLAVVKLGLRIRLTHMTNHNYWPNAMVDRDVIHYCYGDKTWSKRDYFTTRQARKVWSPPAVAQHGTILAELLSQIREARDFYSKFH*
Ga0164304_1050094423300012986SoilMMCQRYRIVVSAENSPYLAWQAKLFHFSCLSRLSRSPTVIVHACGSKWHRDFQELADAGAIVSRAPSYRITSNGDYYPCRNHAGTLLQAAELFSAKDEFIVLCDPDMIFVRQPDFSRSLSGEYYGYVNYDRKPIRRAAKKIGIRLEMLDRQKKELCCGVPYVIPVAAAKQLAEGWLRAIDEVSPREWADDMGAFGLAVVKLGLRIRLTHMTNHNYWPNAMVDRDVIHYCYGDKTWSKLNYFT
Ga0137405_100582743300015053Vadose Zone SoilMARPARRYRIVVSAENSPYLAWQAKLFHFSCISRLNHSPIVIVHDRGSKWRRDFQELADAGAIVSRAPSYRITSNGDDYLPRNTAGTLLHAAELCSAKDEFIVLCDPDMIFVRQPNFSRNLSGEYWGNMKYDQKQVRRTARKIGVSLEMLDRQKEELCCGVPYVIPVTAAKRFADAWLQAIDAFLPRDWEDQMYAFGLAVVKLGLRVTLTHIVNHNYWPDAMVDRDVIHYAYGDKRWDKRNYETTRQARKVWSPAVAAQQGTIFAELLSQIREARDFYFARQFFYKPEAT*
Ga0137405_127135313300015053Vadose Zone SoilRHTSDSYCVAPNDVSIFVGDEIQLMARPARRYRIVVSAENSPYLAWQAKLFHFSCISRLNHSPIVIVHDRGSKWRRDFQELADAGAIVSRAPSYRITSNGDDYLPRNTAGTLLHAAELCSAKDEFIVLCDPDMIFVRQPNFSRNLSGEYWGNMKYDQKQVRRTARKIGVSLEMLDRQKEELCCGVPYVIPVTAAKRFADAWLQAIDAFLPRDWEDQMYAFGLAVVKLGLRVTLTHIVNHNYWPDAMVDRDVIHYAYGDKRWDKRNYETTRQARKVWSPAVAAQQGTIFAELLSQIREARDFYFARQFFYKPEAT*
Ga0137403_1024368323300015264Vadose Zone SoilCGSKWRRDFQEIADAGAIVRRAPSYRITSNGDDYPPRNTAGTLLHAAELCSAKDEFIVLCDPDMIFVGQLDFSESLSGEYYGYVKYDRKPVRGAAKEIGIRLEMLDRQKEELCCGVPYVIPVAAAKPLADAWLQAIDAFSPRHWEDQMHAFGLAVVKLGLRVTLTHIMNHNYWPDAMVDRDVIHYAYGDKTWNKRSYFTTRQARKVWSPAAAAQQGTILAELLSQIREARDFYSRFH*
Ga0132258_1350150413300015371Arabidopsis RhizosphereMARLARGYRIVVSAENSPYLAWQAKLFHFSCVSRLNHSPIVIVHEYGSKWRRDFQELADAGAIVSRVPSYCITPNGEDYPPRNTAGTLLHAAQLCSAKDEFIVLCDPDMIFMRRPSFSRDLSGEYYDCLNYDRKPVRRAAKRIGIRLEMLDRQKEELCCGVPYVIPVAAAKPLAEAWLQAIDAFSPRRWEDQMHAFGLAVVKVGLTLTLTHMMNLNYWPDAMVDRDVIHYGYGDKTWNK
Ga0132255_10171303013300015374Arabidopsis RhizosphereVARNDVSISGGDEVQVMARPARRYRIVVSAENTPYMAWQAKLFHFSCVSRLNHSPIFIVHEFGSKWRRDFQELADAGAIVSRAPSYRITSKGDDYAPRNTAGTLLHAAELCSAKDEFIVLCDPDMIFVRQPNFSRNLSGEYYGQLTYDQKHVRRTARKIGISLERLDRRKEELCCGVPYVIPVTATKRIAKAWLQAIDAFSPQEWEDQMYAFGLAVVKVGLRVTLTHIVNHNYWPDAMVDRDVIHYAYGDKRWDKRNYVTTGQAQKVWLPAAAQQGTIRAELFSQIREAKDFFSPRQFFYKPEAT*
Ga0134069_108565423300017654Grasslands SoilMARRYRIVVSAENSPYLAWQAKLFYFSCVSRLNRSPIVIVHDCGSKWRRDFQELADAGAIVSRAPCYRITSNGDDYPPRNTAGTLLHAAQLCSAKDEFIVLCDPDMIFVRQPNFSRNLSGEYYGQMKYDQKQVRRTARKIGISLEMLDRQKEELCCGVPYVIPVTVAKRFADAWLQAIDAFSPRDWEDQMYAFGLAVVKLGLRVTLTHIVNHNYWPDAMVDRDVIHYCYGDKTWNKRSYFTTRRARKIWSPTAAAQQGTILAELLSQIREARDFYSRFH
Ga0184618_1004351023300018071Groundwater SedimentMADMAPRYRIVVSAENSPYLAWQAKLFHFSCVSRLSRSPIVIVHDCGSKWCRDFQELADAGAIVSRAPSYRMTSNGDDYLCRNIAGTLLHAAELCSARDEFIVFCDPDMIFVRQPDFSRNLSGEYYGYVNYDRKPVRRAAKKIGIRLEMLDRQKKELCCGPPYVIPVAAAKQLAEAWLQAIDEVSPREWDDVMGAFGLAVVKLGLRITLTHIMNHNYWPDAMVDRDVIHYSYGDKTWSKRSYFTTRQARKVWSPAAVAQQGTILAELLSQIREASDFYSKFH
Ga0066667_1000988953300018433Grasslands SoilMARPARRYRIVVSAENSPYLAWQAKLFHFSCVSRLNHSPIVIVHERGSKWRRDFQELADAGAIVSRAPSYRITSNGDDYLPRNTVGTLLHAAELCSAKDEFIVLCDPDMIFVRQPDFSRNLSGEYWGNMKYDQKQVRRTARKIGISLEMLDRQKEELCCGVPYVIPVTAAKRFADAWLQAIDAFSPRDWEDQMYAFGLAVVKLGLRVMLTHIVNHNYWPDAMVDRDVIHYAYGDKRWDKRNYETTRQARKVWSPAVAAQQGTIFAELLSQIREARDFYSASQFFYKPEAT
Ga0066669_1049199223300018482Grasslands SoilRGSKWRRDFQELADAGAIVSRAPSYRITSNGDDYLPRNTVGTLLHAAELCSAKDEFIVLCDPDMICVRQPDFSRNLSGEYWGNMKYDQKQVRRTARKIGISLEMLDRQKEELCCGVPYVIPVTAAKRFADAWLQAIDAFSPRDWEDQMYAFGLAVVKLGLRVTLTHIVNHNYWPDAMVDRDVIHYAYGDKRWDKRNYETTRQARKVWSPAVAAQQGTIFAELLSQIREARDFYCARQFFYKPEAT
Ga0066669_1065727523300018482Grasslands SoilISRLNHSPIVIVHDCGSKWRRDFQELADAGAIVSRAPSYRMTSNGDDYVCRNIPGSLLHAAELCSARDEFIVFCDPDMIFVRQPDFSRSLSGEYYGYVNYDRKPVRRAAKKIGIRLEMLDRQKKELCCGPPYVIPVAAAKQLAEAWLQAIDEVSPREWDDVMGAFGLAVVKLGLNVTLTHIMNHNYWPNAMVDRDVIHYCYGDKTWSKRSYFTTRQARKVWSPAAVAQQGTILAELLSQIREARDFYSKFH
Ga0193723_101022223300019879SoilMADMAPRYRIVVSAENSPYLAWQAKLFHFSCVSRVGRSPIVIVHDCGSKWCRDFQELADAGAIVSRAPSYRITSNGDDYLCRNIPGSLLHAAKLCSAKDEFIVFCDPDMIFVRQPDFSRSLSGEYYGYVNYDRKPVRRAAKKIGIRLEMLDRQKKELCCGPPYVIPVAAAKQLAEAWLRAIDEVSPREWDDVMGAFGLAVVKLGLRVTLTHMMNHNYWPNAPVNRDVIHYCYGDKTWSKRSYFTTRQARKVWSPAAVAQQGTILAELLSQIREARDFYSKFH
Ga0193747_100665023300019885SoilMIHVPSSFLARHSSVEALAERRDRCKALRADLAPRYRIVVSAENSPYLAWQAKLFHFSCVSRVGRSPIVVVHDCGSKWRRDFQELADAGAIVSRAPSYRITSNGDDYLTRNTAGTLLHAAELCSAKDEFIVFCDPDMIFVRQPDFSRSLSGEYYGYVNYNRKPVRRAAKKIGISLEMLDRQKEELCCGVPYVIPVAAAKQLAEAWLQAIDEVSPREWDDVMGAFGLAVVKLGLRITLTHMMNHNYWPNAVVNRDVIHYCYGDKTWNKRSYFTTRQAQKVWSPPAVAQQGTILAELLSQIREARDFYSRFD
Ga0193727_104442523300019886SoilMARLARRYRIVVSAENSPYLAWQAKLFHFSCVSRLNRSPIVIVHHCGSKWRREFQEIADAGAIVSRAPSYRITSNGDDYPPRNTAGTLLRAAQLCSAKDEFIVLCDPDMIFVGQPDFSGNLSGEYYGYMEYERKPVRGAAKEIGIGLEMLDRQKEELCCGVPYVIPVTAAKPLADAWLQAIDAFSPRHWEDQMHAFGLAVVKLGLRVTLTHIMNHNYWPAAMVDRDIIHYCYGDKTWNKRSYFTTRQARKVWSPAAAAQQGTILAELLSQIREARDFYSRFH
Ga0193729_100470973300019887SoilMARLAHRYRIVVSAENSPYLAWQAKLFYFSCVSRLNRSPIVIVHDCGSKWRRDFQEIADAGAIVRRAQSYRITSNGDDYPPRNTAGTLLHAAELCSAKDEFIVLCDPDMIFVGQPDFSESLSGEYYGYVKYDRKPVRGAAKEIGIRLEMLDRQKEELCCGVPYVIPVAAAKRLADAWLQAIDAFSPRHWEDQMHAFGLAVVKLGLRVTLTHIMNHNYWPDAMVDRDVIHYAYGDKTWNKRSYFTTRQARKVWSPAAAAQQGTILAELLSQIREARDFYSRFH
Ga0193751_104872233300019888SoilVPQAKFPRCAAHHQADLDSYTVARNDVSIFAGDGIQLMARLAHRYRIVVSAENSPYLAWQAKLFYFSCVSRLNRSPIVIVHDCGSKWRRDFQEIADAGAIVSRAPSYRITSNGDDYPPRNTAGTLLHAAELCSAKDEFIVLCDPDMIFVGQPDFSGSLSGEYYGYVKYDRKPVRGAAKEIGIRLEMLDRQKEELCCGVPYVIPVAAAKRLADAWLQAIDAFSPRHWEDQMHAFGLTVVKLGLRVTLTHIMNHNYWPDAMVDRDVIHYAYGDKTWNKRSYFTTRQVRKVWSPAAAAQQGTILAELLSQIREARDFYSRFH
Ga0193731_110505813300020001SoilQEIADAGAIVRRAQSYRITSNGDDYPPRNTAGTLLHAAELCSAKDEFIVLCDPDMIFVGQPDFSESLSGEYYGYVKYDRKPVRGAAKEIGIRLEMLDRQKEELCCGVPYVIPVAAAKRLADAWLQAIDAFSPRHWEDQMHAFGLAVVKLGLRVTLTHIMNHNYWPDAMVDRDVIHYAYGDKTWNKRSYFTTRQARKVWSPAAAAQQGTILAELLSQIREARDFYSRFH
Ga0193730_109492313300020002SoilDGIQLMARLAHRYRIVVSAENSPYLAWQAKLFYFSCVSRLNRSPIVIVHDCGSKWRRDFQEIADAGAIVRRAQSYRITSNGDDYPPRNTAGTLLHAAELCSAKDEFIVLCDPDMIFVGQPDFSESLSGEYYGYMEYDRKPVRGAAKEIGIRLEMLDRQKEELCCGVPYVIPVAAAKRLADAWLQAIDAFSPRHWEDQMHAFGLAVVKLGLRVTLTHIMNHNYWPDAMVDRDVIHYAYGDKTWNKRSYFTTRQARKVWSPPAVAQQGSILAELLSQIR
Ga0193755_112491713300020004SoilDVSIFAGDGIQLMARLAHRYRIVVSAENSPYLAWQAKLFYFSCVSRLNRSPIVIVHDCGSKWRRDFQEIADAGAIVRRAQSYRITSNGDDYPPRNTAGTLLHAAELCSAKDEFIVLCDPDMIFVGQPDFSESLSGEYYGYVKYDRKPVRGAAKEIGIRLEMLDRQKEELCCGVPYVIPVAAAKRLADAWLQAIDAFSPRHWEDQMHAFGLAVVKLGLRVTLTHIMNHNYWPDAMVDRDVIHYAYGDKTWNKRSYFTTRQARKVWSP
Ga0179594_1000714723300020170Vadose Zone SoilMARLARRYRIVVSAENSPYLAWQAKLFYFSCVSRLNRSPIVIVHDCGSKWRRDFQEIADAGAIVRRAPSYRITSNGDDYPPRNTAGTLLHAAELCSAKDEFIVLCDPDMIFVGQLDFSESLSGEYYGYVKYDRKPVRGAAKEIGIRLEMLDRQKEELCCGVPYVIPVAAAKPLADAWLQAIDAFSPRHWEDQMHAFGLAVVKLGLRVTLTHIMNHNYWPDAMVDRDVIHYAYGDKTWNKRSYFTTHQARKVWSPAAAAQQGTILAELLSQIREARDFYSRFH
Ga0210382_1005669723300021080Groundwater SedimentMADMAPRYRIVVSAENSPYLAWQAKLFHFSCVSRLSRSPIVIVHDCGSKWCRDFQELADAGAIVSRAPSYRITSNGDDYLCRNIPGSLLHAAELCSARDEFIVFCDPDMIFVRQPDFSRNLSGEYYGYVNYDRKPVRRAAKKIGIRLEMLDRQKKELCCGPPYVIPVAAAKQLAEAWLQAIDEVSPREWDDVMGAFGLAVVKLGLRITLTHIMNHNYWPDAMVDRDVIHYSYGDKTWSKRSYFTTRQARKVWSPAAVAQQGTILAELLSQIREASDFYSKFH
Ga0193719_1003135133300021344SoilMARPARRYRIVVSAENSPYMAWQAKLFHFSCVSRLNHSPIVIVHERGSKWRRDFQEVADAGAIVSRAPSYSITSNGDDYAPRNSAGTLLHAAELCSAKDEFIVLCDPDMIFVRQPNFSRNLSGEYWGNMKYDQKQVRRTARKIGISLEMLDRQKEELCCGVPYVIPVTAAKRFADAWLQAIDAFSPRDWEDQMYAFGLAVVKLGLRVTLTHIVNHNYWPDAMVDRDVIHYAYGDKRWDKRNYETTRQARKVWSPAVAAQQGTIFAELLSQIREARDFYSARQFFYKPEAT
Ga0193695_100085953300021418SoilMARLAHRYRIVVSAENSPYLAWQAKLFYFSCVSRLNRSPIVIVHDCGSKWRRDFQEIADAGAIVRRAQSYRITSNGDDYPPRNTAGTLLHAAELCSAKDEFIVLCDPDMIFVGQPDFSESLSGEYYGYVKYDRKPVRGAAKEIGIRLEMLDRQKEELCCGVPYVIPVAAAKRLADAWLQAIDAFSPRHWEDQMHAFGLAVVKLGLRVTLTHIMNHNYWPDAMVDRDVIHYAYGDKTWNKRSYFTTRQAQKVWSPPAVAQQGTILAELLSQIREARDFYSRFH
Ga0222623_1000629723300022694Groundwater SedimentMARRYRIVVCAENSPYLAWQAKLFHFSCVSRLNRSPIVIVHHCGSKWRRDFQEIADAGAIVSRAPTYRITSNGDDYPPRNTAGTLLHAAQLCSAKDEFIVLCDPDMIFVDQPDFSGNLSGEYYGYMKYDRKPVRRAAKKIGIRLEMLDRQKEELCCGVPYVIPVAAAKRLAEAWLQAIDAFSPRRWEDVMHAFGLAVVKLGLRVTLTNIMNHNYWPDATVDRDVIHYCYGDKTWNKRSYFTTRQARKVWSPAAAAQQGTVLAELLSQIREARDFYSKFH
Ga0207697_1029432013300025315Corn, Switchgrass And Miscanthus RhizosphereSRLSRSPIVIVHDCGSKWRRDFQELADAGAIVSRAPSYRMTSNGDDYVCRNIPGSLLHAAELCSARDEFIVFCDPDMIFVRQPDFSRNLSGEYYGYVNYDRKPVRRAAKKIGIRLEMLDRQKKELCCGPPYVIPVAAAKQLAEAWLRAIDEVSPREWDDVMGAFGLAVVKLDLNVTLTHIMNHNYWPNAMVDRDVIHYCYGDKTWSKRSYFTTRQAKKVWSPPAVAQQGTILAELLSQI
Ga0207646_1000775433300025922Corn, Switchgrass And Miscanthus RhizosphereMCQRYRIVVSAENSPYLAWQAKLFYFSCVSRLNHSPLIIVHDCGSRWRRDFQELADAGAMVSRVPSYRITSTGDDYLPRNTPGTLLHAAELCSAKDEYIVLCDPDMIFVRRPDFSSSLSGEYYGYVNYERKPVRRAAKQIGIRLEMLDRQKEELCCGVPYVIPVAAAKPLAEAWLQAIDAISRRQWEDQMRAFGLAVVKLGLRLTLTHMMNHNYWPNAAVDRDVIHYCYGDKMWNKRSFFTTRQARKVWSPAAPAEEGTILAELLSQIREARDFYSSFHS
Ga0207681_1000763563300025923Switchgrass RhizosphereMADLAPRYRIVVSAENSPYLAWQAKLFHFSCVSRLSRSPIVIVHDCGSKWRRDFQELADAGAIVSRAPSYRMTSNGDDYVCRNIPGSLLHAAELCSARDEFIVFCDPDMIFVRQPDFSRNLSGEYYGYVNYDRKPVRRAAKKIGIRLEMLDRQKKELCCGPPYVIPVAAAKQLAEAWLRAIDEVSPREWDDVMGAFGLAVVKLDLNVTLTHIMNHNYWPNAMVDRDVIHYCYGDKTWSKRSYFTTRQAKKVWSPPAVAQQGTILAELLSQIREARDFYSKFH
Ga0207659_1038658813300025926Miscanthus RhizosphereARCTTRQTLDSYTVARSAVSIFDRDGIQLMADLAPRYRIVVSAENSPYLAWQAKLFHFSCVSRLSRSPIVIVHDCGSKWRRDFQELADAGAIVSRAPSYRMTSNGDDYVCRNIPGSLLHAAELCSARDEFIVFCDPDMIFVRQPDFSRNLSGEYYGYVNYDRKPVRRAAKKIGIRLEMLDRQKKELCCGPPYVIPVAAAKQLAEAWLRAIDEVSPREWDDVMGAFGLAVVKLDLNVTLTHIMNHNYWPNAMVDRDVIHYCYGDKTWSKRSYFTTRQAKKVWSPPAVAQQGTILAELLSQIREARDFYSKFH
Ga0207668_1062477113300025972Switchgrass RhizosphereSAVSIFDRDGIQLMADLAPRYRIVVSAENSPYLAWQAKLFHFSCVSRLSRSPIVIVHDCGSKWRRDFQELADAGAIVSRAPSYRMTSNGDDYVCRNIPGSLLHAAELCSARDEFIVFCDPDMIFVRQPDFSRNLSGEYYGYVNYDRKPVRRAAKKIGIRLEMLDRQKKELCCGPPYVIPVAAAKQLAEAWLRAIDEVSPREWDDVMGAFGLAVVKLDLNVTLTHIMNHNYWPNAMVDRDVIHYCYGDKTWSKRSYFTTRQAKKVWSPPAVAQQGTILAELLSQIREARDFYSKFH
Ga0209687_110862823300026322SoilMCQRYRIVVSAENSPYLAWQAKLFYFSCVSRLNHSPLIIVHDCGSRWRRDFQELADAGAMVSRVPSYRITSTGDDYLPRNTPGTLLHAAELCSAKDEYIVLCDPDMIFVRRPDFSSSLSGEYYGYVNYERKPVRRAAKQIGIRLEMLDRQKEELCCGVPYVIPVTAAKPLAEAWLQAIDAISRRQWEDQMRAFGLAVVKLGLRLTLTHMMNHNYWPNAVVDRDVIHYCYGDKMWNKRSFF
Ga0209472_107938023300026323SoilMIPLARRYQIVVSAENSPYMAWQAKLFHFSCVSRLNCSPIVIVHDCGSKWRRDFQELVEAGAIVSRAPSYRRTSNGEDYPPRNTAGTLLCAAQLCSAKDEFIVLCDPDMIFLGQPEFSRSLSGEYYGYVKYDRKPVRGAAKEIGIRLDTLDRQKDELCCGVPYVIPVAAAKCLAEVWLQAIDALSPRRWEDVMHAFGLAVVKLGLSVSLTHMLNHNYWPDAMVDRDVIHYAYGDKRWDKRNYETTRQARKVWSPAVAAQQGTIFAELLSQIREARDFYCARQFFYKPEAT
Ga0209801_106415533300026326SoilMADMAPRYRIVVSAENSPYLAWQAKLFHFSCVSRLSRSPIVIVHDCGSTWCRDFQELADAGAIVSRAPSYRITSNGDDYLCRNIPGSLLHAAKLCSARDEFIVFCDPDMIFVRQPDFSRSLSGEYYGYVNYDRKPVRRAAKKIGIRLEMLDRQKKELCCGPPYVIPVAAAKQLAEAWLRAIDEVSPREWDDVMGAFGLAVVKLGLNVTLTHIMNHNYWPNAMVNRDVIHYCYGDKTWSKRSYFTTRQARKVWSPAA
Ga0209266_113663413300026327SoilVHDCGSKWRRDFQELADAGAIVSRAPSYRITSNGDDYVCRNIPGSLLHAAELCSARDEFIVFCDPDMIFVRQPDFSRSLSGEYYGYVNYDRKPVRRAAKKIGIRLEMLDRQKKELCCGPPYVIPVAAAKQLAEAWLRAIDEVSPREWDDVMGAFGLAVVKLGLNVTLTHIMNHNYWPNAMVNRDVIHYCYGDKTWSKRSYFTTRQARKVWSPAAVAQQGTILAELLSQIREASDFYSKFH
Ga0209376_104736333300026540SoilVPQAKYLRCAAHHQADPDSYTVARNDVSIFAGDGIQLMARLARRYRIVVSAENSPYLAWQAKLFHFSCVSRLNRSPIVIVHDWGSKLRRDFQKIADAGAIVSRAPSYRITSNGDDYPPRNTAGTLLHAAELCSAKDEFIVLCDPDMIFVGQPDFSGSLSGEYYGYVKYDRKPVRGAAKEIGIRLEMLDRQKEELCCGVPYVIPVAAAKPLADAWLQAIDAFSPRHWEDQMHAFGLAVVKLGLRVTLTHIMNHNYWPDAMVDRDVIHYAYGDKTWNKRSYFTTSQARKVWSPAAAAQQGTILAELLAQIREARDFYSRFH
Ga0209622_101013123300027502Forest SoilMIHVPSSLLARHSSVEALAERRDRCKALRADLAPRYRIVVSAENSPYLAWQAKLFHFSCVSRVGRSPIVIVHDCGSKWRRDFQELADAGAIVSRAPSYRITSNGDDYLCRNIAGTLLHAAELCSAKDEFIVFCDPDMIFVRQPDFSRSLSGEYYGYVNYDRKPVRRAAKKIGIRLEMLDRQKRELCCGPPYVIPVAAAKQLAEAWLRAIDEVSPREWDDVMGAFGLAVVKLGLRITLTHMMNHNYWPNAMVNRDVIHYCYGDKTWNKRSYFTTRQAQKVWSPSAVAQQGTILAELLSQIREA
Ga0209523_102465123300027548Forest SoilMIHVPSSLLARHSSVEALAERRDRCKALRADLAPRYRIVVSAENSPYLAWQAKLFHFSCVSRVGRSPIVIVHDCGSKWRRDFQELADAGAIVSRAPSYRITSNGDDYLCRNIAGTLLHAAELCSAKDEFIVFCDPDMIFVRQPDFSRSLSGEYYGYVNYDRKPVRRAAKKIGIRLEMLDRQKRELCCGPPYVIPVAAAKQLAEAWLRAIDEVSPREWDDVMGAFGLAVVKLGLRITLTHMMNHNYWPNAMVNRDVIHYCYGDKTWNKRSYFTTRQAQK
Ga0268264_1070051013300028381Switchgrass RhizosphereMADLAPRYRIVVSAENSPYLAWQAKLFHFSCVSRLSRSPIVIVHDCGSKWRRDFQELADAGAIVSRAPSYRMTSNGDDYVCRNIPGSLLHAAELCSARDEFIVFCDPDMIFVRQPDFSRNLSGEYYGYVNYDRKPVRRAAKKIGIRLEMLDRQKKELCCGPPYVIPVAAAKQLAEAWLRAIDEVSPREWDDVMGAFGLAVVKLDLNVTLTHIMNHNYWPNAMVDRDVIHYCYGDKTWSKRSYF
Ga0307280_1007254623300028768SoilMTRLARGYRIVVSAENSPYLAWQAKLFHFSCVSRLNHSPIVVVHEYGSKWRRDFQELADAGAIVSRVPSYCITPNGEDYPPRNTAGTLLHAAQLCSAKDEFIVLCDPDMIFMRRPSFSRDLSGEYYDCLNYDRKPVRRAAKRIGIRLEMLDRQKEELCCGVPYVIPVAAAKPLGEAWLQAIDAFSPRRWEDQMHAFGLAVVKVGLTLTLTHMMNLNYWPDAMVDRDVIHYGYGDKTWNKRSYFTTRQARKVWSPAAVAQQGTILAELLAQIREARNFYSRFH
Ga0170820_1730910013300031446Forest SoilMAELAPRYRIVVSAENSPYLAWQAKLFHFSCLTRLSRSPIVIVHACGSKWRRDFQELADAGAIVSRAPSYRITSNGDDYLPRNTAGTLLHAAELCSAQDEFIVLCDPDMIFVRQPDFSRSLSGEYYGYVNYDRKPVRRAAKKIGIRLEMLDRQKKELCCGPPYVIPVAAAKQLAEAWLRAIDEASPREWDDVMGAFGLAVVKLGLRIRLTHMTNFNYWPNAMVNRDVI
Ga0307469_1033774023300031720Hardwood Forest SoilMARLARRYRIVVSAENSPYLAWQAKLFYFSCVSRLNRSPIVIVHDCGSKWRRDFQEIADAGAIVSRAPSYRITSNGNDYPPRNTAGTLLHAAELCSAKDEFLVLCDPDMIFVRQPDFSGSLSGEYYGYVKYDRKPVRGAAKEIGIRLEMLDRQKEELCCGVPYVIPVAAARRLADAWLQAIDAFSPRHWEDQMHAFGLAVVKLGLRVTLTHIMNHNYWPDAMVDRDVIHYAYGDKTWNKRSYFTTRQAQKVWSPPAVAQQGTILAELLAQIREARNFYSRFH
Ga0310913_1044703013300031945SoilMAELAPRYRIVVSAENSPYLAWQAKLFHFSCVSRVGRSPIVIVHDCGSKWRRDFQELADTGAIVSRAPSYRITSNGDDYLCRNIAGTLLHAAELCSAKDEFIVFCDPDMIFVRQPDFSRSLSGEYYGYVNYDRKPVRRAAKKIGIRLEMLDRQKKELCCGPPYVIPVAAAKQLAEAWLRAIDEVSPREWDDVMGAFGLAVVKLGLRITLTHMMNHNYWPNAMVNRDVIHYCYGDKTWNKRSYFTTRQAQKVWS
Ga0306922_1095395413300032001SoilMAELAPRYRIVVSAENSPYLAWQAKLFHFSCVSRVGRSPIVIVHDCGSKWRRDFQELADTGAIVSRAPSYRITSNGDEFIVFCDPDMIFVRQPDFSRSLSGEYYGYVNYDRKPVRRAAKKIGIRLEMLDRQKKELCCGPPYVIPVAAAKQLAEAWLRAIDEVSPREWDDVMGAFGLAVVKLGLRITLTHMMNHNYWPNAMVNRDVIHYCYGDKTWNKRSYFTTRQAQKVWS
Ga0307471_10000123163300032180Hardwood Forest SoilMARLARRYRIVVSAENSPYLAWQAKLFYFSCVSRLNRSPIVIVHDCGSKWRRDFQEIADAGAIVSRAPSYRITSNGNDYPPRNTAGTLLHAAELCSAKDEFLVLCDPDMIFVRQPDFSGSLSGEYYGYVKYDRKPVRGAAKEIGIRLEMLDRQKEELCCGVPYVIPVAAARRLADAWLQAIDAFSPRHWEDQMHAFGLAVVKLGLRVTLTHIMNHNYWPDAMVDRDVIHYAYGDKTWNKRSYLTTRQARKVWSSAAAAQQGTILAELLSQIREARDFYSRFR
Ga0307472_10082509723300032205Hardwood Forest SoilFHFSCVSRVGRSPIVIVHDCGSKWRRDFQELADAGAIVSRAPSYRITSNGDDYLTRNTAGTLLHAAELCSAQDEFIVLCDPDMIFVRQPDFSRSLSGDYYSYVNYDRKPVRRAAKRIGIRLEMLDRQKEELCCGPPYVIPVAAAKQLAEAWLRAIDEVSPREWDDVMGAFGLAVVKLGLRITLTHMTNFNYWPNAMVNRDVIHYCYGDKTWNKRSYFTTRQAQKVWSPPAVAQQGTILAELLSQIREARDFYSRFH


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.