NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F020670

Metagenome Family F020670

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F020670
Family Type Metagenome
Number of Sequences 222
Average Sequence Length 231 residues
Representative Sequence MKPRTALALFIVLVSTLAHGETFIGPTTATNRLLVPSNSAIIITATFGDFTNSTQVALGEGGIPFPQNYFAPLENGSSYALAGPAELIFSNAVLISYYQVTNSAIRTQFIANDPIVIPIASNKTMRLFGVPAPVNAGFSRPGSGFVSFTLEPNRPAEFTGPGSLQLSSGVFFPYSKFISYFIAEDGFTLPSQRAIAGPTGSYAITVEKSFDLNTWSPALLANTSDANKAFYRLRIQR
Number of Associated Samples 164
Number of Associated Scaffolds 222

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 31.98 %
% of genes near scaffold ends (potentially truncated) 45.95 %
% of genes from short scaffolds (< 2000 bps) 66.67 %
Associated GOLD sequencing projects 152
AlphaFold2 3D model prediction Yes
3D model pTM-score0.68

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (57.658 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(30.180 % of family members)
Environment Ontology (ENVO) Unclassified
(47.297 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(40.541 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 5.66%    β-sheet: 40.00%    Coil/Unstructured: 54.34%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.68
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 222 Family Scaffolds
PF00596Aldolase_II 1.80
PF02518HATPase_c 1.80
PF01207Dus 1.80
PF00583Acetyltransf_1 1.80
PF03781FGE-sulfatase 1.35
PF00689Cation_ATPase_C 0.90
PF00561Abhydrolase_1 0.90
PF00326Peptidase_S9 0.90
PF13304AAA_21 0.90
PF12804NTP_transf_3 0.90
PF00903Glyoxalase 0.90
PF14508GH97_N 0.90
PF10543ORF6N 0.90
PF13927Ig_3 0.45
PF13204DUF4038 0.45
PF12704MacB_PCD 0.45
PF10017Methyltransf_33 0.45
PF02543Carbam_trans_N 0.45
PF00295Glyco_hydro_28 0.45
PF07638Sigma70_ECF 0.45
PF07676PD40 0.45
PF00079Serpin 0.45
PF13533Biotin_lipoyl_2 0.45
PF01965DJ-1_PfpI 0.45
PF06964Alpha-L-AF_C 0.45
PF02622DUF179 0.45
PF01258zf-dskA_traR 0.45
PF07586HXXSHH 0.45
PF02082Rrf2 0.45
PF12974Phosphonate-bd 0.45
PF02643DUF192 0.45
PF00724Oxidored_FMN 0.45
PF01663Phosphodiest 0.45
PF03683UPF0175 0.45
PF00733Asn_synthase 0.45
PF13202EF-hand_5 0.45
PF14236DUF4338 0.45
PF13365Trypsin_2 0.45
PF01906YbjQ_1 0.45
PF11999Ice_binding 0.45
PF13495Phage_int_SAM_4 0.45
PF00702Hydrolase 0.45
PF00156Pribosyltran 0.45
PF14332DUF4388 0.45
PF00072Response_reg 0.45
PF09957VapB_antitoxin 0.45
PF12831FAD_oxidored 0.45
PF13847Methyltransf_31 0.45
PF05239PRC 0.45
PF12344UvrB 0.45
PF01734Patatin 0.45
PF08445FR47 0.45
PF08281Sigma70_r4_2 0.45
PF01050MannoseP_isomer 0.45
PF00400WD40 0.45
PF05840Phage_GPA 0.45
PF13200DUF4015 0.45
PF01594AI-2E_transport 0.45
PF01223Endonuclease_NS 0.45
PF11175DUF2961 0.45
PF03544TonB_C 0.45
PF13360PQQ_2 0.45
PF08388GIIM 0.45
PF01909NTP_transf_2 0.45
PF01477PLAT 0.45
PF00890FAD_binding_2 0.45
PF00828Ribosomal_L27A 0.45
PF00005ABC_tran 0.45
PF07592DDE_Tnp_ISAZ013 0.45
PF13591MerR_2 0.45
PF13361UvrD_C 0.45

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 222 Family Scaffolds
COG0042tRNA-dihydrouridine synthaseTranslation, ribosomal structure and biogenesis [J] 2.25
COG1262Formylglycine-generating enzyme, required for sulfatase activity, contains SUMF1/FGE domainPosttranslational modification, protein turnover, chaperones [O] 1.35
COG0474Magnesium-transporting ATPase (P-type)Inorganic ion transport and metabolism [P] 0.90
COG1752Predicted acylesterase/phospholipase RssA, containd patatin domainGeneral function prediction only [R] 0.45
COG5434PolygalacturonaseCarbohydrate transport and metabolism [G] 0.45
COG4826Serine protease inhibitorPosttranslational modification, protein turnover, chaperones [O] 0.45
COG4667Predicted phospholipase, patatin/cPLA2 familyLipid transport and metabolism [I] 0.45
COG3621Patatin-like phospholipase/acyl hydrolase, includes sporulation protein CotRGeneral function prediction only [R] 0.45
COG3534Alpha-L-arabinofuranosidaseCarbohydrate transport and metabolism [G] 0.45
COG2886Predicted antitoxin, contains HTH domainGeneral function prediction only [R] 0.45
COG2524Predicted transcriptional regulator, contains C-terminal CBS domainsTranscription [K] 0.45
COG2378Predicted DNA-binding transcriptional regulator YobV, contains HTH and WYL domainsTranscription [K] 0.45
COG2192Predicted carbamoyl transferase, NodU familyGeneral function prediction only [R] 0.45
COG2188DNA-binding transcriptional regulator, GntR familyTranscription [K] 0.45
COG2186DNA-binding transcriptional regulator, FadR familyTranscription [K] 0.45
COG1959DNA-binding transcriptional regulator, IscR familyTranscription [K] 0.45
COG19022,4-dienoyl-CoA reductase or related NADH-dependent reductase, Old Yellow Enzyme (OYE) familyEnergy production and conversion [C] 0.45
COG1864DNA/RNA endonuclease G, NUC1Nucleotide transport and metabolism [F] 0.45
COG1734RNA polymerase-binding transcription factor DksATranscription [K] 0.45
COG1727Ribosomal protein L18ETranslation, ribosomal structure and biogenesis [J] 0.45
COG1725DNA-binding transcriptional regulator YhcF, GntR familyTranscription [K] 0.45
COG1678Putative transcriptional regulator, AlgH/UPF0301 familyTranscription [K] 0.45
COG1595DNA-directed RNA polymerase specialized sigma subunit, sigma24 familyTranscription [K] 0.45
COG1430Uncharacterized conserved membrane protein, UPF0127 familyFunction unknown [S] 0.45
COG1414DNA-binding transcriptional regulator, IclR familyTranscription [K] 0.45
COG0810Periplasmic protein TonB, links inner and outer membranesCell wall/membrane/envelope biogenesis [M] 0.45
COG0640DNA-binding transcriptional regulator, ArsR familyTranscription [K] 0.45
COG0628Predicted PurR-regulated permease PerMGeneral function prediction only [R] 0.45
COG0393Uncharacterized pentameric protein YbjQ, UPF0145 familyFunction unknown [S] 0.45


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A57.66 %
All OrganismsrootAll Organisms42.34 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
2228664022|INPgaii200_c1005981All Organisms → cellular organisms → Bacteria5949Open in IMG/M
3300000364|INPhiseqgaiiFebDRAFT_101419764Not Available1002Open in IMG/M
3300000364|INPhiseqgaiiFebDRAFT_101691869All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium3783Open in IMG/M
3300000364|INPhiseqgaiiFebDRAFT_105340537Not Available872Open in IMG/M
3300000789|JGI1027J11758_12435654Not Available818Open in IMG/M
3300000789|JGI1027J11758_12814979All Organisms → cellular organisms → Bacteria1545Open in IMG/M
3300001213|JGIcombinedJ13530_100537965Not Available852Open in IMG/M
3300004080|Ga0062385_10500452Not Available749Open in IMG/M
3300004092|Ga0062389_103266044Not Available607Open in IMG/M
3300004479|Ga0062595_100455369Not Available942Open in IMG/M
3300005180|Ga0066685_10389397Not Available968Open in IMG/M
3300005181|Ga0066678_10292680Not Available1064Open in IMG/M
3300005294|Ga0065705_10036371Not Available1144Open in IMG/M
3300005295|Ga0065707_10040433Not Available788Open in IMG/M
3300005333|Ga0070677_10346037Not Available769Open in IMG/M
3300005456|Ga0070678_100133854Not Available1974Open in IMG/M
3300005466|Ga0070685_10789558Not Available699Open in IMG/M
3300005526|Ga0073909_10093832Not Available1175Open in IMG/M
3300005534|Ga0070735_10023672All Organisms → cellular organisms → Bacteria4378Open in IMG/M
3300005537|Ga0070730_10465809Not Available813Open in IMG/M
3300005540|Ga0066697_10487762Not Available702Open in IMG/M
3300005553|Ga0066695_10150673All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Chelicerata → Pycnogonida → Pantopoda → Nymphonidae → Nymphon → Nymphon striatum1448Open in IMG/M
3300005558|Ga0066698_10573756Not Available764Open in IMG/M
3300005569|Ga0066705_10137190Not Available1491Open in IMG/M
3300005577|Ga0068857_100000452All Organisms → cellular organisms → Bacteria29274Open in IMG/M
3300005598|Ga0066706_10578778Not Available892Open in IMG/M
3300005618|Ga0068864_100257739All Organisms → cellular organisms → Bacteria1621Open in IMG/M
3300005618|Ga0068864_101307560Not Available725Open in IMG/M
3300005764|Ga0066903_100046224All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium5227Open in IMG/M
3300005764|Ga0066903_100263796Not Available2677Open in IMG/M
3300005764|Ga0066903_101017207All Organisms → cellular organisms → Bacteria1517Open in IMG/M
3300005836|Ga0074470_10258989All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Verrucomicrobiae → Verrucomicrobiales → Verrucomicrobia subdivision 3 → Pedosphaera → Pedosphaera parvula → Pedosphaera parvula Ellin5142466Open in IMG/M
3300005994|Ga0066789_10366731Not Available602Open in IMG/M
3300006796|Ga0066665_10509279Not Available985Open in IMG/M
3300006904|Ga0075424_102068774Not Available600Open in IMG/M
3300007255|Ga0099791_10368268Not Available690Open in IMG/M
3300009012|Ga0066710_100759414Not Available1483Open in IMG/M
3300009012|Ga0066710_101023326All Organisms → cellular organisms → Bacteria1275Open in IMG/M
3300009012|Ga0066710_101347852Not Available1107Open in IMG/M
3300009012|Ga0066710_101429106Not Available1071Open in IMG/M
3300009012|Ga0066710_102350779Not Available774Open in IMG/M
3300009029|Ga0066793_10599323Not Available628Open in IMG/M
3300009038|Ga0099829_10658841Not Available870Open in IMG/M
3300009088|Ga0099830_10600764Not Available902Open in IMG/M
3300009137|Ga0066709_100485727All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1736Open in IMG/M
3300009137|Ga0066709_100567282All Organisms → cellular organisms → Bacteria1611Open in IMG/M
3300009177|Ga0105248_11110619Not Available894Open in IMG/M
3300009519|Ga0116108_1189453Not Available605Open in IMG/M
3300009615|Ga0116103_1039382All Organisms → cellular organisms → Bacteria1388Open in IMG/M
3300009700|Ga0116217_10202927Not Available1302Open in IMG/M
3300009760|Ga0116131_1116332Not Available790Open in IMG/M
3300009760|Ga0116131_1153205Not Available662Open in IMG/M
3300010341|Ga0074045_10259241All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1148Open in IMG/M
3300010343|Ga0074044_10074481All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium2296Open in IMG/M
3300010343|Ga0074044_10537529Not Available763Open in IMG/M
3300010362|Ga0126377_11247455Not Available814Open in IMG/M
3300010397|Ga0134124_10009044All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia8032Open in IMG/M
3300010397|Ga0134124_10722098Not Available988Open in IMG/M
3300010398|Ga0126383_10685046Not Available1102Open in IMG/M
3300010401|Ga0134121_10286040All Organisms → cellular organisms → Bacteria1454Open in IMG/M
3300011269|Ga0137392_10764566Not Available799Open in IMG/M
3300011269|Ga0137392_10944267Not Available709Open in IMG/M
3300011271|Ga0137393_10352677Not Available1257Open in IMG/M
3300011423|Ga0137436_1075148Not Available879Open in IMG/M
3300011444|Ga0137463_1089272All Organisms → cellular organisms → Bacteria1158Open in IMG/M
3300012096|Ga0137389_10700877Not Available870Open in IMG/M
3300012189|Ga0137388_10037199All Organisms → cellular organisms → Bacteria3847Open in IMG/M
3300012198|Ga0137364_10339295All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1120Open in IMG/M
3300012199|Ga0137383_10688513Not Available747Open in IMG/M
3300012199|Ga0137383_10773755Not Available701Open in IMG/M
3300012201|Ga0137365_10459466All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes937Open in IMG/M
3300012203|Ga0137399_10811444Not Available788Open in IMG/M
3300012204|Ga0137374_10216346Not Available1635Open in IMG/M
3300012205|Ga0137362_10308058Not Available1371Open in IMG/M
3300012205|Ga0137362_10388100Not Available1209Open in IMG/M
3300012205|Ga0137362_10807134Not Available804Open in IMG/M
3300012205|Ga0137362_10955564Not Available731Open in IMG/M
3300012206|Ga0137380_10092596All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Burkholderiaceae → Caballeronia → Caballeronia terrestris2762Open in IMG/M
3300012206|Ga0137380_10650284Not Available918Open in IMG/M
3300012206|Ga0137380_10914258Not Available753Open in IMG/M
3300012209|Ga0137379_10037754All Organisms → cellular organisms → Bacteria4660Open in IMG/M
3300012209|Ga0137379_10416467All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1253Open in IMG/M
3300012209|Ga0137379_10919097Not Available780Open in IMG/M
3300012209|Ga0137379_11306371Not Available631Open in IMG/M
3300012210|Ga0137378_10032545All Organisms → cellular organisms → Bacteria4627Open in IMG/M
3300012210|Ga0137378_10231524All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium 13_2_20CM_55_101720Open in IMG/M
3300012211|Ga0137377_10033587All Organisms → cellular organisms → Bacteria4669Open in IMG/M
3300012349|Ga0137387_10100450All Organisms → cellular organisms → Bacteria2016Open in IMG/M
3300012350|Ga0137372_10384198Not Available1067Open in IMG/M
3300012351|Ga0137386_10350125All Organisms → cellular organisms → Bacteria1063Open in IMG/M
3300012351|Ga0137386_10809197Not Available673Open in IMG/M
3300012354|Ga0137366_10095745All Organisms → cellular organisms → Bacteria2251Open in IMG/M
3300012354|Ga0137366_10108256Not Available2104Open in IMG/M
3300012360|Ga0137375_10713304Not Available817Open in IMG/M
3300012361|Ga0137360_10013063All Organisms → cellular organisms → Bacteria5346Open in IMG/M
3300012361|Ga0137360_10107637All Organisms → cellular organisms → Bacteria2146Open in IMG/M
3300012361|Ga0137360_10402292All Organisms → cellular organisms → Bacteria1154Open in IMG/M
3300012361|Ga0137360_10845992Not Available788Open in IMG/M
3300012362|Ga0137361_10111257All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2400Open in IMG/M
3300012363|Ga0137390_11227992Not Available696Open in IMG/M
3300012582|Ga0137358_10186821Not Available1414Open in IMG/M
3300012683|Ga0137398_10299954All Organisms → cellular organisms → Bacteria1080Open in IMG/M
3300012685|Ga0137397_10109651Not Available2028Open in IMG/M
3300012685|Ga0137397_10125501Not Available1892Open in IMG/M
3300012922|Ga0137394_10008096All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Verrucomicrobiae → Verrucomicrobiales → Verrucomicrobia subdivision 3 → Pedosphaera → Pedosphaera parvula8156Open in IMG/M
3300012922|Ga0137394_10049090All Organisms → cellular organisms → Bacteria3481Open in IMG/M
3300012922|Ga0137394_10126275All Organisms → cellular organisms → Bacteria2167Open in IMG/M
3300012922|Ga0137394_10592400Not Available938Open in IMG/M
3300012923|Ga0137359_10007128All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia9267Open in IMG/M
3300012923|Ga0137359_10051950All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium3566Open in IMG/M
3300012923|Ga0137359_10381142Not Available1252Open in IMG/M
3300012929|Ga0137404_10015985All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis5314Open in IMG/M
3300012929|Ga0137404_10050890All Organisms → cellular organisms → Bacteria3173Open in IMG/M
3300012929|Ga0137404_10167433All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1841Open in IMG/M
3300012929|Ga0137404_11039262Not Available750Open in IMG/M
3300012930|Ga0137407_10855069Not Available860Open in IMG/M
3300012930|Ga0137407_10930161Not Available823Open in IMG/M
3300012944|Ga0137410_10112464All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium2033Open in IMG/M
3300012944|Ga0137410_10136245All Organisms → cellular organisms → Bacteria1857Open in IMG/M
3300012944|Ga0137410_10266199All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1348Open in IMG/M
3300012972|Ga0134077_10427327Not Available576Open in IMG/M
(restricted) 3300013138|Ga0172371_10558396Not Available811Open in IMG/M
3300013296|Ga0157374_10367470Not Available1431Open in IMG/M
3300014151|Ga0181539_1127747Not Available1036Open in IMG/M
3300014156|Ga0181518_10037216All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia3063Open in IMG/M
3300014158|Ga0181521_10009509All Organisms → cellular organisms → Bacteria9724Open in IMG/M
3300014159|Ga0181530_10011212All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Verrucomicrobiae → Verrucomicrobiales → Verrucomicrobia subdivision 3 → Pedosphaera → Pedosphaera parvula7758Open in IMG/M
3300014159|Ga0181530_10341331Not Available775Open in IMG/M
3300014162|Ga0181538_10004078All Organisms → cellular organisms → Bacteria13909Open in IMG/M
3300014200|Ga0181526_10040414All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia2990Open in IMG/M
3300014325|Ga0163163_10041547All Organisms → cellular organisms → Bacteria4497Open in IMG/M
3300014490|Ga0182010_10353272Not Available796Open in IMG/M
3300014493|Ga0182016_10026020All Organisms → cellular organisms → Bacteria5053Open in IMG/M
3300014494|Ga0182017_10374923Not Available883Open in IMG/M
3300014495|Ga0182015_10066878All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → unclassified Planctomycetota → Planctomycetes bacterium RBG_13_50_242563Open in IMG/M
3300014496|Ga0182011_10019188All Organisms → cellular organisms → Bacteria4978Open in IMG/M
3300014498|Ga0182019_10542971All Organisms → cellular organisms → Bacteria → Terrabacteria group → Armatimonadetes → Fimbriimonadia → Fimbriimonadales → Fimbriimonadaceae → Fimbriimonas → Fimbriimonas ginsengisoli → Fimbriimonas ginsengisoli Gsoil 348810Open in IMG/M
3300014501|Ga0182024_11463136Not Available782Open in IMG/M
3300014502|Ga0182021_10180842All Organisms → cellular organisms → Bacteria2461Open in IMG/M
3300014838|Ga0182030_10011390All Organisms → cellular organisms → Bacteria17961Open in IMG/M
3300014839|Ga0182027_10668492Not Available1105Open in IMG/M
3300014968|Ga0157379_10601492Not Available1026Open in IMG/M
3300015241|Ga0137418_10368529Not Available1180Open in IMG/M
3300015245|Ga0137409_10684185Not Available859Open in IMG/M
3300015264|Ga0137403_10011725All Organisms → cellular organisms → Bacteria9643Open in IMG/M
3300015264|Ga0137403_10029617All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis5802Open in IMG/M
3300015264|Ga0137403_10128505All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium2519Open in IMG/M
3300015371|Ga0132258_11871874Not Available1511Open in IMG/M
3300015371|Ga0132258_12722479Not Available1233Open in IMG/M
3300015374|Ga0132255_100092778All Organisms → cellular organisms → Bacteria4077Open in IMG/M
3300015374|Ga0132255_103159801Not Available702Open in IMG/M
3300017929|Ga0187849_1015919All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia4511Open in IMG/M
3300017931|Ga0187877_1007648All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia7322Open in IMG/M
3300017931|Ga0187877_1009219Not Available6389Open in IMG/M
3300017935|Ga0187848_10179427Not Available919Open in IMG/M
3300018005|Ga0187878_1103770Not Available1158Open in IMG/M
3300018009|Ga0187884_10271167Not Available689Open in IMG/M
3300018013|Ga0187873_1058751Not Available1629Open in IMG/M
3300018020|Ga0187861_10305006Not Available682Open in IMG/M
3300018026|Ga0187857_10002027All Organisms → cellular organisms → Bacteria18971Open in IMG/M
3300018026|Ga0187857_10032843Not Available2773Open in IMG/M
3300018033|Ga0187867_10015286All Organisms → cellular organisms → Bacteria5109Open in IMG/M
3300018040|Ga0187862_10571287Not Available672Open in IMG/M
3300018047|Ga0187859_10199180All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1068Open in IMG/M
3300018057|Ga0187858_10340037Not Available943Open in IMG/M
3300018071|Ga0184618_10266562Not Available726Open in IMG/M
3300018083|Ga0184628_10005174All Organisms → cellular organisms → Bacteria6231Open in IMG/M
3300018083|Ga0184628_10034045All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia2546Open in IMG/M
3300018084|Ga0184629_10079204Not Available1569Open in IMG/M
3300019788|Ga0182028_1427291All Organisms → cellular organisms → Bacteria1854Open in IMG/M
3300019880|Ga0193712_1071548Not Available757Open in IMG/M
3300019887|Ga0193729_1007224All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Verrucomicrobiae → Verrucomicrobiales → Verrucomicrobia subdivision 3 → Pedosphaera → Pedosphaera parvula5171Open in IMG/M
3300019890|Ga0193728_1019855All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia3372Open in IMG/M
3300020002|Ga0193730_1061018All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → unclassified Planctomycetota → Planctomycetota bacterium1081Open in IMG/M
3300020004|Ga0193755_1174073Not Available636Open in IMG/M
3300021082|Ga0210380_10096780Not Available1303Open in IMG/M
3300021344|Ga0193719_10155796Not Available985Open in IMG/M
3300021411|Ga0193709_1000086All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Verrucomicrobiae → Verrucomicrobiales41433Open in IMG/M
3300021413|Ga0193750_1018059All Organisms → cellular organisms → Bacteria1713Open in IMG/M
3300021420|Ga0210394_10020661All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium6100Open in IMG/M
3300022526|Ga0224533_1010160Not Available1680Open in IMG/M
3300023088|Ga0224555_1006327All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia9293Open in IMG/M
3300024179|Ga0247695_1029565Not Available778Open in IMG/M
3300024179|Ga0247695_1038992Not Available683Open in IMG/M
3300025878|Ga0209584_10094755All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Bryobacterales → Solibacteraceae1097Open in IMG/M
3300025941|Ga0207711_11261271Not Available681Open in IMG/M
3300026078|Ga0207702_11236353Not Available741Open in IMG/M
3300026095|Ga0207676_10785015Not Available928Open in IMG/M
3300026095|Ga0207676_11196201Not Available753Open in IMG/M
3300026116|Ga0207674_10011113All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Verrucomicrobiae → Verrucomicrobiales → Verrucomicrobia subdivision 3 → Pedosphaera → Pedosphaera parvula10125Open in IMG/M
3300026121|Ga0207683_10128655Not Available2277Open in IMG/M
3300026557|Ga0179587_10230365All Organisms → cellular organisms → Bacteria1181Open in IMG/M
3300027821|Ga0209811_10117781Not Available966Open in IMG/M
3300027854|Ga0209517_10020448All Organisms → cellular organisms → Bacteria6174Open in IMG/M
3300027896|Ga0209777_10057147All Organisms → cellular organisms → Bacteria → PVC group → Lentisphaerae → unclassified Lentisphaerota → Lentisphaerae bacterium RIFOXYA12_FULL_48_113489Open in IMG/M
3300027905|Ga0209415_10011576All Organisms → cellular organisms → Bacteria14831Open in IMG/M
3300029911|Ga0311361_10028949All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Verrucomicrobiae → Verrucomicrobiales → Verrucomicrobia subdivision 3 → Pedosphaera → Pedosphaera parvula9775Open in IMG/M
3300029913|Ga0311362_10096191Not Available3964Open in IMG/M
3300029914|Ga0311359_10576295Not Available836Open in IMG/M
3300029953|Ga0311343_10721324Not Available828Open in IMG/M
3300029955|Ga0311342_10919852Not Available658Open in IMG/M
3300029982|Ga0302277_1158075Not Available922Open in IMG/M
3300030020|Ga0311344_11132618Not Available597Open in IMG/M
3300030507|Ga0302192_10221334Not Available810Open in IMG/M
3300031057|Ga0170834_102536017Not Available804Open in IMG/M
3300031753|Ga0307477_10820066Not Available617Open in IMG/M
3300031813|Ga0316217_10001441All Organisms → cellular organisms → Bacteria30888Open in IMG/M
3300031947|Ga0310909_11322176Not Available579Open in IMG/M
3300032783|Ga0335079_10114737All Organisms → cellular organisms → Archaea → Euryarchaeota → Stenosarchaea group → Methanomicrobia3044Open in IMG/M
3300032783|Ga0335079_11642699Not Available630Open in IMG/M
3300032893|Ga0335069_10003668All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Verrucomicrobiae → Verrucomicrobiales → Verrucomicrobia subdivision 3 → Pedosphaera → Pedosphaera parvula23081Open in IMG/M
3300032893|Ga0335069_11540849Not Available714Open in IMG/M
3300033402|Ga0326728_10000161All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia269868Open in IMG/M
3300033402|Ga0326728_10193092All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium2073Open in IMG/M
3300033402|Ga0326728_10292879Not Available1495Open in IMG/M
3300033412|Ga0310810_10059718All Organisms → cellular organisms → Bacteria4630Open in IMG/M
3300033412|Ga0310810_10061934All Organisms → cellular organisms → Bacteria4533Open in IMG/M
3300033475|Ga0310811_10798669Not Available881Open in IMG/M
3300033823|Ga0334837_005749All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia6585Open in IMG/M
3300033823|Ga0334837_013886Not Available3365Open in IMG/M
3300034125|Ga0370484_0033321Not Available1235Open in IMG/M
3300034282|Ga0370492_0006993All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Verrucomicrobiae → Verrucomicrobiales → Verrucomicrobia subdivision 3 → Pedosphaera → Pedosphaera parvula4558Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil30.18%
PeatlandEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Peatland6.31%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil6.31%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil4.96%
BogEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Bog3.60%
BogEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog3.15%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil3.15%
FenEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Fen3.15%
PeatlandEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Peatland1.80%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment1.80%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil1.80%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil1.80%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil1.80%
SoilEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Soil1.80%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere1.80%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere1.80%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil1.35%
Peatlands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Peatlands Soil1.35%
Bog Forest SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Bog Forest Soil1.35%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil1.35%
Peat SoilEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Peat Soil1.35%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere1.35%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil0.90%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil0.90%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil0.90%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Switchgrass Rhizosphere0.90%
Untreated Peat SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Untreated Peat Soil0.90%
BogEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Bog0.90%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Soil → Unclassified → Miscanthus Rhizosphere0.90%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.90%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment0.45%
FreshwaterEnvironmental → Aquatic → Freshwater → Lentic → Hypolimnion → Freshwater0.45%
Freshwater Lake SedimentEnvironmental → Aquatic → Freshwater → Lentic → Sediment → Freshwater Lake Sediment0.45%
FreshwaterEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater0.45%
WetlandEnvironmental → Aquatic → Marine → Wetlands → Sediment → Wetland0.45%
Sediment (Intertidal)Environmental → Aquatic → Sediment → Unclassified → Unclassified → Sediment (Intertidal)0.45%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil0.45%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.45%
Arctic Peat SoilEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Arctic Peat Soil0.45%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil0.45%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.45%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Soil0.45%
Prmafrost SoilEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Prmafrost Soil0.45%
PalsaEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Palsa0.45%
PermafrostEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Permafrost0.45%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere0.45%
Switchgrass RhizosphereHost-Associated → Plants → Roots → Rhizosphere → Soil → Switchgrass Rhizosphere0.45%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere0.45%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.45%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.45%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Miscanthus Rhizosphere0.45%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
2228664022Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000364Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000789Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300001213Combined assembly of wetland microbial communities from Twitchell Island in the Sacramento Delta (Jan 2013 JGI Velvet Assembly)EnvironmentalOpen in IMG/M
3300004080Coassembly of ECP04_OM1, ECP04_OM2, ECP04_OM3EnvironmentalOpen in IMG/M
3300004092Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3, ECP04_OM1, ECP04_OM2, ECP04_OM3, ECP14_OM1, ECP14_OM2, ECP14_OM3EnvironmentalOpen in IMG/M
3300004479Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAsEnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005294Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL2 Bulk SoilEnvironmentalOpen in IMG/M
3300005295Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL3EnvironmentalOpen in IMG/M
3300005333Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M6-3 metaGHost-AssociatedOpen in IMG/M
3300005456Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M7-3 metaGHost-AssociatedOpen in IMG/M
3300005466Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S1-3L metaGEnvironmentalOpen in IMG/M
3300005526Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire. - Coalmine Soil_Cen17_06102014_R1EnvironmentalOpen in IMG/M
3300005534Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen07_05102014_R1EnvironmentalOpen in IMG/M
3300005537Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1EnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005553Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144EnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005569Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154EnvironmentalOpen in IMG/M
3300005577Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C7-2Host-AssociatedOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300005618Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S7-2Host-AssociatedOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300005836Microbial communities from Youngs Bay mouth sediment, Columbia River estuary, Oregon - S.42_YBBEnvironmentalOpen in IMG/M
3300005994Permafrost soil microbial communities from the Arctic, to analyse light accelerated degradation of dissolved organic matter (DOM) - Organic soil replicate 3 DNA2013-049EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009029Permafrost soil microbial communities from the Arctic, to analyse light accelerated degradation of dissolved organic matter (DOM) - Permafrost soil replicate 1 DNA2013-189EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009177Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-4 metaGHost-AssociatedOpen in IMG/M
3300009519Peatland microbial communities from Minnesota, USA, analyzing carbon cycling and trace gas fluxes - June2015DPH_6_150EnvironmentalOpen in IMG/M
3300009615Peatland microbial communities from Minnesota, USA, analyzing carbon cycling and trace gas fluxes - June2015DPH_4_100EnvironmentalOpen in IMG/M
3300009700Peat soil microbial communities from Weissenstadt, Germany - Sb_50d_4_PS metaGEnvironmentalOpen in IMG/M
3300009760Peatland microbial communities from Minnesota, USA, analyzing carbon cycling and trace gas fluxes - June2015DPH_17_100EnvironmentalOpen in IMG/M
3300010341Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - Bog Forest MetaG ECP23OM2EnvironmentalOpen in IMG/M
3300010343Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - Bog Forest MetaG ECP23OM1EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010397Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-4EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300010401Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300011423Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT119_2EnvironmentalOpen in IMG/M
3300011444Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT800_2EnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012204Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012350Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012354Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012360Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_113_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012972Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300013138 (restricted)Freshwater microbial communities from Kabuno Bay, South-Kivu, Congo ? kab_022012_12mEnvironmentalOpen in IMG/M
3300013296Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M2-5 metaGHost-AssociatedOpen in IMG/M
3300014151Peatland microbial communities from Houghton, MN, USA - PEATcosm2014_Bin23_60_metaGEnvironmentalOpen in IMG/M
3300014156Peatland microbial communities from Houghton, MN, USA - PEATcosm2014_Bin01_60_metaGEnvironmentalOpen in IMG/M
3300014158Peatland microbial communities from Houghton, MN, USA - PEATcosm2014_Bin02_60_metaGEnvironmentalOpen in IMG/M
3300014159Peatland microbial communities from Houghton, MN, USA - PEATcosm2014_Bin10_60_metaGEnvironmentalOpen in IMG/M
3300014162Peatland microbial communities from Houghton, MN, USA - PEATcosm2014_Bin23_30_metaGEnvironmentalOpen in IMG/M
3300014200Peatland microbial communities from Houghton, MN, USA - PEATcosm2014_Bin06_30_metaGEnvironmentalOpen in IMG/M
3300014325Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S6-5 metaGHost-AssociatedOpen in IMG/M
3300014490Permafrost microbial communities from Stordalen Mire, Sweden - 611E1M metaGEnvironmentalOpen in IMG/M
3300014493Permafrost microbial communities from Stordalen Mire, Sweden - 712S2M metaGEnvironmentalOpen in IMG/M
3300014494Permafrost microbial communities from Stordalen Mire, Sweden - 712E3D metaGEnvironmentalOpen in IMG/M
3300014495Permafrost microbial communities from Stordalen Mire, Sweden - 712P3M metaGEnvironmentalOpen in IMG/M
3300014496Permafrost microbial communities from Stordalen Mire, Sweden - 711E1D metaGEnvironmentalOpen in IMG/M
3300014498Permafrost microbial communities from Stordalen Mire, Sweden - 812E2M metaGEnvironmentalOpen in IMG/M
3300014501Permafrost microbial communities from Stordalen Mire, Sweden - P3-2 metaG (Illumina Assembly)EnvironmentalOpen in IMG/M
3300014502Permafrost microbial communities from Stordalen Mire, Sweden - 612E3M metaG (Illumina Assembly)EnvironmentalOpen in IMG/M
3300014838Permafrost microbial communities from Stordalen Mire, Sweden - 812S3M metaG (Illumina Assembly)EnvironmentalOpen in IMG/M
3300014839Permafrost microbial communities from Stordalen Mire, Sweden - 712E1D metaG (Illumina Assembly)EnvironmentalOpen in IMG/M
3300014968Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S2-5 metaGHost-AssociatedOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300017929Peatland microbial communities from SPRUCE experiment site at the Marcell Experimental Forest, Minnesota, USA - June2016WEW_4_100EnvironmentalOpen in IMG/M
3300017931Peatland microbial communities from SPRUCE experiment site at the Marcell Experimental Forest, Minnesota, USA - June2016WEW_17_100EnvironmentalOpen in IMG/M
3300017935Peatland microbial communities from SPRUCE experiment site at the Marcell Experimental Forest, Minnesota, USA - June2016WEW_4_40EnvironmentalOpen in IMG/M
3300018005Peatland microbial communities from SPRUCE experiment site at the Marcell Experimental Forest, Minnesota, USA - June2016WEW_17_150EnvironmentalOpen in IMG/M
3300018009Peatland microbial communities from SPRUCE experiment site at the Marcell Experimental Forest, Minnesota, USA - June2016WEW_20_40EnvironmentalOpen in IMG/M
3300018013Peatland microbial communities from SPRUCE experiment site at the Marcell Experimental Forest, Minnesota, USA - June2016WEW_16_100EnvironmentalOpen in IMG/M
3300018020Peatland microbial communities from SPRUCE experiment site at the Marcell Experimental Forest, Minnesota, USA - June2016WEW_10_100EnvironmentalOpen in IMG/M
3300018026Peatland microbial communities from SPRUCE experiment site at the Marcell Experimental Forest, Minnesota, USA - June2016WEW_8_100EnvironmentalOpen in IMG/M
3300018033Peatland microbial communities from SPRUCE experiment site at the Marcell Experimental Forest, Minnesota, USA - June2016WEW_13_10EnvironmentalOpen in IMG/M
3300018040Peatland microbial communities from SPRUCE experiment site at the Marcell Experimental Forest, Minnesota, USA - June2016WEW_10_150EnvironmentalOpen in IMG/M
3300018047Peatland microbial communities from SPRUCE experiment site at the Marcell Experimental Forest, Minnesota, USA - June2016WEW_10_10EnvironmentalOpen in IMG/M
3300018057Peatland microbial communities from SPRUCE experiment site at the Marcell Experimental Forest, Minnesota, USA - June2016WEW_8_150EnvironmentalOpen in IMG/M
3300018071Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_b1EnvironmentalOpen in IMG/M
3300018083Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_5_b1EnvironmentalOpen in IMG/M
3300018084Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32_b1EnvironmentalOpen in IMG/M
3300019788Permafrost microbial communities from Stordalen Mire, Sweden - 712E1D metaG (PacBio error correction)EnvironmentalOpen in IMG/M
3300019880Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H3a1EnvironmentalOpen in IMG/M
3300019887Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1c2EnvironmentalOpen in IMG/M
3300019890Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1c1EnvironmentalOpen in IMG/M
3300020002Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1a1EnvironmentalOpen in IMG/M
3300020004Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1a2EnvironmentalOpen in IMG/M
3300021082Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_5_coex redoEnvironmentalOpen in IMG/M
3300021344Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2a2EnvironmentalOpen in IMG/M
3300021411Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H3c2EnvironmentalOpen in IMG/M
3300021413Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L1c1EnvironmentalOpen in IMG/M
3300021420Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-MEnvironmentalOpen in IMG/M
3300022526Peat soil microbial communities from Stordalen Mire, Sweden - 717 E1 10-14EnvironmentalOpen in IMG/M
3300023088Peat soil microbial communities from Stordalen Mire, Sweden - 717 S2 30-34EnvironmentalOpen in IMG/M
3300024179Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK36EnvironmentalOpen in IMG/M
3300025878Arctic peat soil microbial communities from the Barrow Environmental Observatory site, Barrow, Alaska, USA - NGEE PermafrostAB12-D (SPAdes)EnvironmentalOpen in IMG/M
3300025941Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026078Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C6-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026095Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S7-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026116Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C7-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026121Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M7-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300027821Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire. - Coalmine Soil_Cen17_06102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027854Peat soil microbial communities from Weissenstadt, Germany - SII-2010 (SPAdes)EnvironmentalOpen in IMG/M
3300027896Freshwater lake sediment microbial communities from the University of Notre Dame, USA, for methane emissions studies -HBP12 HB (SPAdes)EnvironmentalOpen in IMG/M
3300027905Peat soil microbial communities from Weissenstadt, Germany - SII-SIP-2007 (SPAdes)EnvironmentalOpen in IMG/M
3300029911III_Bog_N2 coassemblyEnvironmentalOpen in IMG/M
3300029913III_Bog_N3 coassemblyEnvironmentalOpen in IMG/M
3300029914III_Bog_E2 coassemblyEnvironmentalOpen in IMG/M
3300029953II_Bog_E3 coassemblyEnvironmentalOpen in IMG/M
3300029955II_Bog_E2 coassemblyEnvironmentalOpen in IMG/M
3300029982Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - III_Bog_N3_1EnvironmentalOpen in IMG/M
3300030020II_Bog_N1 coassemblyEnvironmentalOpen in IMG/M
3300030507Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - II_Bog_E3_2EnvironmentalOpen in IMG/M
3300031057Oak Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031753Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_515EnvironmentalOpen in IMG/M
3300031813Freshwater microbial communities from Trout Bog Lake, Wisconsin, USA - 1anoAEnvironmentalOpen in IMG/M
3300031947Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.T000HEnvironmentalOpen in IMG/M
3300032783Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.3EnvironmentalOpen in IMG/M
3300032893Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_1.1EnvironmentalOpen in IMG/M
3300033402Lab enriched peat soil microbial communities from McLean, Ithaca, NY, United States - MB31MNEnvironmentalOpen in IMG/M
3300033412Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - NCEnvironmentalOpen in IMG/M
3300033475Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - YCEnvironmentalOpen in IMG/M
3300033823Peat soil microbial communities from Stordalen Mire, Sweden - 714 S3 30-34EnvironmentalOpen in IMG/M
3300034125Peat soil microbial communities from wetlands in Alaska, United States - Sheep_creek_tus_01_15EnvironmentalOpen in IMG/M
3300034282Peat soil microbial communities from wetlands in Alaska, United States - Eight_mile_03D_16EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
INPgaii200_100598122228664022SoilMKFRILLAVFLASISTVAHSETFIGPTTSTNRLLVASNSAIIITTTLGNFTNSTHIVLQGYQFNQSYFAPLESGNSYALAGPAELIFTNPAVATFYRITNSAIFSQSIGNDPIGISIPTNKTMRLFGVPDSVNASFSRPGTLSVNFNLEPNRPAEFTGPGTLFLASGVFPPLGKFISYFFAEDGFVLPNQRALAGPSGSFAIMVEKSFDLNSWSPVLLENTSDAPHAFYRLRMQR
INPhiseqgaiiFebDRAFT_10141976423300000364SoilMNPRNSLSLFFAFVSTFAQGETFIGPTTTTNRLLVPTNSAIIITTTLGDFTNSTQVVLQGYQFTQSYFAPLESGNSYAIAGPAELIFTNPVAITFYRIVNSAIRSQGIGNDPIGISIPTNKTMRLFGVPAPVNASFSRLGTGSVNFTLQPNQPAEFTGPGTLFLASGVFPPSGKFISYFFAEDGFVLPNQRAIAGPSGSYAIMVEKSFDLNTWAPVLLETTSDAPQAFYRLRTQR*
INPhiseqgaiiFebDRAFT_10169186933300000364SoilMNXTGAIALSVLLISQFAFAETFVGPTTPANHLLVASNSAIIITTTLGDFTNSTQVILNEFPFQLDYFAPLENSTTYAVAGPAELVFSNAVVITYYQITNSTIHTQWIANDPIAITIASNKTLRLFGVPTAVNASFSRSNSFVSFTLKPNRPAEFTGPGVLALSSGVFPPYAKFISYFVAEDGFVLPDQRAISGPSGSFAVTLEKSADLKTWSPALLLNTSDEMKAFYRLRIER*
INPhiseqgaiiFebDRAFT_10534053723300000364SoilMKPKFAIALYILLAFPLAHGETFIGPTTVTNRLLVPSDSAIVITATFGDFTTSTQLAQGAGGLPRPLNYFTPLENGTTYALAGPAELVFSNAVLFTFYRVTNSGILTQSIANDPIGITIAANKTMQLFGVPAAVNASFLRPESSPVSFTLEPGKPAEFTGPGSLILSSGVPAPSIKFISYFIAEDGFALPDEGAIAGPSGSYTIMVEKSFDLNTWSPVLLKNTSNPDKAFFRLRIQR*
JGI1027J11758_1243565413300000789SoilMKPKFAIALYILLAFPLAHGETFIGPTTVTNRLLVPSDSAIVITATFGDFTTSTQLAQGAGGLPRPLNYFTPLENGTTYALAGPAELVFSNAVLFTFYRVTNSGILTQSIANDPIGITIAANKTMQLFGVPAAVNASFLRPESSPVSFTLEPGKPAEFTGPGSLILSSGVPAPSIKFISYFIAEDGFALPDEGAIAGPSGSYTIMVEXSFDLNTWSPVLLKNTSNPDKAFFRLRIQR*
JGI1027J11758_1281497933300000789SoilAVFLASISTVAHSETFIGPTTSTNRLLVASNSAIIITTTLGNFTNSTHIVLQGYQFNQSYFAPLESGNSYALAGPAELIFTNPAVATFYRITNSAIFSQSIGNDPIGISIPTNKTMRLFGVPDSVNASFSRPGTLSVNFNLEPNRPAEFTGPGTLFLASGVFPPLGKFISYFFAEDGFVLPNQRALAGPSGSFAIMVEKSFDLNSWSPVLLENTSDAPHAFYRLRMQR*
JGIcombinedJ13530_10053796513300001213WetlandSLMKSLGMIALWLGVGVRIACGETFIGPTSATNRLLVPTNSAIIITATFGDFTNSTQVQLGDGSPFTQSYFAPLANGTAYALAGPGELIFSNAMLFTFYRLTNSAIYSQGIANDPIGITIASNKTMHLFGVPGPVNASFSRESSAGGGALSFTLEPNRPAEFSGPGTLFLNSGVFFPYAKFISYFIEEDGFALPGLRSIAGPSGSFAISVERSVDLRSWSPVLMQNVSDPANAFYRLRIER*
Ga0062385_1050045213300004080Bog Forest SoilSTLAKGETFIGPTTATNRLLVSSNSAIIITTTLGDFTNSTYIALGVDGYLFPLNYFAPLENGSEYALAGPAELVFSNAALITYYVVTNSTISTQTIATDPISIVIASNQTFHLFGVPAPVNVSFAQGGSRSVSFALLPNQPAEFSGPGGLTLGGGVFPPYTKFISYFIEEDGFTLPNQRAVAGPTGSYAITVEKSFDLNIWSPVLLATTSDANQAFYRLKIQR*
Ga0062389_10326604413300004092Bog Forest SoilLENFLIMKPRVAIILCLALVSTLAQGETFIGPTTTTNRLLVSSNSAIIITATFGDFTNSTQVAQGVGGSPSPLNYFAPLENGTEYALAGPAELVFSNAALITYYVLTNSAITTQIIEQDPIAITIASNQTLHLFGVPAAVNASFARAGSSTVNFALVPNQPAEFSGPGGLLLSSGVFPKFISFFREEDGFTIPNQRAIAGP
Ga0062595_10045536913300004479SoilMNPGISLSLFFALISMLAEGETFIGPTTATNRLLVPTNSAIIITTSLGNFTNSTQVVLQGYQFTQSYFAPLESGNSYALAGPAELIFTNPVAITFFRIVNSAIRSQGIGYDPIGISIPTNKTMRLFGVPAPVNASFSRVGRGSVSFTLQPNQPAEFTGPGTLFLNSGVFPPSGKFISYFFAEDGFVLPNQRGIAGPSGSYAIMVEKSFDLNSWSPVLFETTSDAPQAFYRLRTQR*
Ga0066685_1038939723300005180SoilQRLESDCCMMNSKFATALFIASISTLAHGETFIGPTTSTNRLLVPSNSAIVITATLGDFTNSTQVELGGFQFSQNYFAPLENGGAYALAGPGELVFSNAVLISYYRVTNAAIRTQVILNDPIGIAIGTNQTMRLFGVPAPVNASFSRPGSGFVGFTLEPNQPAEFTGPGELALNSGVFFPYAKFISYFIAEDGFAMPNQRFIAGPSGSFAIMVEKSVDLNTWSPVLLQNTSDDNRAFYRLRIQR*
Ga0066678_1029268023300005181SoilMKPRTALALFIVLVSALAHGETFIGPTTATNRLLVPSNSAIIITATFGDFTNSTQVALSEDGFPFSQNYFAPLENGSSYALAGPAELIFSNAVLITYYQVTNSAIRTQFIANDPIVIPIASYKTMRLFGVPAPVNAAFSRPGSGFVSFTLEPNRPAEFTGPGSLQLSSGVFFPYSKFISYFIAEDGFTLPSQRAIAGPTGSYAITVEKSFDLNTWSPALLANTSDANKAFYRLRIQR*
Ga0065705_1003637113300005294Switchgrass RhizosphereLRSGCERGTFVGTGSLRYGDSRVTTCVAVTKKRLVNVLIMKPIXTLALFIVLVSTLAXGETFIGPTTATNRLLVASNSAIIITATLGDFTNSTQLAQGESGFPFALNYFAPLENGNTYALAGPAELVFSNAVLFTFYRMTNSAIFTQSIANDPIVIQIASNKTMRLFGVPATVNAGFSRPGSGFVSFTLEPNRPAEFTGPGSLQLSSGVFFPYSKFISYFIAEDGFALPDQRAIVGPSGSYAIIVEKSFDLNRWSPVLLKNTSDADKAFFRLRIQR*
Ga0065707_1004043313300005295Switchgrass RhizosphereELTRPLRSGCERGTFVGTGSLRYGDSRVTTCVAVTKKRLVNVLIMKPIXTLALFIVLVSTLAXGETFIGPTTATNRLLVASNSAIIITATLGDFTNSTQLAQGESGFPFALNYFAPLENGNTYALAGPAELVFSNAVLFTFYRMTNSAIFTQSIANDPIVIQIASNKTMRLFGVPATVNAGFSRPGSGFVSFTLEPNRPAEFTGPGSLQLSSGVFFPYSKFISYFIAEDGFALPDQRAIVGPSGSYAIIVEKSFDLNRWSPV
Ga0070677_1034603713300005333Miscanthus RhizosphereMNTKWSIALCLVFTSTLAQGETFIGPTTATNRLLVPSNSAIIIAATLGDFTNSMRLAQGQGGSPFPLDYFAPLENGNTYALAGPSELVFSNAALFTFYRVTNAAIFTQSIANDPIGRSIASNQTIRLFGVPATVNANFVRPGSDPISFTLEPGKPAEFTGPGSLSLSSGVFPPYIKFISYFIAEDGFVLPDQRVVTGPSGSFSIMVERSLDLNLWSPVLLKNTSDANKAFFRIRIQR*
Ga0070678_10013385423300005456Miscanthus RhizosphereMNTKWSIALCLVFTSTLAQGETFIGPTTATNRLLVPSNSAIIIAATLGDFTNSMRLAQGQGGSPFPLDYFAPLENGNTYALAGPSELVFSNAALFTFYRVTNAAIFTQSIANDPIGRSIASNQTIRLFGVPATVNANFVRPGSAPISFTLEPGKPAEFTGPGSLSLSSGVFPPYIKFISYFIAEDGFVLPDQRVVTGPSGSFSIMVERSLDLNLWSPVLLKNTSDANKAFFRIRIQR*
Ga0070685_1078955813300005466Switchgrass RhizosphereFIGPTTATNRLLVQSNSAIVITATFGDFTNSTQVALGEGSFAFPLNYFAPLEHGNTYALAGPAELIFSNAVLISYYPVTNSAIFTQFIANDPVGISIASNKTMRLFGVTAPVSASFSRPGSSFVSFTLEPNRPAEFTGPGILSLNSGVLPPYFKFISYFIAEDGFALPNQRAIAGPTGSYAITVEKSFDLKAWSPVILENTSDANKAFYRLRIQR*
Ga0073909_1009383213300005526Surface SoilMKPIHIIALFLGVSVRLACAETFIGPTTATNRLLVPGNSAIIITTTLGDFTNSTQVVLGGGAPFTQSYFVPLQNGTVYAVAGPAELLFSNAVLFTFFRLTNSAIHSQGIANDPIGISIPANKTMHVFGVPAEVNASFVRPAAAGGGSLSFKLEPNAPAEFTGPGTLSLNSGVFPPYAKFISYFFDEDGFTLSDMRTISGPSGSFAISVERSVDLQGWSPVMLQNTSEPTKAFYRLRIQR*
Ga0070735_1002367253300005534Surface SoilMKLGTAIALFVVFVSTLAQAETFIGPTTATNRLLVPTNSAIIITATFGDFTNSTQLAYSGSQFPLNYFAPLGNGSAYALAGPMELVFSKAALITYYVVTNSAILTQTIGNDPITITIASNKTMRLFGVPALVNASFSRPGSGSVSFTLGPDQSAEFTGPGALSLNSGVLPPYVKFISYFFAEDGFVLPNQRAIAGPSGSYAITLEKSFDLNTWSPILLANTTDANQAFYRLTIQH*
Ga0070730_1046580913300005537Surface SoilSCRLYMLPHPCCVDTTEASSTVAAMKPISFIALVLAVSLRFAGAETFIGPTTATNRLLVPTNSAIIITATFGDFSNSTQVVLFGGPAFTQSFAPLEHGAVYALAGPSELIFSNAVLFTFFRLTNSAIHSQGIANDPIGILIPTNKTMHLFAVPAEVNAGFMRPDGANLNFTLEPSAPAEFTGPGTLSLNSGVLPPAARFISYFFDEDGFTLPDMRIIAGPTGSFAISVEKSLDMRSWSPVLLQNTSDPTQAFYRLRIQR*
Ga0066697_1048776213300005540SoilMKSSPALALFIVLVSALAHGETFIGPTTATNRLLVPSNSAIIITATFGDFTNSTQVALGEDGFAFSQNYFAPLENGGSYALAGPAELIFSNAVIISYYQVTNSAIRTQSIAYDPVGISIASNKTMRLFGVPAPVSASFSRLGNGSVSFTLEPSRPAEFTGPGTLSLSSGVFPPAGKFISYFIAEDGFTLPNQRAIAGPTGSYAITVEKSFDLNTWWPAFLQNT
Ga0066695_1015067333300005553SoilMKSSTALALFMVLVSTLAHGETFIGPTTATNRLLVPSNSAVIITATFGDFTNSTQVALGEGGFPFSQNYFAPLENGSSYALAGPAELIFSNAALITFYQVTNSAIRTQFIANDPIVIPIASNKTMRLFGVPAPVNAGFSRPGSGFVSFTLEPNRPAEFTGPGSLQLSSGVFSPYSKFISYFIAEDGFSLPSQRAIA
Ga0066698_1057375613300005558SoilGYGTFSPDNTSHRTMKTRLVIALLIASVSTFARGETFIGPTTSTNRLLVQSNSAIIVTTTFGDFTNSTQVVLDGFQFLQSYFAPLENGSSYALAGPAELIFSNAVVITYYRITNSAIRTQSIGNDPIPISIASNKTMRLFGVPAPVNASFSRPEGGFVSFILEPNRPAEFTGPGTLALNSGVFPPYSKFISYFFAEDGFALPNQRYISGPNGSYAISVEKSFDLNTWSPVLLENTAEENKAFYRLRIQR*
Ga0066705_1013719023300005569SoilMKPRTALALFIVLVSTLAHGETFIGPTTATNRLLVPSNSAIIITATFGDFTNSTQVALGEGGIPFPQNYFAPLENGSSYALAGPAELIFSNAVLISYYQVTNSAIRTQFIANDPIVIPIASNKTMRLFGVPAPVNAGFSRPGSGFVSFTLEPNRPAEFTGPGSLQLSSGVFFPYSKFISYFIAEDGFTLPSQRAIAGPTGSYAITVEKSFDLNTWSPALLANTSDANKAFYRLRIQR*
Ga0068857_100000452133300005577Corn RhizosphereMKPRILFALLFALISRLVHGETFVGPTTGTNRLLVPSNSAIVITTTLGDFTNSTQVVLQDYQFTQSYFAPLESGNSYALAGPAELIFTNPVVITFYRIMNSSIHSQGIGNDPIGIQIPTNKTMRLFGVSAPVNASFSRPGSGFVSFILEPNHPAEFTGPGTLSLNSGVFPPMGKFISYFIAEDGFIVPNQRAMAGPSGSYSIMVEKSFDMIAWSPVLLENTADMPQAFYRLRTQH*
Ga0066706_1057877813300005598SoilMKSSTALALFMVLVSTLAHGETFIGPTTATNRLLVPSNSAVIITATFGDFTNSTQVALGEGGFPFSQNYFAPLENGSSYALAGPAELIFSNAALITFYQVTNSAIRTQFIANDPIVIPIASNKTMRLFGVPAPVNAGFSRPGSGFVSFTLEPNRPAEFTGPGSLQLSSGVFSPYSKFISYFIAEDGFILPSQRAITGPTGSYAITVEKSFDLNTWSPALLANTADPNKAFYRLRIQR*
Ga0068864_10025773923300005618Switchgrass RhizosphereMNPKNVIAVLCAFIAMLAHAETFVGPTTATNRLLVPANSAIILTTMLGDFTNSTQVVLGGFAFAQAFAPLETGNSYALAGPAELIFSNAAVITYYQVTNSAIRSQGIANDPIGIQIATNKTMRLFGVPAPVNASFSRPGSGFVSFTLEPNQPAEFTGPGSLTLSSGVFFPNGKFISYFMVEDGFVLPNQRALAGPSGSYAIMVEKSFDLNNWTPVLLENTVDANRAFYRLRIQR*
Ga0068864_10130756013300005618Switchgrass RhizosphereSRFAFAVIGKRLENASIMKSRTAIALFIASVSTLALGETFIGPTTATNRLLVQSNSAIVITATFGDFTNSTQVALGEGSFAFPLNYFAPLEHGNTYALAGPAELIFSNAVLISYYPVTNSAIFTQFIANDPVGISIASNKTMRLFGVTAPVSASFSRPGSSFVSFTLEPNRPAEFTGPGILSLNSGVLPPYFKFISYFIAEDGFALPNQRAIAGPTGSYAITVEKSFDLKAWSPVILENT
Ga0066903_10004622433300005764Tropical Forest SoilMKPAVEIAAAIALFSAPALGETFVGPTTSTNRLLVGTNSAIIITSTFGDFTNSTQVALGGFQFTQSYFAPLETGSAYALAGPAELIFSNAVLISYYRITNSAIFTQSIGNDPIGIPIASNKTMRFFSVPAPVNANFSGSGNGSVSFTLEPNRPAEFTGPGILALNSGVFPPYAKFVSYFIAEDGFVMPDQRPIVGPSGSFAVIIEKSADLETWSPVVMQNTSDELKAFYRLRIAR*
Ga0066903_10026379623300005764Tropical Forest SoilMKPQGFLTLFFVLISTFSYAETFIGPTTATNRLVVPTNSAIIITTTLGDFTNSTQVVLQGYQFTQSYFAPLESGNSYALAGPAELIFTNPVVITFYRIMNSAIRSQGIANDPIGISIPTNKTMRLFGVPALVNASFSRLGSGSVSFTLEPNHPAEFTGPGTLFVNSGVFPPLGKFISYFIAEDGFVLPNQRAIAGPSGSYAIMVERSFDLTTWSPVLMEDTSDASQAYYRLRTQR*
Ga0066903_10101720723300005764Tropical Forest SoilMRLSCYETPRISRICLCGGIHTRHGETIIGPTTATNRLLVPTNSAIIITTTFGDFTNSTQVVLQGYQFTQSYFAPLESGNSYALAGPAELIFTNPVAITFYRIVNSAIRSQGIGNDPIGISIPTNKTMRLFGVPAPVNASFSRIGTGSVSFTLQPNQPAEFTGPGTLFLNSGVIPPSGKFISYFFAEDGFVLPNQRAIAGPTGSYAIMVEKSFDLNSWSPVLMENTSDALQAFYRLRTQR
Ga0074470_1025898933300005836Sediment (Intertidal)MKTKTSIALCAAFVATLANGETFLGPTTTTNRLLVPTNSAIIVTAMFGDFTNSTVVVLGGLPFTQEFAPLANGGSYALAGPAELVFSNATVITYYQVTNSAIRSQWIGYDPVGIPIASNKTIRLFGVPAPVNASFSRADGGFVSFKLEPNRPAEFTGPGVLALNSGVFPPEGKFISYFIAEDGFVIPNQRATVEPTGSFAIMVEKSFDLNSWSPALLENTVEANRAFYRLRIQR*
Ga0066789_1036673113300005994SoilIGPTTATNRLLVPGNSAIIITATFGDFTNSTRVAQGGGAAFPMSYFAPLQNGTEYALAGPAELVFSNVALITYYQLTNSAIITQTIANDPIPITIASNQTMRLFGVPASVNANFGRPGGGSVGFTLEPCQPAEFTGPGTLSLNSGVLPPYTKFISYFIAKDGFTIPNQRFIAGPTGSFAVTVEKSSDLNTWSPVLLENTT
Ga0066665_1050927913300006796SoilMKSTTALALFVVLVSTLAHGETFIGPTTATNRLLVPSNSAVIITATFGDFTNSTQVALGEGGFPFSQNYFAPLENGSSYALAGPAELIFSNAVLITYYQVTNSAIRTQFIANDPIVIPIASNKTMRLFGVPAPVNAAFSRPGSGFVSFTLEPNRPAEFTGPGSLALSSGVFFPYSKFISYFIAEDGFTLPGERAIAGPTGSYAITVEKSFDLNTWSPALLANTSDANKAFYRLRIQR*
Ga0075424_10206877413300006904Populus RhizosphereVLLVSTLAQSETFIGPTTATNRLLVASNSAIIITATFGDFTNSTQVALGEGSPFPLNYFAPLENGTAYALAGPAELIFSNAVLISFYQMTNSAISTQFIANDPIVIPIATNKTMRLFGVPAPVNAAFSRPGSGFVSFTLEPNRPAEFTGPGSLQLSSGVLFPYSKFISYFIAEDGFALPDQRAIVGPSGSYAIMVEKSL
Ga0099791_1036826813300007255Vadose Zone SoilKTQNAFVMRPRTSIVLFLALVSTLSHAETFIGPTTSTNHLLVPTNSAIIITTTLGDFTNSTQVVLQGYQFTQSYFAPLESGNSYALAGPAELIFTNPVAITFYRIMNSAIRTQDIGYDPIGISIPTNKTMRLFGVPAPVNAAFSRPGSGYVTFTLEPNRPAEFTGPGTLSLNSGVFPPLGKFISYFFAEDGFVLPNQRAIAGPSGSYAIMVEKSFDLNSWSPVLLETTS
Ga0066710_10075941413300009012Grasslands SoilMKPRTAIALFVAFAATLAHGETFIGPTTATNRLLVPSNSAIVITATFGDFTNSTQVVLGGFQFTQSYFAPLENGNSYALAGPAELIFSNSVLITYYQLTNSAIRTQSIGNDPIGISIASNRTMRLFGVPAPVNASFSRPGNGFVSFTLEPNRPAEFTGPGTLALNSGVFLPYGKFISYFIAEDGFTLPNQRAIVGPSGSYSIM
Ga0066710_10102332623300009012Grasslands SoilMFRHGRFDADFGVDDVIVGNYCAWMKSAEPIALILALTSTDVFGETFIGPTTATNRLLVASNSAIIITATFGDFTNSTQVVLGGFQFTQSSFAPLETGATYALAGPAELVFSNAVLITYYQITNSTIRTQSIGNDPVPISIASNKTMRLFGVPAPVNANFSGPGNSSVSFTLEPNRPAEFTGPGILHLNSGVFHPYGKFISYFVAEDGFALPDQRAIAGPSGSFAITVEKSADLKSWSPVLMQNTSDVAKAFYRLRIER
Ga0066710_10134785213300009012Grasslands SoilITAAFGDFTNSTQVVLDGFQFLQSYFAPLENGSTYALAGPAELIFSNAVVITYYRITNSAIRTQSIGNDPIPISIASNKTMRLFGVPAPVNASFSRPEGGFVSFILEPNRPAEFTGPGTLALNSGVFPPYSKFISYFFAEDGFALPNQRYISGPNGSYAISVEKSFDLNAWSPVLLENTAEENKAFYRLRIQR
Ga0066710_10142910623300009012Grasslands SoilMNSRTAISLFIASVSTLAEGETFIGPTTATNRLLVPSNSAIVITATFGDFTNSTQVALGEGGFPYPQNYFAPLENGNAYALAGPAELIFSNAVLISYYQVTNSAIFTQYIANDPVGISIASNKTMRLFGVTAPVNASFSRPGSSFVSFTLEPNRPAEFTGPGTLMLGSGVLPPYIKFISYFIAEDGFALPNQRVIAGPSGSYAITVEKSFDLK
Ga0066710_10235077913300009012Grasslands SoilMKSSTALALFMVLVSTLAHGETFIGPTTATNRLLVPSNSAVIITATFGDFTNSTQVALGEGGFPFSQNYFAPLENGSSYALAGPAELIFSNAALITFYQVTNSAIRTQFIANDPIVIPIASNKTMRLFGVPAPVNAGFSRPGSGFVSFTLEPNRPAEFTGPGSLQLSSGVFLPYSKFISYFIAEDGFTLPSQRAIAGPTGSYAITVEKSFDLNTWSPALLANTSDPNKAFYRLRIQR
Ga0066793_1059932313300009029Prmafrost SoilMLVPTNSAIIITATFGDFTNSTQLELGGGVPFSQSYFAPLENGASYALAGPAELIFSNAVLFTFYRLTNSAIYSQVILNDPIGIPIASNKTMRVFGVPAPVSASYERPISAGGGALSFTLEPNHPAEFTGPGTLSLNSGEFFPYAKFISYFIEEDGFAMPGQRAMAGPSGSFAISVEKSVDLNTWSPVLMENTSDPVKAYY
Ga0099829_1065884113300009038Vadose Zone SoilMMNSKLATALFIASISMLARGETYVGPTTSTNRLLVPSNSAIVITATLGDFTNSTQVALDGSPFTQSYFAPLENGSAYALAGPAELIFSNAVLITYYRVTNSAIRTQVILNDPITLSLGTNQTMRLFGVPAPVNASFSRPGTGFVAFTLEPNRPAEITGPGILALNSGVFFPYAKFISYFIAEDGFAMPNQRFIAGPSGSYAISVEKTFDLNTWSPVLLQNTSDDNRAFYRLRIQR*
Ga0099830_1060076413300009088Vadose Zone SoilMKPRIAVSLLIASVSTLAHGETFIEPTSATNRLLVPSNSAIIITATFGDFTNNTQVVLDGFQFMQSYFAPLENGNSYALAGPAELIFSNAVVISYYQITNSAIRTQSIANDPIGIAIASNKTMRLFGVPAPVSASFSRLGNGSVSFTLEPSRPAEFTGPGTLSLSSGVFPPSAKFISYFIAEDGFTLPDQRAIAGPTGSYAITVEKSFDLNTWSPAFLQNTSDENRAFYRLRIQH*
Ga0066709_10048572713300009137Grasslands SoilTMKTRLVIALLIASVSTFARGETFIGPTTSTNRLLVQSNSAIIITAAFGDFTNSTQVVLDGFQFLQSYFAPLENGSTYALAGPAELIFSNAVVITYYRITNSAIRTQSIGNDPIPISIASNKTMRLFGVPAPVNASFSRPEGGFVSFILEPNRPAEFTGPGTLALNSGVFPPYSKFISYFFAEDGFALPNQRYISGPNGSYAISVEKSFDLNAWSPVLLENTAEENKAFYRLRIQR*
Ga0066709_10056728223300009137Grasslands SoilMFRHGRFDAGFGVDDVIVGNYCAWMKSAEPIALILALTSTDVFGETFIGPTTATNRLLVASNSAIIITATFGDFTNSTQVVLGGFQFTQSSFAPLETGATYALAGPAELVFSNAVLITYYQITNSTIRTQSIGNDPVPISIASNKTMRLFGVAAPVNASFSGPGNRSISFTLEPNRPAEFTGPGILDLNSGVFPPYGKFISYFVAEDGFALPDQRAIAGPSGSFAITVEKSADLKSWSPVLMQNTSDVVKAFYRLRIER*
Ga0105248_1111061913300009177Switchgrass RhizosphereIARLARFQYQPYCLSSETTAALSRKRHEIVFIMKPRTAIALFIALVSTPAPGETFIGPTTSSNRLLVPSNSALIITATFGDFTNNTQVALGEGGLPFLQSYFAPLENGNSYALAGPAELIFSNAALISFYQVTNSAIRTQFIANDPIVIPIAANKTMRLFGVPAPVPAGFSRPGSGFVSFTLEPNQPAEFTGPGNLQLSSGVLFPYSKFISYFIAEDGFTLPNQRVIAGPSGSYVITVEKSFDLNTWSPVLLQNTSDADKAFYRLRIQR*
Ga0116108_118945313300009519PeatlandTNRLLVPTNAAIIITTTLGDFTNSTQVQLGDGGTPFIQSYFAPLEYGTVYAVAGPAELVFSNAVLFTYYRLTNSAIYSQGIANDPIGIPIASNKTMHVFGVPAPVNASFERPISAGGGALSFMLEPNSPAEFTGPGTLFLNSGVFFPYAKFISYFIAEDGFTLPNQRAIAGPSGSFAISVEKSVDLNTWSPVLMGNTADPV
Ga0116103_103938223300009615PeatlandMKPICLVAVSLAVAAQFARGETFIGPTTATNRLLVPTNSAIIVTATFGDFTNSTQVQLDGGTPFTQSYFAPLENGTVYALAGPAELIFSNAVLFTFYRLTNTAIYTQGIAKDPISIPITSNKTMHLFGVPAPTSASFTRLDSAGGGTLSFTLAPNMPAEFSGPGTLFLNSGVFPPYAKFLSYFMEEDGFALPDQRAIAGPSGSFAISVEK
Ga0116217_1020292723300009700Peatlands SoilFIGPTTATNRLLVPANSAIIITTTLGDFTNSTQVQLGGGGSFTQPSFAPLENGTVYALAGPAELIFSNAVLFTFYRLTNSAILSQGIANDPIGIAIPSNQTMHIFGVPAPVNASFVRQNSQGGGALSFVLAPNSPAEFTGPGTLFLSSGVLPPNAKFISYFIDQGGTPLPGQQAVAGPTGSFAITVDKSVDLSTWLPIWMQTASDPASAFYRLRIQR*
Ga0116131_111633213300009760PeatlandMRPIICAIALFLALAATPACGETFIGPTTSTNRLLVPTNSAIIITATFGDFTNSTQVQLDGGYPFTQSFFAPLEHGTVYALAGPAELIFSNAVLFTFYRLTNSAILSQGIANDPLGIPLASNQTMHVFGAPAPVNASFVRQSSEGGGALSFVLAPNSPAEFTGPGTLFLNSGVFFPNAKFISYFIDQDGYALPGQQAVAGPTGSFAISLDK
Ga0116131_115320513300009760PeatlandATFGDFTNSTQVQLDGGTPFTQSYFAPLENGTVYALAGPAELIFSNAVLFTFYRLTNTAIYTQGIAKDPISIPITSNKTMHLFGVPAPTSASFTRLDSAGGGTLSFTLAPNMPAEFSGPGTLFLNSGVFPPYAKFLSYFMEEDGFALPDQRAIAGPSGSFAISVEKSVDLKTWSPVLLQNNSDPANAYYRLRIQR*
Ga0074045_1025924123300010341Bog Forest SoilGLLAEAEMFIGPTTSTNRLLVPTNSAIIITATFGDFTNSTQVQLGDGGAPIIQSYFAPLENGTVYALAGPAELIFSNAVLFTFYRLTNSAIYSQGIANDPIGISIASNKTMHVFGVPAPVNASFNRPISAGGGSLSFMLEPNSPAEFTGPGTLLLNSGVLPPYAKFISYFIAEDGFVMPGQRVIAGPSGSFAISVEKSIDLNTWSPVLMENTSDPVQAYYRLRIQR*
Ga0074044_1007448123300010343Bog Forest SoilMKSICMIALFIALAGLLAEAEMFIGPTTSTNRLLVPTNSAIIITATFGDFTNSTQVQLGDGGAPIIQSYFAPLENGTVYALAGPAELIFSNAVLFTFYRLTNSAIYSQGIANDPIGISIASNKTMHVFGVPAPVNASFNRPISAGGGSLSFMLEPNSPAEFTGPGTLLLNSGVLPPYAKFISYFIAEDGFVMPGQRVIAGPSGSFAISVEKSIDLNTWSPVLMENTSDPVQAYYRLRIQR
Ga0074044_1053752913300010343Bog Forest SoilMKSKIPTILCIAVVSGFAYGETFIGPTTSTNRLLVSSNSAIIITATLGDFTNSTYVAFGSSAPEPQIYFAPLESGNVYALAGPAELVFSNAAVITYYQLTNSAIFTQFVANDPIGIPIASNQTMRLFGVPAAVNASFSRDGSSAGCTLEPNQPAEFTGPGTLTLSSGVLPPYGKFISYFIEQNGFTLPDQTVMAGPSGSYTVTVEKSSDMTVWSPVLLENTTDAQNAFYRLKIQR*
Ga0126377_1124745513300010362Tropical Forest SoilVAHGGIGVGLHQRIKGPNVHIMKLISAIALFTLVSSSVQSETFIGPTTETNRLLVSSNSAIVITALFGEFTNSTQVVLGGFQFMQSYFAPLLIGNTYALAGPAELIFTNSAVVTYYQLTNSTIITQSIGNDPIVITIASNKTMRLFGVPAPVNASFSRPGSGFVSFTLEPNVAAAFSGPGDLALNSGVFPPFSKFISYFIEEDGFTLPDRRAIVGPSGSFAISVEKSFDLSAWSPVLLETSSDDSKAFYRLRIQR*
Ga0134124_10009044133300010397Terrestrial SoilMVANHALRTAAGRRDCNRCASWPPSLGLWVASRFAFAVIGKRLENASIMKSRTAIALFIASVSTLALGETFIGPTTATNRLLVQSNSAIVITATFGDFTNSTQVALGEGSFAFPLNYFAPLEHGNTYALAGPAELIFSNAVLISYYPVTNSAIFTQFIANDPVGISIASNKTMRLFGVTAPVSASFSRPGSSFVSFTLEPNRPAEFTGPGILSLNSGVLPPYFKFISYFIAEDGFALPNQRAIAGPTGSYAITVEKSFDLKAWSPVILENTSDANKAFYRLRIQR*
Ga0134124_1072209813300010397Terrestrial SoilTNIFFLMNTKFSIALCIVFASTLAQGETFIGPTTATNRLLVPSNSAIIITATLGDFTNSMRLAQGQGGFPYPLNYFAPLENGNTYALAGAAELVFSNAALFTFYRVTNSAIFTQSIANDPIGLSIVSNRTMRLIGVPATVNASFVRPGSSPVSFTLEPGKPAEFTGPGSLSLSSGVFAPYIKFISYFIEEDGFVLPDQRVVAGPSGSFSVLVEKSLDLNLWSPVLLKSTSDADKAFFRLRIQR*
Ga0126383_1068504623300010398Tropical Forest SoilMAAFALFSAQALGETFVGPTTSANRLLVATNSAIIITSTFGDFTNSTQVVLEGFQFTQSYFAPLETGGTYALAGPAELIFSNAVLITYYRITNSAIFTQSIANDPIGIPIASNKTMRLFGVPAPVNASFLGPGNGSVSFTLEPNRPAEFTGPGVLALNSGVFPPYAKFVSYFIAEDGFVMPDQRPIVGPSGSFAVIIEKSADLETWSPVLMENTSDEVKAFYRLRIAR*
Ga0134121_1028604023300010401Terrestrial SoilMKPRTALAFFIVLISTLAHGETFIGPTTASNRLLVASNSAIIVTATFGDFTNSTQVALAGSKFALPYFAPLENGSSYALAGPAELIFANTVLISYYQMTNSAISTQFIANDPVVIPIASNKTMRLFGVPASVNAGFSRPGSGFVSFTLEPNRPAEFTGPGSLQLSSGVFFPFSKFISYFIAEDGFTLPGQRAIAGPTGSYAITVEKSFDLNTWSPVLLANTSDEKEVFYRLRIQR*
Ga0137392_1076456613300011269Vadose Zone SoilARGETYVGPTTSTNRLLVPSNSAIVITATLGDFTNGTQVALDGSPFTQSYFAPLENGSAYALAGPAELIFSNAVLITYYRVTNSAIRTQVILNDPITLSLGTNQTMRLFGVPAPVNASFSRPGTGFVAFTLEPNRPAEITGPGILALNSGVFFPYAKFISYFIAEDGFAMPNQRFIAGPSGSYAISVEKTFDLNTWSPVLLQNTSDDNRAFYRLRIQR*
Ga0137392_1094426713300011269Vadose Zone SoilYPTQRAYFGIATNSTVAQFRRVAFHRKRRDNSLTMKPRIAVSLLIASVSTLAHGETFIGPTSATNRLLVPSNSAIIITATLGDFTNSTQVVLDGFLFMQSYFAPLENGNSYALAGPAELIFSNAVVISYYQITNSAIRTQSIANDPIGIAIASNKTMRLFGVPAPVSASFSRLGNGSVSFTLEPSRPAEFTGPGTLSLSSGVFPPSAKFISYFIAEDGFTLPDQRAIAGPTGSYAI
Ga0137393_1035267723300011271Vadose Zone SoilMKPGSVIAFCMALGSAQAFSETFVGPTTSTNRLLVASNSAIIITATFGDFTNSTQVVLGGFQFTQSYFAPLETGGVYALAGPAELIFSNAVLITYYQITNSAIRTQSIGNDPIPISIASNKTVRLFGVPAPVNASFSASGSGFVSFTLEPNRPAEFTGPGTLALNSGVFPPYGKFISYFIAEDGFALPDQRAIVGPSGSLAIIIEKSANLKTWLPVLMQNTSDDVKAFYRLRIER*
Ga0137436_107514813300011423SoilMKPRITLALFLVLVSTLGQGETFIGPTTANNRLLVASNSAIIITATLGDFTNSTQLAQGESGFPFALNYFAPRENGNTYALAGPAELVFSNAVLFTFYRMTNSAIFTQSIANDPIVIQIASHNTMRLFGVPATVNAGFSRPGSGFVSFTLEPNRPAEFTGPGSLQLSSGVFFPCSKFISYFIAEDGFALPDQRAIVGPSGSYAIIVEKSFDVNMLSPVLLKNTSDADKAFFRLRIQR*
Ga0137463_108927223300011444SoilMNPKNVISVLCAFIAMLAHAETFVGPTTATNRLLVPSNSAIILTTMLGDFTNSTQVVLGGFAFAQAFAPLETGNSYALAGPAELIFSNAAVITYYQVTNSAIRSQGIANDPIGIQIATNKTMRLFGVPAPVNASFSRPGSGFVPFTLEPNQPAEFTGPGSLTLSSGVFFPNGKFISYFMVEDGFVLPNQRALVGPSGSYAIMVEKSFDLNNWTPVLLENTADANRAFYRLRIQR*
Ga0137389_1070087713300012096Vadose Zone SoilMKPRIAVSLLIASVSTLAHGETFIGPTSATNRLLVPSNSAIIITATLGDFTNSTQVVLDGFLFMQSYFAPLENGNSYALAGPAELIFSNAVVISYYQITNSAIRTQSIANDPIGIAIASNKTMRLFGVPAPVSASFSRLGNGSVSFTLEPSRPAEFTGPGTLSLSSGVFPPSAKFISYFIAEDGFTLPDQRAIAG
Ga0137388_1003719933300012189Vadose Zone SoilMKPRIAVSLLIASVSTLAHGETFIGPTSATNRLLVPSNSAIIITATLGDFTNSTQVVLDGFLFMQSYFAPLENGNSYALAGPAELIFSNAVVISYYQITNSAIRTQSIANDPIGIAIASNKTMRLFGVPAPVSASFSRLGNGSVSFTLEPSRPAEFTGPGTLSLSSGVFPPSAKFISYFIAEDGFTLPDQRAIAGPTGSYAITVEKSFDLNTWSPAFLQNTSDENRAFYRLRIQH*
Ga0137364_1033929513300012198Vadose Zone SoilMKSSTALALFVVLVSTLAHGETFIGPTTATNRLLVPSNSAIIITATFGDFTNSTQVALGEGGIPFPQNYFAPLENGSSYALAGPAELIFSNVALITFYQVTNSAIRTQFIANDPIVIPIASNKTMRLFGVPAPVNAGFSRPGSGFVSFTLEPNRPAEFTGPGSLQLSSGVFFPYSKFVSYFIAEDGFTLPSQPAIAGPTGSYAITVEKSFDLNTWSPALLANTSDPNKAFYRLRIQRQ*
Ga0137383_1068851313300012199Vadose Zone SoilMKPRTALALVIVVVSTLAHGETFIGPTTATNRLLVPSNSAIIITATFGDFTNSTQVALGEGGIPFPQNYFAPLENGSSYALAGPAELIFSNAVLISYYEVTNSAIRTQFIANDPIVIPIASNKTMRLFGVPAPVNAGFSRPGSGFVSFTLEPNRPAEFTGPGSLQLSSGVFFPYSKFISYFIAEDGFTLPSQRAIAGPTGSYAITVEKSFDLNTWSPALL
Ga0137383_1077375513300012199Vadose Zone SoilTNRLLVPSNSAIVITATLGDFTNSTQVELSGFQFSQNYFAPLENGGAYALAGPGELVFSNAVLISYYRVTNSAIRTQVILNDPIGISIGTNQTMRLFGVPAPVNASFVRPGTGSVAFTLEPNRPAEFTGPGELALNSGVFFPYAKFISYFIAEDGFAMPNQRFIAGPSGSYAISVEKTFDLNNWSPVLLQNTADDNRAFYRLRIQR*
Ga0137365_1045946613300012201Vadose Zone SoilTFIGPTTATNRLLVPSNSAIIITTTLGDFTNSTQVVLDGFQFMQSYFAPLDNGNSYALAGPAELIFSNAVIISYYQVTNSAIRTQSIANDPVGISIASNKTMRLFGVPAPVSASFSRLGNGSVSFTLEPSRPAEFTGPGTLSLSSGVFPPAGKFISYFIAEDGFTLPNQRAIAGPTGSYAITVEKSFDLNTWWSAFLQNTSDENRAFYRLRIQH*
Ga0137399_1081144413300012203Vadose Zone SoilPIGAIASFLLLVSKLALGETFIGPTTPTNRLLVASNSAIIITTTLGNFTNSTQVVLGGSPIALDYFAPLESGGSYALAGPAELVFSNAVVITYYLVTNSAIHTQGIANDPISISIASNKTMHLFGVPTTVNASFSRSGSFVSFALEPNRPAEFTGPGALALSSGVFPPYDKFISYYIAEDGFALPDQRAISGPSGSFAVTVEKSADLKTWSPVLMQNTSDPVKAFYRLRIER*
Ga0137374_1021634623300012204Vadose Zone SoilMKPRTAIALFIVLVSTLAHGETFIGPTTATNRLLVPSNSAIVITATFGDFTNSTQVALGEGGFPFPQNYFAPLENGNAYALAGPAELIFSNVVLISYYQVTNSAIFTQFVANDPIGISIASNKTMRLFGVTAPVKASFSRPGSSFVSFTLEPNRPAEFTGPGILSLNSGVLPPYFKFISYFIAEDGFALPDQRAIAGPSGSYAITVEKSFDLKAWSPVILKNTSDPNRAFYRLQIQR*
Ga0137362_1030805813300012205Vadose Zone SoilMMNSKLATALLIASISALARGETFIGPTTSTNRLLVPNNSAIVITATLGDFTNSTQVELGGFQFSQSYFAPLENGGAYALAGPAELIFSNAVLITYYRVTNSAIRTQVILNDPITLSIGSNQTMRLFGVPAPVAASFVRPGTGFVGFTLEPNRPAEITGPGTLALNSGVFFPYAKFISYFIAEDGFALPNQRFIAGPSGSYAISVEKTVDLNTWSPVLLQNTSDDNRAFYRLRIQR*
Ga0137362_1038810013300012205Vadose Zone SoilMRPRISIVLFLALVSTLSHGETFIGPTTTTNRLLVPTNSAIIITTTLGDFTNSTQVVLQGYQFTQSYFAPLESGNSYALAGPAELIFTNPVVITFYRIMNSAIRTQDIGYDPIGISIPTNKTMRLFGVPAPVNAAFSRPGSGYVTFTLEPNRPAEFTGPGTLFLNSGVFPPLGKFISYFFAEDGFVLPNQRAIAGPTGSYAIMVEKSFDLNTWSPVLLENTSDAPQAFYRLRTQR*
Ga0137362_1080713423300012205Vadose Zone SoilATNRLLVPSNSAIVIIATFGDFTNSTQVALAGFQFIQSYFAPLENGGSYALAGPAELIFSNAVVISYYQVTNSAIHTQSIGNDPIGILIASNKTMRLFGAPAPVNASFSRPGSGFVSFTLDPNRPAEFTGPGTLSLNSGVFLPYGKFISYFIAEDGFAMPSQRAIAGPSGSYAISVEKSFDMNTWSPVLLENTSDAGQAFYRLRIQR*
Ga0137362_1095556413300012205Vadose Zone SoilGKQDGGAPSSVKLGIHLAAKALPPRKGFVIMTRFAFHRKRRENPLVMKQRIAGSLLMAFVSALAHGETFIGPTTATNRLLVPSNSAIIITATLGDFTNSTQVVLDGFQFVQSYFAPLESGNLYALAGPAELIFSNAVVISYYQITNSAIRTQSIGNDPIGISIASNKTVRLFGVPAVVNASFSRSGNGFVSFTLEPNRPAEFTGPGTLSLNSGVLPPSAKFISYFIAEDGFALPDQRAIAGPT
Ga0137380_1009259623300012206Vadose Zone SoilMMNSKFAIALFIASISTLAHGETFIGPTTSTNRLLVPSNSAIVITATLGDFTNSTQVELGGFQFSQNYFAPLENGGAYALAGPGELVFSNAVLISYYRVTNAAIRTQVILNDPIGISIGTNQTMRLFGVSAPVNASFSRPGSGFVGFTLEPNQPAEFTGPGELALNSGVFFPYAKFISYFIAEDGFAMPNQRFIAGPSGSFAIMVEKSVDLNTWSPVLLQNTSDDNRAFYRLRIQR*
Ga0137380_1065028413300012206Vadose Zone SoilMNSKLAIALFIASISTLAHGETFVGPTTSTNRLLVPSNSAIVITATLGDFTNSTQVELSGFQFSQNYFAPLENGGAYALAGPGELIFSNAVLISYYRVTNSAIRTQVILNDPIGISIGTNQTMRLFGVPAPVNASFSRPGTGFVAFTLEPNRPAEITGPGTLALNSGVFFPYAKFISYFIAEDGFAMPNQRFIAGPSGSYAISVEKTFDLNTWSPVLLQNTSDDNRAFYRLRIQR*
Ga0137380_1091425813300012206Vadose Zone SoilSTLAHGETFIGPTTAANRLLVSRNSAIIITATFGDFTNSTQVALGEGGFPFPQNYFAPLENGSSYALAGPAELIFSNAVLITYYQVTNSAIRTQFIANDPIVIPIASNKTMRLFGVPAPVNAGFSRPGSGFVSFTLEPNRPAEFTGPGSLQLSSGVFFPYSKFISYFIAEDGFTLPSQRAIAGPTGSYAITVEKSFDLNTWSPALLANTSDANKAFYRLRIQR*
Ga0137379_1003775453300012209Vadose Zone SoilMMNSKLATALFIASISTLAHGETFVGPTTSTNRLLVPSNSAIVITATLGDFTNSTQVELSGFQFSQNYFAPLENGGAYALAGPGELIFSNAVLISYYRVTNSAIRTQVILNDPIGISIGTNQTMRLFGVPAPVNASFSRPGTGFVAFTLEPNRPAEITGPGTLALNSGVFFPYAKFISYFIAEDGFAMPNQRFIAGPSGSYAISVEKTFDLNTWSPVLLQNTSDDNRAFYRLRIQR*
Ga0137379_1041646723300012209Vadose Zone SoilMKPRTALALFIVLVSTLAHGETFIGPTTATNRLLVPSNSAIIITATFGDFTNSTQVALGEGGIPFPQNYFAPLENGSSYALAGPAELIFSNAVLISYYEVTNSAIRTQFIANDPIVIPIASNKTMRLFGVPAPVNAGFSRPGSGFVSFTLEPNRPAEFTGPGSLQLSSGVFFPYSKFISYFIAEDGFTLPSQRAIAGPTGSYAITVEKSFDLNTWSPALLANTSDANKAFYRLRIQR*
Ga0137379_1091909713300012209Vadose Zone SoilMFFIMKPRTPIALFIAIVSTLAYGETFIGPTTATNRLLIPSNSAIIITATFGDFTNSTQVVLGGYQFAQSYFAPLENGNSYALAGPAELIFSNAVLITYYQLTNSAIRTLSIGNDPIGISIGSNQTMRLVGLPAPMNASFHLGSGWANFTLQPNQPAEFTGPGELTLANGVPPAYGKFISYFIAEDGFVLPNQRAIVGPSGSYAVTVEKSFDLNIWSPVLLENTSDATRSFYRLRI
Ga0137379_1130637113300012209Vadose Zone SoilAVSTRVAFSWEEADNSTMKPQVAVSLLISLVSTFAHGETFIGPTSATNRLLVPSNSAIIITATFGDFTNSTHVVLDGFQFMQSYFAPLENGNSYALAGPAELIFSNAVIISYYQVTNSAIRTQSIANDPIGISIASNKTMRLFGVPAVVNASFSRSGSGSVSFTLEPSRPAEFTGPGTLSLSSGVLPPSAKFISYFIAEDGFTLPDQRAI
Ga0137378_1003254523300012210Vadose Zone SoilMNSKLAIALFVASISTLAHGETFIGPTTSTNRLLVPSNSAIVITATLGDFTNSTQVELGGFQFSQNYFAPLENGGVYALAGPGEIVFSNAVLFTYYRVTNAAIRTQVILNDPIGISIGTNQTMRLFGVPAPVNASFSRPGTGFVDFTLEPNRPAEISGPGTLALNSGVFFPYAKFISYFIAEDGFAMPNQRFIAGPSGSFAISVEKTVDLNTWSPVLLQNTSDDNRAFYRLRIQR*
Ga0137378_1023152413300012210Vadose Zone SoilMKPRTALALFIVLVSTLAHGETFIGPTTATNRLLVPRNSAIIITATFGDFTNSTQVALGEGGIPFPQNYFAPLENGSSYALAGPAELIFSNAVLISYYEVTNSAIRTQFIANDPIVIPIASNKTMRLFGVPAPVNAGFSRPGSGFVSFTLEPNRPAEFTGPGSLQLSSGVFFPYSKFISYFIAEDGFTLPSQRAIAGPTGSYAITVEKSFDLNTWSPALLANTSDANKAFYRLRIQR*
Ga0137377_1003358723300012211Vadose Zone SoilMKPRIAVSFLIALVSTLARGETFIGPTTATNRLLVPSNSAIIITATLGDFTNSTQVVLDGFQFMQSYFAPLDNGNSYALAGPAELIFSNAVIISYYQVTNSAIRTQSIANDPVGISIASNKTMRLFGVPAPVSASFSRLGNGSVSFTLEPSRPAEFTGPGTLSLSSGVFPPAGKFISYFIAEDGFTLPNQRAIAGPTGSYAITVEKSFDLNTWRPAFLQNSSDENRAFYRLRIQH*
Ga0137387_1010045023300012349Vadose Zone SoilMMNSNLATALFIASISTLAHGETFIGPTTSTNRLLVPSNSAIVITATLGDFTNSTQVELGGFQFSQNYFAPLENGGAYALAGPGELVFSNAVLISYYRVTNSAIRTQVILNDPIGISIGTNQTMRLFGVSAPVNASFSRPGSGFVGFTLEPNQPAEFTGPGELALNSGVFFPYAKFISYFIAEDGFAMPNQRFIAGPSGSYAISVEKTFDLNNWSPVLLQNTSDDNRAFYRLRIQR*
Ga0137372_1038419823300012350Vadose Zone SoilMNSKLAIALFVASISTLAHGETFIGPTTSTNRLLVPSNSAIVITATLGDFTNSTQVELGGFQFSQNYFAPLENGGVYALAGPGEIVFSNAVLFTYYRVTNAAIRTQVILNDPIGISIGTNQTMRLFGVPAPVNASFSRPGTGFVDFTLEPNRPAEISGPGTLALNSGVFFPYAKFISYFIAEDGFAMPNQRFIAG
Ga0137386_1035012513300012351Vadose Zone SoilTNRLLVPSNSAVVITATLGDFTNSTQVELGGFQFSQSYFAPLENGGVYALAGPGEIVFSNAVLFTYYRVTNAAIRTQVILNDPIGISIGTNQTMRLFGVPAPVNASFSRPGSGFVGFTLEPNQPAEFTGPGELALNSGVFFPYAKFISYFIAEDGFAMPNQRFIAGPSGSFAIMVEKSVDLNTWSPVLLQNTSDDNRAFYRLRIQR*
Ga0137386_1080919713300012351Vadose Zone SoilVSVSTFARAETFIGPTTSTNRLLVQSNSAIIITAAFGEFTNSTQVVLDGFQFAQSYFAPLENGSTYALAGPAELIFSNGVVITYYQITNSAIRTQSIGNDPIPISIASNKTMRLFGVPAPVNASFSRPEGGFVSFILEPNRPAEFTGPGTLALNSGVFPPYSKFISYFFAEDGFALPNQRYISGPNGSYAISVEKSFDLNAWSPVLLENTAEENKAFYRLRIQR
Ga0137366_1009574523300012354Vadose Zone SoilMKSSTALAVFIVLVSALAQGETFIGPTTATNRLLVPSNSAIIITATFGDFTNSTQVALGEDGFPFSQNYFAPLENGSSYALAGPAELIFSNVVLISYYQVTNSAIRTQFIANDPIVIPIASNKTMRLFAVPAPVNAAFSRPGSGFVSFTLELNRPAEFTGPGSLALSSGMFFPYSKFISYFIAEGGFTVPSQRAIAGPTGSYAITVEKSFDLNTWSPALLANTSDANKAFYRLRIQH*
Ga0137366_1010825623300012354Vadose Zone SoilMKPKSAIALCIVLASMLAHGETFIGPTTVTNRLLVPSNSAIVITATLGDFTNSMRLAQGESGSSFALNYFAPLGNGDTYALAGPAELVFSNAALFTFYRVTNSAIFTQSIGNDPIGITIASNKTMRLFGVPATVNAGFLRPGSSPVSFTLEPGAPAEFTGPGSLSLSSGVFAPYIKFISYFIAEDGFALADQRAIAGPSGSYAIMVEKSLNLNTWSPVLLKNTADADQAFFRLRIQR*
Ga0137375_1071330423300012360Vadose Zone SoilMKPRTAIALFIVLVSTLAHGETFIGPTTATNRLLVPSNSAIVITATFGDFTNSTQVALGEGGFPFPQNYFAPLENGNAYALAGPAELIFSNVVLISYYQVTNSAIFTQFVANDPIGISIASNKTMRLFGVTAPVKASFSRPGSSFVSFTLEPNRPAEFTGPGILSLNSGVLPPYFKFISYFIAEDGFALPDQRAIAGPSGSYAITV
Ga0137360_1001306333300012361Vadose Zone SoilMKPRIAVSLLIASVSTLAHGETFIGPTTATNRLLVPSNSAIIITATLGDFTNSTQVVLDGFQFMQSYFAPLENGNAYALAGPAELIFSNAVVISYYQVTNSAIRTQSIANDPIGITIASNKTMRLFGVPAPVSASFSRLGNGSVSFTLEPSRPAEFTGPGTLSLSSGVFPPSAKFISYFIAEDGFTLPDQRAIAGPTGSYAITVEKSFDLNTWSPAFLQNTSDENRAFYRLRIQH*
Ga0137360_1010763723300012361Vadose Zone SoilMRPRISIVLFLALVSTLSHGETFIGPTTTTNRLLVPTNSAIIITTTLGDFTNSTQVVLQGYQFTQSYFAPLESGNSYALAGPAELIFTNPVVITFYRIMNSAIRTQDIGYDPIGISIPTNKTMRLFGVPAPVNAAFSRPGSGYVTFTLEPNRPAEFTGPGTLFLNSDVFPPLGKFISYFFAEDGFVLPNQRAIAGPSGSYAIMVEKSFDLNTWSPVLLENTSDAPQAFYRLRTQR*
Ga0137360_1040229223300012361Vadose Zone SoilMFFIMKSRTPIALFIAIVSTLAYGETFIGPTTATNRLLIPSNSAIIITATFGDFTNSTQVVLGGYQFAQSYFAPLENGNSYALAGPAELIFSNAVLITYYQLTNSAIRTLSIGNDPIGISIGSNQTMRLVGLPAPMNASFHLGSGWANFTLQPNQPAEFTGPGELTLANGVPPAYGKFISYFIAEDGFVLPNQRAIVGPSGSYAVTVEKSFDLNIWSPVLLENTSDATRSFYRLRIAR*
Ga0137360_1084599213300012361Vadose Zone SoilMMNSNLAIALFLGSISTFAHGETFIGPTTSTNRLLVPNNSAIVITATLGDFTNSTQVELGGFQFSQSYFAPLENGGAYALAGPAELIFSNAVLITYYRVTNSAIRTQVILNDPITLSIGSNQTMRLFGVPAPVAASFVRPGTGFVGFTLEPNRPAEITGPGTLALNSGVFFPYAKFISYFIAEDGFALPNQRFIAGPSGSYAISVEKKVD
Ga0137361_1011125723300012362Vadose Zone SoilMMNSNLAIALFLGSISTFAHGETFIGPTTSTNRLLVPNNSAIVITATLGDFTNSTQVELGGFQFSQSYFAPLENGGAYALAGPAELIFSNAVLITYYRVTNSAIRTQVILNDPITLSIGSNQTMRLFGVPAPVAASFVRPGTGFVGFTLEPNRPAEITGPGTLALNSGVFFPYAKFISYFIAEDGFALPNQRFIAGPSGSYAISVEKTFDLNTWSPVLLQNTSDDNRAFYRLRIQR*
Ga0137390_1122799213300012363Vadose Zone SoilSTVAQFRRVAFHRKRRDNSLTMKPRIAVSLLIASVSTLAHGETFIGPTTATNRLLVPSNSAIIITATLGDFTNSTQVVLDGFLFMQSYFAPLENGNSYALAGPAELIFSNAVVISYYQITNSAIRTQSIANDPIGIAIASNKTMRLFGVPAPVSASFSRLGNGSVSFTLEPSRPAEFTGPGTLSLSSGVLPPAAKFISYFIAEDGFTLPDQRAIAGPSGSYTITVEKSFDLN
Ga0137358_1018682113300012582Vadose Zone SoilMPPGGKQDGGAPSSVKMRIHLAAKALPPRKGFVIMTRFAFHRKRRENPLVMKQRIAGSLLMAFVSALAHGETFIGPTTATNRLLVPSNSAIIITATLGDFTNSTQVVLDGFQFMQSYFAPLESGNLYALAGPAELIFSNAVVISYYQITNSAIRTQSIGNDPIGISIASNKTVRLFGVPAVVNASFSRSGNGFVSFTLEPNRPAEFTGPGTLSLNSGVLPPSAKFISYFIAEDGFALPDQRAIAGPTGSYAITVEKSFDLSTWAPAFLQNTSDEKQAFYRLRIQH*
Ga0137398_1029995413300012683Vadose Zone SoilYLDSVPFQSRERQKLEIVSMKPRILLAFICALISTLAHAETFIGPTTATNRLLVPTNSAIIISTTLGDFTNSTQVVLQGYQFTQSYFAPLESGNSYALAGPAELIFTNPAVITFYRIMNSAIRTQDIGYDPIGIPIPTNKTMRLFGVPAPVNAAFFRPGSAYVTFTLEPNRPAEFTGPGTLSLNSGVFPPLGKFISYFFAEDGFVLPNQRVIAGPSGSFAIMVEKSFDLNSWSPVLLENTSDAPQAFYRLRTQR*
Ga0137397_1010965113300012685Vadose Zone SoilTNRLLVPTNSAIIITTTLGDFTNSTQVVLQGYQFTQSYFTPLESGNSCALAGPAELIFTNPVVITFYRIMNSAIRTQDIGYDPIGISIPTNKTMRLFGVPAPVNAAFSRPGSGYVSFTLEPNRPAEFTGPGTLFLNSGVFPPSGKFISYFFAEDGFVLPNQRAIAGPSGSYAIMVEKSFDLNSWSPVLLETTSDAPQAFYRLRTQR*
Ga0137397_1012550123300012685Vadose Zone SoilLDTILAVSETRLENIVLMKPKPAIALFIALVSTLAHGETFIGPTTAANRLLVPSNSAIVITATFGDFTNSTQVVLGGFQFTQSYFAPLENGDSYALAGPAELIFSNAVVISYYQVTNSAIRTQSIGNDPIGIAIALNRTMRLFSVPAPVSASFSRPGSGFVSFTLEPNRPAEFTGPGTLALNSGVFPPYGKFVSYFIAEDGFTMPNQRAIAGPSGSFAIMVEKSFDLNTWSPVLLQNTSDANKAFYRLRIQR*
Ga0137394_1000809623300012922Vadose Zone SoilMKSIGAIASFLLLVSKFALGETFIGPTTPTNRLLVASNSAVIITTTLGNFTNSTQVVLGGSPIALDYFAPLESGGSYALAGPAELVFSNAVVITYYLVTNSAIHTQGIANDPISISIASNKTMHLFGVPTTVNASFSRSGSFVSFALEPNRPAEFTGPGALALSSGVFPPYDKFISYYIAEDGFALPDQRAISGPSGSFAVTVEKSADLKTWSPVLMQNTSDPVKAFYRLRIER*
Ga0137394_1004909023300012922Vadose Zone SoilMHETATHPSSITAESPARKKKPHPRAHCRGKEKTVECFIMKPRTAIALLIALVSRLAQGETFIGPTTATNRLLVPTNSAIIITATFGDFTNSTEVVLNGFAFSQLFAPLQSGSSYALAGPAELIFSNAVAITYYQMTNSAIRTQGIANDPIGLPIASNKTMRLFGVPAPVNASLSRPGSGYVSFTLVPNQPAEFTGPATLSLSSGVFFPNGRFISYFIAEDGFVMPNQRAVAGPSGSHAIMVEKSFDLNTWSPVLLENTADADRAFYRLRIQR*
Ga0137394_1012627523300012922Vadose Zone SoilMKPGSVIAFCMALGSAQAFSETFVGPTTSTNRLLVASNSAIIITATFGDFTNSTQVVLGGFQFTQSYFAPLETGGVYALAGPAELIFSNAVLITYYQITNSAIRTQSIGNDPIPISIASNKTVRLFGVPAPVNASFSASGSGFVSFTLEPNRPAEFTGPGTLALNSGVFPPYGKFISYFIAEDGFALPDQRAIVGPSGSLAIIIEKSADLKTWSPVLMQNTSDDLKAFYRLRIER*
Ga0137394_1059240013300012922Vadose Zone SoilMRPRTSIVLFLALVSTLSHAETFIGPTTSTNHLLVPTNSAIIITTTLGDFTNSTQVVLQGYQFTQSYFAPLESGNSYALAGPAELIFTNPVAITFYRIMNSAIRTQDIGYDPIGISIPTNKTMRLFGVPAPVNAAFSRPGSGYVTFTLEPNRPAEFTGPGTLSLNSGVFPPLGKFISYFFAEDGFVLPNQRAIAGPTGSYAI
Ga0137359_1000712823300012923Vadose Zone SoilMKPGSVIAFCMALGSAQAFSETFVGPTTSTNRLLVASNSAIIITATFGDFTNSTQVVLGGFQFTQSYFAPLETGGVYALAGPAELIFSNAVLITYYQITNSAIRTQSIGNDPIPISIASNKTVRLFGVPAPVNASFSASGSGFVSFTLEPNRPAEFTGPGTLALNSGVFPPDGKFISYFIAEDGFALPDQRAIVGPSGSLAIIIEKSADLKTWSPVLMQNTSDDVKAFYRLRIER*
Ga0137359_1005195033300012923Vadose Zone SoilMFFIMKSRTPIALFIAIVSTLAYGETFIGPTTATNRLLIPSNSAIIITATFGDFTNSTQVVLGGYQFAQSYFAPLENGNSYALAGPAELIFSNAVLITYYQLTNSAIRTLSIGNDPIGISIGSNQTMRLVGLPAPMNASFHLGSGWANFTLQPNQPAEFTGPGELTLANGVPPAYGKFISYFIAEDGFVLPNQRAIVGPSGSYAVTVEKSFNLDTWSPVLLENTSDATRAFYRLRIAR*
Ga0137359_1038114223300012923Vadose Zone SoilMKPGTALGLFIAFISTLSAGETFIGPTTATNHLVIPTNSAIVITATFGDFTNSTQVALTGIQFTLNYFAPLEYGNTYALAGPAELIFSNAVVISYYRLTNSAIYTQWVGNDPIGIPIASNKTMRLFSVPTPVNASFSRQEGGFVSFTLEPNRPAEFTGPGTLALNSGVFPPMGKFVSYFIAEDGFTLSGQRTISGPSGSYAIAVEKSYDLNTWSPVLLENTSDPTKAFYRLRIQR*
Ga0137404_1001598513300012929Vadose Zone SoilMKPQFAIAPLIAFISILAHSETFVGPTTADNRLLIASNSAVIITATFGDFTNSTQVVLSGFQFTQSYFAPLENGNSYALAGPAELIFSNAVVISYYQITNSAIRTQSIGNDPIGISIASNKTVRLFGVPAVVNASFSRSGNGFVSFTLEPNRPAEFTGPGTLSLNSGVLPPSAKFISYFIAEDGFALPDQRAIAGPTGSYAITVEKSFDLSTWAPAFLQNTSDEKQAFYRLRIQH*
Ga0137404_1005089043300012929Vadose Zone SoilMFRHGRFDADFGVDDVIVGNYCSWMKSAAPIVLILALASTDVFGETFIGPTTATNRLLVASNSAIIITATFGDFTNSTQVVLGGFQFTQSSFAPLETGATYALAGPAELVFSNAVLITYYQITNSTIRTHSIGNDPVPISIASNKTMRLFGVPAPVNANFSGPGNSSVSFTLEPNRPAEFTGPGILDLNSGVFPPNGKFISYFVAEDGFALPDQRVIAGPSGSFAIMVEKSADLKSWSPVLMQNTSDVVKAFYRLRIER*
Ga0137404_1016743313300012929Vadose Zone SoilVKPIASLASLLLLVSKFALGETFIGPTTPTNRLLVASNSAVIITTTLGNFTNSTQVVLGGSPIALDYFAPLENGGSYALAGPAELVFSNAVTITYYAVTNSAIRTQFIANDPISISIASNKTMHLFGVPTTVNASFSRSGSFVSFALEPNRPAEFTGPGALALSSGVFPPYAKFISYYIAEDGFALPDQRAISGPSGSFAVTVEKSADLKTWSPVLMQNTS
Ga0137404_1103926213300012929Vadose Zone SoilRTPIALFIAIVSTLAYGETFIGPTTATNRLLIPSNSAIIITATFGDFTNSTQVVLGGLQFAQSYFAPLENGNSYALAGPAELIFSNAVLITYYQLTNSAIRSLWIGNDPIGISIGSNQTMRLVGLPAPMNASFHLGSGWANFTLQPNQPAEFTGPGELTLANGVPPAYGKFISYFIAEDGFVLPNQRAIVGPSGSYAVTVEKSFNLDTWSPVLLENTSDATRAFYRLRIAR*
Ga0137407_1085506913300012930Vadose Zone SoilMLARGETFIGPTTSTNRILVPSNSAIVITATFGDFTNSTQVVLAGYQFTQSYFAPLENGNSYALAGPAELIFSNAVVISYYQITNSAIRTQSIGNDPIGISIASNKTVRLFGVPAVVNASFSRSGNGFVSFTLEPNRPAEFTGPGTLSLNSGVLPPSAKFISYFIAEDGFALPDQRAIAGPTGSYAITVEKSFDLSTWTPAFLQNTSDEKQAFYRLRIQH*
Ga0137407_1093016113300012930Vadose Zone SoilQRLTRQLFFIMKPRTPIALFIAIVSTLAYGETFIGPTTATNRLLIPSNSAIIITATFGDFTNSTQVVLGGYQFAQSYFAPLENGNSYALAGPAELIFSNAVLITYYQLTNSAIRSLWIGNDPIGISIGSNQTMRLVGVPAPMNASFHLGSGWANFTLQPNQPAEFTGPGELTLANGVPPAYGKFISYFIAEDGFVLPNQRAIVGPSGSYAVTVEKSFDLNIWSPVLLENTSDATRSFYRLRIAR*
Ga0137410_1011246423300012944Vadose Zone SoilMKQRIAIFFLMALVSTVTRGETFIGPTSPTNRLLVPSNSAIIITTTLGDFTNSTEVVLDGFQFAQNYFAPLENGNSYALAGPAELIFSNAVVISYYQVTNSAIRTQSIANDPIGITIASNKTMRLFGVPAPVSASFSRAGNGFVSFTLEPSRPAEFTGPGTLSLSSGVLPPAAKFISYFIAEDGFTLPDQRAIAGPTGSYAITVEKSFDLNTWSPAFLQNTADENRAFYRLRIQH*
Ga0137410_1013624523300012944Vadose Zone SoilMKPGSVIAFCMALGSAQAFSETFVGPTTSTNRLLVASNSAIIITATFGDFTNSTQVVLGGFQFTQSYFAPLETGGVYALAGPAELIFSNAVLITYYQITNSAIRTQSIGNDPIPISIASNKTVRLFGVPAPVNATFSASGSGFVSFTLEPNRPAEFTGPGTLALNSGVFPPYGKFISYFIAEDGFALPDQRALVGPSGRLAIIIEKSADLKTWSPVLMQNTSDDVKAFYRLRIER*
Ga0137410_1026619923300012944Vadose Zone SoilMKPIGAIASFLLLVSKLALGETFIGPTTPTNRLLVASNSAVIITTTLGNFTNSTQVVLGGSPIALDYFSPLENGGSYAFAGPAELVFSNAVVITYYLVTNSAIHTQWIANDPISISIASNKTMHLFGVPTTVNASFSRSDSFVSFALEPNRPAEFTGPGALALSSGVFPPYAKFISYYIAEDGFALPDQRAISGPSGSFAVTVEKTADLKTWSPVLMQNTSDPVKAFYRLRIER*
Ga0134077_1042732713300012972Grasslands SoilFVALVSTLAFGETFIGPTTATNRLLVSSNSAIVITATFGNFTNSTQVALGEGGFPFSQNYFAPLENGNAYALAGPAELIFSNVVLISYYQVTNSAIFTQFIANDPIGISIASNKTMRLFGVTAPVNASFSRPGSSFVSFTLEPNRPAEFTGPGILSLNSGVLPPYYKFISYFIAEDGFALPDQRAIGGPSG
(restricted) Ga0172371_1055839613300013138FreshwaterFMKPICLIPLLLGVAIHAETFLGPTTATNRLEVPTNSAIIITATFGDFTNSTQVRMGDGAPFILSYFAPLQNGTVYALAGPAELIFSNAVLITFYRLTNSAIHTQGIANDPIGIPIASNRTMHLFGVPAEVNATFTRIDGGSLSFALEPNQPAEFTGPGTLFLNSGVLPPSAKFISYFFDEDGFTLPDLRAISGPSGSFAVSVEKSLDLQTWSPILLQNATDPTLAFYRLRIQR*
Ga0157374_1036747023300013296Miscanthus RhizosphereMNPKNVISVLCAFIAMLAHAETFVGPTTATNRLLVPANSAIILTTMLGDFTNSTQVVLGGFAFAQAFAPLETGNSYALAGPAELIFSNAAVITYYQVTNSAIRSQGIANDPIGIQIATNKTMRLFGVPAPVNASFSRPGSGFVSFTLEPNQPAEFTGPGSLTLSSGVFFPNGKFISYFMVEDGFVLPNQRALAGPSGSYAIMVEKSFDLNNWTPVLLENTVDANRAFYRLRIQR*
Ga0181539_112774713300014151BogMRYIIRAIVLFSGLAATLAHGETFIGPTTSTNRLLVPTNSAIIITATFGDFTNSTQVQIDGAFAFTQSSFAPLQNGTVYAIPGPGELIFSNAMLFTFYRLTNSGILFQGIANDPIGVPIASNATMHLFGVPAPVNASFVRQSGAGGGALSFVLQPNQPAEFTGPGILFLNSGVFPPLAKFITYFIAQAGSSLPGQQAITAPTGSYAITVDQSVDLNTWSPIWMGTTSSPAAAFYRFRIQR
Ga0181518_1003721613300014156BogMKTFRVIALFIGLGGLLAEAETFIGPTTSTNRLLVPTNAAIIITTTLGDFTNSTQVQLGDGGTPFIQSYFAPLEYGTVYAVAGPAELVFSNAVLFTYYRLTNSAIYSQGIANDPIGIPIASNKTMHVFGVPAPVNASFERPISAGGGALSFMLEPNSPAEFTGPGTLFLNSGVFFPYAKFISYFIAEDGFTLPNQRAIAGPSGCFAISVEKSVDLNTWSPVLMGNTADPVKAYYRLQIQR
Ga0181521_1000950913300014158BogTNRLLVPTNAAIIITTTLGDFTNSTQVQLGDGGTPFIQSYFAPLEYGTVYAVAGPAELVFSNAVLFTYYRLTNSAIYSQGIANDPIGIPIASNKTMHVFGVPAPVNASFERPISAGGGALSFMLEPNSPAEFTGPGTLFLNSGVFFPYAKFISYFIAEDGFTLPNQRAIAGPSGSFAISVEKSVDLNTWSPVLMGNTADPVKAYYRLQIQR*
Ga0181530_1001121253300014159BogMKPICTIALILGLAVKLAYGETFIGPTTATNRLLVPTNSAIIITATFGDFTNSTQVQLGTGTPFIQSYFAPLESGTVYALAGPAELIFSNTVLFTFYRLTNSAIYSQGIANDPIGISIASNKTMHVFGVPACVNASFTRPTSAGGGSLSFTLEPNHPAEFTGPGTLFFNSGVFFPYAKFISYFFDEDGFTLPDLRTIAGPSGSFAISVEKSVDLQSWSPVLLQNTSDPIKAFYRLQIQH*
Ga0181530_1034133113300014159BogGKKRDNIIIMKTRTVITLFISAASILSHAETFIGPTTPTNRLLVSGNSAIIITATFGNFTNSTTVAIGGTPIPQNYFAPLANGTSYALAGPAELVFSNAVVITYYQVTNASIFTQSIANDPITIPIATNKTMRLFGVWAPVNASFSTPGSGSVSFILEPDQPAEFTGPGTLSLNSGVFPPYGKFISYFIAEDGFAIPNQRFIAGPSGSYAITVEKSFDLNTWSPVLLGNTSDDANAFYRLRIQH*
Ga0181538_1000407883300014162BogMKTFRVIALFIGLGGLLAEAETFIGPTTSTNRLLVPTNAAIIITTTLGDFTNSTQVQLGDGGTPFIQSYFAPLEYGTVYAVAGPAELVFSNAVLFTYYRLTNSAIYSQGIANDPIGIPIASNKTMHVFGVPAPVNASFERPISAGGGALSFMLEPNSPAEFTGPGTLFLNSGVFFPYAKFISYFIAEDGFTLPNQRAIAGPSGSFAISVEKSVDLNTWSPVLMGNTADPVKAYYRLQIQR
Ga0181526_1004041413300014200BogMKTFRVIALFIALGGLLAEAETFIGPTTSTNRLLVPTNAAIIITTTLGDFPNSTQVQLGDGGTPFIQSYFAPLEYGTVYAVAGPAELVFSNAVLFTYYRLTNSAIYSQGIANDPIGIPIASNKTMHVFGVPAPVNASFERPISAGGGALSFMLEPNSPAEFTGPGTLFLNSGVFFPYAKFISYFIAEDGFTLPNQRAIAGPSGSFAISVEKSVDLNTWSPVLMGNTA
Ga0163163_1004154723300014325Switchgrass RhizosphereMNPKNVIAVLCAFIAMLAHAETFVGPTTATNRLLVPANSAIILTTMLGDFTNSTQVVLGGFAFAQAFAPLETGNSYALAGPAELIFSNAAVITYYQVTNSAIRSQGIANDPIGIQIATNKTMRLFGVPAPVNASFSRPGSGFVAFTLEPNQPAEFTGPGSLTLSSGVFFPNGKFISYFMVEDGFVLPNQRAIVGPSGSYAIIVEKSFDLNNWTPVLLENTVDANRAFYRLRIQR*
Ga0182010_1035327223300014490FenMKLICSIALFIALAATPAKSETFIGPTTSTNRLLVPTNSAIIITATFGDFTNSTQLELGGGVPFTQSYFAPLENGAIYALAGPAELIFSNAVLFTFYRLTNSAIYSQGILNDPIGIPIASDKTMRVFGVPAPVNASFERPISAGGGALSFILEPNRPAEFTGPGTLSLNSGVFFPYAKFISYFIAEDGFTLPDQRAIAGPSGSFAISVE
Ga0182016_1002602063300014493BogMKLGIAIAFFVVSNSTLSPAETFIGPSTVSNRLLVASNAAIIITSIFGDFTNSTVVAQGDTQIPQTYFAPLESGSSYALAGPAELIFSNAVAITYYQLTNASIFTQSIANDPIGITVASNKTMRLFSVWAPVNASFSVPGTGSVSFVLQPNQPAEFTGPGTLALNSGVLPPYGKFISYFYAEDGFAIPDERFIAGPTGSFAITVEKSFDLNKWSPVLLGNTSDATSAYYRLRIQH*
Ga0182017_1037492313300014494FenMKLICSIALFIALAATPAKSETFIGPTTSTNRLLVPTNSAIIITATFGDFTNSTQLELGGGVPFSLSYFAPLENGAIYALAGPAELIFSNAVLFTFYRLTNSAIYSQGILNDPIGIPIASNKTMRVFEVPAPVNASFERPISAGGGALSFILEPNRPAEFTGPGTLSLNSGVFFPYAKFISYFIAEDGFTLPDQRAIAGPSGSFAISVERSVDLNTWSPVLLH
Ga0182015_1006687823300014495PalsaMKLGTAIALLIASASTLSHAETFVGPTTPANHLLVCSNSAIIITATFGDFTNSTAVAIGGAQIPQNYFAPLANGNSYALAGPAELIFSNAVAITYYQVTNSSIFTQGIANDPIGIQIATNKTMRLFSVWAPVNASFSRPGSGSVSFVLEPDQPAEFTGPGTLFLNSGVFPPYAKFISYFIAEDGFTIPNQRFIAGPTGSYAVTVEKSFDLNTWSPVLLGNTSDATNAYYRLRVQH*
Ga0182011_1001918823300014496FenMKLICSIALFIALAATPAKSETFIGPTTSTNRLLVPTNSAIIITATFGDFTNSTQLELGGGVPFTQSYFAPLENGAIYALAGPAQLIFSNAVLFTFYRLTNSAIYSQGILNDPIGIPIASDKTMRVFGVPAPVNASFERPISAGGGALSFILEPNRPAEFTGPGTLSLNSGVFFPYAKFISYFIAEDGFTLPDQRAIAGPSGSFAISVERSVDLNTWSPVLLHNTSDPAKAYYRLRIQH*
Ga0182019_1054297113300014498FenTATFGDFTNSTQLELGGGVPFSLSYFAPLENGAIYALAGPAELVFSNAVLFTFYRLTNSAIYSQGILNDPIGIPIASNKTMRVFGVPAPVNASFERPISAGGGALSFILEPNRPAEFTGPGTLSLNSGVFFPYAKFISYFIAEDGFTLPDQRAIAGPSGSFAISVERSVDLNTWSPVLLHNTSDPAKAYYRLRIQH*
Ga0182024_1146313613300014501PermafrostMKLGTPLALFIASASTLSYAATFVGPTTTTNHLLVSSNSAIIITATLGNFTNSTAVIVGGGTFPQNYFAPLASGDSYALAGPAELVFSNAVAITYYQVTNSAILTQGVANDPIGIQIATNQTMRLFGVWAPVNASFSGPAGFVSFTLEPDQPAEFTGPGTLFLNSGVLPPYIKFISYFIAENGFAIPNEPFIAGPTGS
Ga0182021_1018084223300014502FenMKLICSIALFIALAATPAKSETFIGPTTSTNRLLVPTNSAIIITATLGDFTNSTQLELGGGVPFTQSYFAPLKNGAIYALAGPAELIFSNAVLFTFYRLTNSAIYSQGILNDPIGIPIASDKTMRVFGVPAPVNASFERPISAGGGALSFILEPNRPAEFTGPGTLSLNSGVFFPYAKFISYFIAEDGFTLPDQRAIAGPSGSFAISVERSVDLNTWSPVLLHNTSDPAKPYYRLRIQH*
Ga0182030_10011390123300014838BogMAGPAAHGETFIGPTTATNRLLVPTNSAIIITATLGDFTNSTQIQVNGGFAFTQSYFAPLQNGTVYAVPGPAELIFPNPMLFTFYRLTNSGILFQGIANDPIGVPVATNTTIHLFGVPSPVGASFTRATTASGGTLSFTLQPNQPAEFTGPGTLYLNSGVLPPLGTFISYFVAAPGCILPGQQAILGPTGNYAITLDQSADLNTWSPIWMETTSNPSRAFYRLRVQR*
Ga0182027_1066849223300014839FenMKPICLVAVSLAVAAQLARGETFIGPTTATNRLLVPTNSAIIITATFGDFTNSTKVQLDGGTPFTQSYFAPLENGTAYALAGPAELIFSNTVLFTFYRLTNTAIYTQGIANDPIGIPIASNKTMHLFGVPAPTSASFTRLNSDGGGTLSFTLAPNAPAEFSGPGTLFLNSGVFPPYAKFISYFYEEDGFAIPGQRAIAGPSGSFAISVEKSGDLKSWSPVLLENSSDPANAYYRLRIQR*
Ga0157379_1060149223300014968Switchgrass RhizosphereMNPKNVIAVLCAFIAMLAHAETFVGPTTATNRLLVPANSAIILTTMLGDFTNSTQVVLGGFAFAQAFAPLETGNSYALAGPAELIFSNAAVITYYQVTNSAIRSQGIANDPIGIQIATNKTMRLFGVPAPVNASFSRPGSGFVAFTLEPNQPAEFTGPGSLTLSSGVFFPNGKFISYFMVEDGFVLPNQRALAGPSGSYAIMVEKSFDLNNWTPVLLENTVDANRAFYRLRIQR*
Ga0137418_1036852923300015241Vadose Zone SoilMKPGSVIAFCIALGSAQAFSETFVGPTTSTNRLLVASNSAIIITATFGDFTNSTQVVLGGFQFTQSYFAPLETGGVYALAGPAELIFSNAVLITYYQITNSAIRTQSIGNDPIPISIASNKTVRLFAVPAPVNASFSASGSGFVSFTLEPNRPAEFTGPGTLALNSGVFPPYGKFISYFIAEDGFALPDQRAIVGPSGSLAIIIEKSADLKTWSPVLMQNTSDDVKAFY
Ga0137409_1068418523300015245Vadose Zone SoilMKPIGAIASFLLLVSKLALGETFIGPTTPTNRLLVASNSAVIITTTLGNFTNSTQVVLGGSPIALDYFAPLENGGSYAFAGPAELVFSNAVVITYYLVTNSAIHTQWIANDPISISIASNKTMHLFGVPTTVNASFSRSGSFVSFALEPNRPAEFTGPGALALSSGVFPPYAKFISYYIAEDGFALPDQRAISGPSGSFAVTVEKSADLKTWTPAIMQNTSDPVKAFYRLRIER*
Ga0137403_1001172533300015264Vadose Zone SoilMFFIMKSRTPIALFIAIVSTLAYGETFIGPTTATNRLLIPSNSAIIITATFGDFTNSTQVVLGGYQFAQSYFAPLENGNSYALAGPAELIFSNAVLITYYQLTNSAIRSLWIGNDPIGISIGSNQTMRLVGLPAPMNASFHLGSGWANFTLQPNQPAEFTGPGELTLANGVPPAYGKFISYFIAEDGFVLPNQRAIVGPSGSYAVTVEKSFNLDTWSPVLLENTSDATRAFYRLRIAR*
Ga0137403_1002961753300015264Vadose Zone SoilMKQRIAGSLLMAFVSALAHGETFIGPTTATNRLLVPSNSAIIITATLGDFTNSTQVVLDGFQFMQSYFAPLESGNLYALAGPAELIFSNAVVISYYQITNSAIRTQSIGNDPIGISIASNKTVRLFGVPAVVNASFSRSGNGFVSFTLEPNRPAEFTGPGTLSLNSGVLPPSAKFISYFIAEDGFALPDQRAIAGPTGSYAITVEKSFDLSTWTPAFLQNTSDEKQAFYRLRIQH*
Ga0137403_1012850533300015264Vadose Zone SoilVKPIASLASLLLLVSKFALGETFIGPTTPTNRLLVASNSAVIITTTLGNFTNSTQVVLGGSPIALDYFAPLENGGSYALAGPAELVFSNAVTITYYAVTNSAIRTQFIANDPISISIASNKTMHLFGVPTTVNASFSRSGSFVSFALEPNRPAEFTGPGALALSSGVFPPYAKFISYYIAEDGFALPDQRAISGPSGSFAVTVEKSADLKTWSPVLMQNTSDPVKAFYRLRIER*
Ga0132258_1187187413300015371Arabidopsis RhizosphereMKPRILLALFLASISWLVQGETFIGPTTTTNRLLVPTNSAIIITTTLGDFTNSTQIVLQGYRFTQSYFAPLESGNSYAVAGPAELVFTNPVVITFYRIMNSAIRSQSIGNDPIGISIPTNKTMRLFGVPAPVNAAFSRPGSGYVTFTLEPNRPAEFTGPGTLSLASGVFPPFGKFISYFFAEDGFILPNQRAIAGPSGSYAIMVEKSFDLTSWSPVLLETTSDAPQAFYRLRTQR*
Ga0132258_1272247923300015371Arabidopsis RhizosphereMKTKLSITLWIILPSILAQGETFIGPTTATNRLLVSSNSAIIITATLGDFTNSTSLAQGQGGSPFPLPYFAPLESGDTYALAGPAELVFSNTALFTFYRMTNSAIFTQSIANDPIGVSIASNKTIRLFGVPATVNANFVRPGSSPVSFTLEPGKPAEFTGPGSLSLSSGVLPPYIKFISYFIAEDGFALPDQRIAAGPSGSYSVMVEKSLDLNLWSPVLLHNTSDSDKAFFRLRIQR*
Ga0132255_10009277853300015374Arabidopsis RhizosphereMKHQIWLTFILAVIPTLAHGETFIGPTSATNRLLVPTNSAIVITTTLGDFTNSTQVVLQGYQFTQSYFAPLESGNSYALAGPAELIFTNPVAITFFRIVNSAIRSQGIGYDPIGISIPTNKTMRLFGVPAPVSASFSRLGTGSVSFTLQPNQPAEFTGPGTLFLNSGVFPPSGKFISYFFAEDGFVLPNQRAIAGPSGSYAIMVEKSFDMNSWSPVLLETTSDAPQAFYRLRTQR*
Ga0132255_10315980113300015374Arabidopsis RhizosphereTNSAIIITTTLGDFTNSTQIVLQGYRFTQSYFAPLESGNSYAVAGPAELVFTNPVVITFYRIMNSAIRSQSIGNDPIGISIPTNKTMRLFGVPAPVNAAFSRPGSGYVTFTLEPNRPAEFTGPGTLSLASGVFPPFGKFISYFFAEDGFVLPNQRAIAGPSGSYAIMVEKSFDLTSWSPVLLETTSDAPQAFYRLRTQR*
Ga0187849_101591933300017929PeatlandLTGAEPRVVLYLMRPIICAIALFLALAATPACGETFIGPTTSTNRLLVPTNSAIIITATFGDFTNSTQVQLDGGYPFTQSFFAPLEHGTVYALAGPAELIFSNAVLFTFYRLTNSAILSQGIANDPLGIPLASNQTMHVFGAPAPVNASFVRQSSEGGGALSFVLAPNSPAEFTGPGTLFLNSGVFFPNAKFISYFIDQDGYALPGQQAVAGPTGSFAISLDKSVDLNTWSPVWMQTASDSAKAFYRLRIQH
Ga0187877_100764833300017931PeatlandMRPIICAIALFLALAATPACGETFIGPTTSTNRLLVPTNSAIIITATFGDFTNSTQVQLDGGYPFTQSFFAPLEHGTVYALAGPAELIFSNAVLFTFYRLTNSAILSQGIANDPLGIPLASNQTMHVFGAPAPVNASFVRQSSEGGGALSFVLAPNSPAEFTGPGTLFLNSGVFFPNAKFISYFIDQDGYALPGQQAVAGPTGSFAISLDKSVDLNTWSPVWMQTASDSAKAFYRLRIQH
Ga0187877_100921933300017931PeatlandVRTATDLCPVDKRSDGISVFPMKPICLVAVSLAVAAQFARGETFIGPTTATNRLLVPTNSAIIVTATFGDFTNSTQVQLDGGTPFTQSYFAPLENGTVYALAGPAELIFSNAVLFTFYRLTNTAIYTQGIAKDPISIPITSNKTMHLFGVPAPTSASFTRLDSAGGGTLSFTLAPNMPAEFSGPGTLFLNSGVFPPYAKFLSYFMEEDGFALPDQRAIAGPSGSFAISVEKSVDLKTWSPVLLQNNSDPANAYYRLRIQR
Ga0187848_1017942723300017935PeatlandTLAGAETFIGPTTPTNRLLVPTNSAVIITATFGDFTNSTQVQLGGGGSFTQPSFAPLENGTVYALAGPAELIFSNAVLFTFYRLTNSAILSQGIANDPIGIAIPSNQTMHVFGVPAPVNASFVRQASEGGGALSFVLAPNSPAEFTGPGTLFLNSGVFPPNAKFISYFIDQGGNALPGQQAIAGPTGSFAITVDKSVDLNTWLPIWMQTASDPASAFYRLRIQR
Ga0187878_110377013300018005PeatlandMKSICMITLFIALAGTLAYGETFIGPSTSTNRLLVPTNSAIIITGTFGEFTNSTQVLLGDGGTAFIQSHFAPLENGTVYALAGPGELIFSNAVLFTFYRLTNSAIYSLGIANDPIGIPVASNKTMHVFGVPGPVNASFSRPDSAGGGTLTFMLEPNQPAEFTGPGTLYLNSGVLPPYTKFISYFIAEDGFAMPGQRAIAGPSGSFAISVEKSVDLNTWSPVLMENTADPVKAFYRLRIER
Ga0187884_1027116713300018009PeatlandALAASLADGETFIGPTTSTNRLLVPTNSAIIITATFGDFTNSTQVQLDGGYSSTQSFAPLGNGTVYALAGPAELIFSNAVLFTFYRLTNSAIVSQGIANDPIGIPIASNNTMHVFAVPAPVNASFVRLSSDGGGALSFVLAPNSPAEFTGPGTLFLNSGVLPPNARFISYFIDQGGYALPGQQAIAGPTGNFAISLDKSVDLNTWSPIWMGTASDPAKAFYRLRIQR
Ga0187873_105875133300018013PeatlandMKPICLVAVSLAVAAQFARGETFIGPTTATNRLLVPTNSAIIVTATFGDFTNSTQVQLDGGTPFTQSYFAPLENGTVYALAGPAELIFSNAVLFTFYRLTNTAIYTQGIAKDPISIPITSNKTMHLFGVPAPTSASFTRLDSAGGGTLSFTLAPNMPAEFSGPGTLFLNSGVFPPYAKFLSYFMEEDGFALPDQRAIAGPSGSFAISVEKSADMNTWSPVLLGNTSDPINAFYRLRIQH
Ga0187861_1030500613300018020PeatlandTATNRLLVPTNSAIIVTATFGDFTNSTQVQLDGGTPFTQSYFAPLENGTVYALAGPAELIFSNAVLFTFYRLTNTAIYTQGIAKDPISIPITSNKTMHLFGVPAPTSASFTRLDSAGGGTLSFTLAPNMPAEFSGPGTLFLNSGVFPPYAKFLSYFMEEDGFALPDQRAIAGPSGSFAISVEKSVDLKTWSPVLLQNNSDPANAYYRLRIQR
Ga0187857_10002027103300018026PeatlandLTGAEPRVVLYLMRPIICAIALFLALAATPACGETFIGPTTSTNRLLVPTNSAIIITATFGDFTNSTQVQLAGGYPFTQSFFAPLEHGTVYALAGPAELIFSNAVLFTFYRLTNSAILSQGIANDPLGIPLASNQTMHVFGAPAPVNASFVRQSSEGGGALSFVLAPNSPAEFTGPGTLFLNSGVFFPNAKFISYFIDQDGYALPGQQAVAGPTGSFAISLDKSVDLNTWSPVWMQTASDSAKAFYRLRIQH
Ga0187857_1003284333300018026PeatlandMKPICLVAVSLAVAAQFARGETFIGPTTATNRLLVPTNSAIIVTATFGDFTNSTQVQLDGGTPFTQSYFAPLENGTVYALAGPAELIFSNAVLFTFYRLTNTAIYTQGIAKDPISIPITSNKTMHLFGVPAPTSASFTRLDSAGGGTLSFTLAPNMPAEFSGPGTLFLNSGVFPPYAKFLSYFMEEDGFALPDQRAIAGPSGSFAISVEKSVDLKTWSPVLLQNNSDPANAYYRLRIQR
Ga0187867_1001528643300018033PeatlandMKTFRVIALFIALGGLLAEAETFIGPTTSTNRLLVPTNAAIIITTTLGDFTNSTQVQLGDGGTPFIQSYFAPLEYGTVYAVAGPAELVFSNAVLFTYYRLTNSAIYSQGIANDPIGIPIASNKTMHVFGVPAPVNASFERPISAGGGALSFMLEPNSPAEFTGPGTLFLNSGVFFPYAKFISYFIAEDGFTLPNQRAIAGPSGSFAISVEKSVDLNTWSPVLMGNTADPVKAYYRLQIQR
Ga0187862_1057128713300018040PeatlandQIQLDGFVFTQSSIAPLQNGTVYAIPGPAELIFSNAMLFTFYRLTNSGIQLQGIANDPIGIPIASNATMHVFGVPAPVNASFIRDSSAGGGSLSFTLQPNQPAEFTGPGTLNLNSGVFPPLSKFITYVIAQAGSLVPGGQAVAGPTGSYAITLDQSPDLNSWSPVWMGTTSSPAAVFYRVRIQR
Ga0187859_1019918013300018047PeatlandMKTFRVIALFIALGGLLAEAETFIGPTTSTNRLLVPTNAAIIITTTLGDFTNSTQVQLGDGGTPFIQSYFAPLEYGTVYAVAGPAELVFSNAVLFTYYRLTNSAIYSQGIANDPIGIPIASNKTMHVFGVPAPVNASFERPISAGGGALSFMLEPNSPAEFTGPGTLFLNSGVFFPYAKFISYFIAEDGFTLPNQRAIAGPSGSFAISVEKSVDLNTWSP
Ga0187858_1034003713300018057PeatlandTSTNRLLVPTNSAIIITATFGDFTNSTQVQLDGGYPFTQSFFAPLEHGTVYALAGPAELIFSNAVLFTFYRLTNSAILSQGIANDPLGIPLASNQTMHVFGAPAPVNASFVRQSSEGGGALSFVLAPNSPAEFTGPGTLFLNSGVFFPNAKFISYFIDQDGYALPGQQAVAGPTGSFAISLDKSVDLNTWSPVWMQTASDSAKAFYRLRIQH
Ga0184618_1026656213300018071Groundwater SedimentQGDKNSRKLPSRGFLSYAPTTATNRLLVPSNSAIIITATFGDFTNSTQVALGEGGFPFRQNYFAPLESGSSYALAGPAELIFSNVVLITYYQVTNSAIRTQFIANDPIVIPIASNKTMRLFGVPAPVNAAFSRPGSGFVSFTLEPNRPAEFTGPGSLQLSSGVFFPYSKFISYFIAEDGFTLPSQRALAGPTGSYAITVEKSFDLNTWSPALLANISDANKAFYRLRIQR
Ga0184628_1000517413300018083Groundwater SedimentTLALFIVLVSTLAQGETFIGPTTATNRLVVPSNSAIIITTTLGDFTNSVGLAQGQGGYPFPLDYFAPLESGNTYALAGPSELVLSNVALFTFYRLTNAAIFTQSIGNDPVGVSIASNRTMRLFGVPTRVNANFVRPGSSPVSFTLEPGKPAEFTGPGSLSLSSGVFASYIKFISYFIAEDGFALPDQRVVTGPTGSYSVMVEKSLDLNSWTPVLLKSTSDADHAFFRLRIER
Ga0184628_1003404513300018083Groundwater SedimentAGRVSGLHSKAAGPAWLRSSSDGITISVAVRVIRLKNVLIMKSRTAITLFIVSVSTLAQGETFIGPTTATNRLLVPSNSAIVITATFGDFTNSTQVAFGEGGIPFPQNYFAPLENGNAYALAGPAELIFSNAALISYYLVTNSAIFTQYIANDPVGISIASNKTMRLFGVTAPVNVSFSRPGSSFVSFTLEPNRPAEFTGPGTLMLSSGVLPPYIKFISYFIAEDGFALPNQRVIAGPSGSYAITVEKSFDLKAWSPVILENTSDANKAFYRLRIQRERGLYNRRPELEAAMSISLHFVRGRRRASEADCY
Ga0184629_1007920423300018084Groundwater SedimentMKPIITLALFIVLVSTLAQGETFIGPTTATNRLLVASNSAIIITATLGDFTNSTQLAQGESGFPFALNYFAPLENGNTYALAGPAELVFSNAVLFTFYRVTNSAIFTQSIANDPIVIQIASNKTMRLFGVPATVNAGFSRPGSSFVSFTLEPNRPAEFTGPGSLQLSSGVFFPYSKFISYFIAEDGFALPDQRAIAGPSGSYAIIVEKSFDLNMWSPVLLKNTSDADKAFFRLRIQR
Ga0182028_142729133300019788FenVDKRSDGISVFPMKPIRLVAVSLAVAAQLARGETFIGPTTATNRLLVPTNSAIIITATFGDFTNSTKVQLDGGTPFTQSYFAPLENGTAYALAGPAELIFSNTVLFTFYRLTNTAIYTQGIANDPIGIPIASNKTMHLFGVPAPTSASFTRLNSDVGGTLSFTLAPNAPAEFSGPGTLFLNSGVFPPYAKFISYFYEEDGFAMPGQRPSPARAEALPLAWRSPGI
Ga0193712_107154813300019880SoilMKPITALALFIELVSALAHGETFIGPTTDTNRLLVPGNSAIIITATFGDFTNSTRVALGEVGFPFSLNYFAPLENGSSYALAGPAELIFSNAVLITYYQVTNSAIRTQFIANDPIVIPIASNKTMRLFGVPAPVNAGFSRPGSGFVSFTLEPNRPAEFTGPGSLQLSSGVFPPYSKFISYFIAEDGFTLPSQQVMAGPTGSHAITVEKSFDLNTWSPALLANTSDANRAFYRLRI
Ga0193729_100722433300019887SoilMPSSARIGVALIAKMTEGALIMKARTAMALFIGLTSSFTHGETFIGPTTATNRLLVPSNSAIVITATFGDFTNSTQVALADFQFTQSYFAPLEIGNSYALAGPAELIFSNAVVITYYQVTNSAIRTQSIGNDPIGIPIATNKTMRLFGVPAPVNASFSRPGSGSLSFILEPNQPAEFTGPGTLALSSGVFPPYGKFISYFFAEDGFTLPNQRAIAGPSGSYAVTVEKSFDLNTWLPVLLENTSDANKAFYRLRIQR
Ga0193728_101985553300019890SoilMPSSARIGVALIAKMTEGALIMKARTAMAPFIGPTTATNRLLVPSNSAIVITATFGDFTNSTQVALADFQFTQSYFAPLEIGNSYALAGPAELIFSNAVVITYYQVTNSAIRTQSIGNDPIGIPIATNKTMRLFGVPAPVNASFSRPGSGSLSFILEPNQPAEFTGPGTLALSSGVFPPYGKFISYFFAEDGFTLPNQRAIAGPSGSYAVTVEKSFDLNTWLPVLLENTSDANKAFYRLRIQR
Ga0193730_106101813300020002SoilRHLRSFNMKPGIAIALFIALVSTLARGETFIGPTTTTNRLLVPSNSAIVITATFGDFTNSTEVVLDGFQFTQSYFAPLENGNAYALAGPAELIFSNAVLISYYQITNSAIRSQSIANDPIGISIASNKTMRLFSVPASVNASFSRPGSSFVPFTLEPNRPAEFTGPGTLTLNSGVFPPYSKFISYFIAEDGFTMPNQRAIAGPAGSYAITVEKSFDLNTWSPVLLQNTSDANKAFYRLRIQP
Ga0193755_117407313300020004SoilKQRIAIFFLMALVSTVTRGETFIGPTSPTNRLLVPSNSAIIITPTLGDFTNSTQVVLDGFQFAQNYFAPLENGNSYALAGPAELIFSNAVVISYYQVTNSAIRTQSIANDPIGITIASNKTMRLFGVPAPVSASFSRAGNGFVSFTLEPSRPAEFTGPGTLSLSSGVLPPAAKFISYFIAEDGFTLPDQRAIAGPTGSYAITVEKSFDLTTW
Ga0210380_1009678023300021082Groundwater SedimentMNAKFPIALFLVLASTLAQGETFIGPTTATNRLVVPSNSAIIITTTLGDFTNSVGLAQGQGGYPFPLDYFAPLESGNTYALAGPSELVLSNVALFTFYRLTNAAIFTQSIGNDPVGVSIASNRTMRLFGVPTRVNANFVRPGSSPVSFTLEPGKPAEFTGPGSLSLSSGVFASYIKFISYFIAEDGFALPDQRVVTGPTGSYSVMVEKSLDLNSWTPVLLKSTSDADHAFFRLRIER
Ga0193719_1015579623300021344SoilISLEAVFIMKPRTAVTLFVALVSTLAHGESFIGPTTATNRLVVPTNSAIIITATFGDFTNSTRVALGEGSFPFSLNYFAPLENGSSYALAGPAELIFSNTVLITYYQVTNSAIRTQFIANDPIVIPIASNKTMRLFGVPAPVNAGFSRPGSGFVSFTLEPNRPAEFTGPGSLQLSSGVFPPYSKFISYFIAEDGFTLPSQQVMAGPTGSHAITVEKSFDLNTWSPALLANTSDANRAFYRLRIQR
Ga0193709_1000086103300021411SoilMNPRPAIAIFVALISTLAHAETFIGPTTATNRLLVPSNSAIVITTTLGDFTNSTQVVLQGYQFTQSYFAPLESGNSYALAGPAELIFSNAVVITFYRVMNSAIHSQGIANDPIGIQVPTNKTMRLFGVPEPVNASFSRPGGGFASFILQPNQPAEFTGPGTLSLNSGVFPPYSKFISYFIAEDGFVLPNQQAIAGPSGSYTIMVEKSFDMNTWSPVLMENTDDAPQAYYRLRTQR
Ga0193750_101805923300021413SoilMKPRTLTALFIALVSTPAHGETFIGPTTATNRLLVPSNSAIVITATFGDFTNSTQIVLDGFQFTQSYFAPLENGNSYALAGPAELIFSNAVLITYYQVTNSAIRTQSIGNDPIGIMIATGKTMRLFSVPGPVNASFSRPGSGFVSFTLEPNRPAEFTGPGTLALNSGVFLPYSKFVSYFIAEDGFTILNQRAIAGPSGSYAITVEKSFDLNTWSPVLLENTSDANHAFYRLWIQR
Ga0210394_1002066113300021420SoilMKPTCAIALLAVTARLACGGTFIGPTTSTNRLLIPTNSAIIITAIFGDFTNSTQIQFDGGFSFTQPDFAPLESGTTYALAGPAELICSNTVLLTFYPLTNSAIRTQVILNDPIGIPIASNKTMHLFGVPASVSASFSRLDSTGGGALSFTLEPNSAAEFTGPGTLYLNSGVFFPYAKFISYFIAEDGIVLPGQRAMAGPSGSFAISVEKSLDLMNWSPVLMGNSSDPTKAFYRLQIQR
Ga0224533_101016013300022526SoilLVPTNSAIIITTTFGDFTNSTQVQLGDGGTPFTQSYFAPLENRTVYAVAGPAELIFSNAVLFTYYRLTNSAIYSQGIANDPIGIPIDSNKTMHVFGVPAPVNASFERPISAGGGTLGFVLAPNGPAEFTGPGTLFLNSGVFFPYAKFISYFIAEDGFTLPNQRAIAGPSGSFAISVEKSVDLNTWSPVLMENTSDPVKAYYRLQIQR
Ga0224555_100632783300023088SoilMKPRTAIVLFIALVSAFAHGETFIGPTTATNRLLVPTNSAIIITATFGDFTNSTQLAYAGDQFPLNYFAPLENGSAYALAGPMELIFSNAVLITYYQVTNSAILTQTIGNDPIPITIGSNQTVRLFGVPVSVNASFSRSGSGSVSFTLEPNQPAEFSGPGTLSLSSGVLPPYVKFISYFIAEDGFAIPNQRAIAGPSGSYAITVEKSFDLNTWSPVLLENTSDANQAFYRLKIQR
Ga0247695_102956513300024179SoilLCPLVPALAWGDTFIGPTTATNRLLVASNSAIIITTTLGDFTNSTQVLLGPGGVPFTQSYFAPLESGNSYAVAGPAELIFSNSVLFTFFRVTNSNIRSQGIANDPIGVVVPTNKTMHLFGVPGPVNASFQNSVTGRSISFTLQPNQPAEFCGPGILSLNSGVFPPYAKFISYYFAEDGFAMPGQRAIAGPTGSFGIIVEQSADLNAWSPVLLQNTSDATQAYYRLRIAR
Ga0247695_103899213300024179SoilRQRARAGNLYRQPLIDTPKVKSSVSGMKPIHIIALFLGVSVRLACAETFIGPTTATNRLLVPGNSAIIITTTLGDFTNSTQVVLGGGAPFTQSYFVPLQNGTVYAVAGPAELLFSNAVLFTFFRLTNSAIHSQGIANDPIGISIPANKTMHVFGVPAEVNASFVRPAAAGGGSLSFKLEPNAPAEFTGPGTLSLNSGVFPPYAKFISYFFDEDGFTLSDMRTISGPS
Ga0209584_1009475513300025878Arctic Peat SoilMKTFSLIALFMALGGLLAEGETFIGPTTSTNRLLVPTNSAIIITTTLGDFTNSTQVQLGDGGTPFIQSYFAPFEYGTVYAVAGPAELVFSNAVLFTYYRLTNSAIYSQGIANDPIGIPIASNKTMHVFGVPAPVNASFERPISAGGGTLSFMLEPNGPAEFTGPGTLFLNSGVFFPYAKFISYFIAEDGFTLPNQRTIAGPSGSFAISVEKSVDLNTWSPVLMENTADPVKAYYRLQIQR
Ga0207711_1126127113300025941Switchgrass RhizosphereIALVSTPAPGETFIGPTTSSNRLLVPSNSALIITATFGDFTNNTQVALGEGGLPFLQSYFAPLENGNSYALAGPAELIFSNAALISFYQVTNSAIRTQFIANDPIVIPIAANKTMRLFGVPAPVPAGFSRPGSGFVSFTLEPNQPAEFTGPGNLQLSSGVLFPYSKFISYFIAEDGFTLPNQRVIAGPSGSYVITVEKSFDLNTWSPVLLQNTSDADKAFYRLRIQ
Ga0207702_1123635313300026078Corn RhizosphereMKPRILFALLFALISRLVHGETFVGPTTGTNRLLVPSNSAIVITTTLGDFTNSTQVVLQDYQFTQSYFAPLESGNSYALAGPAELIFTNPVVITFYRIMNSSIHSQGIGNDPIGIQIPTNKTMRLFGVSAPVNASFSRPGSGFVSFILEPNHPAEFTGPGTLSLNSGVFPPMGKFISYFIAEDGFIVPNQRAMAGPSGSYSIMVEKSFDMIAWSPVLLENTADMPQAF
Ga0207676_1078501523300026095Switchgrass RhizosphereATNHLLVPANSAIILTTMLGDFTNSTQVVLGGFAFAQAFAPLETGNSYALAGPAELIFSNAAVITYYQVTNSAIRSQGIANDPIGIQIATNKTMRLFGVPAPVNASFSRPGSGFVSFTLEPNQPAEFTGPGSLTLSSGVFFPNGKFISYFMVEDGFVLPNQRALAGPSGSYAIMVEKSFDLNNWTPVLLENTVDANRAFYRLRIQR
Ga0207676_1119620113300026095Switchgrass RhizospherePPSLGIWVASRFAFAVIGKRLENASIMKSRTAIALFIASVSTLALGETFIGPTTATNRLLVQSNSAIVITATFGDFTNSTQVALGEGSFAFPLNYFAPLEHGNTYALAGPAELIFSNAVLISYYPVTNSAIFTQFIANDPVGISIASNKTMRLFGVTAPVSASFSRPGSSFVSFTLEPNRPAEFTGPGILSLNSGVLPPYFKFISYFIAEDGFALPNQRAIAGPTGSYAITVEKSFDLKAWSPVILENTS
Ga0207674_1001111383300026116Corn RhizosphereMKPRILFALLFALISRLVHGETFVGPTTGTNRLLVPSNSAIVITTTLGDFTNSTQVVLQDYQFTQSYFAPLESGNSYALAGPAELIFTNPVVITFYRIMNSSIHSQGIGNDPIGIQIPTNKTMRLFGVSAPVNASFSRPGSGFVSFILEPNHPAEFTGPGTLSLNSGVFPPMGKFISYFIAEDGFIVPNQRAMAGPSGSYSIMVEKSFDMIAWSPVLLENTADMPQAFYRLRTQH
Ga0207683_1012865523300026121Miscanthus RhizosphereMNTKWSIALCLVFTSTLAQGETFIGPTTATNRLLVPSNSAIIIAATLGDFTNSMRLAQGQGGSPFPLDYFAPLENGNTYALAGPSELVFSNAALFTFYRVTNAAIFTQSIANDPIGRSIASNQTIRLFGVPATVNANFVRPGSAPISFTLEPGKPAEFTGPGSLSLSSGVFPPYIKFISYFIAEDGFVLPDQRVVTGPSGSFSIMVERSLDLNLWSPVLLKNTSDANKAFFRIRIQR
Ga0179587_1023036513300026557Vadose Zone SoilMKSRSTVALFIAFISILAHSETFVGPTTADNRLLIASNSAVIITATFGDFTNSTQVVLSGFQFTQSYFAPLENGNSYALAGPAELVYSNTVLISYYVVTNSAIRTQSIGNDPIGIAIPPNKTMRLFGVPAPVNASFSRPGSGFVSLTLEPNRPAEFTGPGTLAVNSGVFFPYSKFISYFIAEDGFSMPNQRSIAGPSGSYAITVERSFDLNTWSPVLLQNTSDANKAFYRLRIQP
Ga0209811_1011778113300027821Surface SoilRLACAETFIGPTTATNRLLVPGNSAIIITTTLGDFTNSTQVVLGGGAPFTQSYFVPLQNGTVYAVAGPAELLFSNAVLFTFFRLTNSAIHSQGIANDPIGISIPANKTMHVFGVPAEVNASFVRPAAAGGGSLSFKLEPNAPAEFTGPGTLSLNSGVFPPYAKFISYFFDEDGFTLSDMRTISGPSGSFAISVERSVDLQGWSPVMLQNTSEPTKAFYRLRIQR
Ga0209517_1002044843300027854Peatlands SoilMPRTERTVKGMRSLVRAITLFSGLAATLAGAETFIGPTTATNRLLVPANSAIIITTTLGDFTNSTQVQLGGGGSFTQPSFAPLENGTVYALAGPAELIFSNAVLFTFYRLTNSAILSQGIANDPIGIAIPSNQTMHIFGVPAPVNASFVRQNSQGGGALSFVLAPNSPAEFTGPGTLFLSSGVLPPNAKFISYFIDQGGTPLPGQQAVAGPTGSFAITVDKSVDLSTWLPIWMQTASDPASAFYRLRIQR
Ga0209777_1005714723300027896Freshwater Lake SedimentMKPICLVAVSLAVAAQLACGETFIGPTTATNRLLVPASSAIIITATFGDFTNSTQVQLDGGTPFTQSYFAPLENGTVYALAGPAELIFSNAVLFTFYRLTNTAIYTQGIANDPIGIPIASNKTMHLFGVPAPTSASFTRLDSAGGGTLSFTLAPNTPAEFSGPGTLFLNSGVFPPYAKFISYLMEEDGFALPDQRAIAGPSGSFAISVEKSVDLKTWSPVLLQNSSDPANAYYRLRIQR
Ga0209415_10011576123300027905Peatlands SoilMRSLVRAITLFSGLAATLAGAETFIGPTTATNRLLVPANSAIIITTTLGDFTNSTQVQLGGGGSFTQPSFAPLENGTVYALAGPAELIFSNAVLFTFYRLTNSAILSQGIANDPIGIAIPSNQTMHIFGVPAPVNASFVRQNSQGGGALSFVLAPNSPAEFTGPGTLFLSSGVLPPNAKFISYFIDQGGTPLPGQQAVAGPTGSFAITVDKSVDLSTWLPIWMQTASDPASAFYRLRIQR
Ga0311361_1002894933300029911BogMKPTTVVIICLALASILAHGETFIGPTTATNRLIVPANSAIIITATFGDFTNSTQVVLGAGGNPLLEKNFSPLENGSTYALAGPAELIFTNLALITYYQVTNSAIITQFVGNDPIGFPIASNTTIRLFNVPAPIGAIFSRPGNSTVNFILEPNQPAEFTGPGTLSLYGDLPPYGQFISYFIEQDGFTIPNERVIAGPSGSYAITVEKSLDLNTWTPVLLANTSDPVQAFYRMKIQR
Ga0311362_1009619143300029913BogPTTATNRLIVPANSAIIITATFGDFTNSTQVVLGAGGNPLLEKNFSPLENGSTYALAGPAELIFTNLALITYYQVTNSAIITQFVGNDPIGFPIASNTTIRLFNVPAPIGAIFSRPGNSTVNFILEPNQPAEFTGPGTLSLYGDLPPYGQFISYFIEQDGFTIPNERVIAGPSGSYAITVEKSLDLNTWTPVLLANTSDPVQAFYRMKIQR
Ga0311359_1057629513300029914BogCLALASILAHGETFIGPTTATNRLIVPANSAIIITATFGDFTNSTQVVLGAGGNPLLEKNFSPLENGSTYALAGPAELIFTNLALITYYQVTNSAIITQFVGNDPIGFPIASNTTIRLFNVPAPIGAIFSRPGNSTVNFILEPNQPAEFTGPGTLSLYGDLPPYGQFISYFIEQDGFTIPNERVIAGPSGSYAITVEKSLDLNTWTPVLLANTSDPVQAFYRMKIQR
Ga0311343_1072132413300029953BogRLIVPANSAIIITATFGDFTNSTQVVLGAGGNPLLEKNFSPLENGSTYALAGPAELIFTNLALITYYQVTNSAIITQFVGNDPIGFPIASNTTIRLFNVPAPIGAIFSRPGNSTVNFILEPNQPAEFTGPGTLSLYGDLPPYGQFISYFIEQDGFTIPNERVIAGPSGSYAITVEKSLDLNTWTPVLLANTSDPVQAFYRMKIQR
Ga0311342_1091985213300029955BogMKPTTVVIICLALASILAHGETFIGPTTATNRLIVPANSAIIITATFGDFTNSTQVVLGAGGNPLLEKNFSPLENGSTYALAGPAELIFTNLALITYYQVTNSAIITQFVGNDPIGFPIASNTTIRLFNVPAPIGAIFSRPGNSTVNFILEPNQPAEFTGPGTLSLYGDLPPYGQFISYFIEQDGFTIPNERVIAGPSGSYA
Ga0302277_115807513300029982BogPANSAIIITATFGDFTNSTQVVLGAGGNPLLEKNFSPLENGSTYALAGPAELIFTNLALITYYQVTNSAIITQFVGNDPIGFPIASNTTIRLFNVPAPIGAIFSRPGNSTVNFILEPNQPAEFTGPGTLSLYGDLPPYGQFISYFIEQDGFTIPNERVIAGPSGSYAITVEKSLDLNTWTPVLLANTSDPVQAFYRMKIQR
Ga0311344_1113261813300030020BogVVIICLALASILAHGETFIGPTTATNRLIVPANSAIIITATFGDFTNSTQVVLGAGGNPLLEKNFSPLENGSTYALAGPAELIFTNLALITYYQVTNSAIITQFVGNDPIGFPIASNTTIRLFNVPAPIGAIFSRPGNSTVNFILEPNQPAEFTGPGTLSLYGDLPPYGQFISYFIEQDGFTIPNERVIAGPSGSYAI
Ga0302192_1022133413300030507BogNSAIIITATFGDFTNSTQVVLGAGGNPLLEKNFSPLENGSTYALAGPAELIFTNLALITYYQVTNSAIITQFVGNDPIGFPIASNTTIRLFNVPAPIGAIFSRPGNSTVNFILEPNQPAEFTGPGTLSLYGDLPPYGQFISYFIEQDGFTIPNERVIAGPSGSYAITVEKSLDLNTWTPVLLANTSDPVQAFYRMKIQR
Ga0170834_10253601723300031057Forest SoilFIGPTSSTNHLLIPTNSAIIITTTLGDFTNSTHVVIQGYQLNLSYFAPLESGNSYALAGPAELIFTNPAVATFYRITNSAIFSQSIGNDPIGISIPTNKTMRLFGVPDSVNASFSRPGTVSVSLTLQPNRPAEFTGPGTLFLNSGEIPPLGKFISYFFAEDGFVLPNQHAIAGPSGSFAIMVEKSFDLNSWSPVLLDTTSDAPQAFYRLRTQR
Ga0307477_1082006613300031753Hardwood Forest SoilKLGTAIALFIASGSTLSHAATFVGPTTATNRLIVSSNSAIIITATLGDFTNSTVVAIDGDQIPQNYFAPLENGSSYALAGPAEFIFSNAVAITYYQVTNSSIFTQSIANDPIGIPIATNKTMRLFGVWAPVNASFSRPGGGFVSFTLEPNQPAEFTGPGTLYLNSGVFPPYGKFISYFLAEDGFAIPDQRFIAGPTGSFAITVEK
Ga0316217_10001441273300031813FreshwaterMIYACALFCFSDEAETFIGPTTSTNRLQIPTNSAIIITATFGDFTNSTQIRIGGGIAFTQSYFAPLENGAVYAVPGPAELIFSNAMLFTFYRLTNSGILFQGIGNDPIGIAIPSNSTMRLFGVPASVNATFERQGRDGGGGLSFTLQPNQPAEFTGPGTLFLNSGVLPPSGKFISYFMAENGSILPGEQGIAGPTGSFAITVDKSVDLNTWSPIWMQNPYAPSRAFYRLRVQR
Ga0310909_1132217613300031947SoilAPLACGETFIGPTTSTNRLLVPTNSAIIITATFGDFTNSTQLAFGGGAPTPLSYFAPLGNGTVYALAGPAELIFSNTMLFTFYRLTNSAIYTQGIANDPIGVNIATNKTLRLFGVWATVNASFSNPGGGSVSFMLEPGRPAEFTGPGTLFLNSGELPPYAKFISYFIEEDGFTLPDQRAIAGPAGSFAISVEK
Ga0335079_1011473743300032783SoilMKLICISALLLGVAAHGETFIGPTTATNRLVVPTNSAIIITATFGDFTNSTRVQIGGGTPFIQGYFAPLGNGTVYALAGPAELIFSNAVLITFYRLTNSAIYTQGIANDPIGIPIASNKTMHVFGVPAEVNATFTLQDGGSLSFALEPNQPAEFTGPGTLFLNSGVLPPYAKFISYFFDEDGFTLPDLRAIAGPSGSFAVMVEKSVDLQSWSPVLLQNATDSAKAFYRLRIQR
Ga0335079_1164269913300032783SoilVPANSAIIITTTLGDFTNSTQVQLGGGAPFAQSCFAPLEQGACYALAGPAELIFSNAVLFTFFRLTNSAIYTQGIANDPIGIPVASNKTMHLFGVPAPVSGSFTRPASSGAGGGAVSFTLEPNAPAEFTGPGTLFLNSGVFYPYAKFISYFIDEDGFTLPDQRTIVGPSGTFGISVEKSVDLNTWVPVLMQTASDPTRAFYRLRLQH
Ga0335069_10003668143300032893SoilMKPILSTAVTLGLAATIAYGETFIGPTTSTNRLLVPTNSAIIITATFGDFTNSTQVQLGSGAPFLQSYFAPLDSGTVYAVAGPAELIFSNAVLFTFYRLTNSAIYSQGIANDPIGLPVASNSTMHLFGVPAPVNATFTQPPSAGGGSLSFTLEPNQPAEFTGPGTLFLNSGVLPPSAKFISYFFDENGFTLPDARAVAGPSGSYAVSVEKSADLQSWSPVLLGTTTDPAKSFYRLRIQH
Ga0335069_1154084913300032893SoilSGLRFLVKPIHAIVLFLGAAATLAYGETFVGPTTATNRLLVPTNSAIIITATFGDFTNSTQVQLGAGTPFVQSYFAPLENGTVYALAGPAELIFSNAVLFTFYRLTNSAIYSQGIANDPVGISIASNKTMHVFGVPASVNASFTRPPSAGGGSLSFTLEPTQPAEFTGPGTLFLNSGVLPPYAKFISYFFDEDGFTLPDLRAVAGPSGSFAISVEKSVDLQSWSPVLLQNVSDPVKA
Ga0326728_10000161563300033402Peat SoilMKPIRTIVLILSAAATLASGETFIGPTTATNRLLVPTNSAIIITATFGDFTNSTQVQLGSGTPFIQSYFAPLESGTVYALAGPAELIFSNAVLFTFYRLTNSAIYSQGIANDPIGIPIASTKTMHVFGVPACVNASFTSLTSAGGSLSFMLEPNHPAEFTGPGTLFLNSGVFFPYAKFISYFFDEDGFTLPDLRAIAGPSGSFAISVEKSVDLQSWSPVLLQNTSDPIKAFYRLQIQH
Ga0326728_1019309233300033402Peat SoilMRTIICAIALFVALAPTLAYSETFIGPTTSTNRLLVPTNSAIIITATFGDFTNSTQVQLDGGSPFTQSFAPLGNGTVYALAGPAELIFSNAVLFTFYRLTNSAIVSQGIANDPIGIPIASNTTMHVFAVPAPVNASFVRQSSDGGGALSFVLAPNSPAEFTGPGTLFLNSGVFPPNARFISYFIDQGGYALPGQQAIAGPTGSFAISLDKSVDLNTWSPIWMDTASDPAKAFYRLRVQR
Ga0326728_1029287923300033402Peat SoilMLSQAGTFIGPTTPTNRLLISSNSAIIITATFGNFTNSTTVAAGGNQIPQIYFAPLSSGDSYALAGPAELIFSNAVVITYYQVTNSSIFTQSIANDPIPISIPTNKTMRLFGVWAPVNASFSAPGIGSVSFTLEPAQPAEFTGPGTLALNSGVFPPYGKFISYFIAEDGFAIPNQRFIAGPTGSYAISVEKSFDLTTWSPVLLGNTSDATNAYYRLRIQH
Ga0310810_1005971843300033412SoilMKARILFALFFACISALAHGETFLGPTTSTNHLLIATNSAIIITTTVGDFTNSTHVVLQGFEFNQSYFAPLESGNSYAIAGPAELIFTNPVAISFYRIINSAIHSQSIGNDPIGISIPTNKTMRLFGVPASVNASFSRPGAPSVSFTLEPNRPAEFTGPGTLFLNNGGLVPPLGKFISYFFAEDGFVLPNQRAIAGPSGSFTIMVEKSFDLNSWSPVLLENTSDAPQAFYRLRTQR
Ga0310810_1006193433300033412SoilMNPKNVIAVLCAFIAMLAHAETFVGPTTATNRLLVPANSAIILTTMLGDFTNSTQVVLGGFAFAQAFAPLETGNSYALAGPAELIFSNAAVITYYQVTNSAIRSQGIANDPIGIQIATNKTMRLFGVPAPVNASFSRPGSGFVSFTLEPNQPAEFTGPGSLTLSSGVFFPNGKFISYFMVEDGFVLPNQRALAGPSGSYAIMVEKSFDLNNWTPVLLENTVDANRAFYRLRIQR
Ga0310811_1079866913300033475SoilLVPANSAIILTTMLGDFTNSTQVVLGGFAFAQAFAPLETGNSYALAGPAELIFSNAAVITYYQVTNSAIRSQGIANDPIGIQIATNKTMRLFGVPAPVNASFSRPGSGFVSFTLEPNQPAEFTGPGSLTLSSGVFFPNGKFISYFMVEDGFVLPNQRALAGPSGSYAIMVEKSFDLNNWTPVLLENTVDANRAFYRLRIQR
Ga0334837_005749_5665_63843300033823SoilMKPIYLIALILSAAATLASGETFIGPTTATNRLLVPTNSAIIITATFGDFTNSTQVQLGSGTPFIQSYFAPLESGTVYALAGPAELIFSNAVLFTFYRMTNSAIYSQGIANDPIGIPIASNKTMHVFGVPACVNASFTRPTSAGGGSLSFMLEPNHPAEFTGPGTLFLNSGVFFPYAKFISYFFDEDGFTLPDLRTIAGPSGSFAISVEKSVDLQSWSPVLLQNTSDPVKAFYRLQIQH
Ga0334837_013886_1068_17903300033823SoilMKSIYVMALFIGLGGPLAEGETFIGPTTSTNRLLVPTNSAIIITTTFGDFTNSTQVQLGDGGTPFTQSYFAPLENRTVYAVAGPAELIFSNAVLFTYYRLTNSAIYSQGIANDPIGIPIDSNKTMHVFGVPAPVNASFERPISAGGGTLGFVLAPNGPAEFTGPGTLFLNSGVFFPYAKFISYFIAEDGFTLPNQRAIAGPSGSFAISVEKSVDLNTWSPVLMENTSDPVKAYYRLQIQR
Ga0370484_0033321_586_12123300034125Untreated Peat SoilLLVPTNSAIIITTTLGDFTNSTQVQLGDGGTPFIQSYFAPLEYGTVYAVAGPAELVFSNAVLFTYYRLTNSAIYSQGIANDPIGIPIASNKTMHVFGVPAPVNASFERPISAGGGTLSFMLEPNGPAEFTGPGTLFLNSGVFFPYAKFISYFIAEDGFTLPNQRAIAGPSGSFAISVEKSVDLNTWSPVLMENTADPVKAYYRLQIQR
Ga0370492_0006993_858_15683300034282Untreated Peat SoilMKPTTVVIICLALVSILARGETFIGPTTATNRLIVPANSAIVITATLGNFTNSTQVDLGTGGAPLLEKYFAPLENGSTYALAGPAELIFTNTALITYYQVTNSAIFNQFVGNDPIGFQIASNTTIRLFNVPAPIGGIFSRPGSGVVNFTLEPNQPAEFTGPGTLSLYGDVPPYGQFITYFVEQDGFSIPNQRAIAGPSGSYAVTVEKSLDLNTWTPVLLANTSDPSQAFYRLKIQH


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.