NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F070497

Metagenome / Metatranscriptome Family F070497

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F070497
Family Type Metagenome / Metatranscriptome
Number of Sequences 123
Average Sequence Length 224 residues
Representative Sequence MHTPKLIFALFLAVPIGLAAQTNANDDVFTISVAAPTSAKDVQVRYFLNGDPTVQQSSSIAKPNEQQIMVKTGSQGKPAKGFRAIVFAPGCQFATIQADDLAAGTRQADFQCQKLSTVPLSGKADISRFAGRDMQVEALYVCGWAGQFFGMPGLAISPFSVGKAKMETDGTFALELPDFSGDPLWTNLSHNATLMLFLVDASTGEHLAQLSAPGALSRKGSLKVAASYPAQIQFAVR
Number of Associated Samples 87
Number of Associated Scaffolds 123

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 46.34 %
% of genes near scaffold ends (potentially truncated) 46.34 %
% of genes from short scaffolds (< 2000 bps) 72.36 %
Associated GOLD sequencing projects 73
AlphaFold2 3D model prediction Yes
3D model pTM-score0.77

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (55.285 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(17.073 % of family members)
Environment Ontology (ENVO) Unclassified
(19.512 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(52.846 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 10.94%    β-sheet: 36.98%    Coil/Unstructured: 52.08%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.77
Powered by PDBe Molstar

Structural matches with PDB biological assemblies

PDB IDStructure NameBiol. AssemblyTM-score
1oxyCRYSTALLOGRAPHIC ANALYSIS OF OXYGENATED AND DEOXYGENATED STATES OF ARTHROPOD HEMOCYANIN SHOWS UNUSUAL DIFFERENCES10.50227
1ll1HYDROXO BRIDGE MET FORM HEMOCYANIN FROM LIMULUS10.50185
1llaCRYSTAL STRUCTURE OF DEOXYGENATED LIMULUS POLYPHEMUS SUBUNIT II HEMOCYANIN AT 2.1ANGSTROMS RESOLUTION: CLUES FOR A MECHANISM FOR ALLOSTERIC REGULATION10.50168
3wkyCRYSTAL STRUCTURE OF HEMOLYMPH TYPE PROPHENOLOXIDASE (PROPOB) FROM CRUSTACEAN10.50038


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 123 Family Scaffolds
PF01058Oxidored_q6 14.63
PF07739TipAS 6.50
PF03483B3_4 3.25
PF13411MerR_1 3.25
PF01425Amidase 2.44
PF08241Methyltransf_11 1.63
PF01833TIG 1.63
PF01625PMSR 0.81
PF00210Ferritin 0.81
PF00378ECH_1 0.81
PF00005ABC_tran 0.81
PF02897Peptidase_S9_N 0.81
PF07366SnoaL 0.81
PF03476MOSC_N 0.81
PF08240ADH_N 0.81
PF14026DUF4242 0.81
PF12833HTH_18 0.81
PF00535Glycos_transf_2 0.81

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 123 Family Scaffolds
COG0377NADH:ubiquinone oxidoreductase 20 kD subunit (chain B) or related Fe-S oxidoreductaseEnergy production and conversion [C] 14.63
COG1740Ni,Fe-hydrogenase I small subunitEnergy production and conversion [C] 14.63
COG1941Coenzyme F420-reducing hydrogenase, gamma subunitEnergy production and conversion [C] 14.63
COG3260Ni,Fe-hydrogenase III small subunitEnergy production and conversion [C] 14.63
COG0154Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit or related amidaseTranslation, ribosomal structure and biogenesis [J] 2.44
COG0225Peptide methionine sulfoxide reductase MsrAPosttranslational modification, protein turnover, chaperones [O] 0.81
COG1505Prolyl endopeptidase PreP, S9A serine peptidase familyAmino acid transport and metabolism [E] 0.81
COG1770Protease IIAmino acid transport and metabolism [E] 0.81
COG3217N-hydroxylaminopurine reductase subunit YcbX, contains MOSC domainDefense mechanisms [V] 0.81


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms55.28 %
UnclassifiedrootN/A44.72 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000156|NODE_c0709695All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae4625Open in IMG/M
3300002906|JGI25614J43888_10173296Not Available581Open in IMG/M
3300003321|soilH1_10107235Not Available1274Open in IMG/M
3300004800|Ga0058861_11173896Not Available899Open in IMG/M
3300004803|Ga0058862_11490019Not Available575Open in IMG/M
3300005169|Ga0066810_10125342Not Available592Open in IMG/M
3300005332|Ga0066388_100112090All Organisms → cellular organisms → Bacteria3226Open in IMG/M
3300005332|Ga0066388_101157429Not Available1317Open in IMG/M
3300005434|Ga0070709_10348493All Organisms → cellular organisms → Bacteria1093Open in IMG/M
3300005434|Ga0070709_11110963Not Available633Open in IMG/M
3300005436|Ga0070713_101781095Not Available598Open in IMG/M
3300005437|Ga0070710_10188402All Organisms → cellular organisms → Bacteria1296Open in IMG/M
3300005439|Ga0070711_100008651All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae6245Open in IMG/M
3300005439|Ga0070711_100390327All Organisms → cellular organisms → Bacteria1128Open in IMG/M
3300005439|Ga0070711_101792864All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Xanthomonadales → Xanthomonadaceae → Xanthomonas → Xanthomonas citri group → Xanthomonas citri538Open in IMG/M
3300005526|Ga0073909_10034855All Organisms → cellular organisms → Bacteria1747Open in IMG/M
3300005526|Ga0073909_10133758All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Planctomycetia → Pirellulales → unclassified Pirellulales → Pirellulales bacterium1018Open in IMG/M
3300005529|Ga0070741_10003361All Organisms → cellular organisms → Bacteria39974Open in IMG/M
3300005532|Ga0070739_10171671Not Available1131Open in IMG/M
3300005537|Ga0070730_10028774All Organisms → cellular organisms → Bacteria4216Open in IMG/M
3300005537|Ga0070730_10156502All Organisms → cellular organisms → Bacteria1542Open in IMG/M
3300005575|Ga0066702_10067030All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Candidatus Angelobacter → unclassified Candidatus Angelobacter → Candidatus Angelobacter sp. Gp1-AA1171974Open in IMG/M
3300005575|Ga0066702_10112204Not Available1573Open in IMG/M
3300005614|Ga0068856_101166298Not Available787Open in IMG/M
3300005764|Ga0066903_100369218All Organisms → cellular organisms → Bacteria2336Open in IMG/M
3300005764|Ga0066903_101476360All Organisms → cellular organisms → Bacteria1282Open in IMG/M
3300005764|Ga0066903_102164253Not Available1072Open in IMG/M
3300005764|Ga0066903_104663104Not Available730Open in IMG/M
3300006028|Ga0070717_10121277All Organisms → cellular organisms → Bacteria2240Open in IMG/M
3300006028|Ga0070717_10526378Not Available1070Open in IMG/M
3300006028|Ga0070717_10585629Not Available1012Open in IMG/M
3300006028|Ga0070717_10665861Not Available945Open in IMG/M
3300006041|Ga0075023_100009813All Organisms → cellular organisms → Bacteria2456Open in IMG/M
3300006173|Ga0070716_100239532All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1229Open in IMG/M
3300006175|Ga0070712_100320359All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1260Open in IMG/M
3300006175|Ga0070712_100583889Not Available944Open in IMG/M
3300006800|Ga0066660_11019066Not Available663Open in IMG/M
3300006854|Ga0075425_101063347All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Micromonosporales → Micromonosporaceae → Salinispora → Salinispora tropica922Open in IMG/M
3300006871|Ga0075434_100060023All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae3782Open in IMG/M
3300006904|Ga0075424_100040599All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae4843Open in IMG/M
3300006904|Ga0075424_101071394All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Micromonosporales → Micromonosporaceae → Salinispora → Salinispora tropica859Open in IMG/M
3300006954|Ga0079219_10247801Not Available1057Open in IMG/M
3300006954|Ga0079219_10513354Not Available844Open in IMG/M
3300006954|Ga0079219_10639787Not Available790Open in IMG/M
3300007076|Ga0075435_100352039Not Available1262Open in IMG/M
3300009088|Ga0099830_10248909All Organisms → cellular organisms → Bacteria1406Open in IMG/M
3300009143|Ga0099792_10051468All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Candidatus Angelobacter → unclassified Candidatus Angelobacter → Candidatus Angelobacter sp. Gp1-AA1172009Open in IMG/M
3300010048|Ga0126373_12573624Not Available567Open in IMG/M
3300010159|Ga0099796_10005863All Organisms → cellular organisms → Bacteria3100Open in IMG/M
3300010358|Ga0126370_11821398Not Available590Open in IMG/M
3300010361|Ga0126378_10000337All Organisms → cellular organisms → Bacteria35346Open in IMG/M
3300010361|Ga0126378_10045250All Organisms → cellular organisms → Bacteria4062Open in IMG/M
3300010364|Ga0134066_10368123Not Available538Open in IMG/M
3300010371|Ga0134125_10387368All Organisms → cellular organisms → Bacteria1550Open in IMG/M
3300010376|Ga0126381_100127253All Organisms → cellular organisms → Bacteria3308Open in IMG/M
3300010396|Ga0134126_10911469Not Available988Open in IMG/M
3300010398|Ga0126383_10007642All Organisms → cellular organisms → Bacteria7622Open in IMG/M
3300011120|Ga0150983_13134742All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Candidatus Angelobacter → unclassified Candidatus Angelobacter → Candidatus Angelobacter sp. Gp1-AA1171780Open in IMG/M
3300011269|Ga0137392_10284032All Organisms → cellular organisms → Bacteria → Acidobacteria1366Open in IMG/M
3300011271|Ga0137393_10128124All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Candidatus Angelobacter → unclassified Candidatus Angelobacter → Candidatus Angelobacter sp. Gp1-AA1172101Open in IMG/M
3300011431|Ga0137438_1118417Not Available809Open in IMG/M
3300012202|Ga0137363_10141778All Organisms → cellular organisms → Bacteria1879Open in IMG/M
3300012206|Ga0137380_10130959All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia2290Open in IMG/M
3300012683|Ga0137398_10219901All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Candidatus Angelobacter → unclassified Candidatus Angelobacter → Candidatus Angelobacter sp. Gp1-AA1171258Open in IMG/M
3300012917|Ga0137395_10012068All Organisms → cellular organisms → Bacteria4759Open in IMG/M
3300012917|Ga0137395_10344663All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Candidatus Angelobacter → unclassified Candidatus Angelobacter → Candidatus Angelobacter sp. Gp1-AA1171061Open in IMG/M
3300012924|Ga0137413_10504802Not Available890Open in IMG/M
3300012924|Ga0137413_10729788Not Available755Open in IMG/M
3300012929|Ga0137404_10985254Not Available771Open in IMG/M
3300012929|Ga0137404_10987759Not Available770Open in IMG/M
3300012929|Ga0137404_11067354Not Available740Open in IMG/M
3300012951|Ga0164300_10611667Not Available645Open in IMG/M
3300012955|Ga0164298_10050570All Organisms → cellular organisms → Bacteria1995Open in IMG/M
3300012955|Ga0164298_10963794Not Available627Open in IMG/M
3300012957|Ga0164303_10051398All Organisms → cellular organisms → Bacteria1829Open in IMG/M
3300012957|Ga0164303_10126305All Organisms → cellular organisms → Bacteria1314Open in IMG/M
3300012960|Ga0164301_10016156All Organisms → cellular organisms → Bacteria → Acidobacteria3200Open in IMG/M
3300012960|Ga0164301_10067003Not Available1928Open in IMG/M
3300012960|Ga0164301_10709741Not Available758Open in IMG/M
3300012971|Ga0126369_10787226All Organisms → cellular organisms → Bacteria1033Open in IMG/M
3300012984|Ga0164309_10289815Not Available1175Open in IMG/M
3300012985|Ga0164308_10051351All Organisms → cellular organisms → Bacteria2668Open in IMG/M
3300012986|Ga0164304_10039243All Organisms → cellular organisms → Bacteria2497Open in IMG/M
3300012986|Ga0164304_10062473All Organisms → cellular organisms → Bacteria2080Open in IMG/M
3300012987|Ga0164307_10132034All Organisms → cellular organisms → Bacteria1618Open in IMG/M
3300012987|Ga0164307_10329713Not Available1099Open in IMG/M
3300012987|Ga0164307_10387033Not Available1026Open in IMG/M
3300012988|Ga0164306_10005226All Organisms → cellular organisms → Bacteria → Acidobacteria6308Open in IMG/M
3300012988|Ga0164306_10371335All Organisms → cellular organisms → Bacteria1066Open in IMG/M
3300012989|Ga0164305_10739069Not Available809Open in IMG/M
3300015053|Ga0137405_1295102All Organisms → cellular organisms → Bacteria1297Open in IMG/M
3300015242|Ga0137412_10558888Not Available870Open in IMG/M
3300015242|Ga0137412_10758004Not Available716Open in IMG/M
3300015264|Ga0137403_10005092All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis15296Open in IMG/M
3300015371|Ga0132258_11624380All Organisms → cellular organisms → Bacteria1631Open in IMG/M
3300018468|Ga0066662_10520079Not Available1090Open in IMG/M
3300019999|Ga0193718_1011200All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Candidatus Angelobacter → unclassified Candidatus Angelobacter → Candidatus Angelobacter sp. Gp1-AA1171968Open in IMG/M
3300021170|Ga0210400_10006675All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis9656Open in IMG/M
3300021170|Ga0210400_10531903Not Available970Open in IMG/M
3300021560|Ga0126371_10079412All Organisms → cellular organisms → Bacteria3215Open in IMG/M
3300021560|Ga0126371_10171284All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Candidatus Angelobacter → unclassified Candidatus Angelobacter → Candidatus Angelobacter sp. Gp1-AA1172248Open in IMG/M
3300023056|Ga0233357_1006092Not Available1207Open in IMG/M
3300025544|Ga0208078_1006587All Organisms → cellular organisms → Bacteria2864Open in IMG/M
3300025898|Ga0207692_10355988Not Available903Open in IMG/M
3300025898|Ga0207692_11028985Not Available544Open in IMG/M
3300025906|Ga0207699_11420994Not Available513Open in IMG/M
3300025916|Ga0207663_11179670Not Available616Open in IMG/M
3300025928|Ga0207700_10482337All Organisms → cellular organisms → Bacteria1096Open in IMG/M
3300025928|Ga0207700_11522060Not Available593Open in IMG/M
3300026304|Ga0209240_1043020All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Candidatus Angelobacter → unclassified Candidatus Angelobacter → Candidatus Angelobacter sp. Gp1-AA1171710Open in IMG/M
3300026319|Ga0209647_1088119All Organisms → cellular organisms → Bacteria1512Open in IMG/M
3300026527|Ga0209059_1039747All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Candidatus Angelobacter → unclassified Candidatus Angelobacter → Candidatus Angelobacter sp. Gp1-AA1172100Open in IMG/M
3300026527|Ga0209059_1061132All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Candidatus Angelobacter → unclassified Candidatus Angelobacter → Candidatus Angelobacter sp. Gp1-AA1171638Open in IMG/M
3300027583|Ga0209527_1005915All Organisms → cellular organisms → Bacteria2527Open in IMG/M
3300027773|Ga0209810_1047659All Organisms → cellular organisms → Bacteria → Acidobacteria2287Open in IMG/M
3300027821|Ga0209811_10024320All Organisms → cellular organisms → Bacteria1997Open in IMG/M
3300027862|Ga0209701_10631815Not Available563Open in IMG/M
3300027903|Ga0209488_10039027All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Candidatus Angelobacter → unclassified Candidatus Angelobacter → Candidatus Angelobacter sp. Gp1-AA1173476Open in IMG/M
3300027910|Ga0209583_10011257All Organisms → cellular organisms → Bacteria → Acidobacteria2653Open in IMG/M
3300030991|Ga0073994_12261820Not Available619Open in IMG/M
3300031231|Ga0170824_123017639Not Available659Open in IMG/M
3300031954|Ga0306926_11475697Not Available786Open in IMG/M
3300031962|Ga0307479_11179676Not Available730Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil17.07%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere16.26%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil15.45%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil7.32%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil6.50%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil4.88%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil4.06%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere4.06%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil3.25%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil3.25%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil2.44%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds1.63%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil1.63%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil1.63%
Host-AssociatedHost-Associated → Human → Digestive System → Large Intestine → Fecal → Host-Associated1.63%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil0.81%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.81%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil0.81%
Sugarcane Root And Bulk SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Sugarcane Root And Bulk Soil0.81%
Arctic Peat SoilEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Arctic Peat Soil0.81%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil0.81%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.81%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.81%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.81%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere0.81%
Sugar Cane Bagasse Incubating BioreactorEngineered → Solid Waste → Grass → Composting → Bioreactor → Sugar Cane Bagasse Incubating Bioreactor0.81%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000156Sugar cane bagasse incubating bioreactor microbial communities from Sao Carlos, Brazil, that are aerobic and semianaerobicEngineeredOpen in IMG/M
3300002906Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_60cmEnvironmentalOpen in IMG/M
3300003321Sugarcane bulk soil Sample H1EnvironmentalOpen in IMG/M
3300004800Switchgrass rhizosphere and bulk soil microbial communities from Kellogg Biological Station, Michigan, USA for expression studies - soil CB-1 (Metagenome Metatranscriptome)Host-AssociatedOpen in IMG/M
3300004803Switchgrass rhizosphere and bulk soil microbial communities from Kellogg Biological Station, Michigan, USA for expression studies - soil CB-2 (Metagenome Metatranscriptome)Host-AssociatedOpen in IMG/M
3300005169Soil and rhizosphere microbial communities from Laval, Canada - mgHPAEnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005434Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaGEnvironmentalOpen in IMG/M
3300005436Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaGEnvironmentalOpen in IMG/M
3300005437Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-2 metaGEnvironmentalOpen in IMG/M
3300005439Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaGEnvironmentalOpen in IMG/M
3300005526Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire. - Coalmine Soil_Cen17_06102014_R1EnvironmentalOpen in IMG/M
3300005529Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen16_06102014_R1EnvironmentalOpen in IMG/M
3300005532Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen14_06102014_R1EnvironmentalOpen in IMG/M
3300005537Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1EnvironmentalOpen in IMG/M
3300005575Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151EnvironmentalOpen in IMG/M
3300005614Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C6-2Host-AssociatedOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300006028Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-3 metaGEnvironmentalOpen in IMG/M
3300006041Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2014EnvironmentalOpen in IMG/M
3300006173Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaGEnvironmentalOpen in IMG/M
3300006175Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaGEnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006871Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD3Host-AssociatedOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300006954Agricultural soil microbial communities from Georgia to study Nitrogen management - GA ControlEnvironmentalOpen in IMG/M
3300007076Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD4Host-AssociatedOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300010048Tropical forest soil microbial communities from Panama - MetaG Plot_11EnvironmentalOpen in IMG/M
3300010159Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_3EnvironmentalOpen in IMG/M
3300010358Tropical forest soil microbial communities from Panama - MetaG Plot_3EnvironmentalOpen in IMG/M
3300010361Tropical forest soil microbial communities from Panama - MetaG Plot_23EnvironmentalOpen in IMG/M
3300010364Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09212015EnvironmentalOpen in IMG/M
3300010371Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-1EnvironmentalOpen in IMG/M
3300010376Tropical forest soil microbial communities from Panama - MetaG Plot_28EnvironmentalOpen in IMG/M
3300010396Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-2EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300011120Combined assembly of Microbial Forest Soil metaTEnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300011431Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT157_2EnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012924Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012951Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_226_MGEnvironmentalOpen in IMG/M
3300012955Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_216_MGEnvironmentalOpen in IMG/M
3300012957Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_207_MGEnvironmentalOpen in IMG/M
3300012960Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_231_MGEnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300012984Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_247_MGEnvironmentalOpen in IMG/M
3300012985Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_246_MGEnvironmentalOpen in IMG/M
3300012986Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_217_MGEnvironmentalOpen in IMG/M
3300012987Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_243_MGEnvironmentalOpen in IMG/M
3300012988Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_242_MGEnvironmentalOpen in IMG/M
3300012989Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_237_MGEnvironmentalOpen in IMG/M
3300015053Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015242Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300019999Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2a1EnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021560Tropical forest soil microbial communities from Panama - MetaG Plot_4EnvironmentalOpen in IMG/M
3300023056Soil microbial communities from Shasta-Trinity National Forest, California, United States - GEON-SFM-MS2EnvironmentalOpen in IMG/M
3300025544Arctic peat soil from Barrow, Alaska - NGEE Surface sample 53-2 deep-072012 (SPAdes)EnvironmentalOpen in IMG/M
3300025898Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025906Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025916Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025928Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_80cm (SPAdes)EnvironmentalOpen in IMG/M
3300026319Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_60cm (SPAdes)EnvironmentalOpen in IMG/M
3300026527Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151 (SPAdes)EnvironmentalOpen in IMG/M
3300027583Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027773Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen14_06102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027821Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire. - Coalmine Soil_Cen17_06102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300027910Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2014 (SPAdes)EnvironmentalOpen in IMG/M
3300030991Metatranscriptome of forest soil microbial communities from Montana, USA - Site 5 -Soil GP-1A (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031954Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.178 (v2)EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
NODE_070969553300000156Sugar Cane Bagasse Incubating BioreactorMFAVAQTSDQDIFTITVKPPTSPQNVQVRYFLSGDSGLQQAGSIAKPDNNQIVIKTGVEDKSAKGFKAIVFSPGCEFATINANDLTASTRQADFQCQKLDTTSLHGKADISNFTGKQLQVEALYMCHWAGKFFGVPSLAISPFSVGKAKVESDGTFAVDLPNFSSDPLWSSLSHSATLTFVLVDVANGQRLGELTAPSDLSHGKALKIAASYPAEIPFSVQ*
JGI25614J43888_1017329613300002906Grasslands SoilSSQDIFTISVAAPTSPKEVQVRYFLSGDPALQQSSSIAKPDEARIVVKTGVDGKPAKGFRAIVFAPGCQFTTIKADDLSAASRQAEFQCQKLSTTPLHGRADISRFADRPLQVEALYVCGWAGQFFGVPSLAISPFSVGKAKVESDGTFAFELPDFTGDPLWASLSHNATLTFSLVDASNGAHLARLSAPREL
soilH1_1010723513300003321Sugarcane Root And Bulk SoilMRQYGFTALFLVLGLPVFAAGQNNSSQDVFTITVAAPTSPQDVQVRYALSGDPTVQQASSVARPDDNRILVETTIAGKPAKGFRAIVYSPGCQFTTINVNDLASSTRQAQFECQKLSNTTLHGKADISRFSGKQLRLEALYVCRWAGQFFGIPSLSISPFSVATAKVEQDGSFAVDLPDFSADPSWIRLSHNATLTLVLVDAANGEHLARLSAPRDLARGGGLKIAASYPAEIEFGVR*
Ga0058861_1117389613300004800Host-AssociatedLAAQTSSNENIFTISVAEPISPRDVQVRYFLSGDPAIQQSSSIAKPDDNRIVVKTGVEGKPARGFRAIVYAPGCQFVTISADDLAASTRQSDFQCQKLSTTPLHGRADISKFEGQQLQVEALYVCGWAGQFFGMPGLAISPLSVGKARVENDGTFAMELPDFSADPLWSRFSHNATLMFFLVDPDTGAPLARLSAPRDLSRGGALKVAPSYPAEVPFAVRSQTAPTNVKSSK*
Ga0058862_1149001913300004803Host-AssociatedFLALPIGLAAQTSSNENIFTISVAEPISPRDVQVRYFLSGDPAIQQSSSIAKPDDNRIVVKTGVEGKPARGFRAIVYAPGCQFVTISADDLAASTRQSDFQCQKLSTTPLHGRADISKFEGQQLQVEALYVCGWAGQFFGMPGLAISPLSVGKARVENDGTFAMELPDFSADPLWSRFSHNATLMFFLVDP
Ga0066810_1012534213300005169SoilFLNGDPTVQQSSSIANPDEQRITVKTGTQAKPAKSFRAIVFAPGCQFATIQADDLAAGNRQADFQCQKLSTLPLSGKADISRFAGRDMQVEALYVCGWAGQFFGMPGLAISPFSVGKGKVESDGSFALELPDFSGDPLWKNLSHNATLMLSLVDASTGEHLAQLSAPGALSRRGSLKVAASYPAQIEFAVRQ*
Ga0066388_10011209043300005332Tropical Forest SoilSPKDVQVRYFLSGDPAVQQSGSIARPDDDRIVVKTGVEGRPARGIRAIVFSPGCQFATFSADDLASSSRQADFHCQKLATTPLHGKADISRFAGKELQVEALYVCGWAGQFFGVPGLAISPISLAKAKVENDGTFAIELPDFGSDPSWTSLSHNATLMFVLVDGSTGAHLARLSAPRDISRRGSLKVAATYPAQIDFTVR*
Ga0066388_10115742923300005332Tropical Forest SoilMRTRVFAITLLVSLSVCLAAQTSSSEDVFTISVAAPTSPKDVQVRYFLSGDPAVQQSGSIARPDDDRIVVKTGVEGRPARGIRAIVFSPGCQFATISADNLSSSTRQADFHCQKLATMPLHGKADISRFEGKDLQVEALYVCGWAGQFFGVPGLAISPISLAKAKVESDGTFAMELPDFSSDPSWTALSHNATLMFFLVDSATGAQLARLSAPRDISRRGSLKVAGSYPGEITFVINAQKR*
Ga0070709_1034849323300005434Corn, Switchgrass And Miscanthus RhizosphereMHKPGLIFALFLAVPICVAAQTNASDDVFTISVAAPTSARDVQIRYFLNGDPTVQESTSIAKPDEQRITVKTGTQAKPVKSFRAIVFAPGCQFATIQADDLAASNRQADFQCQKLSTVPLSGKADISRFAGRDMQVEALYVCGWAGQFFGMPGLAISPFSVGKGKVESDGTFALELPDFSGDPLWKNLSHNATLMLSLVDASTGEHLAQLSAPGALSRKGALKVAASYPAQIEFAVR*
Ga0070709_1111096313300005434Corn, Switchgrass And Miscanthus RhizosphereMKMHTPKLIFALFLAVPIGLAAQTGASDDVFTISVAAPTSAKDVQVRYFLNGDPTMQQSSSIAKPDEQRITVKTGTQAKPAKSFRAIVFAPGCQFATIQADDLAAGNRQADFQCQKLSTVPLSGKADISRFAGRDMQVEALYVCGWAGQFFGMPGLAISPFTVGKAKVESDGTFALELPDFSGDPLWKDLSHNATLMLSL
Ga0070713_10178109513300005436Corn, Switchgrass And Miscanthus RhizosphereISVAAPMSARDVQVRYFLNGDPTVQESTSIAKPDEQRITVKTGTQAKPVKSFRAIVFAPGCQFATIQADDLAASNRQADFQCQKLSTVPLSGKADISRFAGRDMQVEALYVCGWAGQFFGMPGLAISPFSVGKGKVESDGTFALELPDFSGDPLWKNLSHNATLMLSLVDASTGEHLAQLSAPGALGRRGSLKVAASYP
Ga0070710_1018840223300005437Corn, Switchgrass And Miscanthus RhizosphereAPVSPKDVQVRYYLNGDPVVQQSSSMAKPDDERIVVKTGVEGVPAKSFRAIVFAPGCQFATIKADDLSSSTRQAPFECQKLSTTALRGKADVSRFSGKDLQVTALYVCGWAGQFFGVPGISISPFAVGKAKVENDGTFAVEIPDFSGDPLWTSLSHNATLMLFLVDAQTGQQLARLSAPRDISRRGSLKVAASYPAEISFTVR*
Ga0070711_10000865153300005439Corn, Switchgrass And Miscanthus RhizosphereMHKARLIFALFLAVPICVAAQTNASDDVFTISVTAPTSAKDVQVRYFLNGDPTVQQSSSIANPDEQRITVKTGTQAKPAKSFRAIVFAPGCQFATIQADDLAAGNRQADFQCQKLSTLPLSGKADISRFAGRDMQVEALYVCGWAGQFFGMPGLAISPFSVGKGKVESDGSFALELPDFSGDPLWKNLSHNATLMLSLVDASTGEHLAQLSAPGALSRRGSLKVAASYPAQIEFAVRQ*
Ga0070711_10039032723300005439Corn, Switchgrass And Miscanthus RhizosphereMQKPGLIFALFLAVPICVAAQTNASDDVFTISVAAPMSARDVQVRYFLNGDPTVQESTSIAKPDEQRITVKTGTQAKPVKSFRAIVFAPGCQFATIQADDLAASNRQADFQCQKLSTVPLSGKADISRFAGRDMQVEALYVCGWAGQFFGMPGLAISPFSVGKGKVESDGTFALELPDFSGDPLWKNLSHNATLMLSLVDASTGEHLAQLSAPGALSRKGALKVAASYPAQIEFAVR*
Ga0070711_10179286413300005439Corn, Switchgrass And Miscanthus RhizosphereFTISVAAPVSPKDVQVRYYLNGDPVVQQSSSMAKPDDEKIVVKTGVEGVSAKSFRAIVFAPGCQFATIKADDLSSSTRQAPFECQKLSTTALRGKADVSRFSGKDLQVTALYVCGWAGQFFGVPGISISPFAVGKAKVENDGTFAVEIPDFSGDPLWTNLSHNATLMLFLVDAQTGQQ
Ga0073909_1003485513300005526Surface SoilMTMRRLQFITALFLVLPLCVTAQTNSSDDVFTISVAAPVSPKDVQVRYYLNGDPVVQQSSSMAKPDDERIVVKTGVEGVPAKSFRAIVFAPGCQFATIKADDLSSSTRQAPFECQKLSTTALRGKADVSRFSGKDLQVTALYVCGWAGQFFGVPGISISPFAVGKAKVENDGTFAVEIPDFSGDPLWTSLSHNATLMLFLVDAQTGQQLARLSAPRDISRRGSLKVAASYPAEISF
Ga0073909_1013375813300005526Surface SoilMRTLRFISAFFLVLPLVAAAQTNAADDVFTISVTAPVSPKDVQVRYFLNGDPVVQHSSSTAKPDDEKIVVKTGVEGVSAKSFRAIVFAPGCQFATIKADDLSSSTRQAPFECQKLPNTALRGRADVSRFSGKDLQVAAVYVCGWAGQFFGVPGIAISPFAVGKAKVENDGSFAVEIPDFGGDPLWTSLSHNATLMFFLVDAQTGQQLARLSAPRDISRRGSLKVAASYPAEIQFSVR*
Ga0070741_10003361233300005529Surface SoilMSKLGVIALLLVSMFAVAQTSDQDVFTITVKSPTSPQDVQVRYFLSGDSGLQQAGSVAKPNDNQIVIKTGVEDKSAKGFRAIVFSPGCEFATINANDLTASTRQADFQCQKLDTTSLHGKADISNFAGKQLQVEALYMCHWAGKFFGVPNLAISPFSLGKAKVENDGTFAVDLPNFSSDPLWNNLSHNAALTFVLVDAANGQRLAQLSAPRDLSHGNALKIAASYPPEISFAVR*
Ga0070739_1017167123300005532Surface SoilMMRKLVFALFLFPVFLPAQEASPDNVFTINVAAPTSSKDVQVRYLLNGNPAVMQSSSSAKPDDNQIVIKTDVEGKAAKSFRAIVFSPGCQFAVINADDLSTSNRQAQFQCQKLTDTSLHGKVDTSQFSGRDLQVEAMYVCNWAGQFFGVRGLSISPFTAGKAKVDKDGAFAMDLPDFSTDPLWNNLSHKATLMFVLVDAANGQQLAALTPPNNLAREGSLKVAASYPAEVEFTVQNQR*
Ga0070730_1002877423300005537Surface SoilMHTPKLIFALFLAVPIGLAAQTNANDDVFTISVAAPTSAKDVQVRYFLNGDPTVQQSSSIAKPNEQQIMVKTGSQGKPAKGFRAIVFAPGCQFATIQADDLAAGTRQADFQCQKLSTVPLSGKADISRFAGRDMQVEALYVCGWAGQFFGMPGLAISPFSVGKAKMETDGTFALELPDFSGDPLWTNLSHNATLMLFLVDASTGEHLAQLSAPGALSRKGSLKVAASYPAQIQFAVR*
Ga0070730_1015650223300005537Surface SoilMHTPKLLFALFLAVPIGLAAQTNANDDVFTISVAAPTSAKDVQVRYFLNGDPTVQHSTSIARPDEQRITVKTASQGQPAKSFRAIVFAPGCQFATIQADDLAAGNRQADFQCQKLSTIPLSGKADISRFSGRDMQVEALYVCGWAGRFFGMPGLAISPFSVGKAKMETDGTFALELPDFSGDPLWTNLSHNATLTLFLVDASTGEHLAQLSAPGALSRKGSLKVAASYPAQIQFAVR*
Ga0066702_1006703023300005575SoilMRKSGFIFAFLLALPALAMAQANSSQDVFTITVASPTSPKDVQVRYFLNGDPAAQQSSSIAQPDDNKIVVKTEVTGKSAKSFRAIIYSPGCQFSTISADDLSASTRQAEFQCQKLSTTPLHGKADISRFTGKQLQVEALYVCRWAGQFFGVPGLAISPFSVAKTKVENDGSFALDLPDFSSDPLWSNLSHNATLTLVLIDGSNGERLGRLSAPRDLSRGSGLKVAASYPAEIAFAVR*
Ga0066702_1011220433300005575SoilMRKYALLSFFLALPVLVAAQADPNQDVFAITVAAPTSPRDVQVRYALSGNPSVQQASSVARPEDNRILVETSVAGKPAKGFRAILFAPGCQLSTISVDDLSAGTRQAQFECQKLSTTPLHGKADVSRFSGKQLQVEALYVCRWAGQFFGVPGLAISPFSVTSAKVGEDGAFALDLPNFAADPLWNNLSHNATLTLVLVDSANGERLAHFTAPHDLSRGSSLKIAASYPAEIEFTVR*
Ga0068856_10116629823300005614Corn RhizosphereSVAGPTSAKDVQVRYFLNGDPAVQQSSSIAKPDEHRITVKTGTQAQPAKSFRAIIFAPGCQFATIQADDLAAGNRQADFQCQKLATVPLSGKADISRFSGRDMQVEALYVCGWAGQFFGMRGLAISPFSVGKAKMETDGTFALELPDFSADPLWKDLSHNATLTLFLVDVSTGEHLAQLSAPSALSRKGALKVAASYLAQIQFAASF*
Ga0066903_10036921823300005764Tropical Forest SoilMRTRGFVFALFLALPVCLAAQTSPSNDDIFTISVAAPTSPKDVQVRYFLNGDPVVRQSGSIARPDDNQIVVKTGVEGRPARGIRAIVFSPGCQFATISADNLASNSRQAEFHCQKLATTPLHGKTDVSGFAGKELQVEALYVCDWAGQFFGVPRLAISPLTLVKTKVDADGTFAMELPDFTSDPSWTSLSHNATLMFFLVDSATGAQVARLSAPRDLSRRGSLKVASSYPDEITFVVNSQSR*
Ga0066903_10147636013300005764Tropical Forest SoilMRKFGLLLTLLFVSIFAAAQSNSSPDVFTITVAPPTSPQNIQVRYFMRGDPAVQQSSSVATPGDGKIVVDTTVAGKQAKGFRGIVFAPGCQLATISVDDLASSSREAEFRCQKLSTTALHGKTDVSRFSGKQLQVEALYVCRWAGQFFGVPNFSISPFALGKTKVSEDGSFAFDLPDFAADPLWNSLSHNATITLVLVDASTGEHLARLAGPRDLSKGSSLKVAASYPAEIEFTVR*
Ga0066903_10216425323300005764Tropical Forest SoilAQTNSADDVFTISVAAPVLPKDVQVRYYLNGDPVVQQSSSVAKPDDERIVVKTGVEGVSAKSFRAIVFAPGCQFATIKADDLSSSTRQAPFECQKLSTTALRGKADISRFSGKDLQVTALYVCGWAGQFFGVPGISISPFAVGKAKVENDGTFAVEIPDFSGDPLWTSLSHNATLTLFLVDAQTGEQLAQLSAPRDISRRGSLKVAASYPAEISFTVR*
Ga0066903_10466310413300005764Tropical Forest SoilMRTRGFALVLFLALSVFLSAQTTSSENVFTIGVAAPTSPQDVQVRYFLSGDPAVQQSGSIARPDDDRIVVKTGVEGKPARGIRAILFSPGCQFATISADDLASSSRQADFRCQKLATVPLHGKADISRFEGKDLQVEALYVCAWAGQFFGVPGLSISPLTLAKAKVEKDGTFAMDLPDFPRDPLWTNLSHNATLMFFLVDGASGAQLARLSAPRDISRRGSLKVA
Ga0070717_1012127723300006028Corn, Switchgrass And Miscanthus RhizosphereMRRLQFITALFLALPLCVTAQTNSSDDVFTISVAAPVSQKDVQVRYYLNGDPVVQQSSSMAKPDDERIVVKTGVEGVPAKSFRAIVFAPGCQFATIKADDLSSSTRQAPFECQKLSTTALRGKADVSRFSGKDLQVTALYVCGWAGQFFGVPGISISPFAVGKAKVENDGTFAVEIPDFSGDPLWTSLSHNATLMLFLVDAQTGQQLARLSAPRDISRRGSLKVAASYPAEISFTVR*
Ga0070717_1052637813300006028Corn, Switchgrass And Miscanthus RhizosphereMHKSRLIFALFLALPIALAAQANPSDDVFTISVAPPISTKDVQVRYLLNGDPTVQQSSSIAKPDEQRITVKTGTQAKPAKSFRAIVFAPGCQFATIQADDLAAGNRQADFQCQKLSTVPLSGKADISRFSGRDMQVEALYVCAWAGQFFGTPGLAMSPFSVGKAKVENDGTFALELPDFSADPLWKNLSHNATLTLFLVDVSTGEHLAQLSAPAALSRKGALKVAASYPAQIEFAVQQ*
Ga0070717_1058562923300006028Corn, Switchgrass And Miscanthus RhizosphereMHKPGLIFALFLAVPICVAAQTNASDDVFTISVAAPMSARDVQVRYFLNGDPTVQESTSIAKPDEQRITVKTGTQAKPIKSFRAIVFAPGCQFATIQADDLAASNRQADFQCQKLSTVPLSGKADISRFAGRDMRVEALYVCGWAGQFFGMPGLAISPFSVGKGKVESDGTFALELPDFSGDPLWKNLSHNATLMLSLVDASTGEHLAQLSAPGALSRRGALKVAASYPAQIQFAIQ*
Ga0070717_1066586113300006028Corn, Switchgrass And Miscanthus RhizosphereAPVSPQDVQVRYFLNGDAAVQQSSSIAKPDESKIVVKTGVEGKPARSFRAIVYSPGCQFATVKADDLSNSTRQAQFQCQKLATSPLHGRADVSNFAGKDLQVEVLYVCGWAGQFFGVPNLAISPLSVGKTKMESDGTFAFDLPDFSGDPLWASLSHNATLMFYLVDASNGAHLARLSAPRDLSRKGALKVAASYPAEIQFSVR*
Ga0075023_10000981333300006041WatershedsMGLAAQTSSNENIFTISVAEPTSPKDVQVRYFLSGDPAVQQSSSIAKPDDNRIVVKTGVEGKPARGFRAIVYAPGCQFVTISADDLAASTRQSDFQCQKLSTTPLHGRADIAKFAGRDLQVEALYVCGWAGQFFGMPGLAISPLSVARARVENDGSFAMELPDFSADPLWGSLSHNATLMFFLVDPASGAHLARLSAPRDLSRGGALKVAPSYPAEIPFAVRSQSAPTNSKSSK*
Ga0070716_10023953233300006173Corn, Switchgrass And Miscanthus RhizosphereARDVQIRYFLNGDPTVQESTSIAKPDEQRITVKTGTQAKPVKSFRAIVFAPGCQFATIQADDLAASNRQADFQCQKLSTVPLSGKADISRFAGRDMQVEALYVCGWAGQFFGMPGLAISPFSVGKGKVESDGTFALELPDFSGDPLWKNLSHNATLMLSLVDASTGEHLAQLSAPGALSRRGALKVAASYPAQIQFAIQ*
Ga0070712_10032035923300006175Corn, Switchgrass And Miscanthus RhizosphereMHKSRLIFALFLAVPICVAAQTNASDDVFTISVAAPMSARDVQVRYFLNGDPTVQESTSIAKPDEQRITVKTGTQAKPVKSFRAIVFAPGCQFATIQADDLAASNRQADFQCQKLSTVPLSGKADISRFAGRDMQVEALYVCGWAGQFFGMPGLAISPFSVGKGKVESDGTFALELPDFSGDPLWKNLSHNATLMLSLVDASTGEHLAQLSAPGALSRRGALKVAASYPAQIQFAIQ*
Ga0070712_10058388913300006175Corn, Switchgrass And Miscanthus RhizosphereMRRLQFITALFLVLPLCVTAQTNSSDDVFTISVAAPVSPKDVQVRYYLNGDPVVQQSSSMAKPDDERIVVKTGVEGVPAKSFRAIVFAPGCQFATIKADDLSSSTRQAPFECQKLSTTALRGKADVSRFSGKDLQVTALYVCGWAGQFFGVPGISISPFAVGKAKVENDGTFAVEIPDFSGDPLWTSLSHNATLMLFLVDAQTGQQLARLSAPRDISRRGSLKVAASYPAEISFTVR*
Ga0066660_1101906613300006800SoilMRKPGLIFAFLLALQALAVAQTNSSQDVFTITVASPTSPKDVQVRYFLNGDPAAQQSSSIAQPDDNKIVVKTEVTGKSAKSFRAIIYSPGCQFSTISADDLSASTRQAEFQCQKLSTTPLHGKADISRFTGKQLQVEALYVCRWAGQFFGVPGLAISPFSVAKTKVENDGSFALDLPDFSSDPLWSNLSHNATLTFVLIDGSNGERLGRL
Ga0075425_10106334713300006854Populus RhizosphereVAAPTSPKDVQVRYFLSGDPVVRQSGSIARPDDNQIVVKTGVEGRPARGIRAIVFSPGCQFATISADNLSANSRQADFHCQKLATTPLHGKTDVSGFAGKELQVEALYVCDWAGQFFGVPRLAISPLTLAKTKVEGDGTFAMELPDFTSDPLWTSLSHNATLMFFLVDSATGAQLARLSAPRDVSRRGSLKVASSYPDEITFVVNSQSR*
Ga0075434_10006002313300006871Populus RhizosphereMRMRGFVFALFLALPVFLAAQTSPSNDDIFTISVAAPTSPKDVQVRYFLSGDPVVRQSGSIARPDDNQIVVKTGVEGRPARGIRAIVFSPGCQFATISADNLSANSRQADFHCQKLATTPLHGKTDVSGFAGKELQVEALYVCDWAGQFFGVPRLAISPLTLAKTKVEGDGTFAMELPDFTSDPLWTSLSHNATLMFFLVDSATGAQLA
Ga0075424_10004059933300006904Populus RhizosphereMRMRGFVFALFLALPVFLAAQTSPSNDDIFTISVAAPTSPKDVQVRYFLSGDPVVRQSGSIARPDDNQIVVKTGVEGRPARGIRAIVFSPGCQFATISADNLSANSRQADFHCQKLATTPLHGKTDVSGFAGKELQVEALYVCDWAGQFFGVPRLAISPLTLAKTKVEGDGTFAMELPDFTSDPLWTSLSHNATLMFFLVDSATGAQLARLSAPRDVSRRGSLKVASSYPDEITFVVNSQSR*
Ga0075424_10107139413300006904Populus RhizosphereMADQKVVIMRTGGFVLALFLALPVCLAAQTSSSNDDVFTIAVAAPTSPKDVQVRYLLSGDPVVRQSGSIAKPDDNQIVVKTGFEGRPAHGIRAIVFSPGCQFATISADNLASNSRQADFHCQKLATMPLHGKADVSGFAGKELQVEALYVCDWAGQFFGVPRLAISPLTLAKTKVEGDGTFAMELPDFSRDPLWTSLSHNATLMFFLVDGATGAQLARLSAPRDISRRGSLKVASSYPDEITFV
Ga0079219_1024780123300006954Agricultural SoilMYKPRLIFALFLAVPICVAAQTNASDDVFTISVAAPISARDVQIRYFLNGDPTVQESTSIAKPDEQRITVKTGTQAKPVKSFRAIVFAPGCQFATIQADDLAASNRQADFQCQKLSTVPLSGKADISRFSGRDMQVEALYVCGWAGQFFGMRGLAISPFSVGKAKMETDGTFALELPDFSADPLWKDLSHNATLTLFLVDVSTGEHLAQLSAPSALSRKGALKVAASYPTQIQFAVQQ*
Ga0079219_1051335413300006954Agricultural SoilMRRLYWITALFLALPLCVTAQPNSADDVFTISVGAPVSPKDVQVRYYLNGDPVVQQSSSVAKPDDERIVVKTGVEGVSAKSFRAIVFAPGCQFATIKADDLSSSTRQAPFECQKLSTTALRGKADVSRFSGKDLQVTALYVCGWAGQFFGVPGISISPFAVGKAKVENDGTFAVEIPDFSGDPLWTSLSHNATLTLFLVDAQTGEQLAQLSAPRDISRRGSLKVAASYPAEISFTVR*
Ga0079219_1063978713300006954Agricultural SoilMMKMHLPRLIFALFLAVPIGLAAQTNANDDVFTISVAAPTSAKDVQVRYFLNGDPSVSSSIAKPDEQRITVKTGTQAKPAKSFRAIVFAPGCQYATIQADDLAAGNRQADFQCQKLSTVPLSGKADISRFAGRDMQVEALYVCGWAGQFFGMPGLAISPFSVGKGKVESDGTFALELPDFSGDPLWKNLSHNATLMLSLVDASTGEHLAQLSAPGALSRKGALKVAASYPAQIEFAVR*
Ga0075435_10035203923300007076Populus RhizosphereAPTSPKDVQVRYFLSGDPVVRQSGSIARPDDNQIVVKTGVEGRPARGIRAIVFSPGCQFATISADNLSANSRQADFHCQKLATTPLHGKTDVSGFAGKELQVEALHVCDWAGQFFGVPRLAISPLTLAKTKVEGDGTFAMELPDFTSDPLWTSLSHNATLMFFLVDSATGAQLARLSAPRDVSRRGSLKVASSYPDEITFVVNSQSR*
Ga0099830_1024890923300009088Vadose Zone SoilLGDNRLGKENTMRTNRLLFALFLAFSVCLVAQTSNQDIFTISVAAPTSAKDIQVRYFLGGDPAVQQSSSIAKPDENRIVIKTGVEGKTAKSFRAIIFAPGCQFATIQSDNLAAGTRQAAFDCQKLTTTPLHGKVDVSRFAGKDLQVEALYVCNWAGPFFGVPGLAISPFSIGKAKMETDGAFAFELPDFSADPLWANLAHIASLMFFLVDASNGEHLARLTA
Ga0099792_1005146823300009143Vadose Zone SoilLGDNRLGKENTMRTYRLFFVLSLAFSVCLVAQTSNQDIFTISVAAPTSAKDIQVRYFLGGDPAVQQSSSIAKPDENRIVIKTGVEGKTAKSFRAIIFAPGCQFATIQSDNLAAGTRQAPFDCQKLTTTPLHGKVDVSRFGGKDLQVEALYVCNWAGPFFGVPGLAISPFSIGKAKMETDGTFAFELPDFSADPLWANLAHTASLMFFLVDASNGEHLARLTAPSNLSRKGSLKVASGYPVEIQFGIK*
Ga0126373_1257362413300010048Tropical Forest SoilDNQDIFTITVASPTSPRDVQVRYFLIGDSAVQQAGSIARPNDNQIVIKTGVEDKSARGFRAIVFSPGCELGTISADDLASSTRKADFECQKLASSSLHGKTDLAEFAGKQLQVETLYQCNWAGQFFGVSGLEISPFSVAKAKVESDGSFAVDLPDFTSDPLWGHLSHNATLMFVLVDAANGERLATLS
Ga0099796_1000586323300010159Vadose Zone SoilMRTYRLFFVLSLAFSVCLVAQTSNQDIFTISVAAPTSAKDIQVRYFLGGDPAVQQSSSIAKPDENRIVIKTGVEGKTAKSFRAIIFAPGCQFATIQSDNLAAGTRQAPFDCQKLTTTPLHGKADVSRFGGKDLQVEALYVCNWAGPFFGVPGLAISPFSIGKAKMETDGTFAFELPDFSADPLWANLAHTASLMFFLVDASNGEHLARLTAPSNLSRKGSLKVASGYPAEIQFGIK*
Ga0126370_1182139813300010358Tropical Forest SoilDIFTVNVAAPTAPQDVQVRYFLSGDPAVQQAGSVAKPESSRIVVKTSVEGKSAAGIRAIVFAPGCQFFTVSADNLGASTRQADFQCQKLATTQLHGKVDVSRFAGKELQLETLYMCSWAGQFFGVPALSISPFSLGKTKVDAEGAFALELPDFASDPQWQKLTSNATLTFMLVDKATGQRLARLSAPGELSRRGAL
Ga0126378_10000337273300010361Tropical Forest SoilMSKLGFLLLLLPVLAPAQVDNQDIFTITVASPTSPRDVQVRYFLSGDSAVQQAGSIARPNDNQIVIKTGVEDKSARGFRAIVFSPGCELGTISADNLASSTRKADFECQKLASSSLHGKTDLAEFAGKQLQVETLYQCNWAGQFFGVSGLEISPFSVAKAKVESDGSFAVDLPDFTSDPLWGHLSHNATLMFVLVDAANGERLATLSAPSDLSRRGALKIAHSYPAEIPFTVRSQSR*
Ga0126378_1004525033300010361Tropical Forest SoilMRRFGFIALLLLFSVFAAAQTEKQDVFTIEIASPTAPQNVQVRYFLSGDPALQQAGSIAKPNDNQIVIKTGVEGKSARGFRAIVFSPGCQYATINADDLASSTRQADFQCQKLATTPLHGKADITRFAGKELQVEALYQCNWAGQFFGVAGLAISPFSVGKSKVESDGTFAVDLPDFSNDPLWSNLSHNATLILVLMDANGERLGRLSAPRSVSRGGALKIAPNYPAEIPFTIRSQVN*
Ga0134066_1036812313300010364Grasslands SoilKDVQVRYFLNGDPTVQQSSSIAKPDEQRITVKTGTQAKPAKSFRAIVFAPGCQFATIRADDLAAGNRQADFQCQKLATVPLSGKADISRFAGRDMQVQALYVCGWAGQFFGMPGLAISPFSVGKGKVESDGAFALELPDFSGDPLWKDLSHNATLMLFLVDASTGEQLARLSAPRELSR
Ga0134125_1038736813300010371Terrestrial SoilMSRLGVIALLLVSMFAVAQTSDQDIFTITVKPPTSPQDVQVRYFLSGDSGLQQAGSVAKPENNQIVIKTGVEDKSAKGFKAIVFSPGCEFATINANDLTASNRQADFQCQKLDTTSLHGKADISNFTGKQLQVEALYMCHWAGKFFGVPSLAISPFSVGKTKVESDGTFAVDLPNFSSDPLWSSLSHSATLTFVLVD
Ga0126381_10012725333300010376Tropical Forest SoilMRRFGFIALLLLLPVFAAAQTENQDVFTIEIASPTAPQNVQVRYFLSGDPALQQAGSIAKPNDNQIVIKTGVEGKSARGFRAIVFSPGCQFATINADDLASSTRQADFQCQKLATTPLHGKADITRFAGKELQVEALYQCNWAGQFFGVAGLAISPFSVGKSKVESDGTFAVDLPDFSNDPLWSNLSHNATLILVLMDANGERLGRLSAPRSVSRGGALKIAPNYPAEIPFTIRSQVN*
Ga0134126_1091146913300010396Terrestrial SoilMSRLGVIALLLVSMFAAAQTSDQDIFTITVKPPTSPQNVQVRYFLSGDSGLQQAGSVAKPENNQIVIKTGVEDKSAKGFKAIVFSPGCEFATINANDLTASNRQADFQCQKLDTTSLHGKADISNFTGKQLQVEALYMCHWAGKFFGVPSLAISPFSVGKAKVESDGTFAVDLPNFSSDPLWSSLSHSATLTFVLVDVANGQRLGELTAPSDLSHGKALKIAASYPAEIPFSVQ*
Ga0126383_1000764253300010398Tropical Forest SoilMDVTNMYKSRLIFALFLAAPIALAAQTNPSDDVFTISVAAPTSAKDVQVRYFLNGDPTVQQASSIAKPDEQRITVTTGTQGKPAKGFRAIVFAPGCQFATIQADNLAAGNRQADFQCQKLSTVPLSGKADISRFSGRDMQVEALYVCGWAGQFFGSPGLAMSPFSVGKAKVESDGTFALELPDFSADPLWKKLSHNATLTLFLVDASTGEHLAELSAPTALSRKGALKVAASYPAQMEFTVQQ*
Ga0150983_1313474213300011120Forest SoilYRFIIALSLALPICVAAQNNSSDDVFTISVVAPVSPKDVQVRYFLNGGSMVQEASSIAKPEDAGIVVKTGASGKTASSFRAIVYSPGCQFATIQADDLSTSPRQVQFQCQKLTTTTLHGRIDAARFAGRDLQVQALYVCGWAGKFFGIPGIAISPFSVTRAHVETDGSFSVELPDFTGDPLWTNLSHNATLMFSLVDARTGEPLAGLTAPSDLSRKGALKVAASYPAEVQFAIR*
Ga0137392_1028403223300011269Vadose Zone SoilLGDNRLGKENTMRTNRLLFALFLAFSVCLVAQTSNQDIFTISVAAPTSAKDIQVRYFLGGDPAVQQSSSIAKPDENRIVIKTGVEGKTAKSFRAIIFAPGCQFATIQSDNLAAGTRQAAFDCQKLTTTPLRGKVDVSRFAGKDLQVEALYVCNWAGPFFGVPGLAISPFSIGKAKMETDGAFAFELPDFSADPLWANLAHTASLMFFLVDASNGEHLARLTVPSNLSRKGSLKVASGYPAEIQFGIK*
Ga0137393_1012812413300011271Vadose Zone SoilMRTNRLLFALFLAFSVCLVAQTSNQDIFTISVAAPTSAKDIQVRYFLGGDPAVQQSSSIAKPDENRIVIKTGVEGKTAKSFRAIIFAPGCQFATIQSDNLAAGTRQAAFDCQKLTTTPLRGKVDVSRFAGKDLQVEALYVCNWAGPFFGVPGLAISPFSIGKAKMETDGTFAFELPDFSADPLWANLAHIASLMFFLVDASNGEHLARLTAPSNLSRKGSLKVASGYPAEIQFGIK*
Ga0137438_111841723300011431SoilFSICLAAQTSNQDIFTISVAAPTATKDVQVRYFLSGDPAVQQSSSIAKPDENRIVVKTSAEGKQAKGFRAIIFAPGCQFATIKTDDLASSTRQAQFDCQKLTTTPLHGKVDVSRFSGKDLQVEALYVCNWAGPFFSVPGLAISPFSLGKARMESDGTFAFELPDFSADPLWQNMAHNATLMFFLVDASNGEHLARLTAPRDLSRKGSLKVASGYAEEIQFTVR*
Ga0137363_1014177823300012202Vadose Zone SoilMTMRTYRFIIALSLALPICVVAQNNSSDDVFTISVAAPVSPRDVQVRYFLNGGSMVQEASSIAKPEDAGIVVKTGASGKTASSFRAIVYSPGCQFATIQADDLSTSTRQVQFQCQKLTTTTLHGRTDAARFAGRDLQVQALYVCGWAGKFFGIRGIAISPFAVTKAHVETDGSFSVELPDFTGDPLWTNLSHNATLMFSLVDARTGEPLAGLTAPSDLSRKGSLKVAASYPAEVQFAIR*
Ga0137380_1013095923300012206Vadose Zone SoilMKMHKRRLIFALFLAVPICVAAQTNANDDVFTISVAPPTSAKDVQVRYFLNGDPTVQQSSSIAKPDEQRITVKTGTQAKPAKSFRAIVFAPGCQFATIRADDLAAGNRQADFQCQKLATVPLSGKADISRFAGRDMQVQALYVCGWAGQFFGMPGLAISPFSVGKGKVESDGAFALELPDFSGDPLWKDLSHNATLMLFLVDASTGEQLARLSAPGALSRRGSLKVAASYPAQIQFAVR*
Ga0137398_1021990113300012683Vadose Zone SoilISVAAPTSAKDIQVRYFLGGDPAVQQSSSIAKPDENRIVIKTGVEGKTAKSFRAIIFAPGCQFATIQSDNLAASPRQAPFDCQKLTTTPLHGKADVSRFGGKDLQVEALYVCNWAGPFFGVPGLAISPFSIGKAKMETDGAFAFELPDFSADPLWANLAHTASLMFFLVDASNGEHLARLTAPSNLSRKGSLKVASGYPAEIQFGIK*
Ga0137395_1001206823300012917Vadose Zone SoilMRAYRFLIALVLALTALALTAFLPQTNSSQDIFTISVAAPTSPKEVQVRYFLSGDPALQQSSSIAKPDEARIVVKTGVDGKPAKGFRAIVFAPGCQFTTIKADDLSAASRQAEFQCQKLSTTPLHGRADISRFADRPLQVEALYVCGWAGQFFGVPSLAISPFSVGKAKVESDGTFAFELPDFTGDPLWASLSHNATLTFSLVDASNGAHLARLSAPRELSRRGALKVAPGYPAEIVFSVR*
Ga0137395_1034466323300012917Vadose Zone SoilAFSVCLVAQTSNQDIFTISVAAPTSAKDIQVRYFLGGDPAVQQSSSIAKPDENRIVIKTGVEGKTAKSFRAIIFAPGCQFATIQSDNLAASTRQAPFDCQKLTTTPLHGKADVSRFGGKDLQVEALYVCNWAGPFFGVPGLAISPFSIGKAKMETDGTFAFELPDFSADPLWANLAHTASLMFFLVNASNGEHLARLTAPSNLSRKGSLKVASGYPAEIQFGIK*
Ga0137413_1050480213300012924Vadose Zone SoilMRTYRLFFVLSLAFSVCLVAQTSNQDIFTISVAAPTSAKDIQVRYFLGGDPAVQQSSSIAKPDENRIVIKTGVEGKTAKSFRAIIFAPGCQFATIQSDNLAAGTRQAAFDCQKLTTTPLRGKADVSRFAGKDLQVEALYVCNWAGPFFGVPGLAISPFSIGKAKMETDGTFAFELPDFSADPLWANLAHTASLMFFLVDASNGEHLARLTAPSNLSRKGSLKVASGYPAEIQFGIK*
Ga0137413_1072978813300012924Vadose Zone SoilFLALPLCVAAQTNSSDDIFTIAVAPPVSPKDVQVRYYLNGDAAVQQSASIAKPDDEKIVVKTGVDGVSAKSFRAIVFAPGCQFATIKADDLASSTRQAPFECQKLSTTALHGKADVSKFTGKDLQVSAVYVCGWAGQFFGVPGIAISPFVVGKGKVENDGTFAVEIPDFSGDPLWTSLSHNATLMFFLVDAQTGQQLGRLSAPRDISRRGSLKVAASYPAEISFIVR*
Ga0137404_1098525413300012929Vadose Zone SoilMRTNRLLFALFLAFSVCLVAQTSNQDIFTISVAAPTSAKDIQVRYFLGGDPAVQQSSSIAKPDENRIVIKTGVEGKTAKSFRAIIFAPGCQFATVQSDNLAASTRQAPFDCQKLTTTPLHGKVDVSRFSGKDLKVEALYVCNWAGPFFGVPGLAISPFSIGKAKMETDGTFAFELPDFSADPLWANLAHTASLMFFLVDA
Ga0137404_1098775913300012929Vadose Zone SoilMRTNRLLFALFLAFSVCLVAQTSNQDIFTISVAAPTSAKDIQVRYFLGGDPAVQQSSSIAKPDENRIVIKTGMEGKTAKSFRAIIFAPGCQFATIQSDNLAAGTRQTAFDCQKLTTTPLHGKADVSRFGGKDLQVEALYVCNWAGPFFGVPGLAISPFSIGKAKMETDGTFAFELPDFSADPLWANLAHTASLMFFLVDA
Ga0137404_1106735413300012929Vadose Zone SoilSNQDIFTISVAAPTSAKDIQVRYFLGGDPAVQQSSSIAKPDENRIVVKTGVEGKTAKSFRAIIFAPGCQFATIQSDDLTAGTRQAPFDCQKLTTTPLHGKADVSRFGGKDLQVEALYVCNWAGPFFGVPGLAISPFSLGKAKMETDGSFAFELPDFSADPLWTNLAHTASLTFFLVDASNGEHLARLTAPSNLSRKGSLKVASGYPAEIQFGIK*
Ga0164300_1061166713300012951SoilLNGDPTVQQSTSIAKPDEQRITVKTGTQAKPVKSFRAIVFAPGCQFATIQADDLAASNRQADFQCQKLSTVPLSGKADISRFAGRDMQVEALYVCGWAGQFFGMPGLAISPFSVGKGKVESDGTFALELPGFSGDPLWKNLSHNATLMLSLVDASTGEHLAQLSAPGALSRRGSLKVAASYPAQIQFAVR*
Ga0164298_1005057023300012955SoilMRTYRLLFALFLVFSICLAAQTSNQDIFTISVAAPTAAKDVQVRYFLSGDPAVQQSSSIAKPDENRIVVKTSVEGKQAKGFRAIIFAPGCQFATVQADDVAAGTRQAQFDCQKLTTTALHGKADISRFSGKDLQVEALYVCSWAGSFFGVPALAISPFSVGKTKVESDGAFAFELPAFSADPLWARMSHNASLIFFLVDASNGEHLARLTAPSNLSRKGSLKIASEYPAEIQFGIK*
Ga0164298_1096379413300012955SoilTSTKDVQVRYFLNGDPAVQQSSSIAKPDEHRITVKTGTQAQPAKSFRAIIFAPGCQFATIQADDLAAGNRQADFQCQKLATVPLSGKADISRFSGRDMQVEALYVCGWAGQFFGMPGLAISPFSVGKAKVETDGTFALELPDFSGDPLWKNLSHNATLMLSLVDASTGEHLAQLSAPGALSRRGSLKVAASYPAQIQFAVR*
Ga0164303_1005139823300012957SoilMADQKDVIMRMRGFVLALFLALPVFLAAQTSPSNDDIFTISVAAPTSPKDVQVRYFLSGDPVVRQSGSIARPDDNQIVVKTGVEGRPARGIRAIVFSPGCQFATISADNLSSNSRQADFHCQKLATTPLHGKTDVSGFAGKELQVEALYVCDWAGQFFGVPRLAISPLTLAKTKVEGDGTFAMELPDFTSDPLWTSLSHNATLMFFLVDSATGAQLARLSAPRDVSRRGSLKVASSYPDEITFVVNSQSR*
Ga0164303_1012630523300012957SoilMEVMNMHKSRLIFALFLAVPIALAAQTNPSDDVFTISVAAPTSAKDVQVRYFLNGDPTVQQSSSIAKPDEHRITVKTGTQAQPAKSVRDIVFAPRCQFATTQADDLAAGDRQADFQCQQRSTDPVCGKADLSGFSGRDMQVEALYVCGWAGQFFGMRGLAISPFSVGKAKMETDGTFALELPDFSADPLWKNLSHNATLTLFLVDASTGEHLAQLSAPTALSRKGALKVAASYPAQIEFAVQQ*
Ga0164301_1001615633300012960SoilMRTYPLLFALFLVFSICLAAQTSNQDIFTISVAAPTAAKDVQVRYFLSGDPAVQQSSSIAKPDENRIVVKTSVEGKQAKGFRAIIFAPGCQFATVQADDVAAGTRQAQFDCQKLAATALHGKADISGFSGKDLQVEALYVCNWAGPFFGVPALSISPFSVGKVKVESDGAFAFELPDFSADPLWARMSHNASLIFFLVDASNGEHLARLTAPSNLSRKGSLKIASEYPAEIQFGIK*
Ga0164301_1006700333300012960SoilVEVMKMHKPRLIFALFLAVPICVAAQTNASDDVFTISVAAPTSARDVHIRYFLNGDPTVQESTSIAKPDEQRITVKTGTQAKPVKSFRAIVFAPGCQFATIQADDLAASNRQADFQCQKLSTVPLSGKADISRFAGRDMQVEALYVCGWAGQFFGIPGLAISPFSVGKGKVESDGTFALELPDFSGDPLWKNLSHNATLMLSLVDASTGEHLAQLSAPGTLSRRGSLKVAASYPAQINFIVR*
Ga0164301_1070974113300012960SoilMADQKDVIMRMRGFVLALFLALPVFLAAQTSPSNDDIFTISVAAPTSPKDVQVRYFLSGDPVVRQSGSIARPDDNQIVVKTGVEGRPARGIRAIVFSPGCQFATISADNLSSNSRQADFHCQKLATTPLHGKTDVSGFAGKELQVEALYVCDWAGQFFGVPRLAISPLTLAKTKVEGDGTFAMELPDFTSDPLWTSLSHNATLMFF
Ga0126369_1078722613300012971Tropical Forest SoilDPAVQQSSSVATPGDGKIVVDTTVAGKQAKGFRGIVFAPGCQLATISVDDLASSSREAEFRCQKLSTTALHGKTDVSRFSGKQLQVEALYVCRWAGQFFGVPNFSISPFALGKTKVSEDGSFAFDLPDFAADPLWNSLSHNATITLVLVDASTGEHLARLAGPRDLSKGSSLKVAASYPAEIEFTVR*
Ga0164309_1028981523300012984SoilMHKPGLIFALFLAVPICVAAQTNASDDVFTISVAAPMSARDVQVRYFLNGDPTVQESTSIAKPDEQRITVKTGTQAKPVKSFRAIVFAPGCQFATIQADDLAASNRQADFQCQKLSTVPLSGKADISRFAGRDMQVEALYVCGWAGQFFGMPGLAISPFSVGKGKVESDGTFALELPDFSGDPLWKNLSHNATLMLSLVDASTGEHLAQLSAPGALSRRGSLKVAASYPAQIQFAVR*
Ga0164308_1005135123300012985SoilMRTYRLLFALFLVFSICLAAQTSNQDIFTISVAAPTAAKDVQVRYFLSGDPAVQQSSSIAKPDDNRIVVKTSVEGKQAKGFRAIIFAPGCQFATVQADDVAAGTRQAQFDCQKLTTTALHGRADISRFSGKDLEVEALYVCNWAGPFFGVPALSISPFSVGKAKVESDGAFAFELPDFSADPLWARMSHNASLIFFLVDASNGEHLARLTAPSNLSRKGSLKIASEYPAEIQFGIK*
Ga0164304_1003924323300012986SoilMPKPRLIFALFLAVPICVAAQTNASDDVFTISVAAPMSARDVQVRYFLNGDPTVQESTSIAKPDEQRITVKTGTQAKPVKSFRAIVFAPGCQFATIQADDLAASNRQADFQCQKLSTVPLSGKADISRFAGRDMQVEALYVCGWAGQFFGMPGLAISPFSVGKGKVESDGTFALELPDFSGDPLWKNLSHNATLMLSLVDASTGEHLAQLSAPGALSRRGSLKVAASYPAQIQFAVR*
Ga0164304_1006247323300012986SoilMRTYRLLFALFLVFSICLAAQTSNQDIFTISVAAPTAAKDVQVRYFLSGDPAVQQSSSIAKPDENRIVVKTSVEGKQAKGFRAIIFAPGCQFATVQADDVAAGTRQAQFDCQKLTTTALHGKADISGFSGKDLQVEALYVCNWAGPFFGVPALSISPFSVGKEKVESDGAFAFELPDFSADPLWARMSHNASLMFFLVDASNGEHLARLTAPSNLSRKGSLKIASEYPAEIQFGIK*
Ga0164307_1013203423300012987SoilVEVTKMHKPGLIFALFLAVPICVAAQTNASDDVFTISVAAPMSARDVQVRYFLNGDPTVQESTSIAKPDEQRITVKTGTQAKPVKSFRAIVFAPGCQFATIQADDLAASNRQADFQCQKLSTVPLSGKADISRFAGRDMQVEALYVCGWAGQFFGMPGLAISPFSVGKAKVESDGTFALELPDFSGDPLWKNLSHNATLMLSLVDASTGE
Ga0164307_1032971323300012987SoilMRTYRLLFALFLVFSICLAAQTSNQDIFTISVAAPTAAKDVQVRYFLSGDPAVQQSSSIAKPDENRIVVKTSVEGKQAKGFRAIIFAPGCQFATVQADDVAAGTRQAQFDCQKLAATALHGKADISGFSGKDLQVEALYVCNWAGPFFGVPALSISPFSVGKAKVESDGAFAFELPDFSADPLWARMSHNASLIFFLVDASNGEHLARLTAPSNLSRKGSLKITSEYPAEIQFGIK*
Ga0164307_1038703313300012987SoilMRKYALLSLLPAVFLALPILTAAQANPNPNQDVFAITVAAPTSPQDVQVRYFLSGDSSVQQASSVARPEDNRILVETSVAGKPAKGFRAILFAPGCQLSTISADDLSASPRQAQFQCQKLSTTSLHGKTDVSRFSGKQLQVEALYVCRWAGQFFGVPGLAISPFSVIKGKVGEDGTFALDLPNFAADPLWNSLAHNATLTLVLVDSANGERLAHLAAPRDLSRGSSLKIAASYPQEIEFTVR*
Ga0164306_1000522623300012988SoilMHKPRLIFALFLAVPICVAAQTNASDDVFTISVAAPMSARDVQVRYFLNGDPTVQESTSIAKPDEQRITVKTGTQAKPVKSFRAIVFAPGCQFATIQADDLAASNRQADFQCQKLSTVPLSGKADISRFAGRDMQVEALYVCGWAGQFFGMPGLAISPFSVGKAKVETDGTFALELPDFSGDPLWKNLSHNATLMLSLVDASTGEHLAQLSAPGALSRRGSLKVAASYPAQIQFAVR*
Ga0164306_1037133523300012988SoilMRTYRLLFALFLVFSICLAAQTSNQDIFTISVAAPTAAKDVQVRYFLSGDPAVQQSSSIAKPDENRIVVKTSVEGKQAKGFRAIIFAPGCQFATVQADDVAAGTRQAQFDCQKLTTTALHGRADISRFSGKDLEVEALYVCNWAGPFFGVPALSISPFSVGKAKVESDGAFAFELPDFSADPLWAKMSHNASLIFFLVDASNGEHLARLTAPSNLSRKGSLKITSEYPAEIQ
Ga0164305_1073906913300012989SoilDVQVRYFLNGDPTVQQSTSIAKPDEQRITVKTGTQAKPAKSFRAIVFAPGCQFATIQADDLAASNRQADFQCQKLSTVPLSGKADISRFAGRDMQVEALYVCGWAGQFFGIPGLAISPFSVGKGKVESDGTFALELPDFSGDPLWKNLSHNATLMLSLVDASTGEHLAQLSAPGTLSRRGSLKVAASYPAQINFIVR*
Ga0137405_129510223300015053Vadose Zone SoilMRTNRLLFALFLAFSVCLVAQTSNQDIFTISVAAPTSAKDIQVRYFLGGDPAVQQSSSIAKPDENRIVIKTGMEGKTAKSFRAIIFAPGCQFATIQSDNLAAGTRQTAFDCQKLTTTPLHGKADVSRFGGKDLQVEALYVCNWAGPFFGVPGLAISPFSIGKAKMETDGTFAFELPDFSADPLWANLAHTASLMFFLVDASNGEHLARLTAPSNLSRKGSLKVASGYPAEIQFGI
Ga0137412_1055888813300015242Vadose Zone SoilMRTYRLFFVLSLAFSVCLVAQTSNQDIFTISVAAPTSAKDIQVRYFLGGDPAVQQSSSIAKPDENRIVIKTGVEGKTAKSFRAIIFAPGCQFATIQSDNLAAGTRQAAFDCQKLTTTPLRGKADVSRFAGKDLQVEALYVCNWAGPFFGVPGLAISPFSIGKAKMETDGTFAFELPDFSADPLWANLAHTASLMFFLVDASNGEHLARFNGQVDLIQHGRLLVVMKRQVSQFNAALDG*
Ga0137412_1075800413300015242Vadose Zone SoilFLALPLCVAAQTNSSDDIFTIAVAPPVSPKDVQVRYYLNGDAAVQQSASIAKPDDEKIVVKTGVDGVSAKSFRAIVFAPGCQFATIKADDLASSTRQAPFECQKLSTTALHGKADVSKFTGKDLQVSAVYVCGWAGQFFGVPGIAISPFVVGKGKVENDGTFAVEIPDFSGDPLWTSLSHNATLMFFLVDAQTGQQLGRLSAPRDISRRGSLKVAASYPAEISFTVR*
Ga0137403_1000509213300015264Vadose Zone SoilMRTNRLLFALFLAFSVCLVAQTSNQDIFTISVAAPTSAKDIQVRYFLGGDPAVQQSSSIAKPDENRIVIKTGVEGKTAKSFRAIIFAPGCQFATVQSDNLAASTRQAPFDCQKLTTTPLHGKVDVSRFSGKDLKVEALYVCNWAGPFFGVPGLAISPFSIGKAKMETDGTFAFELPDFSADPLWANLAHTASLMFFLVDASNGEHLARLTAPSNLSRKGSLK
Ga0132258_1162438013300015371Arabidopsis RhizosphereMADQKDVIMRMRGFVLALFLALPVFLAAQTSPSNDDIFTISVAAPTSPKDVQVRYFLSGDPVVRQSGSIARPDDNQIVVKTGVEGRPARGIRAIVFSPGCQFATISADNLSANSRQADFHCQKLATTPLHGKTDVSGFAGKELQVEALYVCDWAGQFFGVPRLAISPLTLAKTKVEGDGTFAMELPDFTSDPLWTSLSHNATLMFFLVDSATGAQLARLSAPRDLSRRGSLKVASSYPDEITFVVNSQSR*
Ga0066662_1052007913300018468Grasslands SoilLQVLAAAQGNANQDIFTINIAPPTSPQDVQVRYFLSGDPSVQQASVAQPGDNKIAIQTDVAGRPSKGFRAIVYSPGCQFATITADDLSSSTRTADFQCQKLATNTLHGKADVSRFSGKQLQVEALYVCRWAGQFFHVPALSISPFAVTKAKVADDGTFAIDLPDFSADPIWNNLSKNATLIFALVDSANGEHLARLAPPRDISRGSSLKVAASYPAEVEFTVK
Ga0193718_101120013300019999SoilSTMRTNRLLFALFLAFSVCLVAQTSNQDIFTISVAAPTSAKDIQVRYFLGGDPAVQQSSSIAKPDENRIVVKTGVEGKTAKSFRAIIFAPGCQFATVQSDNLAASTRQAPFDCQKLTTTPLHGKADVSRFGGKDLQVEALYVCNWAGPFFGVPGLAISPFSIGKAKMETDGTFAFELPDFSADPLWANLAHTASLMFFLVDASNGEHLARLTAPSNLSRKGSLKVASGYPAEIQFGVK
Ga0210400_1000667533300021170SoilMRTNRLLFALFLAFSVCLVAQTSNQDIFTISVAAPTSAKDIQVRYFLGGDPAVQQSSSIAKPDENRIVVKTVVEGKTAKSFRAIIFAPGCQFATIQSDNLATSTRQAPFDCQKLTTTPLHGKADVARFAGKDLQVEALYVCNWAGPFFGVPGLAISPFSIGKAKMETDGTFAFELPDFSADPLWANLAHTASLMFFLVDASNGEHLARLTAPGNLSRKGSLKIASGYPAEIQFGIK
Ga0210400_1053190313300021170SoilMRTNRLLFALFLAFSVCLVAQTSNQDIFTISVAAPTSAKDIQVRYFLGGDPAVQQSSSIAKPDENRIVVKTGVEGKTAKSFKAIIFAPGCQFATIQSDNLAASTRQAPFDCQKLTTTPLHGKADVSRFVGKDLQVEALYVCNWAGPFFGVPGLAISPFSLGKAKMETDGTFAFELPDFSADPLWANLAHNASLMFFLVDTSNGEHLARLTAPSNLSRKGSLKVSSGYPAEIQFGIK
Ga0126371_1007941243300021560Tropical Forest SoilMSKVGVIALLLVSMFAVAQSSDQDIFTITVKSPTSPQDVQVRYFLSGDSGLQQAGSVAKPNENQIVIKTGVEDKSAKGFKAIVFSPGCEFATINANDLTASTRQADFQCQKLDTTSLHGKADISNFAGKQLQVEALYMCHWAGKFFGVSGLAISPFSVGKAKVESDGTFAVDLPNFSSDPLWNNLSHNATLTFVLVDVANGQRLAQLSAPRDLSHGSALKIAASYPAEIPFTVR
Ga0126371_1017128433300021560Tropical Forest SoilMSKLGFLLLLLPVLAPAQVDNQDIFTITVASPTSPRDVQVRYFLSGDSAVQQAGSIARPNDNQIVIKTGVEDKSARGFRAIVFSPGCELGTISADNLASSTRKADFECQKLASSSLHGKTDLAEFAGKQLQVETLYQCNWAGQFFGVSGLEISPFSVAKAKVESDGSFAVDLPDFTSDPLWGHLSHNATLMFVLVDAANGERLATLSAPSDLSRRGALKIAHSYPAEIPFTVRSQSR
Ga0233357_100609233300023056SoilMRTNRLLIALFLTFSVCLVAQTSNQDIFTISVAAPTSAKDVQVRYFLSGDPAVQQSSSIAKPDENRIVVKTGAEGKPAKSFKAIVFAPGCQFATIQADDLAASTRQAPFECQKLNTTPLHGKADASRFAGKDLQVQALYVCNWAGPFFGVPGLAISPLAIGKAKMEADGTFAFELPDFSADPLWANMAHNASLMFFVVDASNGEHLARLTAPASLSRKGSLKVATSYPAEIQFGIK
Ga0208078_100658743300025544Arctic Peat SoilMRTNRLLIALFLAFSVCLLAQTSNQDIFTISVAAPTSAKDVQVRYFLSGDPAVQQSSSIAKPDENRIVVKTGAEGKPAKSFKAIVFAPGCQFATIQADDLAASTRQAPFECQKLNTTPLHGRADASRFAGKELQVQALYVCNWAGPFFGVPGLAISPLAIGKAKMETDGTFAFELPDFSADPLWANMAHNASLMFFLVDASNGEHLARLTAPANLSRKGSLKVATSYPAEIQFGIK
Ga0207692_1035598813300025898Corn, Switchgrass And Miscanthus RhizosphereMRRLQFITALFLALPLCVTAQTNSSDDVFTISVAAPVSPKDVQVRYYLNGDPVVQQSSSMAKPDDERIVVKTGVEGVPAKSFRAIVFAPGCQFATIKADDLSSSTRQAPFECQKLSTTALRGKADVSRFSGKDLQVTALYVCGWAGQFFGVPGISISPFAVGKAKVENDGTFAVEIPDFSGDPLWTSLSHNATLMLFLVDAQTGQQLARLSAPRDISRRGSLKVAASYPAEISFTVR
Ga0207692_1102898513300025898Corn, Switchgrass And Miscanthus RhizosphereMSARDVQVRYFLNGDPTVQESTSIAKPDEQRITVKTGTQAKPVKSFRAIVFAPGCQFATIQADDLAASNRQADFQCQKLSTVPLSGKADISRFAGRNMQVEALYVCGWAGQFFGMPGLAISPFSVGKGKVESDGTFALELPDFSGDPLWKNLSHNATLMLSLVDASTGEHLAQLSAPAPL
Ga0207699_1142099413300025906Corn, Switchgrass And Miscanthus RhizosphereAAPMSARDVQVRYFLNGDPTVQESTSIAKPDEQRITVKTGTQAKPIKSFRAIVFAPGCQFATIQADDLAASNRQADFQCQKLSTVPLSGKADISRFAGRDMQVEALYVCGWAGQFFGMPGLAISPFSVGKGKVESDGTFALELPDFSGDPLWKNLSHNATLMLSLVDASTG
Ga0207663_1117967013300025916Corn, Switchgrass And Miscanthus RhizosphereAAQTNASDDVFTISVAAPMSARDVQVRYFLNGDPTVQESTSIAKPDEQRITVKTGTQAKPVKSFRAIVFAPGCQFATIQADDLAASNRQADFQCQKLSTVPLSGKADISRFAGRDMQVEALYVCGWAGQFFGMPGLAISPFSVGKGKVESDGTFALELPDFSGDPLWKNLSHNATLMLSLVDASTGEHLAQLSAPGALSRKGSLK
Ga0207700_1048233723300025928Corn, Switchgrass And Miscanthus RhizosphereMHKPRLIFALFLAVPICVAAQTNASDDVFTISVAAPMSARDVQVRYFLNGDPTVQESTSIAKPDEQRITVKTGTQAKPIKSFRAIVFAPGCQFATIQADDLAASNRQADFQCQKLSTVPLSGKADISRFAGRDMQVEALYVCGWAGQFFGMPGLAISPFSVGKGKVESDGTFALELPDFSGDPLWKNLSHNATLMLSLADASTGEHLAQLSAPG
Ga0207700_1152206013300025928Corn, Switchgrass And Miscanthus RhizosphereLNGDPTVQQSSSIAKPDEQRITVKTGTQAKPAKSFRAIVFAPGCQFATIQADDLAAGNRQADFQCQKLSTVPLSGKADISRFSGRDMQVEALYVCAWAGQFFGAPGLAMSPFSVGKAKVESDGTFALELPDFSADPLWKSLSHNATLTLFLVDVSTGEHLAQLSAPAALSRKGALKVAASYPAQIEFAVQQ
Ga0209240_104302013300026304Grasslands SoilMRAYRFLIALVLALTALALTAFLPQTNSSQDIFTISVAAPTSPKEVQVRYFLSGDPALQQSSSIAKPDEARIVVKTGVDGKPAKGFRAIVFAPGCQFTTIKADDLSAASRQAEFQCQKLSTTPLHGRADISRFADRPLQVEALYVCGWAGQFFGVPSLAISPFSVGKAKVESDGTFAFELPDFTGDPLWASLSHNATLTFSLVDASNGAHLARLSAPRELSRRGALKVAPGYPAEIVFSV
Ga0209647_108811913300026319Grasslands SoilVAAPTSAKDIQVRYFLGGDPAVQQSSSIAKPDENRIVIKTGVEGKTAKSFRAIIFAPGCQFATVQSDNLAASTRQAPFDCQKLTTTPLRGKVDVSRFAGKNLQVEALYVCNWAGPFFGVPGLAISPFSIGKAKMETDGTFAFELPDFSADPLWANLAHTASLMFFLVDASNGEHLARLTAPSNLSRKGSLKVASGYPAEIQFGIK
Ga0209059_103974723300026527SoilMRKSGFIFAFLLALPALAMAQANSSQDVFTITVASPTSPKDVQVRYFLNGDPAAQQSSSIAQPDDNKIVVKTEVTGKSAKSFRAIIYSPGCQFSTISADDLSASTRQAEFQCQKLSTTPLHGKADISRFTGKQLQVEALYVCRWAGQFFGVPGLAISPFSVAKTKVENDGSFALDLPDFSSDPLWSNLSHNATLTLVLIDGSNGERLGRLSAPRDLSRGSGLKVAASYPAEIAFAVR
Ga0209059_106113213300026527SoilMRKYALLSFFLALPVLVAAQADPNQDVFAITVAAPTSPRDVQVRYALSGNPSVQQASSVARPEDNRILVETSVAGKPAKGFRAILFAPGCQLSTISVDDLSAGTRQAQFECQKLSTTPLHGKADVSRFSGKQLQVEALYVCRWAGQFFGVPGLAISPFSVTSAKVGEDGAFALDLPNFAADPLWNNLSHNATLTLVLVDSANGERLAHFTAPHDLSRGSSLKIAASYPAEIEFTVR
Ga0209527_100591543300027583Forest SoilVAGPTSAKDIQVRYFLSGEPAVQQSSSIAKPDENRIVVKTGVEGKTAKSFRAIVFAPGCQFATIQSDNLAATPRQAAFDCQKLTTTPLHGKADVSRFAGKDLQVEALYVCNWAGPFFGVPGLAISPFSIGKAKMETDGTFAFELPDFSADPLWANLAHNASLMFFLVDVANGEHLARLTAPSNLSRKGSLKVASGYPAEIQFGIK
Ga0209810_104765923300027773Surface SoilMMRKLVFALFLFPVFLPAQEASPDNVFTINVAAPTSSKDVQVRYLLNGNPAVMQSSSSAKPDDNQIVIKTDVEGKAAKSFRAIVFSPGCQFAVINADDLSTSNRQAQFQCQKLTDTSLHGKVDTSQFSGRDLQVEAMYVCNWAGQFFGVRGLSISPFTAGKAKVDKDGAFAMDLPDFSTDPLWNNLSHKATLMFVLVDAANGQQLAALTPPNNLAREGSLKVAASYPAEVEFTVQNQR
Ga0209811_1002432033300027821Surface SoilMRRLQFITALFLVLPLCVTAQTNSSDDVFTISVAAPVSPKDVQVRYYLNGDPVVQQSSSMAKPDDERIVVKTGVEGVPAKSFRAIVFAPGCQFATIKADDLSSSTRQAPFECQKLSTTALRGKADVSRFSGKDLQVTALYVCGWAGQFFSVPGISISPFAVGKAKVENDGTFAVEIPDFSGDPLWTSLSHNATLMLFLVDAQTGQQLARLSAPRDISRRGSLKVAASYPAEISFTVR
Ga0209701_1063181513300027862Vadose Zone SoilHGLPPAAHTPVVPSAPGSRVFLCCLRHPDARRRTYFLGGDPAVQQSSSIAKPDENRIVIKTGVEGKTAKSFRAIIFAPGCQFATIQSDNLAAGTRQAAFDCQKLTTTPLHGKVDVSRFAGKDLQVEALYVCNWAGPFFGVPGLAISPFSIGKAKMETDGAFAFELPDFSADPLWANLAHTASLMFFL
Ga0209488_1003902713300027903Vadose Zone SoilMRTYRLFFVLSLAFSVCLVAQTSNQDIFTISVAAPTSAKDIQVRYFLGGDPAVQQSSSIAKPDENRIVIKTGVEGKTAKSFRAIIFAPGCQFATIQSDNLAAGTRQAPFDCQKLTTTPLHGKVDVSRFGGKDLQVEALYVCNWAGPFFGVPGLAISPFSIGKAKMETDGAFAFELPDFSADPLWANLAHTASLMFFLVDASNGEHLARLTAPSNLSRKGSLKVASGYPVEIQFGIK
Ga0209583_1001125723300027910WatershedsMTMRKHGFFIALFLALPMGLAAQTSSNENIFTISVAEPTSPKDVQVRYFLSGDPAVQQSSSIAKPDDNRIVVKTGVEGKPARGFRAIVYAPGCQFVTISADDLAASTRQSDFQCQKLSTTPLHGRADIAKFAGRDLQVEALYVCGWAGQFFGMPGLAISPLSVARARVENDGSFAMELPDFSADPLWGSLSHNATLMFFLVDPASGAHLARLSAPRDLSRGGALKVAPSYPAEIPFAVRSQSAPTNSKSSK
Ga0073994_1226182013300030991SoilANRLLFALFLAVSVCLVAQTSDQDIFTISVAGPTSAKDIQVRYFLSGDPAVQQSSSIAKPDENRIVVKTGVEGKTAKSFRAIVFAPGCQFATIQSDNLAATARQAAFNCQKLTTTSLHGKADLSRFGGKNLQVEALYVCNWAGPFFGVPGLAISPFSLGKAKMETDGTFAFELPDFSADPLWANLAHNASLMFFLVDVSNGEHLAR
Ga0170824_12301763913300031231Forest SoilMRTNRLFFALFLAFSVCLVAQTSNQDIFTISVAAPTSAKDIQVRYFLGGDPAVQQSSSIAKPDENRIVVKTEVEGKTAKSFRAIIFAPGCQFATVQSDNLAASTRQAAFDCQKLTTTPLHGKADVSRFAGKDLQVEALYVCNWAGPFFGVPGLAISPFSIGKAKMETDGTFAFELPDFSADPLWANLAHTASLMFFLVDASNGEHLARLTAPS
Ga0306926_1147569713300031954SoilAQANNQDVFTIGVASPTSPQDVQVRYFLSGDPAVQQAGSVAKPDDNQIVVKTGVEGKPARGFRAIVFSPACQLATITADDLSSSTRQADFQCQKLATTPLHGTADVSNFAGKQLQVEILYMCNWAGQFFGVPGLAISPFSVAKAKVESDGSFSVDLPDFAGDPLWANLSHNAVLTFVLVDAANGERLARLSAPHDLARGGALKIARSYPAEIPFAVRSQVR
Ga0307479_1117967613300031962Hardwood Forest SoilMHMPRLIFALFLAVPIGLAAQTNANDDVFTISVAAPTSAKDVQVRYFLNGDPKVQQSTSVARPDEQRITVKTASQGQPAKGFRAIVFAPGCQFATIEADDLAAGNRQADFQCQKLSTVPLLGKADISRFAGRDMQVEALYVCGWAGQFFGMPGLAISPFSVGKGKMETDGTFALELPDFSGDPLWTNLSHNATLMLFLVDVSTGEHLAQLS


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.