NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F092443

Metagenome Family F092443

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F092443
Family Type Metagenome
Number of Sequences 107
Average Sequence Length 161 residues
Representative Sequence VRFLSVLVSFVAFPFLALAEPPVQLDCTIQYWNEHFKKEKDHDYQHGYSTTKVAPEDIAWLRPGTSPVGRLGQVGPTYGFVVLGWCKKSNITSSTELRNIMIKLSKLVSDHGGNAMSYDKSGTEIRFYFLRLEDKIYAAGKRGKGSTATAPGAPVVLPVSR
Number of Associated Samples 97
Number of Associated Scaffolds 107

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 25.23 %
% of genes near scaffold ends (potentially truncated) 22.43 %
% of genes from short scaffolds (< 2000 bps) 72.90 %
Associated GOLD sequencing projects 91
AlphaFold2 3D model prediction Yes
3D model pTM-score0.54

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (100.000 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil
(22.430 % of family members)
Environment Ontology (ENVO) Unclassified
(37.383 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(49.533 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 23.28%    β-sheet: 17.99%    Coil/Unstructured: 58.73%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.54
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 107 Family Scaffolds
PF02965Met_synt_B12 6.54
PF13211DUF4019 3.74
PF00672HAMP 1.87
PF03544TonB_C 1.87
PF12706Lactamase_B_2 1.87
PF00753Lactamase_B 1.87
PF00027cNMP_binding 0.93
PF13676TIR_2 0.93
PF02801Ketoacyl-synt_C 0.93
PF04241DUF423 0.93
PF04055Radical_SAM 0.93
PF16450Prot_ATP_ID_OB 0.93
PF04978DUF664 0.93

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 107 Family Scaffolds
COG1410Methionine synthase I, cobalamin-binding domainAmino acid transport and metabolism [E] 6.54
COG0810Periplasmic protein TonB, links inner and outer membranesCell wall/membrane/envelope biogenesis [M] 1.87
COG2363Uncharacterized membrane protein YgdD, TMEM256/DUF423 familyFunction unknown [S] 0.93


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms100.00 %
UnclassifiedrootN/A0.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000956|JGI10216J12902_100763340All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1603Open in IMG/M
3300002557|JGI25381J37097_1000207All Organisms → cellular organisms → Bacteria7197Open in IMG/M
3300002916|JGI25389J43894_1018369All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1210Open in IMG/M
3300003911|JGI25405J52794_10012008All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1662Open in IMG/M
3300003911|JGI25405J52794_10052718All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia873Open in IMG/M
3300004114|Ga0062593_101061171All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia837Open in IMG/M
3300004156|Ga0062589_100821166All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia845Open in IMG/M
3300005167|Ga0066672_10701087All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia648Open in IMG/M
3300005167|Ga0066672_10994761All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia513Open in IMG/M
3300005179|Ga0066684_10592202All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia744Open in IMG/M
3300005181|Ga0066678_10380436All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia934Open in IMG/M
3300005293|Ga0065715_10948979All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia560Open in IMG/M
3300005332|Ga0066388_100183099All Organisms → cellular organisms → Bacteria2705Open in IMG/M
3300005338|Ga0068868_100740116All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia883Open in IMG/M
3300005339|Ga0070660_101902788All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia507Open in IMG/M
3300005439|Ga0070711_102011724All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia508Open in IMG/M
3300005446|Ga0066686_10273733All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1143Open in IMG/M
3300005454|Ga0066687_10663093All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia620Open in IMG/M
3300005467|Ga0070706_101598175All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia595Open in IMG/M
3300005553|Ga0066695_10158991All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1409Open in IMG/M
3300005554|Ga0066661_10020994All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia3474Open in IMG/M
3300005764|Ga0066903_100845976All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1645Open in IMG/M
3300005764|Ga0066903_107848074All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia548Open in IMG/M
3300005937|Ga0081455_10011787All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia8755Open in IMG/M
3300006031|Ga0066651_10627258All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia573Open in IMG/M
3300006034|Ga0066656_10239203All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1165Open in IMG/M
3300006176|Ga0070765_100025422All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium4509Open in IMG/M
3300006800|Ga0066660_10120494All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1895Open in IMG/M
3300009012|Ga0066710_102148680All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia818Open in IMG/M
3300009137|Ga0066709_100137706All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium3092Open in IMG/M
3300009137|Ga0066709_100784290All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1379Open in IMG/M
3300010304|Ga0134088_10139088All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1153Open in IMG/M
3300010321|Ga0134067_10018026All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia2095Open in IMG/M
3300010323|Ga0134086_10114233All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia963Open in IMG/M
3300010325|Ga0134064_10168083All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia769Open in IMG/M
3300010358|Ga0126370_10314687All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → Chthoniobacterales → unclassified Chthoniobacterales → Chthoniobacterales bacterium1248Open in IMG/M
3300010366|Ga0126379_10020898All Organisms → cellular organisms → Bacteria4905Open in IMG/M
3300010373|Ga0134128_10887774All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia987Open in IMG/M
3300010376|Ga0126381_100043328All Organisms → cellular organisms → Bacteria5470Open in IMG/M
3300010398|Ga0126383_10240049All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1770Open in IMG/M
3300012198|Ga0137364_10174523All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1565Open in IMG/M
3300012198|Ga0137364_10376130All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1061Open in IMG/M
3300012202|Ga0137363_10282815All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1357Open in IMG/M
3300012203|Ga0137399_10795457All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia796Open in IMG/M
3300012208|Ga0137376_10636151All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia922Open in IMG/M
3300012285|Ga0137370_10055933All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia2129Open in IMG/M
3300012353|Ga0137367_11061526All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia549Open in IMG/M
3300012361|Ga0137360_10884401All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia770Open in IMG/M
3300012582|Ga0137358_10279015All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1135Open in IMG/M
3300012683|Ga0137398_10600134All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia761Open in IMG/M
3300012917|Ga0137395_10642590All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia767Open in IMG/M
3300012918|Ga0137396_10220076All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1398Open in IMG/M
3300012923|Ga0137359_10265600All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1530Open in IMG/M
3300012923|Ga0137359_11397549All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia588Open in IMG/M
3300012927|Ga0137416_11817518All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia557Open in IMG/M
3300012929|Ga0137404_11915102All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia552Open in IMG/M
3300012971|Ga0126369_10202784All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1914Open in IMG/M
3300012989|Ga0164305_11656807All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia573Open in IMG/M
3300013296|Ga0157374_10127488All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia2460Open in IMG/M
3300013308|Ga0157375_10292089All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1793Open in IMG/M
3300014150|Ga0134081_10003502All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia3912Open in IMG/M
3300015077|Ga0173483_10001611All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia7187Open in IMG/M
3300015241|Ga0137418_11125597All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia557Open in IMG/M
3300015356|Ga0134073_10009289All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia2164Open in IMG/M
3300015371|Ga0132258_12167142All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1396Open in IMG/M
3300015374|Ga0132255_103289088All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia689Open in IMG/M
3300015374|Ga0132255_103771677All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia644Open in IMG/M
3300017659|Ga0134083_10086888All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1220Open in IMG/M
3300018000|Ga0184604_10003456All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia2693Open in IMG/M
3300018027|Ga0184605_10009931All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia3609Open in IMG/M
3300018051|Ga0184620_10101677All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia887Open in IMG/M
3300018433|Ga0066667_10114923All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1825Open in IMG/M
3300018468|Ga0066662_10063723All Organisms → cellular organisms → Bacteria2427Open in IMG/M
3300019356|Ga0173481_10059984All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1341Open in IMG/M
3300019868|Ga0193720_1005585All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1720Open in IMG/M
3300019877|Ga0193722_1010366All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia2368Open in IMG/M
3300019877|Ga0193722_1011022All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia2303Open in IMG/M
3300019878|Ga0193715_1064003All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia779Open in IMG/M
3300019879|Ga0193723_1008860All Organisms → cellular organisms → Bacteria3270Open in IMG/M
3300019881|Ga0193707_1007317All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium3796Open in IMG/M
3300019881|Ga0193707_1041656All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1482Open in IMG/M
3300019887|Ga0193729_1114088All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1014Open in IMG/M
3300020000|Ga0193692_1099214All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia617Open in IMG/M
3300020002|Ga0193730_1001885All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → Chthoniobacterales → Chthoniobacteraceae → Candidatus Udaeobacter → Candidatus Udaeobacter copiosus5702Open in IMG/M
3300020006|Ga0193735_1000502All Organisms → cellular organisms → Bacteria12445Open in IMG/M
3300020010|Ga0193749_1032647All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1011Open in IMG/M
3300020015|Ga0193734_1007929All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1978Open in IMG/M
3300020015|Ga0193734_1016995All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1374Open in IMG/M
3300020059|Ga0193745_1054454All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia875Open in IMG/M
3300020581|Ga0210399_10647973All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia871Open in IMG/M
3300021344|Ga0193719_10077317All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1445Open in IMG/M
3300023058|Ga0193714_1001989All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium3397Open in IMG/M
3300025928|Ga0207700_10919531All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia783Open in IMG/M
3300025930|Ga0207701_10002309All Organisms → cellular organisms → Bacteria19818Open in IMG/M
3300026023|Ga0207677_11907598All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia552Open in IMG/M
3300026277|Ga0209350_1000885All Organisms → cellular organisms → Bacteria14389Open in IMG/M
3300026308|Ga0209265_1014943All Organisms → cellular organisms → Bacteria2415Open in IMG/M
3300026325|Ga0209152_10010445All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia3229Open in IMG/M
3300026330|Ga0209473_1140215All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia989Open in IMG/M
3300026333|Ga0209158_1190782All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia724Open in IMG/M
3300026552|Ga0209577_10776804All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia536Open in IMG/M
3300028784|Ga0307282_10277298All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia807Open in IMG/M
3300028787|Ga0307323_10228515All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia671Open in IMG/M
3300028828|Ga0307312_10004214All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → Chthoniobacterales → Chthoniobacteraceae → Chthoniobacter → Chthoniobacter flavus7702Open in IMG/M
3300028884|Ga0307308_10398511All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia659Open in IMG/M
3300028906|Ga0308309_10583746All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia969Open in IMG/M
3300032180|Ga0307471_101433809All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia850Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil22.43%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil15.89%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil15.89%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil7.48%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil6.54%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil4.67%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment2.80%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil2.80%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere2.80%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere2.80%
Tabebuia Heterophylla RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Tabebuia Heterophylla Rhizosphere2.80%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil1.87%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil1.87%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere1.87%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere1.87%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.93%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Corn, Switchgrass And Miscanthus Rhizosphere0.93%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.93%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.93%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Miscanthus Rhizosphere0.93%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere0.93%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000956Soil microbial communities from Great Prairies - Kansas, Native Prairie soilEnvironmentalOpen in IMG/M
3300002557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_20cmEnvironmentalOpen in IMG/M
3300002916Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cmEnvironmentalOpen in IMG/M
3300003911Tabebuia heterophylla rhizosphere microbial communities from the University of Puerto Rico - S3T2R1Host-AssociatedOpen in IMG/M
3300004114Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 5EnvironmentalOpen in IMG/M
3300004156Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 1EnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005179Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005293Miscanthus rhizosphere microbial communities from Kellogg Biological Station, MSU, sample Bulk Soil Replicate 1 : eDNA_1Host-AssociatedOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005338Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M4-2Host-AssociatedOpen in IMG/M
3300005339Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C3-3 metaGHost-AssociatedOpen in IMG/M
3300005439Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaGEnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005454Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_136EnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005553Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144EnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300005937Tabebuia heterophylla rhizosphere microbial communities from the University of Puerto Rico - S3T2R1Host-AssociatedOpen in IMG/M
3300006031Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Angelo_100EnvironmentalOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006176Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5EnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010321Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09212015EnvironmentalOpen in IMG/M
3300010323Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_0_1 metaGEnvironmentalOpen in IMG/M
3300010358Tropical forest soil microbial communities from Panama - MetaG Plot_3EnvironmentalOpen in IMG/M
3300010366Tropical forest soil microbial communities from Panama - MetaG Plot_24EnvironmentalOpen in IMG/M
3300010373Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-4EnvironmentalOpen in IMG/M
3300010376Tropical forest soil microbial communities from Panama - MetaG Plot_28EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012285Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012353Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300012989Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_237_MGEnvironmentalOpen in IMG/M
3300013296Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M2-5 metaGHost-AssociatedOpen in IMG/M
3300013308Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M3-5 metaGHost-AssociatedOpen in IMG/M
3300014150Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300015077Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S178-409R-2 (version 2)EnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015356Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300017659Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300018000Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_5_coexEnvironmentalOpen in IMG/M
3300018027Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_coexEnvironmentalOpen in IMG/M
3300018051Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_5_b1EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300019356Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S073-202C-2 (version 2)EnvironmentalOpen in IMG/M
3300019868Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2s1EnvironmentalOpen in IMG/M
3300019877Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2m1EnvironmentalOpen in IMG/M
3300019878Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2m2EnvironmentalOpen in IMG/M
3300019879Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2m2EnvironmentalOpen in IMG/M
3300019881Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3c2EnvironmentalOpen in IMG/M
3300019887Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1c2EnvironmentalOpen in IMG/M
3300020000Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L3a1EnvironmentalOpen in IMG/M
3300020002Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1a1EnvironmentalOpen in IMG/M
3300020006Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1m2EnvironmentalOpen in IMG/M
3300020010Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L1s2EnvironmentalOpen in IMG/M
3300020015Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1m1EnvironmentalOpen in IMG/M
3300020059Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L1a2EnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300021344Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2a2EnvironmentalOpen in IMG/M
3300023058Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2m1EnvironmentalOpen in IMG/M
3300025928Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025930Switchgrass rhizosphere bulk soil microbial communities from Kellogg Biological Station, Michigan, USA, with PhiX - S5 (SPAdes)EnvironmentalOpen in IMG/M
3300026023Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M4-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026277Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_2_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026308Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_103 (SPAdes)EnvironmentalOpen in IMG/M
3300026325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107 (SPAdes)EnvironmentalOpen in IMG/M
3300026330Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133 (SPAdes)EnvironmentalOpen in IMG/M
3300026333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 (SPAdes)EnvironmentalOpen in IMG/M
3300026552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109 (SPAdes)EnvironmentalOpen in IMG/M
3300028784Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_121EnvironmentalOpen in IMG/M
3300028787Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_381EnvironmentalOpen in IMG/M
3300028828Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_202EnvironmentalOpen in IMG/M
3300028884Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_195EnvironmentalOpen in IMG/M
3300028906Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5 (v2)EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI10216J12902_10076334023300000956SoilMCYVMGTRFLLIIALSVGSALSASAKPPAQLDCTIHPWNEHFKKEKDHDYQHGYSTTKVAPKDIAWLRPGASPVGRLGQVGSTYGFVVLGYCTKTNITNSAELRNIMTKLSKLVSDHGGNAISYNKSGTEMRFYFLRLEDRIYASGKRGQGSAATSSDASVALPGSH*
JGI25381J37097_100020723300002557Grasslands SoilMMDQRMQMRCLLVVALSAGSALAESPVKLDCTIRYWNEPFKKEKEHDYQHGYSTTKVAPKDIAWLRPGASPVGRLGEVGPTYGFVVLGWCKKGNITSSTELRNLMIKLSKLASDHGANAMSYDKSGTEIRFCFLRLEDAILAAAKRGKGSTATPPGVPFVVPISR*
JGI25389J43894_101836923300002916Grasslands SoilMMDQRMQMRCLLVVALSAGSALAESPVKLDCTIRYWNEPFKKEKEHDYQHGYSTTKVAPKDIAWLRPGASPVGRLGEVGPTYGFVVXGWCKKGNITSSTELRNLMIKLSKLASDHGANAMSYDKSGTEIRFCFLRLEDAILAAAKRGKGSTATPPGVPFVVPISR*
JGI25405J52794_1001200833300003911Tabebuia Heterophylla RhizosphereMRSLSVLASLVVFPFLVSAETPAKLDCTIHSWNEHFKKEKDHDYQHGYSTTKVDPKDIAWLRPGSSPVGRLGQVGPTYGFVVLGYCKKANITNSAELRNIMVKLSKLVSDHGGNAISYSKSGTELRFYFLRLEDRIYAAGKRGGGTSTVSGAPVMLPASQ*
JGI25405J52794_1005271813300003911Tabebuia Heterophylla RhizosphereMRLLSVLASIVAFPFFVLAEPPVKLDCTIHSWNEHFKKEKDHDYQHGYSTTKVAPKEIAWLRPGASPVGRLGQVGPTYGFVVLGYCTKANITNSTELRXIXIRLSKLVSDXGGNAXSYNKSGTXIRFYFLRLEDRIYAAGKRGNGGAL*
Ga0062593_10106117113300004114SoilMALSVVSALSALAEPPAQLDCTIHSWNEHSKKEKDHDYQHGYSTTKVAPKDIAWLRPGASPVGALGQARPTSGFVVLGGCKKTNITSSKELQNILIKLSKLVSNHGGNAISYDKSGAEIRFWFLRLEDRIYAAGKRGQGSAATASGAPVVLPGSR*
Ga0062589_10082116623300004156SoilMRTRFLLIMALSVVSALSALAEPPAQLDCTIHSWNEHSKKEKDHDYQHGYSTTKVAPKDIAWLRPGASPVGALGQARPTSGFVVLGGCKKTNITSSKELQNILIKLSKLVSNHGGNAISYDKSGAEIRFWFLRLEDRIYAAGKRGQGSAATASGAPVVLPGSR*
Ga0066672_1070108713300005167SoilMMDQRMQMRCLLVVALSAGSALAESPVKLDCTIRYWNEPFKKEKEHDYQHGYSTTKVAPKDIAWLRPGASPVGRLGEVGPTYGFVVLGWCKKGNITSSTELRNLMIKLSKLASDHGANAMSYDKSGTEIRFCFLRLEDAILAAAK
Ga0066672_1099476113300005167SoilVRFLSVLVSFVAFPFLALAEPPVPLDCTIQYWNEHFKKEKDHDYQHGYSTTKVAPEDIAWLRPGTSPVGRLGQVGPTYGFVVLGLCKKSNITSSTELRNIMIKLSKLVSDHGGNAISYDKSGTEMRFYFLRLEDKIYAAGKRGKGSTATAP
Ga0066684_1059220213300005179SoilMRTRFLLVIALSVVSALSALAELPAQLDCTIHSWNEHFKKEKDHDYQHGYSTTKVAPKDIAWLRPGASPVGRLGQVGPTYGFVVLGYCTKANITNSAELRNIMIKLSKLVSDHGGNAISYNKSGTEMRFYFLRLEDRIYAGGKRGQGSAATSSGASVALPGSR*
Ga0066678_1038043613300005181SoilMRTRFLLIIALSVVSARLALAEPPAQLDCTIHSWNEHLKKEKDHDYQHGYSTTKVAPKDIAWLRPGASPVGRLGQVGPTYGFVVLGYCTKTNITNSAELRNIMIKLSKLVSDHGGNAISYNKSGTEMRFYFLRLEDRIYAAGKRGQGSAATSSGASVALPGSH*
Ga0065715_1094897913300005293Miscanthus RhizosphereMRTRFLLIIALSVVSALSALAEPPAQLDCTIHSWNEHFKKEKDHDYQHGYSTTKVAPKDIAWLRPGASPVGRLGQVGPTYGFVVLGYCTKANITNSAELRNIMIKLSKLVSDHGGNAISYNNSGTEMRFYFLRLEDRIYAAGKRGQGSAATASGAPVVLSGSR*
Ga0066388_10018309933300005332Tropical Forest SoilVLAEGPVKLDCTIHSWNEHFKKEKDHDYQHGYSTTKVAPKDIEWLRPGASAVGRMGQMGPTYGFVVLGYCTKAKIANTAELRNILIKLSKLVSDHGGNAISYDKSGTEIRFYFLRLEDKIYAAGKRGNGGNPNASNAPVVLPVSH*
Ga0068868_10074011623300005338Miscanthus RhizosphereMRTRFLLIMALSVVSALSALAEPPAQLDCTIHSWNEHFKKEKDHDYQHGYSTTKVAPKDIAWLRPGASPVGGVGQARPTSGFVVLGYCRKANITNSAELRNIMIKLSKLVSDHGGNAISYNNSGTEMRFYFLRLEDRIYAAGKRGQGSAATASGAPVVLSGSR*
Ga0070660_10190278813300005339Corn RhizosphereMRTRFLLIIALSVVSALSALAEPPAQLDCTIQSWTEHSKKEKDHDYQHGYSTTKVAPKDIAWLRPGASPVGGVGQARPTSGFVVLGYCRKANITNSAELRNIMIKLSKLVSDHGGNAISYNNSGTEMRFYFLRLEDRIYAAGKRGQGSA
Ga0070711_10201172413300005439Corn, Switchgrass And Miscanthus RhizosphereMRFLSVLASLAAFPFVVSAEPPVSLDCIIQYWNEHSKKEKDHDYQHGYSTTKIAPKDIAWLRPGVSPVGPLGQVRPTHGFVVLGGCKKTNITSSKELQNILTKLSKLVSDHGGNAISYDKSGAEIRFWFLRLEDRIYAAGKRGNERTA
Ga0066686_1027373313300005446SoilMMDQRMQMRCLLVVALSAGSALAESPVKLDCTIRYWNEPFKKEKEHDYQHGYSTTKVAPKDIAWLRPGASPVGRLGEVGPTYGFVVLGWCKKGNITSSTELRNLMIKLSKLASDHGANAMSYDKSGTEIRFCFLRLEDAILAAAKRGKGSTATPPGVP
Ga0066687_1066309313300005454SoilVRVRFLSVLVSFVVFPFLALAEPPVQLDCNIQYWNEHFKKEKDHDYQHAYSTTKVAPGDIAWLRPGTSPVGRLGQVGPTYGFVVLGLCKKSNITSSTELHNIMIKLSKLVSDHGGNAISYDKSGTEMRFYFLRLEDKIYAAGKRGKGSTATAPGAPVVLPGSR*
Ga0070706_10159817513300005467Corn, Switchgrass And Miscanthus RhizosphereMRTRFLLIMALSVVSALSALAEPPAQLDCTIHSWNEHFKKEKDHDYQHGYSTTKVAPKDIAWLRPGASPVVRLGQVGPTYGFVVLGYCTKANITNSAELRNIMIKLSKLVSDHGGNAISYNKSGMEMRFYFLRLEDRIYAAGKRGQGSAATSFGAPVALPGSR*
Ga0066695_1015899123300005553SoilMDQRMQMRCLLVVALSAGSALAESPVKLDCTIRYWNEPFKKEKEHDYQHGYSTTKVAPKDIAWLRPGASPVGRLGEVGPTYGFVVLGWCKKGNITSSTELRNLMIKLSKLASDHGANAISHDRSGTEIRFYFLRLQEAIFAAAKRGKASSATSPGVPFVVPVSR*
Ga0066661_1002099423300005554SoilMDQRMQMRCLLVVALSAGSALAESPVKLDCTIRYWNEPFKKEKEHDYQHGYSTTKVAPKDIAWLRPGASPVGRLGEVGPTYGFVVLGWCKKGNITSSTELRNLMIKLSKLASDHGANAMSYDKSGTEIRFCFLRLEDAILAAAKRGKGSTATPPGVPFVVPISR*
Ga0066903_10084597613300005764Tropical Forest SoilMRFLSALALVVAFTFLVSAEPPVNLDCTIQYWNEHSKKEKDHDYQHGYSATKVAPKDIAWLRPGVSPVGPLGQVRPTRGFVVLGGCKKTNITSGKELQNIMIKLSKLVSDHGGNAISYDKSGEEIRFWFLRLEDRIYAAGKRGNGGTTNVPGAPVVLPVSR*
Ga0066903_10784807413300005764Tropical Forest SoilMRFLSVLPSLVAFPFLVLAEGPVKLDCTIHSWNEHFKKEKDHDYQHGYSTTKVAPKDIEWLRPGASAVGRMGQMGPTYGFVVLGYCTKAKIANTAELRNILIKLSKLVSDHGGNAISYDKSGTEIRFYFLRLEDKIYAAGKRGNGGNPNASNAPVVLPVSH*
Ga0081455_10011787103300005937Tabebuia Heterophylla RhizosphereMRLLSVLASIVAFPFFVLAEPPVKLDCTIHSWNEHFKKEKDHDYQHGYSTTKVAPKEIAWLRPGASPVGRLGQVGPTYGFVVLGYCTKANITNSTELRNITIRLSKLVSDHGGNAISYNKSGTEIRFYFLRLEDKIYAAGKRGNGGTATASGAPVVLPVSR*
Ga0066651_1062725813300006031SoilALSAGSALAESPVKLDCTIRYWNEPFKKEKEHDYQHGYSTTKVAPKDIAWLRPGASPVGRLGEVGPTYGFVVLGWCKKGNITSSTELRNLMIKLSKLASDHGANAMSYDKSGTEIRFCFLRLEDAILAAAKRGKGSTATPPGVPFVVPISR*
Ga0066656_1023920313300006034SoilMDQRMQMRCLLVVALSAGSALAESPVKLDCTIRYWNEHFKKEKEHDYQHGYSTTKVAPKDIAWLRPGASPVGRLGEVGPTYGFVVLGWCKKGNITSSTELRNLMIKLSKLASDHGANAMSYDKSGTEIRFCFLRLEDAILAAAKRGKGSTATPPGVPFVVPISR*
Ga0070765_10002542233300006176SoilVRFLSVLVSFVAFPFLALAEPPVQLDCTIQYWNEHFKKEKDHDYQHGYSTTKVAPEDIAWLRPGTSPVGRLGQVGPTYGFVVLGWCKKSNITSSTELRNIMIKLSKLVSDHGGNAISYNKAGTEMRFYFLRLEDKIYAAGKRGKGSTATAPGAPVVLPVSR*
Ga0066660_1012049433300006800SoilVSFVAFPFLALAEPPVPLDCTIQYWNEHFKKEKDHDYQHGYSTTKVAPEDIAWLRPGTSPVGRLGQVGPTYGFVVLGLCKKSNITSRTELRNIMIKLSKLVSDHGGNAISYDKSGTEMRFYFLRLEDKIYAARKRGKGSTATAPGAPVVLPGSR*
Ga0066710_10214868013300009012Grasslands SoilMLTRYLSVIALSVVSALSALAELPAQLDCTIHSWNEHFKKEKDHDYQHGYSTTKVAPKDIAWLRPGASPVGRLGQVGPTYGFVVLGYCTKANITNSAELRNIMIKLSKLVSDHGGNAISYNKSGTEMRFYFLRLEDRIYAGGKRGQGSAATSSGASVALPGSR
Ga0066709_10013770633300009137Grasslands SoilMRFLSVLASLVAFPFLVLAEGPVKLDCTIQSWDEHSKKEKDHDYQQGYSTTKVAPQDIAWLRPGVSPVGPLGQVRPTHRFVVLGGCKKTNITSGKELQNIMIKLSKLVSDHRGNANGYDRSGVEIRFWFLRHEDSIYAAGKHRNAGTPTAPGTPAVVPVSR*
Ga0066709_10078429023300009137Grasslands SoilLIAGRRVKGMMDQRMQMRCLLVVALSAGSALAESPVKLDCTIRYWNEPFKKEKEHDYQHGYSTTKVAPKDIAWLRPGASPVGRLGEVGPTYGFVVLGWCKKGNITSSTELRNLMIKLSKLASDHGANAMSYDKSGTEIRFCFLRLEDAILAAAKRGKGKHRYSSRRSLCCPDIALECEIDCLIIDRRPICHRRI*
Ga0134088_1013908823300010304Grasslands SoilLIAGRRVKGMMDQRMQMRCLLVVALSAGSALAESPVKLDCTIRYWNEPFKKEKEHDYQHGYSTTKVAPKDIAWLRPGASPVGRLGEVGPTYGFVVLGWCKKGDITSSTELRNLMIKLSKLASDHGANAMSYDKSGTEIRLCFLRLEDAILAAAKRGKGSTATPPGVPFVVPISR*
Ga0134067_1001802623300010321Grasslands SoilLIAGRRVKGMMDQRMQMRCLLVVALSAGSALAESPVKLDCTIRYWNEPFKKEKEHDYQHGYSTTKVAPKDIAWLRPGASPVGRLGEVGPTYGFVVLGWCKKGNITSSTELRNLMIKLSKLASDHGANAMSYDKSGTEIRFCFLRLEDAILAAAKRGKGSTATPPGVPFVVPISR*
Ga0134086_1011423313300010323Grasslands SoilYWNEPFKKEKEHDYQHGYSTTKVAPKDIAWLRPGASPVGRLGEVGPTYGFVVLGWCKKGNITSSTELRNLMIKLSKLASDHGANAISYDKSGTEIRFYFLRLQEAIFAAAKRGKASSATSPGVPFVVPVSR*
Ga0134064_1016808313300010325Grasslands SoilLIAGRRVKGMMDQRMQMRCLLVVALSAGSALAESPVKLDCTIRYWNEPFKKEKEHDYQHGYSTTKVAPKDIAWLRPGASPVGRLGEVGPTYGFVMLGWCEKGNITSSTELRNLMIKLSKLASDHGANAISYDKSGTEIRFYFLRLQEAIFAAAKRGKASSATSPGVPFVVPVSR*
Ga0126370_1031468713300010358Tropical Forest SoilLTDGKLIAQDVGMRFLSVLPSLVAFPFLVLAEGPVKLDCTIHSWNEHFKKEKDHDYQHGYSTTKVAPKDIEWLRPGASAVGRMGQMGPTYGFVVLGYCTKAKIANTAELRNILIKLSKLVSDHGGNAISYDKSGTEIRFYFLRLEDKIYAAGKRGNGGNPNASNAPVVLPVSH*
Ga0126379_1002089853300010366Tropical Forest SoilLTDGKLIAQDVGMGFLSVLPSLVAFPFLVLAEGPVKLDCTIHSWNEHFKKEKDHDYQHGYSTTKVAPKDIEWLRPGASAVGRMGQMGPTYGFVVLGYCTKAKIANTAELRNILIKLSKLVSDHGGNAISYDKSGTEIRFYFLRLEDKIYAAGKRGNGGNPNASNAPVVLPVSH*
Ga0134128_1088777413300010373Terrestrial SoilLAEPPAQLDCTIHSWNEHFKKEKDHDYQHGYSTTKVAPKDIAWLRPGASPVGGVGQARPTSGFVVLGGCKKTNITSSKELQNILIKLSKLVSNHGGNAVSYDKSGAEIRFWFLRLEDRIYAAGKRGQGSAATASGAPVVLPGSR*
Ga0126381_10004332863300010376Tropical Forest SoilVLELLLIDGRQIDRAGCRYAFSLGSAVARRIPVLVLAEGPVKLDCTIHSWNEHFKKEKDHDYQHGYSTTKVAPKDIEWLRPGASAVGRMGQMGPTYGFVVLGYCTKAKIANTAELRNILIKLSKLVSDHGGNAISYDKSGTEIRFYFLRLEDKIYAAGKRGNGGNPNASNAPVVLPVSH*
Ga0126383_1024004933300010398Tropical Forest SoilLTDGKLIAQDVGMRFLSVLPSLVAFPFLVLAEGPVKLDCTIHSWNEHFKKEKDHDYQHGYSTTKVAPKDIEWLRPGASAVGRMGQMGPTYGFVVLGYCTKAKIANTAELRNILIKLSKLVSDHGGNAIDYDKSGTEIRFYFLRLEDKIYAAGKRGNGGNPNASNAPVVLPVSH*
Ga0137364_1017452323300012198Vadose Zone SoilLVVALSAGSALAEPPVKLDCTIRYWNEHFKKEKEHDYQHGYSTTKVAPKDIAWLRPGASPVGRLGEVGPTYGFVVLGWCKKGNITSSTELGNLMIKLSKLASDHGANAMSYDKSGTEIRFCFLRLEDAILAAAKRGKGSTATPPGVPFVVPISR*
Ga0137364_1037613023300012198Vadose Zone SoilMACIVAFALSALAEPPVKLDCTIRYWNEHFKKEKDHDYQHGYSTTKVAPKDIAWLRPGVSPVGRLGQVGPTYGFVVLGYCRKANITSATELRNFLLKLSKLASDHGANAMSYDKSGAEIRFYFLRLEDKIFAAAKHDKGGTSSTPGAPVVLPASR*
Ga0137363_1028281523300012202Vadose Zone SoilVRFLSVLVSFVAFPFLALAEPPVQLDCTIQYWNEHFKKEKDHDYQHGYSTTKVAPEDIAWLRPGTSPVGRLGQVGPTYGFVMLGWCKKSNITSSTELRNIMIKLSKLVSDHGGNAISYDKSGTEMRFYFLRIEDKIYAAGKRGKGSTATAPGAPVVLPVSR*
Ga0137399_1079545733300012203Vadose Zone SoilVRFLSVLVSFVAFPFLALAEPPVQLDCTIQYWNEHFKKEKDHDYQHGYSTTKVAPEDIAWLRPGTSPVGRLGQVGPTYGFVVLDWCKKSNITGSTELRNIMIKLSKLVSDHGGNAISYDKSGTEMRFYFLRIEDKIYAAGKRGKGSTATAPGAPVVLPVSR*
Ga0137376_1063615113300012208Vadose Zone SoilMRFLSVLASLVAFPFLVLAEGPVKLDCTIQSWDEHSKKEKDHDYQQGYSTTKVAPQDIAWLRPGVSPVGPLGQVRPTHRFVVLGGCKKTNITSGKELQNIMIKLSKLVSDHGGNAISYDKSGVEIRFWFLRLEDRIYAAGKLRNAGTPTAPGTPAVVPVSR*
Ga0137370_1005593323300012285Vadose Zone SoilMQTRFLLIMALSVASALSALAEPAVKLACTIRYWNEHFKKEKEHDSQHGYSTRKVAPKDIAWLRPGASPVNRLGQVGPTYGFVVLGWCKKGNVTSSTELGNLMIKSSKLASDHGANAISYDKCGTEIRFYFLRLQEAIFAAAKRGKASSATSPGVPFVVPVSR*
Ga0137367_1106152613300012353Vadose Zone SoilMRFLSVLASLVVFPFLVLSEPPAKLDCTIQSWNEHSKKEKEKDHSYNQEYSTTKVAPKDVAWLRPGNSVFRLVENPYVASSRGFVLLGYCRKANVTNTTELRSILIKLSKLVSDHGGNTISYDKSGTELRFYFLRLDDRYYSAGKRGN*
Ga0137360_1088440113300012361Vadose Zone SoilVRFLSVLVSFVAFPFLALAEPPVQLDCTIQYWNEHFKKEKDHDYQHGYSTTKVAPEDIAWLRPGTSPVGRLGQVGPTYGFVVLGWCKKSNITGSTELRNIMIKLSKLVSDHGGNAISYDKSGTEMRFYFLRIEDKIYAAGNVVKEAPLPLPALPLCCQYRARTLYFLRLR
Ga0137358_1027901513300012582Vadose Zone SoilVRFLSVLVSFVAFPFLALAEPPVQLDCTIQYWNEHFKKEKDHDYQHGYSTTKVAPEDIAWLRPGTSPIGRLGQVGPTYGFVVLGWCKKSNITSSTELRNIMIKLSKLVSDHGGNAISYDKSGTEMRFYFLRIEDKIYAAGKRGKGSTATAPGAPVVLPVSR*
Ga0137398_1060013413300012683Vadose Zone SoilVRFLSVLVSFVAFPFLALAEPPVQLDCTIQYWNEHFKKEKDHDYQHGYSTTKVAPEDIAWLRPGTSPVGRLGQVGPTYGFVVLGWCKKSNITSSTELRNIMIKLSKLVSDHGGNAISYDKSGTEMRFYFLRIEDKIYAAGKRGKGSTATAPGAPVVLPVSR*
Ga0137395_1064259013300012917Vadose Zone SoilMRFVSVLLLLAAFPSLVSAEPPVSLDCTIQYWNEHSKKEKDHDYQHGYSTTKVAPKDIAWLRPGVSPVGPLGQARPTHGFVVLGGCKKTNITSSKELHNILIKLSKLVSDHGGNAISYDKSGAEIRFWFLRLEDRIYAAGKRGNEGTATAPGAPVAVPVSR*
Ga0137396_1022007633300012918Vadose Zone SoilMQTRFLLIMALSVASAPSALAEPPVKLDRTIRYWNEHFKKEKEHDSQHGYSTTKVAPKDIAWRRPGASPVNRLGQVGPSYGFVVLGWCKKGNMTSSTELRNLMIKLSNLASDHGANAISYDKSGTEVRFYFLRLQDAIFAAAKRGKGSTATPPGVPFVVPVSR*
Ga0137359_1026560023300012923Vadose Zone SoilMAEPPVKLDCSIRYWNEHFKKEKEHDYETGYSTTKVAPKDIAWLEPGASPVNRLGQVLPTHGFVVLGWCKKGNITNTTELRNLMTKLSKVASDHGANAISYDKSGTEMRFYFLRQKDAIYAAGKRGKGSTASPLGVPSPGSR*
Ga0137359_1139754923300012923Vadose Zone SoilLALAEPPVQLDCTIQYWNEHFKKEKDHDYQHGYSTTKVAPEDIAWLRPGTSPVGRLGQVGPTYGFVVLGWCKKSNITSSTELRNIMIKLSKLVSDHGGNAISYDMSGTEMRFYFLRIEDKIYAAGKRGKGSTATAPGAPVVLPVSR*
Ga0137416_1181751813300012927Vadose Zone SoilMRFLSVLASLAFPFLVSAEPPVSFDCTIQYWNEHSKREKDHDYQHGYSTTKVAPKDIAWLRPGASPVGPLGQVRPTHGFVVLGGCKKTNITSGKELQNVMIKLSKLVSDHGGNAISYDKSGAEIRFWFLRLEDRIYAAGEGGNGGTAAATR*
Ga0137404_1191510213300012929Vadose Zone SoilVRFLSVLVSFVALPFLALAEPAVQLDCTIQYWNEHLKKEKDHDYQHGYSTTNVAPEDIAWLRPGASPVNRLGQVGPTFGFVVLGLCKKSNITSSSELRNIMIKLSKLVSEHGGNAISYDKSGTEMRFYFLRLEDKIYAAGKRGK
Ga0126369_1020278433300012971Tropical Forest SoilLTDGKLIAQDVGMRFLSVLPSLLAFPFLVLAEGPVKLDCTIHSWNEHFKKEKDHDYQHGYSTTKVAPKDIEWLRPGASAVGRMGQMGPTYGFVVLGYCTKAKIANTAELRNILIKLSKLVSDHGGNAISYDKSGTEIRFYFLRLEDKIYAAGKRGNGGNPNASNAPVVLPVSH*
Ga0164305_1165680713300012989SoilAFPFVVSAEPPVSLDCTIQYWNEHSKKEKDHDYQHGYSTTKIAPKDIAWLRPGVSPVGPLGQVRPTHGFVVLGGCKKTNITSSKELQNILTKLSKLVSDHGGNAISYDKSGAEIRFWFLRLEDRIYAAGKRGKEGTATAPGAPVALPVSR*
Ga0157374_1012748853300013296Miscanthus RhizosphereMRTRFLLIIALSVVSALSALAEPPAQLDCTIHSWNEHFKKEKDHDYQHGYSTTKVAPKDIAWLRPGASPVGGVGQARPTSGFVVLGYCRKANITNSAELRNIMIKLSKLVSDHGGNAISYNNSGTEMRFYFLRLEDRIYAAGKRGQGSAATASGAPVVLSGSR*
Ga0157375_1029208923300013308Miscanthus RhizosphereMRFLSVLASLAAFPFVVSAEPPVSLDCTIQYWNEHSKKEKDHDYQHGYSTTKIAPKDIAWLRPGVSPVGPLGQVRPTHGFVVLGGCKKTNITSSKELQNILTKLSKLVSDHGGNAISYDKSGAEIRFWFLRLEDRIYAAGKRGQGSAATASGAPVVLSGSR*
Ga0134081_1000350263300014150Grasslands SoilMMDQRMQMRCLLVVALSAGSALAESPVKLDCTIRYWNEPFKKEKEHDYQHGYSTTKVAPKDIAWLRPGASPVGRLGEVGPTYGFVVLGWCKKGDITSSTELRNLMIKLSKLASDHGANAMSYDKSGTEIRFCFLRLEDAILAAAKRGKGSTATPPGVPFVVPISR*
Ga0173483_1000161153300015077SoilMALSVVSALSALAEPPAQLDCTIHSWNEHSKKEKDHDYQHGYSTTKVAPKDIAWLRPGVSPVGALGQARPTSGFVVLGGCKKTNITSSKELQNILIKLSKLVSNHGGNAISYDKSGAEIRFWFLRLEDRIYAAGKRGQGSAATASGAPVVLPGSR*
Ga0137418_1112559713300015241Vadose Zone SoilQRMICQRMQTRFLLIMALSVAFAPSALAEPPVNLDCALRYWNEHFKKEKKHDSQHGYSTTKVAPKDIAWLRPGASPVNRLGQVGPTYGFVVLGWCKKSNITSSTELRNIMIKLSKLVSDHGGNAISYDKSGTEMRFYFLRIEDKIYAAGKRGKGSTATAPGAPVVLPVSR*
Ga0134073_1000928953300015356Grasslands SoilMMDQRMQMRCLLVVALSAGSALAESPVKLDCTIRYWNEPFKKEKEHDYQHGYSTTKVAPKDIAWLRPGASPVGRLGEVGPTYGFVVLGWCKKGNITSSTELRNLMIKLSKLASDHGANAMSYDKSGTEIRFCFLRLEDAILAAAKRGKGSTATPPGVLFVVPISR*
Ga0132258_1216714213300015371Arabidopsis RhizosphereMMFRMRFLSVLASLVAFPFVVSAEPPVSLDCTIQYWNEHSKKEKDHDYQHGYSTTKIAPKDIAWLRPGVSPVGPLGQVRPTHGFVVLGGCKKTNITSSKELQNILTKLSKLVSDHGGNAISYDKSGAEIRFWFLRLEDRIYAAGKRGNERTATAPGVPVVLPVSR*
Ga0132255_10328908813300015374Arabidopsis RhizosphereAFPFVVSAEPPVSLDCTIQYWNEHSKKEKDHDYQHGYSTTKIAPKDIAWLRPGVSPVGPLGQVRPTHGFVVLGGCKKTNITSSKELQNILTKLSKLVSDHGGNAISYDKSGAEIRFWFLRLEDRIYAAGKRGNERTATAPGVPVVLPVSR*
Ga0132255_10377167713300015374Arabidopsis RhizosphereMALFVVSALSALAEPPAQLDCTIHSWNEHFKKEKDHDYQHGYSTTKVAPKDIAWLRPGASPVGGVGQARPTSGFVVLGYCRKANITNSAELRNIMIKLSKLVSDHGGNAISDNNSGTEMRFYFLRLEDRIYAAGKRGQGSAATAS
Ga0134083_1008688823300017659Grasslands SoilMRNDRSQFMKTKTIIVMTLLAFTCACLGQEKAPPIRLDCTIRYWNDAWKKQMGHDQTEYSTTKVAPKDIAWLRPGASPVGRLGEVGPTYGFVVLGWCKKGNITSSTELRNLMIKLSKLASDHGANAMSYDKSGTEIRFCFLRLEDAILAAAKRGKGSTATPPGVPFVVPISR
Ga0184604_1000345623300018000Groundwater SedimentMRSRFLLIIALSVVSALSASAEPPAQLDCTIHSWNEHFKKEKDHDYQHGYSTTKVAPKDIAWLRPGASPVSRLGQVGPTYGFVVLGYCTKANITNSAELRNIMIKLSKLVSDHGGNAISYDKSGTEMRFYFLRLEDRIYAGGKRGQGSAATSSGASVALPGSH
Ga0184605_1000993113300018027Groundwater SedimentMRSRFLLIIALSVVSALSASAEPPAQLDCTIHSWNEHFKKEKDHDYQHGYSTTKVAPKDIAWLRPGASPVSRLGQVGPTYGFVVLGYCTKANITNSAELRNIMIKLSKLVSDHGGNAISYDKSGTETRFYFLRLEDRIYAGGKRGQGSAATSSGASVALPGSH
Ga0184620_1010167723300018051Groundwater SedimentMRTRFLLIIALSVVSALSALAEPPAQLDCTIHSWNEHFKKEKDHDYQHGYSTTKIAPKDIAWLGPGASPVGRLGQVGPTYGFVVLGYCTKANITNSAELRNIMVKLSKLVSDHGGNAISYTKSGTEMRFYFLRLEDRIYAAGKRGQGSAATSSDASVALPGSR
Ga0066667_1011492323300018433Grasslands SoilMMDQRMQMRCLLVVALSAGSALAESPVKLDCTIRYWNEPFKKEKEHDYQHGYSTTKVAPKDIAWLRPGASPVGRLGEVGPTYGFVVLGWCKKGNITSSTELRNLMIKLSKLASDHGANAMSYDKSGTEIRFCFLRLEDAILAAAKRGKGKHRYSSRRSLCCPDIALECEIDCLIIDRRPIRHRRI
Ga0066662_1006372313300018468Grasslands SoilMMDQRMQMRCLLVVALSAGSALAESPVKLDCTIRYWNEHFKKEKEHDYQHGYSTTKVAPKDIAWLRPGASPVGRLGEVGPTYGFVVLGWCKKGNITSSTELRNLMIKLSKLASDHGANAMSYDKSGTEIRFCFLRLEDAILAAAKRGKGSTATPPGVPFVVPISR
Ga0173481_1005998413300019356SoilMALSVVSALSALAEPPAQLDCTIHSWNEHSKKEKDHDYQHGYSTTKVAPKDIAWLRPGASPVGALGQARPTSGFVVLGGCKKTNITSSKELQNILIKLSKLVSNHGGNAISYDKSGAEIRFWFLRLEDRIYAAGKRGQGSAATASGAPVVLPGSR
Ga0193720_100558533300019868SoilMRTRFLLIIALSVVCALSALAEPPAQLDCTIHSWNEHLKKEKDHDYQHGYSTTKVAPKDIAWLRPGASPVSRLGQVGPTYGFVVLGYCTKANITNSAELRNIMIKLSKLVSDHGGNAISYNKSGTEIRFYFLRLEDRIYAAGKRGQGSAATSFGASVALPGSH
Ga0193722_101036623300019877SoilMRARFLLIIVLSVVSALSALAEPPAQLDCTIHSWNEHFKKEKDHDYQHGYSTTKIAPKDIAWLRPGASPVGRLGQVGPTYGFVVLGYCTKTNITDSAELRNIMTKLSKLVSDHGGNAISYNKSGTEMRFYFLRLEDRIYAAGKRGQGSAATSSDASVALPGSR
Ga0193722_101102223300019877SoilVRFLSVLVSFVAFPFLALAEPPVQLDCTIQYWNEHSKKEKDHDYQHGYSTTKVAPEDIAWLRPGTSPVGRLGQVGPTYGFVVLGWCKKSNITSSTELRNIMIKLSKLVSDHGGNAMSYDKSGTEIRFYFLRLEDKIYAAGKRGKGSTATAPGAPVVLPVSR
Ga0193715_106400313300019878SoilMRTRFLLIIALSVVSALSASAEPPAQLDCTIHSWNEHFKKEKDHDYQHGYSTTKVAPKDIAWLRPGASPVGRLGQGGPTYGFVVLGYCTKTNITNSAELRNIMTKLSKLVSDHGGNAISYNKSGTEMRFYFLRLEDRIYAAGKRGQGSAATSSDASVALPGSH
Ga0193723_100886013300019879SoilVRFLSVLVSFVAFPFLALAEPPVQLDCTIQYWNEHFKKEKDHDYQHGYSTAKVAPEDIAWLRPGTSPVVRLGRVGPTYGFVVLGWCKKSNITSSTELRNIMIKLSKLVSDHGGNAMSYDKSGTEIRFYFLRLEDKIYAAGKRGKGSTATAPGAPVVLPVSR
Ga0193707_100731753300019881SoilMRTRFLLIIALSVVSALSALAEPPAQLDCAIQYWNEHFKKEKDHDYQHGYSTTKVAPEDIAWLRPGTSPVGRLGQVGPTYGFVVLGLCKKSNITSSTELRNIMIKLSKLVSDHGGNAISYDKSGTEMRFYFLRLEDKIYAAGKRGKGSTATAPGGPAVLPGSR
Ga0193707_104165623300019881SoilVSFVAFPFLALAEPPVQLDCTIQYWNEHFKKEKDHDYQHGYSTTKVAPEDIAWLRPGTSPVGRLGQVGPTYGFVVLGWCKKSNITSSTELRNIMIKLSKLVSDHGGNAMSYDKSGTEIRFYFLRLEDKIYAAGKRGKGSTATAPGAPVVLPVSR
Ga0193729_111408813300019887SoilVRFLSVLVSFVAFPFLALAEPPVQLDCTIQYWNEHFKKEKDHDYQHGYSTTKVAPEDIAWLRPGTSPVGRLGQVGPTYGFVVLGWCKKSNITSSTELRNIMIKLSKLVSDHGGNAISYDKSGTEMRFYFLRLEDKIYAAGKRGKGNTATAPGAPVVLPVSR
Ga0193692_109921413300020000SoilMMSRMRFLSVLASLAAFPFVVSAEPPVSLDCTIQYWNEHSKKEKDHDYQHGYSMTKIAPKDIAWLRPGVSPVGPLGQVRPTHGFVVLGGCKKTNITSSKELQNILTKLSKLVSDHGGNAISYDKSGAEIRFWFLRLEDRIYAAGKRGKEGTATAPGAPVALPVSR
Ga0193730_100188573300020002SoilVRFLSVLVSFVAFPFLALAEPPVQLDCTIQYWNEHFKKEKDHDYQHGYSTTKVAPEDIAWLRPGTSPVGRLGQVGPTYGFVVLGWCKKSNITSSTELRNIMIKLSKLVSDHGGNAMSYDKSGTEIRFYFLRLEDKIYAAGKRGKGSTATAPGAPVVLPVSR
Ga0193735_1000502143300020006SoilVRFLSVLVSFVAFPFLALAEPPVQLDCTIQYWNEHFKKEKDHDYQHGYSTAKVAPEDIAWLRPGTSPVVRLGRVGPTYGFVVLGWCKKSNITSSTELRNIMIKLSKVVSDHGGNAMSYDKSGTEIRFYFLRLEDKIYAAGKRGKGSTATAPGAPVVLPVSR
Ga0193749_103264713300020010SoilLVSFVAFPFLALAEPPVQLDCTIQYWNEHFKKEKDHDYQHGYSTTKVAPEDIAWLRPGTSPVGRLGQVGPTYGFVVLGWCKKSNITSSTELRNIMIKLSKLVSDHGGNAMSYDKSGTEIRFYFLRLEDKIYAAGKRGKGSTATAPGAPVVLPVSR
Ga0193734_100792923300020015SoilMRTRFLLIIVLSVVSALSALAEPPAQLDCTIHSWNEHFKKEKDHDYQHGYSTTKIAPKDIAWLRPGASPVGRLGQVGPTYGFVVLGYCTKTNITDSAELRNIMTKLSKLVSDHGGNAISYNKSGTEMRFYFLRLEDRIYAAGKRGQGSAATSSDASVALPGSR
Ga0193734_101699523300020015SoilMSRMRFLSVLASLAAFPFVVSAEPPVSLDCTIQYWNEHSKKEKDHDYQHGYSTTKIAPKDIAWLRPGVSPVGPLGQVRSTHGFVVLGGCKKTNITSSKELQNILTKLSKLVSDHGGNAISYDKSGAEIRFWFLRLEDRIYAAGKRGKEGTATAPGAPVALPVSR
Ga0193745_105445413300020059SoilVRFLSVLVSFVAFPFLALAEPPVQLDCTIQYWNEHFKKEKDHDYQHGYSTTKVAPEDIAWLRPGTSPVGRLGQVGPTYGFVVLGWCKKSNITSSTELRNIMIKLSKLVSDHGGNAMSYDKSGTEIRFYFLRLEDKIYAAGKRGKGSTASAPGAPVVLPVSR
Ga0210399_1064797313300020581SoilMRTRFLLIIALSVVSALSALAEPPAQLDCTIHSWNEHLKKEKDHDYQHGYSTTKVAPKDIAWLRPGASPVGRLGQAGPTYGFVVLGYCTKANVTNSAELRNIMIKLSKLVSDHGGNAISYNKSGMEMRFYFLRLEDRIYAAGKRGQGSAATSSGASVALPGSR
Ga0193719_1007731713300021344SoilVRFFSVLVSFVAFPFLALAEPPVQLDCTIQYWNEHFKKEKDHDYQHGYSTTKVAPEDIAWLRPGTSPVGRLGQVGPTYGFVVLGWCKKSNITSSTELRNIMIKLSKLVSDHGGNAMSYDKSGTEIRFYFLRLEDKIYAAGKRGKGSTATAPGAPVVLPVSR
Ga0193714_100198933300023058SoilMRSRFLLIIALSVVSALSASAEPPAQLDCTIHSWNEHFKKEKDHDYQHGYSTTKVAPKDIAWLRPGASPVSRLGQVGPTYGFVVLGYCTKANITNSAELRNIMIKLSKLVSDHGGNAISYDKSGTEMRFYFLRLEDRIYAGGKRGQGSAATSSGASVALPGSR
Ga0207700_1091953123300025928Corn, Switchgrass And Miscanthus RhizosphereMMSRMRFLSVLASLAAFPFVVSAEPPVSLDCTIQYWNEHSKKEKDHDYQHGYSTTKIAPKDIAWLRPGVSQVGPLGQVRPTHGFVVLGGCKKTNITSSKELQNILTKLSKLVSDHGGNAISYDKSGTEVRFYFLRLEDKIYAAGKRGKGNAATASGAPVVLPASR
Ga0207701_10002309183300025930Corn, Switchgrass And Miscanthus RhizosphereMRTRFLLIMALSVVSALSALAEPPAQLDCTIQSWTEHSKKEKDHDYQHGYSTTKVAPKDIAWLRPGASPVGPLGQARPTSGFVVLGGCKKTNITSSKELQNILIKLSKLVSNHGGNAISYDKSGAEIRFWFLRLEDRIYAAGKRGQGSAATASGAPVVLSGSR
Ga0207677_1190759813300026023Miscanthus RhizosphereMRTRFLLIMALSVVSALSALAEPPAQLDCTIHSWNEHFKKEKDHDYQHGYSTTKVAPKDIAWLRPGASPVGGVGQARPTSGFVVLGYCRKANITNSAELRNIMIKLSKLVSDHGGNAISYNNSGTEMRFYFLRLEDRIYAAGKRGQGSAATASGAPVVLSGSR
Ga0209350_1000885123300026277Grasslands SoilMMDQRMQMRCLLVVALSAGSALAESPVKLDCTIRYWNEPFKKEKEHDYQHGYSTTKVAPKDIAWLRPGASPVGRLGEVGPTYGFVVLGWCKKGNITSSTELRNLMIKLSKLASDHGANAMSYDKSGTEIRFCFLRLEDAILAAAKRGKGSTATPPGVPFVVPISR
Ga0209265_101494323300026308SoilMDQRMQMRCLLVVALSAGSALAESPVKLDCTIRYWNEPFKKEKEHDYQHGYSTTKVAPKDIAWLRPGASPVGRLGEVGPTYGFVVLGWCKKGNITSSTELRNLMIKLSKLASDHGANAMSYDKSGTEIRFCFLRLEDAILAAAKRGKGSTATPPGVPFVVPISR
Ga0209152_1001044563300026325SoilMDQRMQMRCLLVVALSAGSALAESPVKLDCTIRYWNEPFKKEKEHDYQHGYSTTKVAPKDIAWLRPGASPVGRLGEVGPTYGFVVLGWCKKGNITSSTELRNLMIKLSKLASDHAANAMSYDKSGTEIRFCFLRLEDAILAAAKRGKGSTATPPGVPFVVPISR
Ga0209473_114021513300026330SoilMRTRFLLVIALSVVSALSALAELPAQLDCTIHSWNEHFKKEKDHDYQHGYSTTKVAPKDIAWLRPGASPVGRLGQVGPTYGFVVLGYCTKANITNSAELRNIMIKLSKLVSDHGGNAISYNKSGTEMRFYFLRLEDRIYAGGKRGQGSAATSSGASVALPGSR
Ga0209158_119078213300026333SoilMRCLLVVALSAGSALAESPVKLDCTIRYWNEPFKKEKEHDYQHGYSTTKVAPKDIAWLRPGASPVGRLGEVGPTYGFVVLGWCKKGNITSSTELRNLMIKLSKLASDHGANAMSYDKSGTEIRFCFLRLEDAILAAAKRGKGSTATPPGVPFVVPISR
Ga0209577_1077680413300026552SoilVAFPFLALAEPPVPLDCTIQYWNEHFKKEKDHDYQHGYSTTKVAPEDIAWLRPGTSPVGRLGQVGPTYGFVVLGLCKKSNITSSTELRNIMIKLSKLVSDHGGNAISYDKSGTEMRFYFLRLEDKIYAAGKRGKGSTATAPGAPVVLPGSR
Ga0307282_1027729823300028784SoilCTIQYWNEHFKKEKDHDYQHGYSTAKVAPEDIAWLRPGTSPVVRLGRVGPTYGFVVLGWCKKSNITSSTELRNIMIKLSKLVSDHGGNAMSYDKSGTEIRFYFLRLEDKIYAAGKRGKGSTATAPGAPVVLPVSR
Ga0307323_1022851513300028787SoilMRSRFLLIIALSVVSALSASAEPPAQLDCTIHSWNEHFKKEKDHDYQHGYSTTKVAPKDIAWLRPGASPVSRLGQVGPTYGFVVLGYCTKANITNSAELRNIMIKLSKLVSDHGGNAISYDKSGTEMRFYFLRLEDRIYAAGKRGQGSAATSSDASVALPGSH
Ga0307312_1000421483300028828SoilMSVCDFFRFWCRSSHSPFWRWQSLRYWNEHFKKEKDHDYQHGYSTTKVAPEDIAWLRPGTSPVVRLGRVGPTYGFVVLGWCKKSNITSSTELRNIMIKLSKLVSDHGGNAMSYDKSGTEIRFYFLRLEDKIYAAGKRGKGSTATAPGAPVVLPVSR
Ga0307308_1039851113300028884SoilMRSRFLLIIALSVVSALSASAEPPAQLDCTIHSWNEHFKKEKDHDYQHGYSTTKVAPKDIAWLRPGASPVSRLGQVGPTYGFVVLGYCTKANITNSAELRNIMIKLSKLVSDHGGNAISYDKSGTEMRFYFLRLEDRIYAGGKRGQGSAATSS
Ga0308309_1058374613300028906SoilVRFLSVLVSFVAFPFLALAEPPVQLDCTIQYWNEHFKKEKDHDYQHGYSTTKVAPEDIAWLRPGTSPVGRLGQVGPTYGFVVLGWCKKSNITSSTELRNIMIKLSKLVSDHGGNAISYNKAGTEMRFYFLRLEDKIYAAGKRGKGSTATAPGAPVVLPVSR
Ga0307471_10143380913300032180Hardwood Forest SoilMRFLLVLALFVAFLFSVMAEPPVTLDCTIQYWNEHFKKEKDHDYQHGYSTTKVAPEDIAWLRPGTSPVGRLGQVGPTYGFVVLGLCKKSNVTSSTELRNIMIKLSKLVSDHGGNAISYDKSGTEMRFYFLRLEDKIYAAGKRGKGSTATAPGAPVVLPVS


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.