NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F080278

Metagenome / Metatranscriptome Family F080278

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F080278
Family Type Metagenome / Metatranscriptome
Number of Sequences 115
Average Sequence Length 135 residues
Representative Sequence MTKTIVKSHGYANRTPWSEASREVPVTWLEDEARSSPSPYSLLASWKGSRGGVPSHVMIKMLRRRLRECQPDLMDTVIPPARLQRAQAMLTEIWARTGGAGEKHPVWRLLEGLLRMALDRVETAQIPPR
Number of Associated Samples 99
Number of Associated Scaffolds 115

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 71.30 %
% of genes near scaffold ends (potentially truncated) 39.13 %
% of genes from short scaffolds (< 2000 bps) 79.13 %
Associated GOLD sequencing projects 94
AlphaFold2 3D model prediction Yes
3D model pTM-score0.48

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (50.435 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(17.391 % of family members)
Environment Ontology (ENVO) Unclassified
(26.957 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(41.739 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 49.68%    β-sheet: 1.27%    Coil/Unstructured: 49.04%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.48
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 115 Family Scaffolds
PF07963N_methyl 7.83
PF00072Response_reg 4.35
PF00873ACR_tran 4.35
PF01569PAP2 4.35
PF00383dCMP_cyt_deam_1 2.61
PF04226Transgly_assoc 1.74
PF11794HpaB_N 1.74
PF01810LysE 0.87
PF01841Transglut_core 0.87
PF12019GspH 0.87
PF13557Phenol_MetA_deg 0.87
PF13544Obsolete Pfam Family 0.87
PF13231PMT_2 0.87
PF12704MacB_PCD 0.87
PF03704BTAD 0.87
PF02321OEP 0.87
PF060523-HAO 0.87
PF14559TPR_19 0.87
PF00392GntR 0.87
PF00027cNMP_binding 0.87

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 115 Family Scaffolds
COG1538Outer membrane protein TolCCell wall/membrane/envelope biogenesis [M] 1.74
COG2261Uncharacterized membrane protein YeaQ/YmgE, transglycosylase-associated protein familyGeneral function prediction only [R] 1.74
COG3629DNA-binding transcriptional regulator DnrI/AfsR/EmbR, SARP family, contains BTAD domainTranscription [K] 0.87
COG3947Two-component response regulator, SAPR family, consists of REC, wHTH and BTAD domainsTranscription [K] 0.87


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms50.43 %
UnclassifiedrootN/A49.57 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000550|F24TB_12276868Not Available855Open in IMG/M
3300000559|F14TC_100970895Not Available824Open in IMG/M
3300000891|JGI10214J12806_11959477All Organisms → cellular organisms → Bacteria699Open in IMG/M
3300002245|JGIcombinedJ26739_101588347Not Available551Open in IMG/M
3300003347|JGI26128J50194_1000813All Organisms → cellular organisms → Bacteria1856Open in IMG/M
3300003349|JGI26129J50193_1006091Not Available864Open in IMG/M
3300004267|Ga0066396_10032354Not Available768Open in IMG/M
3300005295|Ga0065707_10786688Not Available604Open in IMG/M
3300005328|Ga0070676_11225426Not Available571Open in IMG/M
3300005332|Ga0066388_100332916All Organisms → cellular organisms → Bacteria2164Open in IMG/M
3300005445|Ga0070708_100048100All Organisms → cellular organisms → Bacteria3769Open in IMG/M
3300005458|Ga0070681_11813767Not Available537Open in IMG/M
3300005467|Ga0070706_100996902All Organisms → cellular organisms → Bacteria772Open in IMG/M
3300005536|Ga0070697_100010922All Organisms → cellular organisms → Bacteria7093Open in IMG/M
3300005545|Ga0070695_100004463All Organisms → cellular organisms → Bacteria8221Open in IMG/M
3300005713|Ga0066905_100579677Not Available946Open in IMG/M
3300005713|Ga0066905_101427793Not Available627Open in IMG/M
3300006176|Ga0070765_100129997All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2221Open in IMG/M
3300006176|Ga0070765_100834054All Organisms → cellular organisms → Bacteria871Open in IMG/M
3300006354|Ga0075021_10306290All Organisms → cellular organisms → Bacteria → Proteobacteria985Open in IMG/M
3300007255|Ga0099791_10322160Not Available738Open in IMG/M
3300007265|Ga0099794_10478482Not Available654Open in IMG/M
3300007788|Ga0099795_10585401Not Available529Open in IMG/M
3300009038|Ga0099829_10333766All Organisms → cellular organisms → Bacteria1246Open in IMG/M
3300009088|Ga0099830_10974433Not Available702Open in IMG/M
3300009089|Ga0099828_11199150Not Available673Open in IMG/M
3300009147|Ga0114129_10027757All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria8015Open in IMG/M
3300010048|Ga0126373_11496227All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium740Open in IMG/M
3300011269|Ga0137392_11029802Not Available676Open in IMG/M
3300011270|Ga0137391_10899312Not Available725Open in IMG/M
3300011271|Ga0137393_10524762Not Available1016Open in IMG/M
3300011414|Ga0137442_1150823All Organisms → cellular organisms → Bacteria518Open in IMG/M
3300012205|Ga0137362_10753444All Organisms → cellular organisms → Bacteria835Open in IMG/M
3300012205|Ga0137362_11739504Not Available510Open in IMG/M
3300012360|Ga0137375_10036896All Organisms → cellular organisms → Bacteria5473Open in IMG/M
3300012362|Ga0137361_11258533Not Available664Open in IMG/M
3300012582|Ga0137358_10490197Not Available828Open in IMG/M
3300012922|Ga0137394_10350132Not Available1261Open in IMG/M
3300014318|Ga0075351_1043669All Organisms → cellular organisms → Bacteria807Open in IMG/M
3300014324|Ga0075352_1009501Not Available1841Open in IMG/M
3300015371|Ga0132258_10016045All Organisms → cellular organisms → Bacteria16062Open in IMG/M
3300015372|Ga0132256_100165219All Organisms → cellular organisms → Bacteria2235Open in IMG/M
3300017944|Ga0187786_10475021Not Available561Open in IMG/M
3300017959|Ga0187779_10805358Not Available641Open in IMG/M
3300017959|Ga0187779_11124291Not Available550Open in IMG/M
3300017961|Ga0187778_10005081All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium8573Open in IMG/M
3300017966|Ga0187776_10010555All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria4943Open in IMG/M
3300017974|Ga0187777_10138672All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1618Open in IMG/M
3300018000|Ga0184604_10011084All Organisms → cellular organisms → Bacteria1911Open in IMG/M
3300018028|Ga0184608_10086684All Organisms → cellular organisms → Bacteria → Proteobacteria1290Open in IMG/M
3300018029|Ga0187787_10022067All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1714Open in IMG/M
3300018071|Ga0184618_10107351All Organisms → cellular organisms → Bacteria1102Open in IMG/M
3300019881|Ga0193707_1032161Not Available1713Open in IMG/M
3300020579|Ga0210407_10171213All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1680Open in IMG/M
3300020580|Ga0210403_11491685Not Available509Open in IMG/M
3300020581|Ga0210399_10030651All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales4294Open in IMG/M
3300020583|Ga0210401_10744037Not Available841Open in IMG/M
3300021078|Ga0210381_10237443Not Available645Open in IMG/M
3300021080|Ga0210382_10060704All Organisms → cellular organisms → Bacteria → Proteobacteria1510Open in IMG/M
3300021086|Ga0179596_10206032All Organisms → cellular organisms → Bacteria959Open in IMG/M
3300021178|Ga0210408_10159219Not Available1789Open in IMG/M
3300021178|Ga0210408_10235616Not Available1457Open in IMG/M
3300021344|Ga0193719_10006793All Organisms → cellular organisms → Bacteria4728Open in IMG/M
3300021432|Ga0210384_10013280All Organisms → cellular organisms → Bacteria8218Open in IMG/M
3300021432|Ga0210384_11176919Not Available671Open in IMG/M
3300021559|Ga0210409_10248686All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1611Open in IMG/M
3300022525|Ga0242656_1089551Not Available589Open in IMG/M
3300022756|Ga0222622_10628738All Organisms → cellular organisms → Bacteria777Open in IMG/M
3300023072|Ga0247799_1014453Not Available1168Open in IMG/M
3300025907|Ga0207645_10648184Not Available718Open in IMG/M
3300025910|Ga0207684_10016633All Organisms → cellular organisms → Bacteria6314Open in IMG/M
3300026285|Ga0209438_1037030Not Available1617Open in IMG/M
3300026481|Ga0257155_1003594All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1848Open in IMG/M
3300026499|Ga0257181_1009049All Organisms → cellular organisms → Bacteria1291Open in IMG/M
3300026508|Ga0257161_1012595All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1550Open in IMG/M
3300026514|Ga0257168_1001144All Organisms → cellular organisms → Bacteria3348Open in IMG/M
3300026514|Ga0257168_1140214Not Available538Open in IMG/M
3300026551|Ga0209648_10206449Not Available1494Open in IMG/M
3300026772|Ga0207596_106524Not Available501Open in IMG/M
3300027364|Ga0209967_1080377Not Available511Open in IMG/M
3300027424|Ga0209984_1046137Not Available633Open in IMG/M
3300027526|Ga0209968_1100936Not Available526Open in IMG/M
3300027645|Ga0209117_1018817All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales2248Open in IMG/M
3300027645|Ga0209117_1111472All Organisms → cellular organisms → Bacteria738Open in IMG/M
3300027671|Ga0209588_1020170All Organisms → cellular organisms → Bacteria2087Open in IMG/M
3300027846|Ga0209180_10567666All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium630Open in IMG/M
3300027873|Ga0209814_10116175All Organisms → cellular organisms → Bacteria → Proteobacteria1140Open in IMG/M
3300027903|Ga0209488_10919529Not Available612Open in IMG/M
3300028047|Ga0209526_10100080All Organisms → cellular organisms → Bacteria2040Open in IMG/M
3300028047|Ga0209526_10837619Not Available566Open in IMG/M
3300028536|Ga0137415_10054836All Organisms → cellular organisms → Bacteria3854Open in IMG/M
3300028673|Ga0257175_1006807All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1611Open in IMG/M
3300028711|Ga0307293_10036596All Organisms → cellular organisms → Bacteria1498Open in IMG/M
3300028793|Ga0307299_10354901Not Available550Open in IMG/M
3300028884|Ga0307308_10496136Not Available586Open in IMG/M
3300028906|Ga0308309_10956672All Organisms → cellular organisms → Bacteria743Open in IMG/M
3300029636|Ga0222749_10171996Not Available1068Open in IMG/M
3300029636|Ga0222749_10269026Not Available872Open in IMG/M
3300030336|Ga0247826_11558966Not Available537Open in IMG/M
(restricted) 3300031150|Ga0255311_1075861Not Available717Open in IMG/M
(restricted) 3300031197|Ga0255310_10011843All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2237Open in IMG/M
(restricted) 3300031197|Ga0255310_10178266Not Available590Open in IMG/M
(restricted) 3300031197|Ga0255310_10230583Not Available522Open in IMG/M
3300031720|Ga0307469_10018450All Organisms → cellular organisms → Bacteria3714Open in IMG/M
3300031720|Ga0307469_10073308All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2281Open in IMG/M
3300031720|Ga0307469_10104383All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2001Open in IMG/M
3300031720|Ga0307469_10574677All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1003Open in IMG/M
3300031740|Ga0307468_101776849Not Available583Open in IMG/M
3300031754|Ga0307475_11542546Not Available509Open in IMG/M
3300031820|Ga0307473_10570071Not Available776Open in IMG/M
3300031962|Ga0307479_10743836All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium958Open in IMG/M
3300032180|Ga0307471_100926900All Organisms → cellular organisms → Bacteria1038Open in IMG/M
3300032180|Ga0307471_101237538Not Available910Open in IMG/M
3300032205|Ga0307472_100168031All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1626Open in IMG/M
3300034257|Ga0370495_0089221All Organisms → cellular organisms → Bacteria950Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil17.39%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil15.65%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil9.57%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland6.09%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil4.35%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil4.35%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere4.35%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere4.35%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil3.48%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil3.48%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment2.61%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil2.61%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment1.74%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil1.74%
Natural And Restored WetlandsEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Natural And Restored Wetlands1.74%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil1.74%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere1.74%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Miscanthus Rhizosphere1.74%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil0.87%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds0.87%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.87%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil0.87%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Switchgrass Rhizosphere0.87%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.87%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.87%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil0.87%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil0.87%
Untreated Peat SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Untreated Peat Soil0.87%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere0.87%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.87%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Arabidopsis Rhizosphere0.87%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000550Amended soil microbial communities from Kansas Great Prairies, USA - BrdU amended with acetate total DNA F2.4 TB clc assemlyEnvironmentalOpen in IMG/M
3300000559Amended soil microbial communities from Kansas Great Prairies, USA - control no BrdU total DNA F1.4 TC clc assemlyEnvironmentalOpen in IMG/M
3300000891Soil microbial communities from Great Prairies - Wisconsin, Continuous corn soilEnvironmentalOpen in IMG/M
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300003347Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Rhizosphere soil Co-N PMHost-AssociatedOpen in IMG/M
3300003349Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Endophyte Co-N S PMHost-AssociatedOpen in IMG/M
3300004267Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 6 MoBioEnvironmentalOpen in IMG/M
3300005295Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL3EnvironmentalOpen in IMG/M
3300005328Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-3 metaGHost-AssociatedOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005458Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C8-3B metaGEnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005545Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-2 metaGEnvironmentalOpen in IMG/M
3300005713Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 (version 2)EnvironmentalOpen in IMG/M
3300006176Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5EnvironmentalOpen in IMG/M
3300006354Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2012EnvironmentalOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300007788Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_2EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300010048Tropical forest soil microbial communities from Panama - MetaG Plot_11EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300011414Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT266_2EnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012360Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_113_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300014318Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqA_D1_rdEnvironmentalOpen in IMG/M
3300014324Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_TuleA_D1EnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300015372Soil combined assemblyHost-AssociatedOpen in IMG/M
3300017944Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0815_BV2_10_20_MGEnvironmentalOpen in IMG/M
3300017959Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_Q2_SP10_10_MGEnvironmentalOpen in IMG/M
3300017961Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_Q2_SP5_20_MGEnvironmentalOpen in IMG/M
3300017966Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_BV02_MP12_20_MGEnvironmentalOpen in IMG/M
3300017974Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_Q2_SP5_10_MGEnvironmentalOpen in IMG/M
3300018000Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_5_coexEnvironmentalOpen in IMG/M
3300018028Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_30_coexEnvironmentalOpen in IMG/M
3300018029Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_BV01_MP06_20_MGEnvironmentalOpen in IMG/M
3300018071Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_b1EnvironmentalOpen in IMG/M
3300019881Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3c2EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021078Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_5_coex redoEnvironmentalOpen in IMG/M
3300021080Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_coex redoEnvironmentalOpen in IMG/M
3300021086Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021178Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-MEnvironmentalOpen in IMG/M
3300021344Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2a2EnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300022525Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-4-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022756Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_5_b1EnvironmentalOpen in IMG/M
3300023072Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, United States - UWRJ-S151-409C-6EnvironmentalOpen in IMG/M
3300025907Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026285Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026481Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-19-AEnvironmentalOpen in IMG/M
3300026499Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-06-BEnvironmentalOpen in IMG/M
3300026508Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-01-AEnvironmentalOpen in IMG/M
3300026514Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-13-BEnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300026772Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G08A2-12 (SPAdes)EnvironmentalOpen in IMG/M
3300027364Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Inoculated plant Co AM (SPAdes)Host-AssociatedOpen in IMG/M
3300027424Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Inoculated plant M2 S PM (SPAdes)Host-AssociatedOpen in IMG/M
3300027526Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Inoculated plant M2 AM (SPAdes)Host-AssociatedOpen in IMG/M
3300027645Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027671Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027873Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD1 (SPAdes)Host-AssociatedOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028673Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-69-BEnvironmentalOpen in IMG/M
3300028711Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_150EnvironmentalOpen in IMG/M
3300028793Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_159EnvironmentalOpen in IMG/M
3300028884Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_195EnvironmentalOpen in IMG/M
3300028906Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5 (v2)EnvironmentalOpen in IMG/M
3300029636Metatranscriptome of lab incubated forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-M (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030336Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day1EnvironmentalOpen in IMG/M
3300031150 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH4_T0_E4EnvironmentalOpen in IMG/M
3300031197 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH1_T0_E1EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300034257Peat soil microbial communities from wetlands in Alaska, United States - Frozen_pond_02D_17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
F24TB_1227686823300000550SoilAHHRYANRTPWPEASREVPASWLEDEARCSTSPYSLLASWKGNRGGVPSHVMLKMLRRRLREYEPDAPRTITSPARLQRAQDMLAEIWVRSGPVGEKHPSWRLVEGLLRMALDRVAPLQAAPASDHIEPVHAERAT*
F14TC_10097089523300000559SoilAHHRYANRTPWPEASREVPASWLEDEARCSTSPYSLLASWKGNRGGVPSHVMLKMLRRRLREYEPDAPRTIAPPARLQRAQDMLAEIWVRSGPVGEKHPSWRLVEGLLRMALDRVAPLQAAPASDHIEPVHAERAT*
JGI10214J12806_1195947713300000891SoilMTMPKMTPPHHRYANRTPWPEASREVPAAWLEEEARCSTSPYSLLASWKGSRGGVPSHVMLKMLRRRLREHAPDAPRTAAPPARLQRAQDLLAEIWARSGPAGEKHPSWRLVEGLLRMALDR
JGIcombinedJ26739_10158834713300002245Forest SoilMIKSHGHANRTPWSEAVREVPAAWLEDEARNSPSPYSLLASWKGIRGGVPSHVMLKLLRRRLREFQPELMNTTMPPARLQRAQETLKEIWVQVAGAGGTHPVWRLVEGVLRLALDRLGTVRPESDHTNQMLKDRPP*
JGI26128J50194_100081333300003347Arabidopsis Thaliana RhizosphereMTMPKMTPPHHRYANRTPWPEASREVPAAWLEEEARCSTSPYSLLASWKGSRGGVPSHVMLKMLRRRLREHAPDAPRTAAPPARLQRAQDLLAEIWARSGPAGEKHPSWRLVEGLLRMALDRATPVPAAPDHVQPAHAERAP*
JGI26129J50193_100609113300003349Arabidopsis Thaliana RhizosphereAAWLEEEARCSTSPYSLLASWKGSRGGVPSHVMLKMLRRRLREHAPDAPRTAAPPARLQRAQDLLAEIWARSGPAGEKHPSWRLVEGLLRMALDRATPVPAAPDHVQPAHAERAP*
Ga0066396_1003235433300004267Tropical Forest SoilMTEGRSGTNSRGYANRTPWDEATREVPSAWLEDEARSSSSPYSLLASWKGNRGGVPAHVMIKLLRRRLQEYPSDGTAVTLQPASLQRAQDLLKEIWARTGVAREKHPVWRLVEGMLRMARDRMETARAEAGPPATIPSDRVS*
Ga0065707_1078668813300005295Switchgrass RhizosphereMTSSRVKAHGYANRTPWSEASREVPAVWLEDEARNSPSPYSLLASWKGARGGVPSHVMVKLLRRRLRECQPNLKKTAGTPPRLQRAQDMLAQIWARVDGVGEKHPVWRLVEGLLRISLDRVTTGRTTTPDHADPEPVRTEHAP*
Ga0070676_1122542613300005328Miscanthus RhizosphereMTMPKMTPPHHRYANRTPWPEASREVPAAWLEEEARCSTSPYSLLASWKGSRGGVPSHVMLKMLRRRLREHAPDAPRTAAPPARLQRAQDLLAEIWARSGPAGEKHPSWRLVEGLLRMAL
Ga0066388_10033291623300005332Tropical Forest SoilMTEGRSGTNSRGYANRTPWDEATREVPSAWLEDEARSSSSPYSLLASWKGNRGGVPSHVMIKLLRRRLQEYPVDGTAVTLQPARLQRAQDLLKEIWARTGVAREKHPVWRLVEGMLRMARDRMETARAEAGPPATIPSDRVS*
Ga0070708_10004810033300005445Corn, Switchgrass And Miscanthus RhizosphereMMKPSVKSHGYANRTPWSEATREVPAAWLEDEAKNSPSPYSLLASWKGNRGGVPSHVMIKLLRRRLSEYQPDLLKTPMPAARLQRAQDMLKEIWARAGGAGEKHTVWRLVEGVLRMALDRVGSVPGAPERPDQIRADRAP*
Ga0070681_1181376713300005458Corn RhizosphereEEARCSTSPYSLLASWKGSRGGVPSHVMLKMLRRRLREHAPDAPRTAAPPARLQRAQDLLAEIWARSGPAGEKHPSWRLVEGLLRMALDRATPVPAAPDHVQPAHAERAP*
Ga0070706_10099690213300005467Corn, Switchgrass And Miscanthus RhizosphereEAYPEMEKMSERTVKAHGYKSRTPWAEAIREVPSAWLEDEAKNSPSPYSLLASWKGQRCGVPAHVMIRLIRRRLREHSPELVQTALPPASLQRAQDILGEIWARTAGPGDRPRIWRLVEGLLRIVSGLLGGRPG*
Ga0070697_10001092283300005536Corn, Switchgrass And Miscanthus RhizosphereMMKPSVKSHGYANRTPWSEATREVPAAWLEDEAKNSPSPYSLLASWKGNRGGVPSHVMIKLLRRRLSEYQPDLLKTPMPAARLQRAQDMLKEIWARAGGAGEKHTVWRLVEGVLRMALDRVGSVPGAPEHPDQIRADRAP*
Ga0070695_10000446363300005545Corn, Switchgrass And Miscanthus RhizosphereMTMPKMTPPHHRYANRTPWPEASREVPAAWLEEEARCSTSPYSLLASWKGSRGGVPSHVMLKMLRRRLREHAPDAPRTAAPPARLQRAQDLLAEIWARSGPAGEKHPSWRLVEGLLRMALDRATPLPAAPDHVQPAHAERAP*
Ga0066905_10057967713300005713Tropical Forest SoilCSTSPYSLLASWKGNRGGVPSHVMLKMLRRRLREYEPDAPRTIAPPARLQRAQDMLGEIWARSGPVGEKHPSWRLVEGLLRMALDRVAPLQAAPTSDHAEPVHAERAT*
Ga0066905_10142779313300005713Tropical Forest SoilCSTSPYSLLASWKGNRGGVPSHVMLKMLRRRLREYEPDAPTTIAPPARLQRAQDMLAEIWARSGPVGEKHPSWRLVEGLLRMALDRVAPLQAAHVLDHVEPTHAERAP*
Ga0070765_10012999723300006176SoilMTKPSVKSQGFANRTPWAEAIREVPSAWLEDETKKSPSPYSLLASWKGGRGGVPAHVMIKLLRRRLRETQPDLMNAALPPARLQRAQDMLTEIWVRAGGASEKDRGRVWRVVEGVLRLALDCMETVPVEPDHQTPGRAP*
Ga0070765_10083405413300006176SoilGDGPAEMPMSKRPASAHGYASRTPWEEAAREVPSAWLEDEARNSPSPYSLLASWKGPRGVPAHVMIKLLRRRLREHYPELMDIPLPAARLQRALDTLAEIWARAGDSAGKHPTWRLVEGLLRLARDHTVQAKPGGDQSRHDQGH*
Ga0075021_1030629023300006354WatershedsMSKRTVTSHGYASRTPWQEASREVPSAWLEDEARSSPSPYSLLASWKGARGVPSHVMIKLLRRRLREHNPELMKTTRPPARLQRAQDMLERIWARADAVGPRHPTWRLVEDLLRLARDDRGSAPAKADGDPPRHDQGQ*
Ga0099791_1032216023300007255Vadose Zone SoilMTKPSVKSQGFANRTPWAEAIREVPSAWLEDETRKSPSPYSLLSSWKGGRGGVPAHVMIKLLRRRLRENHPALMNAAPPPARLQRAQDMLEELWARAGEANETNRGRVWRLVEGVLRLTLDRLGTVRVEPDSPVQTRTDRTS*
Ga0099794_1047848223300007265Vadose Zone SoilMTKPSVKSQGFANRTPWAEAIREVPSAWLEDETRKSPSPYSLLSSWKGGRGGVPAHVMIKLLRRRLRENHPALMNAAPPPARLQRAQDMLEEIWARAGEANETNRGRVWRLVEGVLRLALDRLGTVRVEPDSPVQTRTDRAP*
Ga0099795_1058540113300007788Vadose Zone SoilMTKPNVKSQGFADRTPWAEAIREVPSAWLEDETRKSPSPYSLLSSWKGGRGGVPAQVMIKLLRRRLRENHLALMNAAPPPARLQRAQDMLEEIWARAGEANETNRGRVWRLVEGVLRLALDRLGTVRVEPDSPVQTRTDRTP*
Ga0099829_1033376623300009038Vadose Zone SoilMTKPSVKSQGFANRTPWAEAIREVPSAWLEDETRKSPSPYSLLSSWKGGRGGMPAHVMIKLLRRRLRENHPALMNAAPPPARLQRAQDMLEELWARAGEANETNRGRVWRLVEGVLRLTLDRLGTVRVEPDSPVQTRTDRTS*
Ga0099830_1097443313300009088Vadose Zone SoilTCLREIYMTKPSVKSRGYANRTPWSEAIREVPAAWLEDEAKNSPSPYSLLASWKGGRGGVPSHVMIKMLRRRLSEYQPDPMKTVIPPARLQRAQDMLKEIWARAGGAGEKHPAWRLVEGLLRMALDRAPDHPDQIRPDRAP*
Ga0099828_1119915013300009089Vadose Zone SoilNVPPGTVRRRSGPTCLREIYMTKPSVKSHGYANRTPWSEAIREVSAAWLEDEAKNSPSPYSLLASWKGGRGGVPSHVMIKMLRRRLREYQPDPMKTVSPPARLQRAQDMLKEIWARAGGAGEKHPVWRLVEGVLRMALDRAPDHPDQIRPDRAP*
Ga0114129_1002775793300009147Populus RhizosphereMTKTIVKSHGYANRTPWSEASREVPVTWLEDEARSSPSPYSLLASWKGSRGGVPSHVMIKMLRRRLRECQPDLMDTVIPPARLQRAQAMLTEIWARTGGAGEKHPVWRLLEGLLRMALDRVETAQIPPRTSRRAATP*
Ga0126373_1149622713300010048Tropical Forest SoilMTEGRSGTKSRGYANRTPWDEATREVPSAWLEDEARSSSSPYSLLASWKGNRGGVPAHVMIKLLRRRLQEYPSDGTAVTLQPASLQRAQDLLKEIWARTGVAREKHPVWRLVEGMLRMARDRMETARAEAGPPATIPSDRVS*
Ga0137392_1102980223300011269Vadose Zone SoilMTKPSVKSQGFANRTPWAEAIREVPSAWLEDETRKSPSPYSLLSSWKGGRGGMPAHVMIKLLRRRLRENHPALMNAAPPPARLLRAQDMLEELWARAGEANETNRGRVWRLVEGVLRL
Ga0137391_1089931213300011270Vadose Zone SoilMASRDMPVSKRRGTEPGYARRTPWLEAIREVPSAWLEDEAKHSPSPHSLLAAWKGDRGVPAHVMLTLLRRRLREHNPELMQTTFPAARLQRAQDMLKAIWARAGAAGGKHPTWRLVDGLLRMAREYMDADRSKPDRSQPRHDQGH*
Ga0137393_1052476223300011271Vadose Zone SoilMTKPSVKSQGFANRTPWAEAIREVPSAWLEDETRKSPSPYSLLSSWKGGRGGVPTHVMITLLRRRLRENPPDLMYAARPPARLQRAQDMLEELWARAGEANETNRGRVWRLVEGVLRLTLDRLGTVRVEPDSPVQTRTDRTS*
Ga0137442_115082313300011414SoilTVKTHRYANRTPWSEASREVPPAWLEDEARSSPSPYSLLASWKGSRGGVPAHVMIKLLRRRLRELQPNPMATVMPPARLQRAQDMLAEIWARTDGASEKHPVWRLVEGLLRMALDRVTPRARAQAPAALWPARRPKTAPDIRPVPPG*
Ga0137362_1075344423300012205Vadose Zone SoilMTKPSAKSPEFANRTPWLEAIRDVPSTWLEDEAKHSPSPHSLLAGWKGDRGVPAHVMLKLLRRRLREHNQELMQTTFPAARLQRAQAMLKEIWARAGEANEKSRGRVWRLVEGVLRLALD
Ga0137362_1173950413300012205Vadose Zone SoilMTKPSVKSQGFANRTPWAEAIREVPSAWLEDETRKSPSPYSLLSSWKGGRGGVPAHVMIKLLRRRLRENHPALMNAAPPPARLQRAQDMLEEIWARAGEANETNRGRVWRLVEGVLRLALDRLGTVRVEPDSPVQTRTDLTP*
Ga0137375_1003689643300012360Vadose Zone SoilMTKTMVKAHRYANRTPWSEASREVPATWLEDEARNSPSPYSLLASWKGRRGGVPSHVMIKLLRRRLRECQPDLVDAVNPPARLQRAQDMLTEIWARTGGAGEKHPAWRLIEGLLRMALDRVGRALPGSRGDRQEGHGRRFTTGT*
Ga0137361_1125853313300012362Vadose Zone SoilLSMTKPSVKSQGFANRTPWAEAIREVPSAWLEDETRKSPSPYSLLSSWKGGRGGVPAHVMIKLLRRRLRENHPDLMNAAPPPARLQRAQDMLEELWARAGEANETNRGRVWRLVEGVLRLTLDRLGTVRVEPDSPVQTRTDRTS*
Ga0137358_1049019713300012582Vadose Zone SoilMTKPSVKSQGFANRTPWAEAIREVPSAWLEDETRKSPSPYSLLSSWKGGRGGVPAQVMIKLLRRRLRENHLALMNAAPPPARLQRAQDMLEELWARVGETNEKNRERVWRLVEGVLRLALDRLGTVRVEPDSPVQTRTDRTP*
Ga0137394_1035013233300012922Vadose Zone SoilMTKPSVKSQGFANRTPWAEAIREVPSAWLEDETRKSPSPYSLLSSWKGGRGGVPAQVMIKLLRRRLRENHLALMNAAPPPARLQRAQDMLEELWARAGEANETNRGRVWRLVEGVLRLALDRLGTVRVEPDSPVQTRTDRTP*
Ga0075351_104366913300014318Natural And Restored WetlandsMRNSTVKSHGYANRTPWSEASREVPAAWLEDEAKNSPSPYSLLASWKGSRGGVPSHVMIKMLRRRLNECQPDLMRTVISPPRLHRAQDMLSEIWARTGGAGEKHPVWRLVEGLLRMALDRVGTVPVPAASDHPEQISHDRAP*
Ga0075352_100950133300014324Natural And Restored WetlandsMRNSTVKSHGYANRTPWSEASREVPAAWLEDEAKNSPSPYSLLASWKGSRGGVPSHVMSKRLRRRRNECQPDLMRTVISPPRLHRAQDMLSEIWARTGGAGEKHPVWRLVEGLLRMALDRVGTVPVPAASDHPEQISHDRAP*
Ga0132258_1001604573300015371Arabidopsis RhizosphereMEASREVPASWLEDEARCSTSPYSLLASWKGSRGGVPSHVMLKMLRRRLREFEPDAARSTAPPARLQRAQDMLAEIWAGSGSAGEKHPSWRLVEGLLRVMLDRVEPVHAAPNHAEPVHADRAP*
Ga0132256_10016521913300015372Arabidopsis RhizosphereLEDEARCSTSPYSLLASWKGSRGGVPSHVMLKMLRRRLREFEPDAARSTAPPARLQRAQDMLAEIWAGSGSAGEKHPSWRLVEGLLRVMLDRVEPVHAAPNHAEPVHADRAP*
Ga0187786_1047502113300017944Tropical PeatlandGHSWLIRIGATAGDPPLERCMGNRTVRHHGYANRTPWGEATREVPSAWLENEASSSSSPYSLLASWKGSRGGVPAHVMIKLLRRRLRELHPDLMNRPLPAPRLQRAQDVLQEIWARTGTSGEKHPVWRLVEGMLRMALDRVGSAGADHDGRGGHSSDRPTRDSL
Ga0187779_1080535813300017959Tropical PeatlandWVEATREVPSAWLEDEARNSPSPYSLLASWKGSRGGVPSHVMIKLLRRRLNELHPDVARRPLPAARLQRAQDMVREIWVRTGDAAEKHPVWRLVEGMLRMALERVGSAPPDPDAVGPR
Ga0187779_1112429113300017959Tropical PeatlandMVNRTGKLHGYANRTPWAEATREVPSAWLEDEARNSPSPYSLLASWKGSRGGVPSHVMIKLLRRRINELHPDAMRRPLPAARLQRAQDMVREIWAQTGDAAEKHPVWRLLEGMLRMALERVGPL
Ga0187778_1000508113300017961Tropical PeatlandMANRTGRLHGYANRTPWVEATREVPSAWLEDEARNSPSPYSLLASWKGSRGGVPSHVMIKLLRRRLNELHPDVARRPLPAARLQRAQDMVREIWVRTGDAAEKHPVWRLVEGMLRMALERVGSAPPDPDAVGPR
Ga0187776_1001055563300017966Tropical PeatlandMGNRTVRHHGYANRTPWGEATREVPSAWLENEASSSSSPYALLASWKGSRGGVPAHVMIKLLRRRLRELHPDLMNRPLPAPRLQRAQDVLQEIWARTGTSGEKHPVWRLVEGMLRMALDRVGSAGADHDGRGGHSSDRPTRDSL
Ga0187777_1013867233300017974Tropical PeatlandMANRTGRLHGYANRTPWVEATREVPSAWLEDEARNSPSPYSLLASWKGSRGGVPSHVMIKLLRRRLNELHPDVARRPLPAARLQRAQDMVREIWARTGDAAEKHPVWRLVEGMLRMALERVGSAPPDPDAVGPR
Ga0184604_1001108413300018000Groundwater SedimentMTKTIVKSHGYANRTPWSEASREVPVTWLEDEARSSPSPYSLLASWKGSRGGVPSHVMIKMLRRRLRESQPDLMDTVIPPARLQRAQAMLTEIWARTGGAGEKRPVWRLIEGLLRMALEPVETVQVPRDRVP
Ga0184608_1008668413300018028Groundwater SedimentMTKTIVKSHGYANRTPWSEASREVPATWLEDEARSSPSPYSLLASWKGSRGGVPSHVMIKMLRRRLRESQPDQMDTVIPPARLQRAQDLLTEIWARTGGAGEKHPVWRLIEGLLRMALDRVETVQVPRDRVP
Ga0187787_1002206723300018029Tropical PeatlandMGNRTVRHHGYANRTPWGEATREVPSAWLENEASSSSSPYSLLASWKGSRGGVPAHVMIKLLRRRLRELHPDLMNRPLPAPRLQRAQDVLQEIWARTGTSGEKHPVWRLVEGMLRMALDRVGSAGADHDGRGGHSSDRPTRDSL
Ga0184618_1010735133300018071Groundwater SedimentMTKTIVKSHGYANRTPWSEASREVPATWLEDEARSSPSPYSLLASWKGSRGGVPSHVMIKMLRRRLRESQPDLMDTVIPPARLQRAQAMLTEIWARTGGAGEKRPVWRLIEGLLRMALDRVETVRIPPRTSRRAATGWSR
Ga0193707_103216133300019881SoilMTKTIVKSHGYANRTPWSEASREVPATWLEDEARSSPSPYSLLASWKGSRGGVPSHVMIKMLRRRLRECQPDQMGTVIPPARLQRAQDLLTEIWARTGGAGEKHPVWRLIEGLLRMALDRVETVRIPPRTSRRAATGWSR
Ga0210407_1017121313300020579SoilMTKPSVKSQGFANRTPWSEAIREVPAAWLEDETKKSPSPYSLLASWKGGRGGVPAYVMIKLLRRRLRETQPDLMNAALPPARLQRAQDMLEEIWARAEEASEKDGGRVWRVVESVLRLALDRMGTARVEPDHPIQTSTDRAP
Ga0210403_1149168513300020580SoilMTKPSVKSQGFANRTPWAEAIREVPSAWLEDETKKSPSPYSLLASWKGGRGGVPAHVMIKLLRRRLRETQPDLMNAALPPARLQRAQDMLTEIWVRAGGASEKDRGRVWRVVEGVLRLALDCMETV
Ga0210399_1003065143300020581SoilMSKRPVNAHGYASRTPWEEAAREVPSAWLEDEARSSPSPYSLLASWKGPRGVPAHVMIKLLRRRLREHYPELMDIPLPPARLQRALDMLAEIWARAGDSAGKHPIWRLVEGLLRLARDHTVQAKPDEDQSLHDQGH
Ga0210401_1074403713300020583SoilMTKPSVKSQGFANRTPWAEAIREVPSAWLEDETKKSPSPYSLLASWKGGRGGVPAHVMIKLLRRRLRETQPDLMNAALPPARLQRAQDMLTEIWVRAGGASEKDRGRVWRVVEGVLRLALDCMETVPVEPDHPIQTPGRAP
Ga0210381_1023744313300021078Groundwater SedimentMTKTIVKSHGYANRTPWSEASREVPATWLEDEARSSPSPYSLLASWKGSRGGVPSHVMIKMLRRRLRESQPDQMDTVIPPARLQRAQDLLTEIWARTGGAGEKRPVWRLIEGLLRMALDPVETVRIPPRT
Ga0210382_1006070433300021080Groundwater SedimentMTKTIVKPHRYANRTPWSEASREVPATWLEDEARSSPSPYSLLASWKGSRGGVPSHVMIKMLRRRLRESQPDLMDTVIPPARLQRAQAMLTEIWARTGGAGEKRPVWRLIEGLLRMALDPVETVRIPPRTSRRAATGWSR
Ga0179596_1020603213300021086Vadose Zone SoilMTKPSVKSQGFANRTPWAEAIREVPSAWLEDETRKSPSPYSLLSSWKGGRGGMPAHVMIKLLRRRLRENHPALMNAAPPPARLLRAQDMLEELWARAGEANETNRGRVWRLVEGVLRLTLDRLGTVRV
Ga0210408_1015921913300021178SoilMIKSHGHANRTPWSEAVREVPAAWLEDEARNSPSPYSLLASWKGIRGGVPSHVMLKLLRRRLREFQPELMSTTMPPARLQRAQDTLKEIWVQVAGAGGTHPVWRLVEGVLRLALDRLGTIRPEPDHTNQMRKDRPP
Ga0210408_1023561623300021178SoilMLGGIFWGASMSKQGVKSHGYANRTPWEEATREVPSAWLEDEARKSSSPYSLLASWKGPRGGVPPHVMIKLLRRRLRESYPTLKKTTFPPASLQRAQDTLKQIWARVGKTGERHSSWRLVEGLLRLISKSLGDARVEPIRAQLRRDQRR
Ga0193719_1000679323300021344SoilMTKTIVKSHGYANRTPWSEASREVPATWLEDEARSSPSPYSLLASWKGSRGGVPSHVMIKMLRRRLRESQPDQMDTVIPPARLQRAQDLLTEIWARTGGAGEKHPVWRLIEGLLRMALDRVETVQIPRDRVP
Ga0210384_1001328053300021432SoilMTKPSVKSQGFANRTPWAEAIREVPSAWLEDETKKSPSPYSLLASWKGGRGGVPAHVMIKLLRRRLRETQPDLMNAALPPARLQRAQDMLTEIWVRAGGASEKDRGRVWRVVEGVLRLALDCMETVPVEPDHQTPGRAP
Ga0210384_1117691923300021432SoilLTRMNKPMTSRDMPVSNRTGTAHGYARPIPWQEAIREVPSTWLEDEARRSPSPHSLLATWKGDRGVPSHVMLKLLRRRLREHHSEMMKAPLPPARLQRTQDMLGEIWARVSGTGAKHPTWRLVEALLRMARDHLEPGR
Ga0210409_1024868623300021559SoilMLGGIFWGASMSKQGVKSHGYANRTPWEEATREVPSAWLEDEARKSSSPYSLLASWKGPRGGVPPHVMIKLLRRRLRESYPELKKTTFPPASLQRAQDTLKQIWARVGKTGERHSSWRLVEGLLRLISKSLGGARVEPVQRAQLRRDQRR
Ga0242656_108955113300022525SoilVKSHGYANRTPWEEATREVPSAWLEDEARKSSSPYSLLASWKGPRGGVPPHVMIKLLRRRLRESYPTLKKTTFPPASLQRAQDTLKQIWARVGKTGERHSSWRLVEGLLRLISKSLGDARVEPIRAQLRRDQRR
Ga0222622_1062873813300022756Groundwater SedimentMTKAIVKSHGYANRTPWSEASREVPATWLEDEARSSPSPYSLLASWKGSRGGVPSHVMIKMLRRRLRECQPDLMDTVIPPARLQRAQAMLTEIWARTGGAGEKHPVWRLVEGLLRMTVDEPRPGRPVRYRNVN
Ga0247799_101445313300023072SoilMTMPKMTPPHHRYANRTPWPEASREVPAAWLEEEARCSTSPYSLLASWKGSRGGVPSHVMLKMLRRRLREHAPDAPRTAAPPARLQRAQDLLAEIWARSGPAGEKHPSWRLVEGLLRMALDRATPVPAAPDHVQPAHAERAP
Ga0207645_1064818413300025907Miscanthus RhizosphereMTMPKMTPPHHRYANRTPWPEASREVPAAWLEEEARCSTSPYSLLASWKGSRGGVPSHVMLKMLRRRLREHAPDAPRTAAPPARLQRAQDLLAEIWARSGPAGEKHPSWRLVEGLLRMALDRATPLPAAPDHVQPAHAERAP
Ga0207684_1001663373300025910Corn, Switchgrass And Miscanthus RhizosphereMMKPSVKSHGYANRTPWSEATREVPAAWLEDEAKNSPSPYSLLASWKGNRGGVPSHVMIKLLRRRLSEYQPDLLKTPMPAARLQRAQDMLKEIWARAGGAGEKHTVWRLVEGVLRMALDRVGSVPGAPERPDQIRADRAP
Ga0209438_103703023300026285Grasslands SoilMTKPSVKSQGFANRTPWAEAIREVPSAWLEDETRKSPSPYSLLSSWKGGRGGVPAQVMIKLLRRRLRENHLALMNAAPPPARLQRAQDMLEEIWARAGEANETNRGRVWRLVEGVLRLALDRLGTVRVEPDSPVQTRTDRAP
Ga0257155_100359413300026481SoilVSKRRGTEPGYARRTPWLEAIREVPSAWLEDEAQRSPSPHSRLAAWKGDRGVPAHVMLTLLRRRLREHNPELMQATFPAARLQRAQDMLKAIWARAGAAGGKHPTWRLADGLLRMARVYMDADRSKPDRTHLRHDQRAMTVGCEHDAAF
Ga0257181_100904933300026499SoilMTKPSVKSQGFANRTPWAEAIREVPSAWLEDETRKSPSPYSLLSSWKGGRGGVPAHVMIKLLRRRLRENHPALMNAAPPPARLQRAQDMLEELWARAGEANETNRGRVWRLVEGVLRLTLDRLGTVR
Ga0257161_101259513300026508SoilVSKRRGTEPGYARRTPWLEAIREVPSAWLEDEAQRSPSPHSRLAAWKGDRGVPAHVMLTLLRRRLREHNPELMQATFPAARLQRAQDMLKAIWARAGAAGGKHPTWRLADGLLRMARVYMDADRSKPDRTHLRHDQ
Ga0257168_100114473300026514SoilMTKPSVKSQGFANRTPWAEAIREVPSAWLEDETRKSPSPYSLLSSWKGGRGGVPAHVMIKLLRRRLRENHPALMNAAPPPARLQRAQDMLEELWARAGEANETNRGRVWRLVEGVLRLTLDRLGTVRVEPDSPVQTRTDRTS
Ga0257168_114021413300026514SoilMSTRRVKSHGYANRTPWHEATREVPSAWLENEARSSPSPYSLLASWKGDRGVPSHVMIKLLRRRLREHYPELMKTALPPARLQRAHDMLEEIWARAGGVGGEHPTWRLVEGLLRLARDHMEAVRAKRD
Ga0209648_1020644923300026551Grasslands SoilMPVSKKGHRARYASRTSWLEAIREVPSTWLEDEAKHSPSPHSLLAAWKGERGVPAHVMLTLLRRRLREHNPELMQTTFPATRLQRTQDMLKAIWARAGAAGGKHPTWRLVDGLLRMAREYMDADRSTPDRSQPRHDQGQ
Ga0207596_10652413300026772SoilMTMPKMTPPHHRYANRTPWPEASREVPAAWLEEEARCSTSPYSLLASWKGSRGGVPSHVMLKMLRRRLREHAPDAPRTAAPPARLQRAQDLLAEIWARSGPAGEKHPSWRLVEGLLRMALDRATPVPAAPDHVQPAHAER
Ga0209967_108037713300027364Arabidopsis Thaliana RhizosphereMTMPKMTPPHHRYANRTPWPEASREVPAAWLEEEARCSTSPYSLLASWKGSRGGVPSHVMLKMLRRRLREHAPDAPRTAAPPARLQRAQDLLAEIWARSGPAGEKHPSWRLVEGLLRMALDRATPVPAAPDHVQPAHA
Ga0209984_104613713300027424Arabidopsis Thaliana RhizosphereMTMPKMTPPHHRYANRTPWPEASREVPAAWLEEEARCSTSPYSLLASWKGSRGGVPSHVMLKMLRRRLREHAPDAPRTAAPPARLQRAQDLLAEIWARSGPAGEKHPSWWLVEGLLRMALDRATPVPAAPDHVQPAHAERAP
Ga0209968_110093613300027526Arabidopsis Thaliana RhizosphereMTMPKMTPPHHRYANRTPWPEASREVPAAWLEEEARCSTSPYSLLASWKGSRGGVPSHVMLKMLRRRLREHAPDAPRTAAPPARLQRAQDLLAEIWARSGPAGEKHPSWRLVEGLLRMA
Ga0209117_101881723300027645Forest SoilMASRDMPVSKRRGTKPGYARRTPWLEAIREVPATWLEDEAKHSPSPHSLLAAWKGDRGVSAHVMLTLLRRRLREHNPELMQTTLPAARLQRAQHMLKEIWGRAGIAGAKHPTWRLVEALLRLVRDHLEPARARAEREQPRHDRVP
Ga0209117_111147223300027645Forest SoilMPMSKRPVNAHGYASRTLWEEATREVPSAWLEDEARSSSSPYSLLASWKGPRGVPSHVMIKLLRRRLREHYPESMEIPLPAARLQRARDMLEEIWARTGKGAGKHPTWRLVEGLLRWARDQYGAGPGGP
Ga0209588_102017023300027671Vadose Zone SoilMTKPSVKSQGFANRTPWAEAIREVPSAWLEDETRKSPSPYSLLSSWKGGRGGVPAHVMIKLLRRRLRENHPALMNAAPPPARLQRAQDMLEEIWARAGEANETNRGRVWRLVEGVLRLALDRLGTVRVEPDSPVQTRTDRTS
Ga0209180_1056766613300027846Vadose Zone SoilMSTRRVKSHGYANRTPWHEATREVPSAWLENEARSSPSPYSLRASWKGDRGVPSHVMIKLLRRRLREHYPELMKTALPPARLQRAHDMLEEIWARAGGVGGEHPTWRLVEGLLRLARDHMEAVRAKRD
Ga0209814_1011617533300027873Populus RhizosphereMTKTIVKSHGYANRTPWSEASREVPVTWLEDEARSSPSPYSLLASWKGSRGGVPSHVMIKMLRRRLRECQPDLMDTVIPPARLQRAQAMLTEIWARTGGAGEKHPVWRLLEGLLRMALDRVETAQIPPR
Ga0209488_1091952913300027903Vadose Zone SoilSTRRVKSHGYANRTPWHEATREVPSAWLENEARSSPSPYSLLASWKGDRGVPSHVMIKLLRRRLREHYPELMKTALPPARLQRAHDMLEEIWARAGGVGGEHPTWRLVEGLLRLARDHMEAVRAKRD
Ga0209526_1010008023300028047Forest SoilMLGGIFWGASMSKQGVKSHGYANRTPWEEATREVPSAWLEDEARKSSSPYSLLASWKGPRGGVPPHVMIKLLRRRLRESYPELKKTSFPPASLQRAQDTLKQIWARVGKTGERHSSWRLVEGLLRLISKSLGGARVEPVQRAHLRRDQRR
Ga0209526_1083761913300028047Forest SoilMIKSHGHANRTPWSEAVREVPAAWLEDEARNSPSPYSLLASWKGIRGGVPSHVMLKLLRRRLREFQPELMNTTMPPARLQRAQETLKEIWVQVAGAGGTHPVWRLVEGVLRLALDRLGTVRPESDHTNQMLKDRPP
Ga0137415_1005483653300028536Vadose Zone SoilMTNQRVKSHGYANRTPWSEASREVPAVWLEDEARNSPSPYSLLASWKGARGGVPSHVMIKMLRRRLRECQPDLMKTASPPPRLQRAQDMLTEIWALMDGAGEKHPVWRLVEGLLRIALDRVTSVRTAPDHSDHLEQVRPDRVP
Ga0257175_100680723300028673SoilMTKPSVKSQGFANRTPWAEAIREVPSAWLEDETRKSPSPYSLLSSWKGGRGGMPAHVMIKLLRRRLRENHPALMNAAPPPARLQRAQDMLEELWARAGEANETNRGRVWRLVEGVLRLALDRLGTVRVEPDSPVQTRTDRTP
Ga0307293_1003659633300028711SoilMTKTIVKSHGYANRTPWSEASREVPATWLEDEARSSPSPYSLLASWKGSRGGVPSHVMIKMLRRRLRESQPDLMDTVIPPARLQRAQAMLTEIWARTGGAGEKRPVWRLIEGLLRMALDPVETVRIPPRTSRRAATGWSR
Ga0307299_1035490113300028793SoilMTKTIVKSHGYANRTPWSEASREVPATWLEDEARSSPSPYSLLASWKGSRGGVPSHVMIKMLRRRLRESQPDQMDTVIPPARLQRAQDLLTEIWARTGGAGEKHPVWRLIEGLLRMALDRVETV
Ga0307308_1049613613300028884SoilMTKTIVKPHRYANRTPWSEASREVPATWLEDEARSSPSPYSLLASWKGSRGGVPSHVMIKMLRRRLRECQPDLMDTVIPPARLQRAQAMLTEIWARTGGAGEKHPVWRLVEGLLRMALD
Ga0308309_1095667213300028906SoilKRPASAHGYASRTPWEEAAREVPSAWLEDEARNSPSPYSLLASWKGPRGVPAHVMIKLLRRRLREHYPELMDIPLPAARLQRALDTLAEIWARAGDSAGKHPTWRLVEGLLRLARDHTVQAKPGGDQSRHDQGH
Ga0222749_1017199613300029636SoilPWEEATREVPSAWLEDEARKSSSPYSLLASWKGPRGGVPPHVMIKLLRRRLRESHPELKKTTFPPASLQRAQDTLKQIWARVGKTGERHSSWRLVEGLLRLISKSLGDARVEPIRAQLRRDQRR
Ga0222749_1026902623300029636SoilMSKRPASAHGYASRTPWEEAAREVPSAWLEDEARNSPSPYSLLASWKGPRGVPAHVMIKLLRRRLREHYPELMDIPLPAARLQRALDTLAEIWARAGDSAGKHPTWRLVEGLLRLARDHTVQAKPGGDQSRHDQGH
Ga0247826_1155896613300030336SoilMTSSRVKAHGYANRTPWSEASREVPAVWLEDEARNSPSPYSLLASWKGARGGVPSHVMIKLLRRRLRECQPNLKKTAGTPPRLQRAQDMHAQIWARVDGVGEKHPVWRLVEGLLRISLDRVTTGRTTTPDHADPEPVRTEHAP
(restricted) Ga0255311_107586113300031150Sandy SoilMTNSRVKVHGYANRTPWSEASREVPAVWLEDEARSSPSPYSLLASWKGARGGVPSHVMIKLLRRRLRECQPGVPKTASPPPRLQRSQDMLAEIWARVGGAGEKHPVWRLVEGLLRISLDRVTTDQTAPDPADPEPSRAEPAP
(restricted) Ga0255310_1001184333300031197Sandy SoilMERRVKSHGCANRTPWAEASREVPAVWLEDEARSSPSPYSLLASWKGARGGVPSHVMIKLLRRRLRECQPGVPKTASPPPRLQRAQDMLAEIWARVGGAGEKHPVWRLVEGLLRISLDRVTTDQTAPDPADPEPSRAEPAP
(restricted) Ga0255310_1017826613300031197Sandy SoilGRHMSQRNVRSHGYANRTLWQDATREVPAAWLVEEARSSPSPYSLLASWKGERGGVPSHVMIKLLRRRLVEQYPELMKTPLPPAELQSAQDMLKQIWARAGGVGERHPMWRLVESVLRVASEQMTTPRIGPDEPAPPAPDGAG
(restricted) Ga0255310_1023058313300031197Sandy SoilMTKQRVKSHGYANRTPWSEAAREVPFAWLEDEARKSPSPYSLLASWKGPRGGVPSHVMIELLRRRLRESYPDLKKTTFPPARLQRSQDMLQQIWARVGRAGERHPSWRLVEGLLR
Ga0307469_1001845093300031720Hardwood Forest SoilMTKPSVKSQGFANRTPWAEAIREVPSAWLEDETKKSPSPYSLLASWKGGRGGVPAHVMIKLLRRRLRETQPDLMNAALPRARLQRAQDMLTEIWVRAGGAGEKDRGRVWRVVEGVLRLALDCMETVPVEPDHPIQTPGRAP
Ga0307469_1007330823300031720Hardwood Forest SoilMKPSVKSHGYANRTPWSEATREVPSAWLEDEAKNSPSPYSLLASWKGNRGGVPSHVMIKLLRRRLSEYQPDLLKTPMPAARLQRAQDMLKEIWARAGGAGEKHTVWRLVEGVLRMALDRVGSVPGAPERPDQIRADRAP
Ga0307469_1010438323300031720Hardwood Forest SoilMEKMSERTVKAHGYKSRTPWAEAIREVPSAWLEDEAKNSPSPYSLLASWKGQRGGVPAHVMIRLIRRRLREHSPELVQTALPPASLQRAQDMLGEIWARTAGPGDRPRIWRLVEGLLRIVSGLLGGRPG
Ga0307469_1057467723300031720Hardwood Forest SoilMKPRVKSPGYANRTPWSEASREVPAAWLEDEAKNSPSPYSLLSSWKGSRGGVPSHVMIKMLRRRLSEYEPDPMKTMMSPPRLHRAQEMLTEIWERTGGAGEKHPVWRLVEGLLRMALDRVGTVRAAPDHLERIHPDRMP
Ga0307468_10177684913300031740Hardwood Forest SoilEDEARSSPSPYSLLASWKGSRGGVPSHVMIKMLRRRLSEYEPDPMKTMMSPPRLHRAQEMLTEIWERTGGAGEKHPVWRLVEGLLRMALDRVGTVRAAPDHLERIHPDRMP
Ga0307475_1154254613300031754Hardwood Forest SoilRTPWAEAIREVPSAWLEDETKKSPSPYSLLASWKGGRGGVPAHVMIKLLRRRLRETQPDLMNAALPPARLQRAQDMLTEIWARAGGASEKDRGRVWRVVEGVLRLALDCMETVPVEPDHPIQTPGRAP
Ga0307473_1057007113300031820Hardwood Forest SoilKSHGYANRTPWSEATREVPAAWLEDEAKNSPSPYSLLASWKGNRGGVPSHVMIKLLRRRLSEYQPDLLKTPMPAARLQRAQDMLKEIWARAGGAGEKHTVWRLVEGVLRMALDRVGSVPGAPERPDQIRADRAP
Ga0307479_1074383613300031962Hardwood Forest SoilMTKPSVKSQGFANRTPWAEAIREVPSAWLEDETKKSPSPYSLLASWKGGRGGVPAHVMIKLLRRRLRETQPDLMNAALPPARLQRAQDMLTEIWVRAGGAGEKDRGRVWRVVEGVLRLALDCMETVPVEPDHPIQTPGRAP
Ga0307471_10092690023300032180Hardwood Forest SoilMSKRPVKAQGYASRTPWEEAAREVPSAWLEDEARNSASPYSLLASWKGPRGVPAHVMIKLLRRRLREHYPELMDIPLPAARLQRALDMLAEIWTRAGESAGKHPTWRLVEGLLRLARDHTVQAKPGGDQSPHAPLDH
Ga0307471_10123753823300032180Hardwood Forest SoilMTKPSVKSQRFANRTPWSEAIREVPSAWLEDETKKSPSPYSLLASWKGGRGGVPAHVMIKLLRRRLRETRPDLMNASLPAARLQRAQDMLSEISTRAGGASEKDRERVWRVVEGVLRLALDRMGTVRVEPDRPGQTRTDQAP
Ga0307472_10016803133300032205Hardwood Forest SoilMEKMSERTVKAHGYKSRTPWAEAIREVPSAWLEDEAKNSPSPYSLRASWKGQRCGVPAHVMISPASLQRAQDILGERWARTAGPGDRPRIWRLVEGLLRMVSGLLGGRPG
Ga0370495_0089221_342_7523300034257Untreated Peat SoilMKTTVKTHRYANRTPWSEASHEVPAAWLEDEARSSPSPYSLLASWKGRRGGVPSHVMIKLLRRRLRELQPNPMATVMPPARLQRAQDMLAEIWARTDGAGEKHPVWRLVEGLLRMALDRVTSRARTIRARTSSAWR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.