NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F058420

Metagenome / Metatranscriptome Family F058420

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F058420
Family Type Metagenome / Metatranscriptome
Number of Sequences 135
Average Sequence Length 116 residues
Representative Sequence MSEPMWKRDDGIGTQPMPSWAMYAEAMNKFSRSATAFMEHVHLLTEARTAYQEAMSVGTELRNRLDAGDRTLRGLMTQLEQVVSAHLGETSLGGKKPELVKGEDTRTSDQATRPWKAFP
Number of Associated Samples 105
Number of Associated Scaffolds 135

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 77.04 %
% of genes near scaffold ends (potentially truncated) 31.11 %
% of genes from short scaffolds (< 2000 bps) 83.70 %
Associated GOLD sequencing projects 95
AlphaFold2 3D model prediction Yes
3D model pTM-score0.43

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (58.519 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(22.222 % of family members)
Environment Ontology (ENVO) Unclassified
(37.778 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(44.444 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 49.66%    β-sheet: 0.00%    Coil/Unstructured: 50.34%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.43
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 135 Family Scaffolds
PF04972BON 2.96
PF03544TonB_C 2.96
PF08238Sel1 1.48
PF01740STAS 1.48
PF05974DUF892 1.48
PF02738MoCoBD_1 1.48
PF00561Abhydrolase_1 1.48
PF13701DDE_Tnp_1_4 0.74
PF00924MS_channel 0.74
PF03992ABM 0.74
PF01594AI-2E_transport 0.74
PF02475Met_10 0.74
PF00593TonB_dep_Rec 0.74
PF11535Calci_bind_CcbP 0.74
PF12704MacB_PCD 0.74
PF01545Cation_efflux 0.74
PF10771DUF2582 0.74
PF05170AsmA 0.74
PF00174Oxidored_molyb 0.74
PF01636APH 0.74
PF03781FGE-sulfatase 0.74
PF00155Aminotran_1_2 0.74
PF13493DUF4118 0.74
PF00491Arginase 0.74
PF14542Acetyltransf_CG 0.74
PF02065Melibiase 0.74
PF13561adh_short_C2 0.74
PF00072Response_reg 0.74

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 135 Family Scaffolds
COG0810Periplasmic protein TonB, links inner and outer membranesCell wall/membrane/envelope biogenesis [M] 2.96
COG3685Ferritin-like metal-binding protein YciEInorganic ion transport and metabolism [P] 1.48
COG2264Ribosomal protein L11 methylase PrmATranslation, ribosomal structure and biogenesis [J] 0.74
COG3965Predicted Co/Zn/Cd cation transporter, cation efflux familyInorganic ion transport and metabolism [P] 0.74
COG3915Uncharacterized conserved proteinFunction unknown [S] 0.74
COG3345Alpha-galactosidaseCarbohydrate transport and metabolism [G] 0.74
COG3264Small-conductance mechanosensitive channel MscKCell wall/membrane/envelope biogenesis [M] 0.74
COG2982Uncharacterized conserved protein AsmA involved in outer membrane biogenesisCell wall/membrane/envelope biogenesis [M] 0.74
COG2520tRNA G37 N-methylase Trm5Translation, ribosomal structure and biogenesis [J] 0.74
COG2265tRNA/tmRNA/rRNA uracil-C5-methylase, TrmA/RlmC/RlmD familyTranslation, ribosomal structure and biogenesis [J] 0.74
COG0010Arginase/agmatinase family enzymeAmino acid transport and metabolism [E] 0.74
COG2041Molybdopterin-dependent catalytic subunit of periplasmic DMSO/TMAO and protein-methionine-sulfoxide reductasesEnergy production and conversion [C] 0.74
COG1262Formylglycine-generating enzyme, required for sulfatase activity, contains SUMF1/FGE domainPosttranslational modification, protein turnover, chaperones [O] 0.74
COG1230Co/Zn/Cd efflux system componentInorganic ion transport and metabolism [P] 0.74
COG109223S rRNA G2069 N7-methylase RlmK or C1962 C5-methylase RlmITranslation, ribosomal structure and biogenesis [J] 0.74
COG0668Small-conductance mechanosensitive channelCell wall/membrane/envelope biogenesis [M] 0.74
COG0628Predicted PurR-regulated permease PerMGeneral function prediction only [R] 0.74
COG0053Divalent metal cation (Fe/Co/Zn/Cd) efflux pumpInorganic ion transport and metabolism [P] 0.74


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A58.52 %
All OrganismsrootAll Organisms41.48 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
2170459021|G14TP7Y02H7UGLNot Available680Open in IMG/M
3300002245|JGIcombinedJ26739_100209902All Organisms → cellular organisms → Bacteria1839Open in IMG/M
3300004479|Ga0062595_100024217All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2369Open in IMG/M
3300004479|Ga0062595_100138003All Organisms → cellular organisms → Bacteria → Acidobacteria1390Open in IMG/M
3300005334|Ga0068869_100205639All Organisms → cellular organisms → Bacteria1554Open in IMG/M
3300005367|Ga0070667_100464669Not Available1157Open in IMG/M
3300005434|Ga0070709_10307079Not Available1160Open in IMG/M
3300005437|Ga0070710_10282935Not Available1077Open in IMG/M
3300005445|Ga0070708_100007101All Organisms → cellular organisms → Bacteria8939Open in IMG/M
3300005445|Ga0070708_101512672Not Available625Open in IMG/M
3300005445|Ga0070708_102052781All Organisms → cellular organisms → Bacteria → Acidobacteria529Open in IMG/M
3300005518|Ga0070699_100453441All Organisms → cellular organisms → Bacteria1163Open in IMG/M
3300005536|Ga0070697_101210756Not Available673Open in IMG/M
3300005546|Ga0070696_100333623All Organisms → cellular organisms → Bacteria1170Open in IMG/M
3300005546|Ga0070696_101725445All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Ruminococcus → Ruminococcus flavefaciens540Open in IMG/M
3300005598|Ga0066706_10198534Not Available1539Open in IMG/M
3300005615|Ga0070702_101638558All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Streptomycetales → Streptomycetaceae → Streptomyces → unclassified Streptomyces → Streptomyces sp. TOR3209533Open in IMG/M
3300005921|Ga0070766_10773571Not Available653Open in IMG/M
3300006050|Ga0075028_100019279Not Available2983Open in IMG/M
3300006050|Ga0075028_100081307Not Available1621Open in IMG/M
3300006057|Ga0075026_100811613Not Available568Open in IMG/M
3300006059|Ga0075017_100628724Not Available822Open in IMG/M
3300006059|Ga0075017_101005647Not Available649Open in IMG/M
3300006102|Ga0075015_100967916Not Available519Open in IMG/M
3300006172|Ga0075018_10419247Not Available684Open in IMG/M
3300006175|Ga0070712_101301612Not Available633Open in IMG/M
3300006176|Ga0070765_100506096All Organisms → cellular organisms → Bacteria → Acidobacteria1135Open in IMG/M
3300006354|Ga0075021_10564717Not Available724Open in IMG/M
3300006904|Ga0075424_101154722All Organisms → cellular organisms → Bacteria → Acidobacteria825Open in IMG/M
3300007258|Ga0099793_10072342All Organisms → cellular organisms → Bacteria → Acidobacteria1561Open in IMG/M
3300009038|Ga0099829_10058373All Organisms → cellular organisms → Bacteria2885Open in IMG/M
3300009038|Ga0099829_10142541All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1906Open in IMG/M
3300009038|Ga0099829_10706702Not Available837Open in IMG/M
3300009088|Ga0099830_11388519Not Available584Open in IMG/M
3300009093|Ga0105240_10591198Not Available1223Open in IMG/M
3300009101|Ga0105247_10063762All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2288Open in IMG/M
3300009137|Ga0066709_102637720All Organisms → cellular organisms → Bacteria672Open in IMG/M
3300009143|Ga0099792_10342602Not Available900Open in IMG/M
3300009147|Ga0114129_12533715All Organisms → cellular organisms → Bacteria → Acidobacteria613Open in IMG/M
3300009174|Ga0105241_11453271All Organisms → cellular organisms → Bacteria658Open in IMG/M
3300009545|Ga0105237_12675583Not Available510Open in IMG/M
3300010371|Ga0134125_11753652Not Available675Open in IMG/M
3300010375|Ga0105239_10360244All Organisms → cellular organisms → Bacteria1642Open in IMG/M
3300011120|Ga0150983_13389143Not Available518Open in IMG/M
3300011269|Ga0137392_10664432Not Available863Open in IMG/M
3300011271|Ga0137393_10625984Not Available923Open in IMG/M
3300012189|Ga0137388_11050705Not Available750Open in IMG/M
3300012205|Ga0137362_11074159Not Available684Open in IMG/M
3300012361|Ga0137360_10307018Not Available1318Open in IMG/M
3300012361|Ga0137360_11335102Not Available619Open in IMG/M
3300012363|Ga0137390_10089903All Organisms → cellular organisms → Bacteria → Acidobacteria3028Open in IMG/M
3300012363|Ga0137390_11184585Not Available711Open in IMG/M
3300012685|Ga0137397_10969651Not Available627Open in IMG/M
3300012918|Ga0137396_10053767All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2769Open in IMG/M
3300012922|Ga0137394_10667946Not Available876Open in IMG/M
3300012923|Ga0137359_10205165Not Available1760Open in IMG/M
3300012925|Ga0137419_10842602Not Available752Open in IMG/M
3300012927|Ga0137416_11649809Not Available584Open in IMG/M
3300012929|Ga0137404_11562275All Organisms → cellular organisms → Bacteria611Open in IMG/M
3300012930|Ga0137407_10022311All Organisms → cellular organisms → Bacteria4780Open in IMG/M
3300012930|Ga0137407_10523992All Organisms → cellular organisms → Bacteria1108Open in IMG/M
3300012957|Ga0164303_10273009Not Available981Open in IMG/M
3300012961|Ga0164302_11700427Not Available529Open in IMG/M
3300012988|Ga0164306_11879734Not Available521Open in IMG/M
3300012989|Ga0164305_11442715Not Available608Open in IMG/M
3300013297|Ga0157378_10023478All Organisms → cellular organisms → Bacteria → Acidobacteria5427Open in IMG/M
3300014325|Ga0163163_11811787Not Available671Open in IMG/M
3300015241|Ga0137418_10146703All Organisms → cellular organisms → Bacteria → Acidobacteria2082Open in IMG/M
3300015264|Ga0137403_10001287All Organisms → cellular organisms → Bacteria31637Open in IMG/M
3300015264|Ga0137403_10590771Not Available976Open in IMG/M
3300015371|Ga0132258_10012079All Organisms → cellular organisms → Bacteria → Acidobacteria18127Open in IMG/M
3300015371|Ga0132258_10070711All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae8091Open in IMG/M
3300017927|Ga0187824_10076124All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1059Open in IMG/M
3300019877|Ga0193722_1129045Not Available574Open in IMG/M
3300019878|Ga0193715_1086095Not Available647Open in IMG/M
3300019881|Ga0193707_1100203Not Available866Open in IMG/M
3300019885|Ga0193747_1025219Not Available1470Open in IMG/M
3300019885|Ga0193747_1046269Not Available1082Open in IMG/M
3300020001|Ga0193731_1013766Not Available2091Open in IMG/M
3300020004|Ga0193755_1176403Not Available629Open in IMG/M
3300020012|Ga0193732_1000163All Organisms → cellular organisms → Bacteria12940Open in IMG/M
3300020022|Ga0193733_1000442All Organisms → cellular organisms → Bacteria13919Open in IMG/M
3300020579|Ga0210407_10014350All Organisms → cellular organisms → Bacteria5889Open in IMG/M
3300020579|Ga0210407_10285878All Organisms → cellular organisms → Bacteria1287Open in IMG/M
3300020583|Ga0210401_10196830Not Available1873Open in IMG/M
3300020583|Ga0210401_10284980Not Available1512Open in IMG/M
3300021086|Ga0179596_10470855Not Available636Open in IMG/M
3300021088|Ga0210404_10432111All Organisms → cellular organisms → Bacteria739Open in IMG/M
3300021168|Ga0210406_11288713Not Available527Open in IMG/M
3300021170|Ga0210400_10409543All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1119Open in IMG/M
3300021170|Ga0210400_11394970Not Available559Open in IMG/M
3300021178|Ga0210408_10566552Not Available901Open in IMG/M
3300021344|Ga0193719_10085424All Organisms → cellular organisms → Bacteria1372Open in IMG/M
3300021432|Ga0210384_10266421All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae1542Open in IMG/M
3300021432|Ga0210384_10363238Not Available1304Open in IMG/M
3300021432|Ga0210384_10884537Not Available793Open in IMG/M
3300021478|Ga0210402_10346857All Organisms → cellular organisms → Bacteria → Acidobacteria1377Open in IMG/M
3300021478|Ga0210402_11219293Not Available680Open in IMG/M
3300021479|Ga0210410_10277711Not Available1503Open in IMG/M
3300025900|Ga0207710_10076926All Organisms → cellular organisms → Bacteria1541Open in IMG/M
3300025910|Ga0207684_10004634All Organisms → cellular organisms → Bacteria → Acidobacteria12910Open in IMG/M
3300025911|Ga0207654_10905106Not Available640Open in IMG/M
3300025915|Ga0207693_11216939Not Available567Open in IMG/M
3300025916|Ga0207663_11309617Not Available583Open in IMG/M
3300025939|Ga0207665_10183825All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1515Open in IMG/M
3300026480|Ga0257177_1077265Not Available537Open in IMG/M
3300027645|Ga0209117_1044491All Organisms → cellular organisms → Bacteria → Acidobacteria1336Open in IMG/M
3300027846|Ga0209180_10174417All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1240Open in IMG/M
3300027875|Ga0209283_10144131All Organisms → cellular organisms → Bacteria → Acidobacteria1578Open in IMG/M
3300027889|Ga0209380_10600451Not Available637Open in IMG/M
3300027894|Ga0209068_10023943All Organisms → cellular organisms → Bacteria → Acidobacteria2993Open in IMG/M
3300028047|Ga0209526_10002880All Organisms → cellular organisms → Bacteria11670Open in IMG/M
3300028047|Ga0209526_10005612All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae8561Open in IMG/M
3300028047|Ga0209526_10127243All Organisms → cellular organisms → Bacteria1789Open in IMG/M
3300028047|Ga0209526_10148068All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1644Open in IMG/M
3300028047|Ga0209526_10575662All Organisms → cellular organisms → Bacteria → Acidobacteria724Open in IMG/M
3300028536|Ga0137415_10576848Not Available935Open in IMG/M
3300028673|Ga0257175_1086566Not Available605Open in IMG/M
3300028906|Ga0308309_11198095Not Available655Open in IMG/M
3300029636|Ga0222749_10688979Not Available559Open in IMG/M
3300031231|Ga0170824_128496579Not Available534Open in IMG/M
(restricted) 3300031248|Ga0255312_1119735Not Available648Open in IMG/M
3300031720|Ga0307469_12399063Not Available515Open in IMG/M
3300031740|Ga0307468_100675985Not Available858Open in IMG/M
3300031820|Ga0307473_10768051Not Available684Open in IMG/M
3300031962|Ga0307479_11716055All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium581Open in IMG/M
3300032180|Ga0307471_100002964All Organisms → cellular organisms → Bacteria9960Open in IMG/M
3300032180|Ga0307471_101204560Not Available921Open in IMG/M
3300032180|Ga0307471_103006812Not Available598Open in IMG/M
3300032180|Ga0307471_104067345Not Available517Open in IMG/M
3300032205|Ga0307472_100334027Not Available1236Open in IMG/M
3300032205|Ga0307472_100637470Not Available947Open in IMG/M
3300032205|Ga0307472_100970672All Organisms → cellular organisms → Bacteria → Acidobacteria793Open in IMG/M
3300032205|Ga0307472_102534737All Organisms → cellular organisms → Bacteria522Open in IMG/M
3300033433|Ga0326726_10641241Not Available1023Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil22.22%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil13.33%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere11.11%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil10.37%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil8.89%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds6.67%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil5.93%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere3.70%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil2.96%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil1.48%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere1.48%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere1.48%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere1.48%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment0.74%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.74%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil0.74%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil0.74%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil0.74%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil0.74%
Peat SoilEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Peat Soil0.74%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere0.74%
Switchgrass RhizosphereHost-Associated → Plants → Roots → Rhizosphere → Soil → Switchgrass Rhizosphere0.74%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.74%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere0.74%
Switchgrass, Maize And Mischanthus LitterEngineered → Solid Waste → Grass → Composting → Unclassified → Switchgrass, Maize And Mischanthus Litter0.74%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
2170459021Litter degradation NP4EngineeredOpen in IMG/M
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300004479Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAsEnvironmentalOpen in IMG/M
3300005334Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M5-2Host-AssociatedOpen in IMG/M
3300005367Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3 metaGHost-AssociatedOpen in IMG/M
3300005434Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaGEnvironmentalOpen in IMG/M
3300005437Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-2 metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300005615Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-10-3 metaGEnvironmentalOpen in IMG/M
3300005921Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 6EnvironmentalOpen in IMG/M
3300006050Freshwater sediment microbial communities from Pennsylvania, USA - Little Laurel Run_MetaG_LLR_2014EnvironmentalOpen in IMG/M
3300006057Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2012EnvironmentalOpen in IMG/M
3300006059Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Alex Branch Run_MetaG_ABR_2012EnvironmentalOpen in IMG/M
3300006102Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Alex Branch Run_MetaG_ABR_2013EnvironmentalOpen in IMG/M
3300006172Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2014EnvironmentalOpen in IMG/M
3300006175Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaGEnvironmentalOpen in IMG/M
3300006176Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5EnvironmentalOpen in IMG/M
3300006354Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2012EnvironmentalOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009093Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-4 metaGHost-AssociatedOpen in IMG/M
3300009101Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-4 metaGHost-AssociatedOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009174Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-4 metaGHost-AssociatedOpen in IMG/M
3300009545Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C2-4 metaGHost-AssociatedOpen in IMG/M
3300010371Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-1EnvironmentalOpen in IMG/M
3300010375Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C4-4 metaGHost-AssociatedOpen in IMG/M
3300011120Combined assembly of Microbial Forest Soil metaTEnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012957Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_207_MGEnvironmentalOpen in IMG/M
3300012961Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_202_MGEnvironmentalOpen in IMG/M
3300012988Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_242_MGEnvironmentalOpen in IMG/M
3300012989Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_237_MGEnvironmentalOpen in IMG/M
3300013297Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M6-5 metaGHost-AssociatedOpen in IMG/M
3300014325Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S6-5 metaGHost-AssociatedOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300017927Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_4EnvironmentalOpen in IMG/M
3300019877Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2m1EnvironmentalOpen in IMG/M
3300019878Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2m2EnvironmentalOpen in IMG/M
3300019881Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3c2EnvironmentalOpen in IMG/M
3300019885Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L1m2EnvironmentalOpen in IMG/M
3300020001Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1a2EnvironmentalOpen in IMG/M
3300020004Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1a2EnvironmentalOpen in IMG/M
3300020012Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1s1EnvironmentalOpen in IMG/M
3300020022Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1s2EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021086Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021088Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-MEnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021178Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-MEnvironmentalOpen in IMG/M
3300021344Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2a2EnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300025900Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025911Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025915Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025916Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025939Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026480Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-07-BEnvironmentalOpen in IMG/M
3300027645Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027889Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 6 (SPAdes)EnvironmentalOpen in IMG/M
3300027894Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2012 (SPAdes)EnvironmentalOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028673Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-69-BEnvironmentalOpen in IMG/M
3300028906Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5 (v2)EnvironmentalOpen in IMG/M
3300029636Metatranscriptome of lab incubated forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-M (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031248 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH5_T0_E5EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300033433Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF15MNEnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
4NP_020529102170459021Switchgrass, Maize And Mischanthus LitterMWKREDGTRPMPSWAMYAEAMNEFTKSAEAFMEHVHLLTEARTAYQEAMSVGAELRNRLDAGDRTLRGLMTQLEQVVSAHMGETPPDGKKPELVKNENARTSGQNTGTWKSFP
JGIcombinedJ26739_10020990243300002245Forest SoilMSEAMWKKDDGMGTQPTPTWAMYADAMNKFSRSATAFMEHVHLLTEARTAYQEAMAIGTALRIRLDAGDQTLRSLMTQLEQVVNNHLGEPVLDRKKPEPVEVESTRAKKEGTGMMFL*
Ga0062595_10002421723300004479SoilMSETPWRKDESMGLQPTPSWALYAEAANRFRSSAEAFMEHVHLLTEARSAYQQAVSASTELRGRLDAGDQTLKSIMTQLEQVVTDHLAEPVLDRKKPELVRAEPARGRNEVTGTGRTFP*
Ga0062595_10013800323300004479SoilMWKKEEGTDTLPTPTFATYAEAANKFRNSATAFMEHVHLLTEARGAYQEAMSASTELRNRLDAGDQTLRSVMTQLEQVVSDHLSEPMLDRKKPELVKVEPTRAKNEGTGTDRAFP*
Ga0068869_10020563913300005334Miscanthus RhizosphereMSEPMWKREDGTRPMPSWAMYAEAMNDFTKSAEAFMEHVHLLTEARTAYQEAMSVGAELRNRLDAGDRTLRGLMTQLEQVVSAHMGETPPDGKKPELVKNENARTSGQNT
Ga0070667_10046466923300005367Switchgrass RhizosphereMSEPMWKREDGTRPMPSWAMYAEAMNDFTKSAEAFMEHVHLLTEARTAYQEAMSVGAELRNRLDAGDRTLRGLMTQLEQVVSAHMGETPPDGKKPELVKNENART
Ga0070709_1030707933300005434Corn, Switchgrass And Miscanthus RhizosphereMETIWKKQEDTGPQPTPSWALYAEAANRFRSSAEAFMEHVYLLTEARNAYQQALSASTELRNTLDAGDQALRSIMTQLEQVIGDHLSEPVLDRKKPELVRSEPSGTWNESTGTDRIFP*
Ga0070710_1028293513300005437Corn, Switchgrass And Miscanthus RhizosphereMSEPMWKREDGTRPMPSWAMYAEAMNEFTKSAEAFMEHVHLLTEARTAYQEAMSVGAELRNRLDAGDRTLRGLMTQLEQVVSAHMGETPPDGKKPELVKNENARTSGQNTGTWKSFP*
Ga0070708_10000710133300005445Corn, Switchgrass And Miscanthus RhizosphereMSEAMWKKEEGTGSQPTPRWALYAEAANRFRSSATAFMEHVHLLTEARTAYQEAMSASTELRNRLDAGDQALRSVMTQLEQVVTDHLSEHVLDRKKPELVKVEPNRGKNEGTGTGRTFP*
Ga0070708_10151267223300005445Corn, Switchgrass And Miscanthus RhizosphereMWKKEEGMGTQAVPTWATYAEAMNKFTKSATAFMEHVHLLTEARTAYQEAMAVGTELRNRLDAGDQTLRSLMNQLEQVVNAHLSEPVLDRKKPELVTAESTRAKN
Ga0070708_10205278113300005445Corn, Switchgrass And Miscanthus RhizosphereGGCQMSEAMWKKEEGMGTQPMPTWATYAETMNKFTRSATAFMEHVHLLTEARTAYQEAMAVGTELRNRLDAGDQTLRSLMAQLEQVVNNHLSGPFADRKKPELVRAEPTREKNEGTGASRMFP*
Ga0070699_10045344133300005518Corn, Switchgrass And Miscanthus RhizosphereLYAEAANRFRSSATAFMEHVHLLTEARTAYQEAMSASTELRNRLDAGDQALRSVMTQLEQVVTDHLSEHVLDRKKPELVKVEPNRGKNEGTGTGRTFP*
Ga0070697_10121075613300005536Corn, Switchgrass And Miscanthus RhizosphereMSEPMWKRDDGIGTQPMPSWAMYAEAMNKFSRSATAFMEHVHLLTEARTAYQEAMSVGTELRNRLDAGDRTLRGLMTQLEQVVSAHLGETSLGGKKPELVKGEETRTSDQATRPWKAFP*
Ga0070696_10033362313300005546Corn, Switchgrass And Miscanthus RhizosphereFMEHVHLLTEARGAYQEAMSASTELRNRLDAGDQTLRSVMTQLEQVVSDHLSEPMLERKKPELVKVEPTRAKNEGTGTDRAFP*
Ga0070696_10172544513300005546Corn, Switchgrass And Miscanthus RhizosphereMSEPMWKREDGTRPMPSWAMYAEAMNDFTKSAEAFMEHVHLLTEARTAYQEAMSVGAELRNRLDAGDRTLRGLMTQLEQVVSAHMGETPPDGKKPELVKNENARTSGQNTGTWKSFP*
Ga0066706_1019853423300005598SoilMEAIWKKQEDTGPQPTPSWALYAEAANRFRSSAEAFMEHVYLLTEARNAYQQALSASTELRNTLDAGDQALRSIMTQLVQVIGDHLSEPVLDRKKPELVRSEPSGTWNESTGTDRIFP*
Ga0070702_10163855813300005615Corn, Switchgrass And Miscanthus RhizosphereMPSWAMYAEAMNDFTKSAEAFMEHVHLLTEARTAYQEAMSVGAELRNRLDAGDRTLRGLMTQLEQVVSAHMGETPPDGKKPELVKNENARTSGQNTGTWKSFP*
Ga0070766_1077357113300005921SoilMSEPMWKREDSKGTQPVPSWMTYAEAMNQFNRSARAFMEQVHLLTEVRTAYQEAMAVGTELRNRLDAGERTLRDLMTQLEQVVSAHLGDPPLDGKKPEPVRGENTRPNSQATGTWRLP*
Ga0075028_10001927923300006050WatershedsMSEAPWRRDEGTGQEPTPSWAPYAEAANRFRSCAEAFMEHVHLLTEARSAYQQAMSASTELRSRLDAGDQTLKSIMTQLEQVVTDHLSEPVVERKKPELVKVEPTIIAGTGRIFP*
Ga0075028_10008130733300006050WatershedsMMEAMWRKENGTGKQPPSWALYAEAADRFRGSAEAFMEHVHLLTEARDAYQQAMSASSELRNRLDAGDQTLRSVMTQLEQVVTDHLSESVPDKKRPELVKVEPTRGKNEGTGAGMMFP*
Ga0075026_10081161313300006057WatershedsMAEPMWKKEEDIDTQSTKTWAMYAEAMNQFSGSARAFMEHVHLLTEARTAYQEAMAVSTELRMKLDAGDETLRSLMTKLEQVVKSHMSEPVLDRKRPELVKDDSGRARNE
Ga0075017_10062872423300006059WatershedsMSEPTWKREDGTQPIPSWATYAEAMNEFTKSATAFMEHVHLLTEARTAYQEAMSVGTELRNRLDAGDRTLRGLMLQLEQVVSAHMSETPDGKKPELVKNENARTSGQATGTWKV
Ga0075017_10100564713300006059WatershedsMSEVIWKKEDGIGTQPVPTWATYAEAMNKFSKSATAFMEHVHLLTEARTAYHEAVAAGTALRNRLDAGDQTLRSLMTQLEQVVNEHFGEPALDKKKLELVKDESTKAKNEGTG
Ga0075015_10096791613300006102WatershedsKLDQIIRGYQMMEAMWRKENGTGKQPPSWALYAEAADRFRGSAEAFMEHVHLLTEARDAYQQAMSASSELRNRLDAGDQTLRSVMTQLEQVVTDHLSESVPDKKRPELVKVEPTRGKNEGTGAGMMFP*
Ga0075018_1041924713300006172WatershedsIIRGYQMMEAMWRKENGTGKQPPSWALYAEAADRFRGSAEAFMEHVHLLTEARDAYQQAMSASSELRNRLDAGDQTLRSVMTQLEQVVTDHLSESVPDKKRPELVKVEPTRGKNEGTGAGMMFP*
Ga0070712_10130161213300006175Corn, Switchgrass And Miscanthus RhizosphereRSSAEAFMEHVYLLTEARNAYQQALSASTELRNTLDAGDQALRSIMTQLEQVIGDHLSEPVLDRKKPELVRSEPSGTWNESTGTDRIFP*
Ga0070765_10050609623300006176SoilMWKREDGVGTQPMPSWAMYAEAMNKFTRSATAFIEQVHLLTEARTAYQEAMAVGTELRNRLDAGDRTLRGLMAQLEQVVSAHLGEAPLNGMKPELVKGENARTNSQATGTWQAFP*
Ga0075021_1056471713300006354WatershedsMSDSMWKKEDITSPQPAPSWILYAEAANKFRNSATAFMEHIHLLTEARTAYQEAMVASAELRHRLDAGDQALKSVMSQLEQVVTDHLSEPGLDRKKPELVKVEMTKGKTEGTGTRGMFP*
Ga0075424_10115472223300006904Populus RhizosphereMWKKEEGTDTLPTPTFATYAEAANKFRNSATAFMEHVHLLTEARGAYQEAMSASTELRNRLDAGDQTLRSVMTQLEQVVSDHLSEPMLERKKPELVKVEPTRAKNEGTGTDRAFP*
Ga0099793_1007234213300007258Vadose Zone SoilMSEAMWKKEEGTGSQPTPRWALYAEAANRFRSSATAFMEHVHLLTEARTAYQEAMSASTELRNRLDAGDEALRSVMTQLEQVVTDHLSEHVLDRKKPELVKVEPTRGKNEGTGTGRTFP*
Ga0099829_1005837333300009038Vadose Zone SoilMSEAMWKKDEGTGPQATPSWALYAEAANRFRSSAEAFMEHVHLLTEARSAYQQAVSASTELRSRLNAGDQALRSIMTQLEQVVTDHLSEPVLDRKRPELVKVEPPREKNEGTGTRRSFP*
Ga0099829_1014254143300009038Vadose Zone SoilMWKKEEGMGTQAVPTWATYAEAMNKFTKSATAFMEHVHLLTEARTAYQEAMAVGTELRNRLDAGDQTLRSLMNQLEQVVNAHLSEPVLDRKKPELVTAESTRAKNEGTGTGAMFP*
Ga0099829_1070670213300009038Vadose Zone SoilCCWPNRGKSGIEGQMSQPIWKRDDGMGTQPMPSWAMYAEAMNKFSRSATAFMEHVHLLTEARTAYQEAMSVGTELRNRLDAGDRTLRGLMTQLEQVVSAHLGETSLGGKKPELVKGEDTRTSDQATRPWKAFP*
Ga0099830_1138851913300009088Vadose Zone SoilLGYAFRALVICTLLLAKQSKSGIEGQMSEPMWKRENGKGTQPMPSWTTYAEAMNQFNRSATAFMEQVHLLTEARTAYQEAMAVGTELRNRLDAGERTLRDLMTQLEQVVSAHLGEPPLDGKKPEPVKGENIRPNSQATGTWRLP*
Ga0105240_1059119813300009093Corn RhizosphereMWKREDGTRPMPSWAMYAEAMNDFTKSAEAFMEHVHLLTEARTAYQEAMSVGAELRNRLDAGDRTLRGLMTQLEQVVSAHMGETPPDGKKPELVKNENARTSGQNTGTWKSFP*
Ga0105247_1006376223300009101Switchgrass RhizosphereMWKREDGTRPMPSWAMYAEAMNEFTKSAEAFMEHVHLLTEARTAYQEAMSVGAELRNRLDAGDRTLRGLMTQLEQVVSAHMGETPPDGKKPELVKNENARTSGQNTGTWKSFP*
Ga0066709_10263772013300009137Grasslands SoilNRFSRSARAFMEHVHYLTEARSAYQEALSVGAELRNRLDAGDQTLKSLMDQLEQVLHVHMSEPGLDRKKPELVKGEQIRQEGTGTYRNLP*
Ga0099792_1034260213300009143Vadose Zone SoilMSEPMWKREISNGAQPMPSWTTYAEAMNQFNRSARAFMEQVHLLTEARTAYQEAMAVGTELRNRLDAGERTLRDLMTQLERVVSAHLGEPPLDGEKPELVKGENTKTKRQTTGTWKAFP*
Ga0114129_1253371523300009147Populus RhizosphereFATYAEAANKFRNSATAFMEHVHLLTEARGAYQEAMSASTELRNRLDAGDQTLRSVMTQLEQVVSDHLSEPMLERKKPELVKVEPTRAKNEGTGTDRAFP*
Ga0105241_1145327113300009174Corn RhizosphereMWKKEDGTAPQPTPSWALYTDAANKFRSSAEAFMEHVPLLTEARSAYQEAMLVSAELRNRLDASDEALKSVMTQLEQVVNDHLDEPALDRKKPKAESSRAKNENTGTGGMFP*
Ga0105237_1267558313300009545Corn RhizosphereMWKREDGTRPMPSWAMYAEAMNDFTKSAEAFMEHVHLLTEARTAYQEAMSVGAELRNRLDAGDRTLRGLMTQLEQVVSAHMGETPPDGKKPELVKNETARTSGQNTGTWKSFP*
Ga0134125_1175365213300010371Terrestrial SoilMEATWKKDESTGPQPTPSWAMYAEAANKFRSFAEAFMEHVHLLTEARSAYQAAMSASTELRRRLDAGDQTLRSLMSQLEQVVNDHLSEPLVDRKKPELVRDENTRTNSQAPDTWKAFP*
Ga0105239_1036024433300010375Corn RhizosphereRPMPSWAMYAEAMNDFTKSAEAFMEHVHLLTEARTAYQEAMSVGAELRNRLDAGDRTLRGLMTQLEQVVSAHMGETPPDGKKPELVKNENARTSGQNTGTWKSFP*
Ga0150983_1338914313300011120Forest SoilMEGSQMNEPMWKKENGMSTQPVPTWATYAEAMNKFSKSATAFMEHVHLLTEARSAYHEAIAVGTTLRNRLDAGDQTLKSLMIQLEQVVNEHFAGPALDKKKLELVKDESTSAKNE
Ga0137392_1066443213300011269Vadose Zone SoilMSEPMWKRDDGIGTKPMPSWAMYAEAMNKFSRSATAFMEHVHLLTEARTAYQEAMSVGTELRNRLDAGDRTLRGLMTQLEQVVSAHLGETSLGGKKPELVKGEDTRTSDQATRPWKAFP*
Ga0137393_1062598423300011271Vadose Zone SoilMSEPMWKRENGKGTQPMPSWTTYAEAMNQFNRSATAFMEQVHLLTEARTAYQEAMAVGTELRNRLDAGERTLRDLMTQLEQVVSAHLGEPPLDGKKPEPVKGENIRPNSQATGTWRLP*
Ga0137388_1105070513300012189Vadose Zone SoilMSEPMWKRDDGIGTQPMPSWAMYAEAMNKFSRSATAFMEHVHLLTEARTAYQEAMSVGTELRNRLDAGDRTLRGLMTQLEQVVSAHLGETSLGGKKPELVKGEDTRTSDQATRPWKAFP*
Ga0137362_1107415913300012205Vadose Zone SoilMMEPIWKKQEGTGPQPTPAWALYSEAANRFRRSAEAFMEHVHLLTEARSAYQQAMSTSTELRNRLDAGDQALRSVMTQLEQVITDHLSEPVLDRKKPELVRSEPSATNESTGTGRTFP*
Ga0137360_1030701833300012361Vadose Zone SoilMMEPIWKKQEGTGPQPTPAWALYSEAANRFRRSAEAFMEHVHLLTEARSAYQQAMSTSTELRNRLDAGDQALRSVMTQLEQVITDHLSEPVLDRKKPELVRSETSATNESTGTGRTFP*
Ga0137360_1133510223300012361Vadose Zone SoilMWKREISNSTQPMPSWTTYAEAMNQFNRSARAFMEQVHLLTEARTAYQEAMAVGTELRNRLDAGERTLRDLMTQLERVVSDHLGEPPLDGEKPELVKGENTKTKTQTTGTWKGFP*
Ga0137390_1008990343300012363Vadose Zone SoilMSEAMWKKDEGTGPQATPSWALYAEAANRFRSSAEAFMEHVHLLTEARSAYQQAVSASTELRSRWNAGDQALRSIMTQLEQVVTDHLSEPVLDRKRPELVKVEPPREKNEGTGTRRSFP*
Ga0137390_1118458513300012363Vadose Zone SoilMWKKEDSVGTQPAPTMAMYSEAMEKFTKSATAFMEHVHLLTEARTAYQEAMSVGTELRKRLDAGDRTLRGLMTQLEQVVSAHLGETSLGGKKPELVKGEDTRTSDQATRPWKAFP*
Ga0137397_1096965113300012685Vadose Zone SoilMWKREISNSAQPMPSWTTYAEAMNQFNRSARAFMEQVHLLTEARTAYQEAMAVGTELRNRLDAGERTLRDLMTQLERVVSDHLGEPPLDGEKPELVKGGNTKTKTQTTGTWKGFP*
Ga0137396_1005376713300012918Vadose Zone SoilMSEPNWKREDGMGTQPMPSWAMYTEAMNEFSKSATAFMGHVHLLTEARYAYQEAMAASTALRNSLDAGDETLRSLMAQLEQVVNNHLGDPVLDKRKPELVKAESIREKNEGTGTGGMYP*
Ga0137394_1066794613300012922Vadose Zone SoilRLLTTRNPKEWMGRMSEPMWKREIGNSAQPMPSWTTYAEAMNQFNRSARAFMEQVHLLTEARTAYQEAMAVGTELRNRLDAGERTLRDLMTQLERVVSAHLGEPPLDGEKPELVKGENTKMKRQTTGTWKGFP*
Ga0137359_1020516533300012923Vadose Zone SoilMWKREISNSAQPMPSWTTYAEAMNQFNRSARAFMEQVHLLTEARTAYQEAMAVGTELRNRLDAGERTLRDLMTQLERVVSDHLGEPPLDGEKPELVKGENTKTKRQTTGTWKAFP*
Ga0137419_1084260213300012925Vadose Zone SoilMWKREISNSTQPMPSWTTYAEAMNQFNRSARAFMEQVHLLTEARTAYQEAMEVGTELRNRLDAGERTLRDLMTQLERVVSAHLGEPPLEGEKPELVKGENTKTKRQTTGTWKAFP*
Ga0137416_1164980913300012927Vadose Zone SoilMSEPMWKREISNGAQPMPSWTTYAEAMNQFNRSARAFMEQVHLLTEARTAYQEAMAVGTELRNRLDAGERTLRDLMTQLERVVSAHLGEPPLEGEKPELVKGENTKTKRQTTGTWKGFP*
Ga0137404_1156227513300012929Vadose Zone SoilGTQPMPTWATYAEAMNRFSRSARAFMEHVHYLTEARSAYQEALSVGAELRNRLDAGDQTLKSLMDQLEQVLNVHMSEPGLDRKKPELVKGEQTRHEGTGTYRNLP*
Ga0137407_1002231163300012930Vadose Zone SoilMWKREISNGAQPMPSWTTYAEAMNQFNRSARAFMEQVHLLTEARTAYQEAMAVGTELRNRLDAGERTLRDLMTQLERVVSDHLGEPPLDGEKPELVKGGNTKTKTQTTGTWKGFP*
Ga0137407_1052399213300012930Vadose Zone SoilMGEFKMNDAISKQNDSTGTQPMPTWATYAEAMNRFSRSARAFMEHVHYLTEARSAYQEALSVGAELRNRLDAGDQTLKSLMDQLEQVLNVHMSEPGLDRKKPELVKGEQTRHEGTGTYRNLP*
Ga0164303_1027300913300012957SoilMAINDPISKQGDGMGTQSMPTWATYAEAMNKFSSSARAFMEHVHLLTEARTAYQEAMSVGTELRNRLDAGDQTLKSLMDQLEQVVNSHMSEPVPDRKKPELVRGETIRGRTEVTGTYKNFP*
Ga0164302_1170042713300012961SoilMLSWLLHGCRRKHESANWFLLKPGDGGQTTMAINDPISKQGDGMGTQSMPTWATYAEAMNKFSSSARAFMEHVHLLTEARTAYQEAMSVGTELRNRLDAGDQTLKSLMDQLEQVVNSHMSEPVPDRKKPELVRGETIRGRTEVTGTYNNLP
Ga0164306_1187973413300012988SoilMAINDPISKQGDGMGTQSMPTWATYAEAMNKFSSSARAFMEHVHLLTEARTAYQEAMSVGTELRNRLDAGDQTLKSLMDQLEQVVNSHMSEPVPDRKKPELVRGET
Ga0164305_1144271513300012989SoilGMGTQSMPTWATYAEAMNKFSSSARAFMEHVHLLTEARTAYQEAMSVGTELRNRLDAGDQTLKSLMDQLEQVVNSHMSEPVPDRKKPELVRGETIRGRTEVTGTYKNFP*
Ga0157378_1002347833300013297Miscanthus RhizosphereMTGGKTTMAINDPISKQGEGIGTQSMPTWATYAEAMNRFSNSARAFMEHVHLLTEARTAYQEAISVGTELRNRLDAGDQTLKSLMEQLEQVVNSHMSEPVPDRKKPELVRGETIRGRTEVTGTYKNFP*
Ga0163163_1181178713300014325Switchgrass RhizosphereMWKREDGTRPMPSWAMYAEAMNDFTKSAEAFMEHVHLLTEARTAYQEAMSVGAELRNRLDAGDRTLRGLMTQLEQVVSAHMGETPPDGKKPELVKNEN
Ga0137418_1014670333300015241Vadose Zone SoilMPSWTTYAEAMNQFNRSARAFMEQVHLLTEARTAYQEAMEVGTELRNRLDAGERTLRDLMTQLERVVSAHLGEPPLEGEKPELVKGENTKTKRQTTGTWKAFP*
Ga0137403_10001287253300015264Vadose Zone SoilMSEPMWKREISNGAQPMPSWTTYAEAMNQFNRSARAFMEQVHLLTEARTAYQEAMAVGTELRNRLDAGERTLRDLMTQLERVVSDHLGEPPLDGEKPELVKGGNTKTKTQTTGTWKGFP*
Ga0137403_1059077123300015264Vadose Zone SoilMSEAMWKKEEGTGSQPTPRWALYAEAANRFRSSATAFMEHVHLLTEARTAYQEAMSASTELSNRLDAGDEALRSVMTQLEQVVTDHLSEHVLDRKKPELVKVEPTRGKNEGTGTGRTFP*
Ga0132258_10012079113300015371Arabidopsis RhizosphereMGLQPTPSWALYAEAANRFRSSAEAFMEHVHLLTEARSAYQQAVSASTELRGRLDAGDQTLKSIMTQLEQVVTDHLAEPVLDRKKPELVRAEPARGRNEVTGTGRTFP*
Ga0132258_1007071143300015371Arabidopsis RhizosphereMEDRAMWKKEEGTDTLPTPTFATYAEAANKFRNSATAFMEHVHLLTEARGAYQEAMSASTELRNRLDAGDQTLRSVMTQLEQVVSDHLSEPMLDRKKPELVKVEPTRAKNEGTGTDRAFP
Ga0187824_1007612413300017927Freshwater SedimentVSEATWKRDEGAGPQPTPSWALYAEAANRFRSSAEAFMEHVHLLTEARSAYQLAVTASTELRSRLDAGDQALRSIMSQLEQVVTDHLSGPVLDRKKPELVRVEPARAKNEATGTSRAFP
Ga0193722_112904513300019877SoilMAINDPISKQGDGMGTQSMPTWATYAEAMNKFSSSARAFMEHVHLLTEARTAYQEAMSVGTELRNRLDAGDQTLKSLMDQLEQVVNSHMSEPVPDRKKPELVRGETIRGRTEVTGTYKNF
Ga0193715_108609513300019878SoilMSEGTWNREDGMGTQPVPTWAMYAEAMNKFSRSATAFMEHVHLLVDAGAAYQEAIAVSTELRKRLDAGDQTLRSLMAQLEQVINDHLREPVLDRKRPELVRVESSRVKNEGTGTGGMFP
Ga0193707_110020313300019881SoilLLSFRYAFQALVILYSAAPCAAEDGGRQMSEGTWNREDGMGMQAVPTWAMYAEAMNKFSRSATAFMEHVHLLVDAGAAYQEAIAVSTELRKRLDAGDQTLRSLMAQLEQVVNDHLREPVLDRKRPELVRVESSRVKNEGTGTGGMFP
Ga0193747_102521913300019885SoilMSEGTWNREDGMGTQPVPTWAMYAEAMNKFSRSATAFMEHVHLLVDAGAAYQEAIAVSTELRKRLDAGDQTLRSLMAQLEQVINDHLREPVLDRKRPELVRVETSRVKNEGTGTGGMFP
Ga0193747_104626913300019885SoilMEPIWKKQEGTGPQPTPSWALYSEAANRFRSSAEAFMEHVHLLTETRIAYQQAMSTSTELRNSLDAGDQALRSIMIQLEQIISDHLSEAVLDRKKPELVRSEPSAARNENTGTGRTFP
Ga0193731_101376643300020001SoilMSEGTWNREDGMGTQPVPTWAMYAEAMNKFSRSATAFMEHVHLLVDAGAAYQEAIAVSTELRKRLDAGDQTLRSLMAQLEQVINDHLREPVLDRKRPELVRVEPSRVKNEGTGTGGMFP
Ga0193755_117640313300020004SoilMSEPMWKKDEGGPQPTPSWALYAEAANRFRSSAEAFMEHVHLLTEARSAYQQAVSASTELRSRLDAGDQALRSIMTQLEQVVTDHLSEPVLDRKKPELVRVEPTREKNQATGTTRAFP
Ga0193732_100016383300020012SoilMSEGMWNREDGMGTQPVPTWAMYAEAMNKFSRSATAFMEHVHLLVDAGAAYQEAIAVSTELRKRLDAGDQTLRSLMAQLEQVINDHLREPVLDRKRPELVRVEPSRVKNEGTGTGGMFP
Ga0193733_100044213300020022SoilMSEGMWNREDGMGTQPVPTWAMYAEAMNKFSRSATAFMEHVHLLVDAGAAYQEAIAVSTELRKRLDAGDQTLRSLMAQLEQVINDHLREPVLDRKRPELVRVEPSRVKNEGTGTG
Ga0210407_1001435053300020579SoilMSEPIWKRENGKGTQPMPSWATYAEGMNQFSRSARAFMEQVHLLTEARTAYQEAMAVGTELRNRLDAGERTLRDLMTQLEQVVSAHLGEPPLDGKKPDLVKGENTRSNSQATGTWRLP
Ga0210407_1028587813300020579SoilMEATWRKEDNTGPQPTPSWALYAEAANRFRSAAEAFMEHVHLLTEAQSAYQQAMSASTELRNRLDAGDQTLRSVMTQLQQVVSDHLSEAVPDKKRPELVRVEPTRGKNEGTGTGMMFP
Ga0210401_1019683033300020583SoilMSEPMWKREDSKGTQPVPSWMTYAEAMNQFNRSARAFMEQVHLLTEVRTAYQEAMAVGTELRNRLDAGERTLRDLMAQLEQVVSAHLGDPPLDGKKPEPVKGENTRSNSQATGTGRLP
Ga0210401_1028498023300020583SoilMSEPIWKRENGKGTQPMPSWATYAEGMNQFSRSARAFMEQVHLLTEARTAYQEAMAVGTELRNRLDAGERTLRDLMTQLEQVVSAHLGEPPLDGKKPELVKGENTRSNSQATGTWRLP
Ga0179596_1047085523300021086Vadose Zone SoilMSEAMWKKEEGTGSQPTPRWALYAEAANRFRSSATAFMEHVHLLTEARTAYQEAMSASTELRNRLDAGDEALRSVMTQLEQVVTDHLSEHVLDRKKPELVKVEPTRGKNEGTGTGRTFP
Ga0210404_1043211123300021088SoilQPMPTWAMYAEAMNKFNRSATAFMEHVHLLTEARTAYQEAMVVGTDLRNRLEAGDKTLRDLMTQLEQVVSAHLGEPPLDGKKPELVKGENTRSNSQATGTWRLP
Ga0210406_1128871313300021168SoilMEAIWRKEDNTGPQPTPSWALYAEAANRFRSSAEAFMEHVHLLTEAQTAYQQAMSASTELRNRLDAGDQTLRSVMTQLQQVVSDHLSEAVPDKKRPELVRVEPTRGKNEGTGTGMMFP
Ga0210400_1040954323300021170SoilMSEPMWKSEDGMGTQPMPTWAMYAEAMNKFNRSATAFMEHVHLLTEARTAYQEAMAVGTELRNRLDAGDRTLRGLMAQLEQVVSAHLGEAPLNGKKPELVKGENARTNSQATGTWQAFP
Ga0210400_1139497013300021170SoilRLLTTRNPKEWMGRMSEPQWKREISNGAQPMPSWTTYAEAMNQFNRSARAFMEQVHLLTEARTAYEEAMAVGTELRNRLDAGEGTLRDLMSQLEQLVSAYLGEPLLDGKQPELVKGENTKTNSQAIGTWKGFP
Ga0210408_1056655223300021178SoilMSEPMWKRDDGIGTQPRPSWAMYAEAMNKFSRSATAFMEHVHLLTEARTAYQEAMSVGTELRNRLDAGDRTLRGLMTQLEQVVNAHLGETSLGGKKPELVKGEDTRTSDQATRPWKAFP
Ga0193719_1008542413300021344SoilMNDAISKQNDSTGTQPMPTWATYAEAMNRFSRSARAFMEHVHYLTEARSAYQEALSVGAELRNRLDAGDQTLKSLMDQLEQVLNVHMSEPGLDRKKPELVKGEQIRHEGTGTYRNLP
Ga0210384_1026642133300021432SoilMWKREDGVGTQPMPSWAMYAEAMNKFTRSATAFIEQVHLLTEARTAYQEAMVVGTDLRNRLEAGDKTLRDLMTQLEQVVSAHSGEPPLDGKKPELVKGENTRTSDQATGTWKAFP
Ga0210384_1036323833300021432SoilMSEPMWKREDSKGTQPVPSWMTYVEAMNQFNRSARAFMEQVHLLTEVRTAYQEAMAVGTELRNRLDAGERTLRDLMAQLEQVVSAHLGDPPLDGKKPEPVKGENTRSNSQATGTGRLP
Ga0210384_1088453713300021432SoilRFRSSAEAFMEHVYLLTEARNAYQQAMSASTELRNTLDAGDQALRSIMTQLEQVIGDHLSEPVLDRKKPELVRSEPSATRNESTGTGRIFP
Ga0210402_1034685733300021478SoilMSEPIWKRENGKGTQPMPSWATYAEGMNQFSRSARAFMEQVHLLTEARTAYQEAMAVGTELRNRLDAGERTLRDLMTQLEQVVSAHLGEPPLDAKKPELVKGENTRSNSQATGTWRLP
Ga0210402_1121929323300021478SoilMSEPMWKRDDGIGTQPMPSWAMYAEAMNKFSRSATAFMEHVHLLTEARTAYQEAMSVGTELRNRLDVGDRTLRGLMTQLEQVVNAHLGETSLGGKKPELVKGEDTRTSDQATRPWKAFP
Ga0210410_1027771113300021479SoilMSEPIWKRENGKGTQPMPSWATYAEGMNQFSRSARAFMEQVHLLTEARTAYQEAMAVGTELRNRLDAGERTLRDLMTQLEQGVSAHLGEPPLDGKKPELVKGENTRSNSQATGTWRLP
Ga0207710_1007692623300025900Switchgrass RhizosphereMSEPMWKREDGTRPMPSWAMYAEAMNDFTKSAEAFMEHVHLLTEARTAYQEAMSVGAELRNRLDAGDRTLRGLMTQLEQVVSAHMGETPPDGKKPELVKNENARTSGQNTGTWKSFP
Ga0207684_1000463473300025910Corn, Switchgrass And Miscanthus RhizosphereMSEAMWKKEEGTGSQPTPRWALYAEAANRFRSSATAFMEHVHLLTEARTAYQEAMSASTELRNRLDAGDQALRSVMTQLEQVVTDHLSEHVLDRKKPELVKVEPNRGKNEGTGTGRTFP
Ga0207654_1090510613300025911Corn RhizosphereMSEPMWKREDGTRPMPSWAMYAEAMNDFTKSAEAFMEHVHLLTEARTAYQEAMSVGAELRNRLDAGDRTLRGLMTQLEQVVSAHMGETPPDGKKPELVKNENARTSG
Ga0207693_1121693923300025915Corn, Switchgrass And Miscanthus RhizosphereRFRSSAEAFMEHVYLLTEARNAYQQALSASTELRNTLDAGDQALRSIMTQLEQVIGDHLSEPVLDRKKPELVRSEPSGTWNESTGTDRIFP
Ga0207663_1130961713300025916Corn, Switchgrass And Miscanthus RhizosphereMSEPMSKREDGTRPMPSWAMYAEAMNEFTKSAEAFMEHVHLLTEARTAYQEAMSVGAELRNRLDAGDRTLRGLMTQLEQVVSAHMGETPPDGKKPELVKNENARTSGQNTGTWKSFP
Ga0207665_1018382523300025939Corn, Switchgrass And Miscanthus RhizosphereMGLQPTPSWALYAEAANRFRSSAEAFMEHVHLLTEARSAYQQAVSASTELRGRLDAGDQTLKSIMTQLEQVVTDHLAEPVLDRKKPELVRAEPARGRNEVTGTGRTFP
Ga0257177_107726513300026480SoilMSEAMWKKDEGTGPQATPSWALYAEAANRFRSSAEAFMEHVHLLTEARSAYQQAVSASTELRSRLNAGDQALRSIMTQLEQVVTDHLSEPVLDRKRPELVKVEPPREKNEGTGTRRSFP
Ga0209117_104449113300027645Forest SoilMWKREDGMGTQPMPSWAMYTEAVNKFRGSATAFNEHVHLLTEARTAYQEAIAVGTDLRNRLDAGDKTLRDLMTKLENIVQDHLSGTVPDRKKPELVRSEPSTPTNDSTGTGRSFHKNQDA
Ga0209180_1017441723300027846Vadose Zone SoilMSGAMWKKEEGMGTQAVPTWATYAEAMNKFTKSATAFMEHVHLLTEARTAYQEAMAVGTELRNRLDAGDQTLRSLMNQLEQVVNAHLSEPVLDRKKPELVTAESTRAKNEGTGTGAMFP
Ga0209283_1014413123300027875Vadose Zone SoilPRWALYAEAANRFRSSATAFMEHVHLLTEARTAYQEAMSASTELRNRLDAGDEALRSVMTQLEQVVTDHLSEHVLDRKKPELVKVEPTRGKNEGTGTGRTFP
Ga0209380_1060045113300027889SoilMSEPMWKREDSKGTQPVPSWMTYAEAMNQFNRSARAFMEQVHLLTEVRTAYQEAMAVGTELRNRLDAGERTLRDLMTQLEQVVSAHLGDPPLDGKKPEPVRGENTRPNSQATGTWRLP
Ga0209068_1002394363300027894WatershedsMMEAMWRKENGTGKQPPSWALYAEAADRFRGSAEAFMEHVHLLTEARDAYQQAMSASSELRNRLDAGDQTLRSVMTQLEQVVTDHLSESVPDKKRPELVKVEPTRGKNEGTGAGMMFP
Ga0209526_1000288073300028047Forest SoilMSEAMWKKDDGMGTQPTPTWAMYADAMNKFSRSATAFMEHVHLLTEARTAYQEAMAIGTALRIRLDAGDQTLRSLMTQLEQVVNNHLGEPVLDRKKPEPVEVESTRAKKEGTGMMFL
Ga0209526_1000561233300028047Forest SoilMSDAMWKKEDGMGTQLTPTWAIYAEAMNRFTKSATAFIEHAHLLTEARDAYQEAMAASTALRKGLDAGDHTLRSLRAQLAQVVYDHLDQPALDRKKPELVRVESTKAKNEGTGTARMFP
Ga0209526_1012724323300028047Forest SoilMSEPMWKRDDGIGTQPMPSWAMYAEAMNKFSRSATAFMEHVHLLTEARTAYQEAMSVGTELRNRLDAGDRTLRGLMTQLEQVVNAHLGETSLGGKKPELVKGEETRTSDQATRPWKAFP
Ga0209526_1014806813300028047Forest SoilMAEPMWNKREGVDTQRAPTWAMYAEAMNRFTGSARAFMEHVHLLTEARTAYQEAIAVGTELRNRLDAGDKTLRDLMTQLEQVVGAHLGEPLFDGKKPELVRGENARTNSQATGTWKTFP
Ga0209526_1057566213300028047Forest SoilMWKSEDGMGTQPMPSWAMNKFSRSATAFMEHVHLLTEARTAYQEAMVVGTDLRNRLDAGDKNLRDLMTKLEQVVSAHLGEPPLDGKKPELVRGENARTNSQATGTWKVFP
Ga0137415_1057684823300028536Vadose Zone SoilMSEPMWKREISNGAQPMPSWTTYAEAMNQFNRSARAFMEQVHLLTEARTAYQEAMAVGTELRNRLDAGERTLRDLMTQLERVVSAHLGEPPLEGEKPELVKGENTKTKRQTTGT
Ga0257175_108656613300028673SoilGTKPMPSWAMYAEAMNKFSRSATAFMEHVHLLTEARTAYQEAMSASTELRNRLDAGDEALRSVMTQLEQVVTDHLSEHVLDRKKPELVKVEPTRGKNEGTGTGRTFP
Ga0308309_1119809513300028906SoilMSEPMWKREDGKGTQPMPTWAMYAEAMNKFNRSATAFMEHVHLLTEARTAYQEAMAVGTELRNRLDAGDRTLRGLMAQLEQVVSAHLGEAPLNGMKPELVKGENARTNSQATGTWQAFP
Ga0222749_1068897913300029636SoilQMSEPMWKRDDGIGTQPRPSWAMYAEAMNKFSRSATAFMEHVHLLTEARTAYQEAMSVGTELRNRLDAGDRTLRGLMTQLEQVVNAHLGETSLGGKKPELVKGEDTRTSDQATRPWKASP
Ga0170824_12849657913300031231Forest SoilKDEGVGTQAVPTWAMYAEAMNNFRRSASSFMEHVHLLTEARMAYQEAMSVGTELRNRLDSGDQVLRSVMAQLEQVINDHLSEPVVDRKRPELVREEQLRGKNAATGTDRNFP
(restricted) Ga0255312_111973533300031248Sandy SoilPTFGTYAEAMNKFSRSARAFMEHVHLLTEARAAYQEAVSIGADLRNRLDAGDQTLQSLMTQLEQVVNVHLSEPILEKKKPELVKAELIRGRAEATGTDRNFP
Ga0307469_1239906313300031720Hardwood Forest SoilEAMWKKEEGMGTQPMPTWATYAETMNKFTRSATAFMEHVHLLTEARTAYQEAMAVGTELRNRLDAGDQTLRSLMAQLEQVVNNHLGEPVLDRKKPELMKAESTRENDEGTGTGGMFP
Ga0307468_10067598513300031740Hardwood Forest SoilMSAAMWKKEDGTAPQPTPSWALYTDAANKFRSSAEAFMEHVPLLTEARSAYQEAMLVSAELRNRLDASDEALKSVMTQLEQVVNDHLDEPALDRKKPKAESSRAKNE
Ga0307473_1076805123300031820Hardwood Forest SoilMEAIWKKQEDTGPQPTPSWALYAEAANRFRSSAEAFMEHVHLLTEARSAYQQAVSASTELRGRLDAGDQTLKSIMTQLEQVVTDHLAEPVLDRKKPELVRAEPARGRNEVTGTGRTFP
Ga0307479_1171605523300031962Hardwood Forest SoilFMEHVHLLTEAQSAYQQAMSASTELRNRLDAGDQTLRSVMTQLAQVVSDHLSESVPDKKRPELVRVEPTRGKNEGTGTGMMFP
Ga0307471_10000296423300032180Hardwood Forest SoilMSEPMWKRHDGIGTQPMPSWAMYAEAMNKFSRSATAFMEHVHLLTEARTAYQEAMSVGTELRNRLDAGDRTLRGLMTQLEQVVSAHLGETSLGGRKPELVKGEETRTSDQATRPWKAFP
Ga0307471_10120456013300032180Hardwood Forest SoilMEATWRKEDNTGPQPTPSWALYAEAANRFRSSAEAFMEHVHLLTEAQSAYQQAMSASTELRNRLDAGDQTLRSVMTQLAQVVSDHLSESVPDKKRPELVRVEPTRGKNEGTGTGMMFP
Ga0307471_10300681213300032180Hardwood Forest SoilMSGAMWKKEEGMGTQPVATWATYAEAMNKVTKSATAFMEHVHLLTEARTAYQEAMAVGTELRSRLDAGDQSLRSLMNQLEQVVNAHLSEPDLDRKKPELVKAESTRATTEGTGNGGIYPRNQPFIASSLSNIVKFHK
Ga0307471_10406734513300032180Hardwood Forest SoilMSAAMWKKEDAMGTQPVPTWAMYVEAMNNFSRSATAFMEHVHLLTEARSAYHEAMAVGTALRNSLDAGDQTLRSLMTQLEQVVNDHLGEPVLDRKKPELVKAESTRAKNEGTGAGRMFP
Ga0307472_10033402713300032205Hardwood Forest SoilMSEPMWKRHDGIGTQPMPSWAMYAEAMNKFSRSATAFMEHVHLLTEARTAYQEAMSVGTELRNRLDAGDRTLRGLMTQLEQVVSAHLGETSLGGKKPELVKGEETRTSDQATRPWKAFP
Ga0307472_10063747013300032205Hardwood Forest SoilMEAIWRKEDNTGPQPTPSWALYAEAANRFRSSAEAFMEHVHLLTEAQSAYQQAMSASTELRNRLDAGDQTLRSVMTQLAQVVSDHLSESVPDKKRPELVRVEPTRGKNEGTGTGMMFP
Ga0307472_10097067213300032205Hardwood Forest SoilLYQQENRRMSMDDAILKREGKVETQLMPTFGTYAEAMNKFSRSARAFMEHVHLLTEARAAYQEAVSIGADLRNRLDAGDQTLQSLMIQLEQIVNVHLAEPVLERKKPELVNSELIRGRAEATGTDRNFLP
Ga0307472_10253473713300032205Hardwood Forest SoilMWKKEDGTAPQPSWALYTDAANKFRSSAEAFMEHVPLLTEARSAYQEAMLVSAELRNRLDASDEALKSVMTQLEQVVNDHLDEPALDRKKPKAESSRAKNENTGTG
Ga0326726_1064124123300033433Peat SoilMSEAMWKKDQGTGPQPTPSWALYAEAADRFRSSAEAFMEHVHLLTEARDAYQQALSASSELRNRLDAGDQTLRSVMSQLEQVVTDHLSESVPNKKRPELVKVEPTRGNNEGTGTGMMFP


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.