NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F041388

Metagenome / Metatranscriptome Family F041388

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F041388
Family Type Metagenome / Metatranscriptome
Number of Sequences 160
Average Sequence Length 78 residues
Representative Sequence MNEEEIEARFEEIRNELKAQKEQLDAFKEDLKILLKHHSIAPRDIVDTAQQLGNRAFDQSDVHKCREFLKKYEIRKGF
Number of Associated Samples 123
Number of Associated Scaffolds 160

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 44.38 %
% of genes near scaffold ends (potentially truncated) 28.12 %
% of genes from short scaffolds (< 2000 bps) 83.75 %
Associated GOLD sequencing projects 113
AlphaFold2 3D model prediction Yes
3D model pTM-score0.64

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (70.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(25.000 % of family members)
Environment Ontology (ENVO) Unclassified
(30.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(66.875 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 54.72%    β-sheet: 0.00%    Coil/Unstructured: 45.28%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.64
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 160 Family Scaffolds
PF00231ATP-synt 2.50
PF13742tRNA_anti_2 1.88
PF00654Voltage_CLC 1.88
PF03972MmgE_PrpD 1.88
PF00106adh_short 1.25
PF00923TAL_FSA 1.25
PF04434SWIM 1.25
PF00326Peptidase_S9 1.25
PF01381HTH_3 0.62
PF01263Aldose_epim 0.62
PF13416SBP_bac_8 0.62
PF03358FMN_red 0.62
PF06751EutB 0.62
PF00296Bac_luciferase 0.62
PF07676PD40 0.62
PF01087GalP_UDP_transf 0.62
PF05985EutC 0.62
PF12681Glyoxalase_2 0.62
PF07508Recombinase 0.62
PF01488Shikimate_DH 0.62
PF07399Na_H_antiport_3 0.62
PF01797Y1_Tnp 0.62
PF01051Rep_3 0.62
PF13185GAF_2 0.62
PF01135PCMT 0.62
PF13671AAA_33 0.62
PF03372Exo_endo_phos 0.62
PF02405MlaE 0.62
PF01565FAD_binding_4 0.62
PF01595CNNM 0.62
PF02806Alpha-amylase_C 0.62
PF09335SNARE_assoc 0.62
PF06078DUF937 0.62
PF02371Transposase_20 0.62
PF01494FAD_binding_3 0.62
PF03459TOBE 0.62
PF08281Sigma70_r4_2 0.62
PF06841Phage_T4_gp19 0.62
PF11008DUF2846 0.62
PF02604PhdYeFM_antitox 0.62

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 160 Family Scaffolds
COG0224FoF1-type ATP synthase, gamma subunitEnergy production and conversion [C] 2.50
COG20792-methylcitrate dehydratase PrpDCarbohydrate transport and metabolism [G] 1.88
COG0038H+/Cl- antiporter ClcAInorganic ion transport and metabolism [P] 1.88
COG06542-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductasesEnergy production and conversion [C] 1.25
COG4279Uncharacterized protein, contains SWIM-type Zn finger domainFunction unknown [S] 1.25
COG4715Uncharacterized protein, contains SWIM-type Zn finger domainFunction unknown [S] 1.25
COG5431Predicted nucleic acid-binding protein, contains SWIM-type Zn-finger domainGeneral function prediction only [R] 1.25
COG0176Transaldolase/fructose-6-phosphate aldolaseCarbohydrate transport and metabolism [G] 1.25
COG4303Ethanolamine ammonia-lyase, large subunitAmino acid transport and metabolism [E] 0.62
COG4302Ethanolamine ammonia-lyase, small subunitAmino acid transport and metabolism [E] 0.62
COG5527Protein involved in initiation of plasmid replicationMobilome: prophages, transposons [X] 0.62
COG4122tRNA 5-hydroxyU34 O-methylase TrmR/YrrMTranslation, ribosomal structure and biogenesis [J] 0.62
COG4118Antitoxin component of toxin-antitoxin stability system, DNA-binding transcriptional repressorDefense mechanisms [V] 0.62
COG3753Uncharacterized conserved protein YidB, DUF937 familyFunction unknown [S] 0.62
COG3547TransposaseMobilome: prophages, transposons [X] 0.62
COG2519tRNA A58 N-methylase Trm61Translation, ribosomal structure and biogenesis [J] 0.62
COG2518Protein-L-isoaspartate O-methyltransferasePosttranslational modification, protein turnover, chaperones [O] 0.62
COG2226Ubiquinone/menaquinone biosynthesis C-methylase UbiE/MenGCoenzyme transport and metabolism [H] 0.62
COG2161Antitoxin component YafN of the YafNO toxin-antitoxin module, PHD/YefM familyDefense mechanisms [V] 0.62
COG2141Flavin-dependent oxidoreductase, luciferase family (includes alkanesulfonate monooxygenase SsuD and methylene tetrahydromethanopterin reductase)Coenzyme transport and metabolism [H] 0.62
COG2017Galactose mutarotase or related enzymeCarbohydrate transport and metabolism [G] 0.62
COG1961Site-specific DNA recombinase SpoIVCA/DNA invertase PinEReplication, recombination and repair [L] 0.62
COG1943REP element-mobilizing transposase RayTMobilome: prophages, transposons [X] 0.62
COG1523Pullulanase/glycogen debranching enzymeCarbohydrate transport and metabolism [G] 0.62
COG1238Uncharacterized membrane protein YqaA, VTT domainFunction unknown [S] 0.62
COG0767Permease subunit MlaE of the ABC-type intermembrane phospholipid transporter MlaCell wall/membrane/envelope biogenesis [M] 0.62
COG0676D-hexose-6-phosphate mutarotaseCarbohydrate transport and metabolism [G] 0.62
COG0665Glycine/D-amino acid oxidase (deaminating)Amino acid transport and metabolism [E] 0.62
COG0644Dehydrogenase (flavoprotein)Energy production and conversion [C] 0.62
COG0586Membrane integrity protein DedA, putative transporter, DedA/Tvp38 familyCell wall/membrane/envelope biogenesis [M] 0.62
COG0578Glycerol-3-phosphate dehydrogenaseEnergy production and conversion [C] 0.62
COG0398Uncharacterized membrane protein YdjX, related to fungal oxalate transporter, TVP38/TMEM64 familyFunction unknown [S] 0.62
COG0366Glycosidase/amylase (phosphorylase)Carbohydrate transport and metabolism [G] 0.62
COG02961,4-alpha-glucan branching enzymeCarbohydrate transport and metabolism [G] 0.62


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A70.00 %
All OrganismsrootAll Organisms30.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
2170459003|FZ032L002J3CHFNot Available516Open in IMG/M
2170459009|GA8DASG01BTGQ7Not Available512Open in IMG/M
3300000789|JGI1027J11758_12813558Not Available1144Open in IMG/M
3300001593|JGI12635J15846_10429934Not Available791Open in IMG/M
3300002245|JGIcombinedJ26739_100077665All Organisms → cellular organisms → Bacteria3076Open in IMG/M
3300002245|JGIcombinedJ26739_101383395Not Available596Open in IMG/M
3300002245|JGIcombinedJ26739_101628631Not Available543Open in IMG/M
3300002245|JGIcombinedJ26739_101769457Not Available518Open in IMG/M
3300003219|JGI26341J46601_10000650All Organisms → cellular organisms → Bacteria10790Open in IMG/M
3300003219|JGI26341J46601_10053180All Organisms → cellular organisms → Bacteria1262Open in IMG/M
3300003368|JGI26340J50214_10002619All Organisms → cellular organisms → Bacteria5849Open in IMG/M
3300003505|JGIcombinedJ51221_10410524Not Available549Open in IMG/M
3300004080|Ga0062385_10662444Not Available668Open in IMG/M
3300004092|Ga0062389_100592527All Organisms → cellular organisms → Bacteria → Proteobacteria1271Open in IMG/M
3300004092|Ga0062389_101381265Not Available889Open in IMG/M
3300004092|Ga0062389_102544047Not Available680Open in IMG/M
3300004092|Ga0062389_103659138All Organisms → cellular organisms → Bacteria577Open in IMG/M
3300004092|Ga0062389_104273935Not Available537Open in IMG/M
3300004152|Ga0062386_100380051All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1135Open in IMG/M
3300004633|Ga0066395_10734402Not Available588Open in IMG/M
3300004635|Ga0062388_101471950Not Available687Open in IMG/M
3300005332|Ga0066388_100621440Not Available1700Open in IMG/M
3300005434|Ga0070709_11613001All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Lachnospiraceae → Enterocloster → Enterocloster clostridioformis528Open in IMG/M
3300005437|Ga0070710_11477695All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Lactobacillales → Carnobacteriaceae → Dolosigranulum → Dolosigranulum pigrum510Open in IMG/M
3300005439|Ga0070711_101343699Not Available621Open in IMG/M
3300005456|Ga0070678_101380699All Organisms → cellular organisms → Bacteria657Open in IMG/M
3300005468|Ga0070707_102045391Not Available540Open in IMG/M
3300005548|Ga0070665_100493171Not Available1236Open in IMG/M
3300005602|Ga0070762_11193763Not Available526Open in IMG/M
3300005764|Ga0066903_100260979Not Available2688Open in IMG/M
3300006041|Ga0075023_100210167Not Available756Open in IMG/M
3300006173|Ga0070716_100396500Not Available991Open in IMG/M
3300006174|Ga0075014_100529970Not Available663Open in IMG/M
3300006175|Ga0070712_100015177All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Phyllobacteriaceae → Mesorhizobium4955Open in IMG/M
3300006176|Ga0070765_100074988All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales2851Open in IMG/M
3300006176|Ga0070765_100544051Not Available1093Open in IMG/M
3300006176|Ga0070765_101509655Not Available632Open in IMG/M
3300006893|Ga0073928_10055948All Organisms → cellular organisms → Bacteria3518Open in IMG/M
3300009545|Ga0105237_10286391All Organisms → cellular organisms → Bacteria1650Open in IMG/M
3300010154|Ga0127503_10600927Not Available540Open in IMG/M
3300010154|Ga0127503_10919783Not Available813Open in IMG/M
3300010154|Ga0127503_11071014Not Available672Open in IMG/M
3300010154|Ga0127503_11230593All Organisms → cellular organisms → Bacteria1107Open in IMG/M
3300010154|Ga0127503_11288280Not Available501Open in IMG/M
3300010339|Ga0074046_10561140Not Available678Open in IMG/M
3300010343|Ga0074044_10192257Not Available1357Open in IMG/M
3300010359|Ga0126376_12510546Not Available563Open in IMG/M
3300010376|Ga0126381_101624527Not Available934Open in IMG/M
3300010379|Ga0136449_101017921All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae1331Open in IMG/M
3300010379|Ga0136449_101095483Not Available1268Open in IMG/M
3300010379|Ga0136449_101512931Not Available1026Open in IMG/M
3300010865|Ga0126346_1089265Not Available524Open in IMG/M
3300011120|Ga0150983_11251215Not Available526Open in IMG/M
3300011120|Ga0150983_13580349Not Available820Open in IMG/M
3300012181|Ga0153922_1005988Not Available3156Open in IMG/M
3300012202|Ga0137363_10247935Not Available1447Open in IMG/M
3300012361|Ga0137360_10116408Not Available2071Open in IMG/M
3300012361|Ga0137360_11695559Not Available537Open in IMG/M
3300012362|Ga0137361_10802037Not Available857Open in IMG/M
3300012960|Ga0164301_11218813Not Available605Open in IMG/M
3300012984|Ga0164309_11870042Not Available515Open in IMG/M
3300012986|Ga0164304_10632392Not Available803Open in IMG/M
3300012989|Ga0164305_10407153Not Available1043Open in IMG/M
3300013308|Ga0157375_11619169Not Available766Open in IMG/M
3300014165|Ga0181523_10138140All Organisms → cellular organisms → Bacteria1442Open in IMG/M
3300014200|Ga0181526_10183597Not Available1339Open in IMG/M
3300015242|Ga0137412_10839704Not Available671Open in IMG/M
3300015371|Ga0132258_12213535All Organisms → cellular organisms → Bacteria1380Open in IMG/M
3300016445|Ga0182038_10808552Not Available822Open in IMG/M
3300019890|Ga0193728_1317302Not Available580Open in IMG/M
3300020579|Ga0210407_10405725All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales1066Open in IMG/M
3300020579|Ga0210407_10495023Not Available955Open in IMG/M
3300020581|Ga0210399_10017138All Organisms → cellular organisms → Bacteria5724Open in IMG/M
3300020582|Ga0210395_10388822All Organisms → cellular organisms → Bacteria1049Open in IMG/M
3300020583|Ga0210401_10392992Not Available1249Open in IMG/M
3300020583|Ga0210401_11377145Not Available563Open in IMG/M
3300020583|Ga0210401_11642401Not Available500Open in IMG/M
3300021168|Ga0210406_11388780Not Available502Open in IMG/M
3300021171|Ga0210405_10286675All Organisms → cellular organisms → Bacteria1302Open in IMG/M
3300021178|Ga0210408_10697796Not Available799Open in IMG/M
3300021180|Ga0210396_10449306Not Available1131Open in IMG/M
3300021180|Ga0210396_10466641All Organisms → cellular organisms → Bacteria1107Open in IMG/M
3300021181|Ga0210388_11298535Not Available614Open in IMG/M
3300021358|Ga0213873_10041924Not Available1182Open in IMG/M
3300021361|Ga0213872_10022497Not Available2903Open in IMG/M
3300021361|Ga0213872_10030929Not Available2453Open in IMG/M
3300021361|Ga0213872_10349769Not Available606Open in IMG/M
3300021372|Ga0213877_10000233All Organisms → cellular organisms → Bacteria8741Open in IMG/M
3300021372|Ga0213877_10018484All Organisms → cellular organisms → Bacteria1853Open in IMG/M
3300021372|Ga0213877_10175102All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium687Open in IMG/M
3300021401|Ga0210393_10572100Not Available923Open in IMG/M
3300021402|Ga0210385_10008554All Organisms → cellular organisms → Bacteria6199Open in IMG/M
3300021402|Ga0210385_10387162Not Available1049Open in IMG/M
3300021403|Ga0210397_11243822All Organisms → cellular organisms → Bacteria579Open in IMG/M
3300021404|Ga0210389_11310306Not Available555Open in IMG/M
3300021405|Ga0210387_10563281All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Burkholderiaceae1013Open in IMG/M
3300021405|Ga0210387_11098324All Organisms → cellular organisms → Bacteria694Open in IMG/M
3300021405|Ga0210387_11189498Not Available662Open in IMG/M
3300021420|Ga0210394_10841772Not Available800Open in IMG/M
3300021432|Ga0210384_11686346Not Available539Open in IMG/M
3300021444|Ga0213878_10035737Not Available1899Open in IMG/M
3300021474|Ga0210390_10640621Not Available889Open in IMG/M
3300021475|Ga0210392_10398921Not Available1002Open in IMG/M
3300021475|Ga0210392_11217186Not Available564Open in IMG/M
3300021478|Ga0210402_10989648Not Available768Open in IMG/M
3300021559|Ga0210409_10578282Not Available991Open in IMG/M
3300021560|Ga0126371_10341107Not Available1631Open in IMG/M
3300022507|Ga0222729_1070628Not Available515Open in IMG/M
3300022531|Ga0242660_1035374All Organisms → cellular organisms → Bacteria → PVC group1028Open in IMG/M
3300022533|Ga0242662_10197463Not Available632Open in IMG/M
3300022557|Ga0212123_10020472All Organisms → cellular organisms → Bacteria → Proteobacteria7575Open in IMG/M
3300022726|Ga0242654_10303475Not Available588Open in IMG/M
3300024225|Ga0224572_1028560Not Available1069Open in IMG/M
3300024288|Ga0179589_10409369Not Available622Open in IMG/M
3300024347|Ga0179591_1044384All Organisms → cellular organisms → Bacteria2779Open in IMG/M
3300025914|Ga0207671_11469804Not Available527Open in IMG/M
3300026551|Ga0209648_10034267All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium4422Open in IMG/M
3300026895|Ga0207758_1017326Not Available701Open in IMG/M
3300027371|Ga0209418_1072590Not Available616Open in IMG/M
3300027439|Ga0209332_1106067Not Available501Open in IMG/M
3300027667|Ga0209009_1004319All Organisms → cellular organisms → Bacteria3441Open in IMG/M
3300027667|Ga0209009_1049538All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Aurantimonadaceae → Aurantimonas → unclassified Aurantimonas → Aurantimonas sp. 22II-16-19i1048Open in IMG/M
3300027680|Ga0207826_1015307Not Available2091Open in IMG/M
3300027684|Ga0209626_1150229Not Available615Open in IMG/M
3300027698|Ga0209446_1100126Not Available744Open in IMG/M
3300027812|Ga0209656_10000114All Organisms → cellular organisms → Bacteria51378Open in IMG/M
3300027812|Ga0209656_10017436All Organisms → cellular organisms → Bacteria4431Open in IMG/M
3300027812|Ga0209656_10034073All Organisms → cellular organisms → Bacteria2991Open in IMG/M
3300027825|Ga0209039_10094607All Organisms → cellular organisms → Bacteria1288Open in IMG/M
3300027884|Ga0209275_10701017Not Available583Open in IMG/M
3300028379|Ga0268266_10542085Not Available1114Open in IMG/M
3300028798|Ga0302222_10305140Not Available620Open in IMG/M
3300028863|Ga0302218_10228053Not Available597Open in IMG/M
3300028906|Ga0308309_10384948All Organisms → cellular organisms → Bacteria1201Open in IMG/M
3300029943|Ga0311340_11146633Not Available629Open in IMG/M
3300029999|Ga0311339_10613845Not Available1081Open in IMG/M
3300030007|Ga0311338_10576538All Organisms → cellular organisms → Bacteria → PVC group1162Open in IMG/M
3300030058|Ga0302179_10012252All Organisms → cellular organisms → Bacteria4386Open in IMG/M
3300030399|Ga0311353_11045569Not Available681Open in IMG/M
3300030531|Ga0210274_1479613Not Available540Open in IMG/M
3300030532|Ga0210290_1655691Not Available524Open in IMG/M
3300030597|Ga0210286_1258937Not Available555Open in IMG/M
3300030741|Ga0265459_12161091Not Available673Open in IMG/M
3300030743|Ga0265461_10454646Not Available1020Open in IMG/M
3300031057|Ga0170834_110906249Not Available537Open in IMG/M
3300031231|Ga0170824_103451325All Organisms → cellular organisms → Bacteria → environmental samples → uncultured bacterium873Open in IMG/M
3300031234|Ga0302325_10165507All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia3909Open in IMG/M
3300031573|Ga0310915_10266397Not Available1207Open in IMG/M
3300031708|Ga0310686_105937005All Organisms → cellular organisms → Bacteria → Acidobacteria2328Open in IMG/M
3300031708|Ga0310686_109550086Not Available586Open in IMG/M
3300031962|Ga0307479_11298370Not Available689Open in IMG/M
3300032059|Ga0318533_10356233Not Available1066Open in IMG/M
3300032063|Ga0318504_10369981Not Available682Open in IMG/M
3300032160|Ga0311301_10292953Not Available2618Open in IMG/M
3300032160|Ga0311301_10963452All Organisms → cellular organisms → Bacteria1136Open in IMG/M
3300032160|Ga0311301_11060780All Organisms → cellular organisms → Bacteria1061Open in IMG/M
3300032160|Ga0311301_11366838Not Available887Open in IMG/M
3300032205|Ga0307472_100234296Not Available1426Open in IMG/M
3300032261|Ga0306920_103579951Not Available573Open in IMG/M
3300032515|Ga0348332_13062323Not Available509Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil25.00%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil10.00%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil8.12%
PalsaEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Palsa5.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil4.38%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil4.38%
Peatlands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Peatlands Soil4.38%
SoilEnvironmental → Aquatic → Freshwater → Groundwater → Unclassified → Soil3.12%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil3.12%
RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Rhizosphere3.12%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil3.75%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere3.75%
Bulk SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Bulk Soil2.50%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil1.88%
BogEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog1.25%
Iron-Sulfur Acid SpringEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Acidic → Iron-Sulfur Acid Spring1.25%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds1.25%
Grass SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grass Soil1.25%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil1.25%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil1.25%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil1.25%
Bog Forest SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Bog Forest Soil1.25%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere1.25%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere1.25%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil0.62%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil0.62%
Plant LitterEnvironmental → Terrestrial → Plant Litter → Unclassified → Unclassified → Plant Litter0.62%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.62%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Soil → Unclassified → Miscanthus Rhizosphere0.62%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.62%
Attine Ant Fungus GardensHost-Associated → Fungi → Mycelium → Unclassified → Unclassified → Attine Ant Fungus Gardens0.62%
Boreal Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Boreal Forest Soil0.62%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
2170459003Grass soil microbial communities from Rothamsted Park, UK - March 2009 indirect MP BIO 1O1 lysis 0-21cmEnvironmentalOpen in IMG/M
2170459009Grass soil microbial communities from Rothamsted Park, UK - July 2009 indirect DNA Tissue lysis 0-10cmEnvironmentalOpen in IMG/M
3300000789Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300001593Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M2EnvironmentalOpen in IMG/M
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300003219Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP12_OM3EnvironmentalOpen in IMG/M
3300003368Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP12_OM2EnvironmentalOpen in IMG/M
3300003505Forest soil microbial communities from Harvard Forest LTER, USA - Combined assembly of forest soil metaG samples (ASSEMBLY_DATE=20140924)EnvironmentalOpen in IMG/M
3300004080Coassembly of ECP04_OM1, ECP04_OM2, ECP04_OM3EnvironmentalOpen in IMG/M
3300004092Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3, ECP04_OM1, ECP04_OM2, ECP04_OM3, ECP14_OM1, ECP14_OM2, ECP14_OM3EnvironmentalOpen in IMG/M
3300004152Coassembly of ECP12_OM1, ECP12_OM2, ECP12_OM3EnvironmentalOpen in IMG/M
3300004633Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 1 MoBioEnvironmentalOpen in IMG/M
3300004635Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3, ECP04_OM1, ECP04_OM2, ECP04_OM3EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005434Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaGEnvironmentalOpen in IMG/M
3300005437Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-2 metaGEnvironmentalOpen in IMG/M
3300005439Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaGEnvironmentalOpen in IMG/M
3300005456Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M7-3 metaGHost-AssociatedOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005548Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S1-3 metaGHost-AssociatedOpen in IMG/M
3300005602Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF2EnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300006041Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2014EnvironmentalOpen in IMG/M
3300006173Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaGEnvironmentalOpen in IMG/M
3300006174Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Alex Branch Run_MetaG_ABR_2014EnvironmentalOpen in IMG/M
3300006175Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaGEnvironmentalOpen in IMG/M
3300006176Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5EnvironmentalOpen in IMG/M
3300006893Iron sulfur acid spring bacterial and archeal communities from Banff, Canada, to study Microbial Dark Matter (Phase II) - Paint Pots PPA 5.5 metaGEnvironmentalOpen in IMG/M
3300009545Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C2-4 metaGHost-AssociatedOpen in IMG/M
3300010154Soil microbial communities from Willow Creek, Wisconsin, USA - WC-WI-TBF metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010339Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - Bog Forest MetaG ECP23OM3EnvironmentalOpen in IMG/M
3300010343Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - Bog Forest MetaG ECP23OM1EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010376Tropical forest soil microbial communities from Panama - MetaG Plot_28EnvironmentalOpen in IMG/M
3300010379Sb_50d combined assemblyEnvironmentalOpen in IMG/M
3300010865Boreal forest soil eukaryotic communities from Alaska, USA - C3-3 Metatranscriptome (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300011120Combined assembly of Microbial Forest Soil metaTEnvironmentalOpen in IMG/M
3300012181Attine ant fungus gardens microbial communities from New Jersey, USA - TSNJ006 MetaGHost-AssociatedOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012960Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_231_MGEnvironmentalOpen in IMG/M
3300012984Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_247_MGEnvironmentalOpen in IMG/M
3300012986Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_217_MGEnvironmentalOpen in IMG/M
3300012989Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_237_MGEnvironmentalOpen in IMG/M
3300013308Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M3-5 metaGHost-AssociatedOpen in IMG/M
3300014165Peatland microbial communities from Houghton, MN, USA - PEATcosm2014_Bin05_30_metaGEnvironmentalOpen in IMG/M
3300014200Peatland microbial communities from Houghton, MN, USA - PEATcosm2014_Bin06_30_metaGEnvironmentalOpen in IMG/M
3300015242Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300016445Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.108EnvironmentalOpen in IMG/M
3300019890Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1c1EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300020582Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-OEnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300021178Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-MEnvironmentalOpen in IMG/M
3300021180Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-OEnvironmentalOpen in IMG/M
3300021181Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-OEnvironmentalOpen in IMG/M
3300021358Rhizosphere microbial communities from Vellozia epidendroides in rupestrian grasslands, the National Park of Serra do Cipo, Brazil - RX_R3Host-AssociatedOpen in IMG/M
3300021361Rhizosphere microbial communities from Vellozia epidendroides in rupestrian grasslands, the National Park of Serra do Cipo, Brazil - RX_R2Host-AssociatedOpen in IMG/M
3300021372Vellozia epidendroides bulk soil microbial communities from rupestrian grasslands, the National Park of Serra do Cipo, Brazil - BS_R01EnvironmentalOpen in IMG/M
3300021401Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-OEnvironmentalOpen in IMG/M
3300021402Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-OEnvironmentalOpen in IMG/M
3300021403Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-OEnvironmentalOpen in IMG/M
3300021404Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-OEnvironmentalOpen in IMG/M
3300021405Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-OEnvironmentalOpen in IMG/M
3300021420Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-MEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300021444Vellozia epidendroides bulk soil microbial communities from rupestrian grasslands, the National Park of Serra do Cipo, Brazil - BS_R02EnvironmentalOpen in IMG/M
3300021474Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-OEnvironmentalOpen in IMG/M
3300021475Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-OEnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300021560Tropical forest soil microbial communities from Panama - MetaG Plot_4EnvironmentalOpen in IMG/M
3300022507Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-27-M (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300022531Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-28-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022533Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-7-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022557Paint Pots_combined assemblyEnvironmentalOpen in IMG/M
3300022726Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-30-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300024225Spruce rhizosphere microbial communities from Bohemian Forest, Czech Republic ? CZU5Host-AssociatedOpen in IMG/M
3300024288Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_08_16fungalEnvironmentalOpen in IMG/M
3300024347Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_08_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300025914Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C2-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300026895Tropical forest soil microbial communities from Luquillo Experimental Forest, Puerto Rico - Sample 12 (SPAdes)EnvironmentalOpen in IMG/M
3300027371Forest soil microbial communities from Davy Crockett National Forest, Groveton, Texas, USA - Texas A ecozone_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027439Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM3_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027667Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM3_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027680Tropical forest soil microbial communities from Luquillo Experimental Forest, Puerto Rico - Sample 80 (SPAdes)EnvironmentalOpen in IMG/M
3300027684Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM1H0_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027698Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP04_OM2 (SPAdes)EnvironmentalOpen in IMG/M
3300027812Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP12_OM2 (SPAdes)EnvironmentalOpen in IMG/M
3300027825Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP12_OM1 (SPAdes)EnvironmentalOpen in IMG/M
3300027884Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF2 (SPAdes)EnvironmentalOpen in IMG/M
3300028379Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S1-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300028798Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - II_Palsa_E2_2EnvironmentalOpen in IMG/M
3300028863Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - II_Palsa_E1_1EnvironmentalOpen in IMG/M
3300028906Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5 (v2)EnvironmentalOpen in IMG/M
3300029943I_Palsa_N3 coassemblyEnvironmentalOpen in IMG/M
3300029999I_Palsa_E3 coassemblyEnvironmentalOpen in IMG/M
3300030007I_Palsa_E1 coassemblyEnvironmentalOpen in IMG/M
3300030058Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - I_Palsa_E3_1EnvironmentalOpen in IMG/M
3300030399II_Palsa_E2 coassemblyEnvironmentalOpen in IMG/M
3300030531Metatranscriptome of forest soil microbial communities from Boreal Montmorency Forest, Quebec, Canada - FO143-VCO038SO (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300030532Metatranscriptome of forest soil microbial communities from Boreal Montmorency Forest, Quebec, Canada - FO410-VDE108SO (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300030597Metatranscriptome of forest soil microbial communities from Boreal Montmorency Forest, Quebec, Canada - FO747-VDE046SO (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300030741Forest Soil Metatranscriptomes Boreal Montmorency Forest, Quebec, Canada ANR Co-assemblyEnvironmentalOpen in IMG/M
3300030743Forest Soil Metatranscriptomes Boreal Montmorency Forest, Quebec, Canada VCO Co-assemblyEnvironmentalOpen in IMG/M
3300031057Oak Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031234Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - Palsa_T0_2EnvironmentalOpen in IMG/M
3300031573Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.AN111EnvironmentalOpen in IMG/M
3300031708FICUS49499 Metagenome Czech Republic combined assemblyEnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032059Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.053b4f27EnvironmentalOpen in IMG/M
3300032063Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.084b2f17EnvironmentalOpen in IMG/M
3300032160Sb_50d combined assembly (MetaSPAdes)EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300032261Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170 (v2)EnvironmentalOpen in IMG/M
3300032515FICUS49499 Metatranscriptome Czech Republic combined assembly (additional data)EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
E4A_116141402170459003Grass SoilMNEEEIVNRFEEIRNELKMYKEQLDSFKEDLKILLKHRSIAHRDIVDTAQQLGNSAFDQSDVRKCREFLKKIRY
F47_122762602170459009Grass SoilFQFLRYIIFLSMNDEEMENRFGELRDELRAFKNELDSVREDLKLLLKHRSIAHRDIIDTAHQVGNRAFDQSDVQKCREFLEKXNIRL
JGI1027J11758_1281355823300000789SoilMNAEEIEHGFQEIRNELETXKQQIASLKEDLRILLKHRSIAHRDIIDTAHQVGNRAFDQSDVQKCREFLQRYNIRV*
JGI12635J15846_1042993423300001593Forest SoilMNEEEIQSHFLQIRNELKICNEQIASLKEDLKILLKHRSIAHRDIVDTAHQLGNRAFDQSDVQKCREFLERYNIRV*
JGIcombinedJ26739_10007766523300002245Forest SoilMNEEEIEARFEEIRNQLKAQKEQLDAFKEDLMILFKHRSIASRDILDTAQQLGNSAFDQSDVHKCQEFLKKYEIRRGF*
JGIcombinedJ26739_10138339513300002245Forest SoilMNEEEIESDLREIRNELKTCKEQIAALKEDLKILLKHRSIAHRDIIDTAHQLGNQAFDQSDVQKCREFLLRYNIRV*
JGIcombinedJ26739_10162863123300002245Forest SoilMNEAEIETRFEQIRNELKAQKEQLDAFKEDLKILLKHRSIAPRDILDTAQQLGNSAFDQSDVHKCQEFLKRYEIRRGF*
JGIcombinedJ26739_10176945713300002245Forest SoilMNEEEIQSHFLQLQNELKMCNEQIASLKEDLKILLKHRSIAHRDIVDTAHQLGNRAFDQSDVQKCREFLERYNIRV*GKG*
JGI26341J46601_1000065073300003219Bog Forest SoilMNEQEIETRFEELKNELKAQKEQLDALKEDLKILLKHRSIAPRDIVDTAQQIGNAAFDQSDVHRCREFLKKYDVRKGL*
JGI26341J46601_1005318033300003219Bog Forest SoilMNEQEIETRFEELKNELKSQKEQLDALKEDIKILLKHRSIAPRDIVDTAQQIGNAAFDQSDVHRCREFLKKYDVRKGL*
JGI26340J50214_1000261963300003368Bog Forest SoilMNEAEIETRFEEIRNELKAQKEQLDAFKEDLKILLKHRSIAPRDILDTAQQLGNSAFDQSDVHKCQEFLKKYEIRRGF*
JGIcombinedJ51221_1041052413300003505Forest SoilRFEQIRNELKAQKEQLDAFKEDLKILLKHRSIAPRDILDTAQQLGNSAFDQSDVHKCQEFLKRYEIRRGF*
Ga0062385_1066244423300004080Bog Forest SoilMNEAEIETRFEEIQNELKAQKEQLDAFKEDLKILLKHRSIAPRDILDTAQQLGNSAFDQSDVHKCQEFLKKYEIRRGF*
Ga0062389_10059252723300004092Bog Forest SoilMHDFIGMNEEEIENAFREFRNELKTCKEQIASLKEDLKILLKHRSIPHRDIIDTAHQLGNRAFDQSDVLKCRAFLEKYHIRL*
Ga0062389_10138126523300004092Bog Forest SoilMNEEEIVNRFEEIRNELKAYKEQLDSFKEDLKILLKHRSIAHRDIIDTAQQLGNAAFDQSDVHRCREFLKKYDIRRGF*
Ga0062389_10254404713300004092Bog Forest SoilIQARFEDIQNELEAQRAQLVAFKEDLKILLRHRSIACRDIVETAQQLGNSGFDQSDVQECKEFLKKYGVRKGF*
Ga0062389_10365913823300004092Bog Forest SoilMNEKEVEARFEELRSELKGYQEQLDSFREDLKILFEHRLIACRDILDTAKQVGNAAFDQSDVRRCQEFMKKYNLRKGY*
Ga0062389_10427393513300004092Bog Forest SoilMNEEEFELRFEEIRNELNAYKEQLDSFKEDLKILFKHRSIAPRDIVDTAQQVGNAAFDQSDVHRCRKFVEKYEIRRGF*
Ga0062386_10038005133300004152Bog Forest SoilMNEEEILNRFEEIRNELKVYKEQLDSFKEDLKILLKHRSVAHRDIIDTAQQLGNAAFDQSDVHRCREFLKKYGIRRGF*
Ga0066395_1073440223300004633Tropical Forest SoilMDEKEIETRFEEIRKELQGYQEQLDSFKEDLKILLKHHSLAPRDILDTAKQVGNAAFDQSDVHKCR
Ga0062388_10147195023300004635Bog Forest SoilLPTLTTVDHLKSMNEAEIETRFEEIQNELKAQKEQLDAFKEDLKILLKHRSIAPRDILDTAQQLGNSAFDQSDVHKCQEFLKKYEIRRGF*
Ga0066388_10062144033300005332Tropical Forest SoilMDEKEIETRFEEIRKELKGYQEQLDSFKEDLKILLKHHSLAPRDILDTAKQVGNAAFDQSDVHKCREFLKKYDLRKGF*
Ga0070709_1161300113300005434Corn, Switchgrass And Miscanthus RhizosphereMNEEEIETRFEEIRNELKAQKEQLDAFKEDLKILFKHHSIAPRDILDTAQQLGNAAFDQSDVRRCREFLKKYDLRKRI*
Ga0070710_1147769513300005437Corn, Switchgrass And Miscanthus RhizosphereKVKINLEFICESESLLDVMKAEEIEHGFQEIRNELETCKQQIASLKEDLKILLKHRSIAHRDIIDTAHQVGNRAFDQSDVQKCREFLQRYNIRV*
Ga0070711_10134369913300005439Corn, Switchgrass And Miscanthus RhizosphereMHDFVGMTEEEIENTFQEFRNELKTCKEQVASLKEDLKILLKHRSIAHRDIIDTAHQLGNRAFDQSDVLKCRAFLEKYHIRL*
Ga0070678_10138069913300005456Miscanthus RhizosphereMNAEEIEHGFQEIRNELETCKQQIASLKEDLRILLKHRSIAHRDIIDTAHQVGNRAFDQSDVQKCREFLQRYNIRV*
Ga0070707_10204539123300005468Corn, Switchgrass And Miscanthus RhizosphereMNEEEIEARFEEIRNELKAQKEQLDAFKEDLKILLKHRSIAPRDILETAQQLGNSAFDQSDVHKCQEFLKKYEIRRGF*
Ga0070665_10049317133300005548Switchgrass RhizosphereMNDDEMENHFGELRDELRAFKKELDSVREDLKLLLKHRSIAHRDIIDTAQQLGNRAFDQSDVQKCREFLQKYNLRP*
Ga0070762_1119376323300005602SoilMHDFIGMNEEEIENAFREFRNELKTCKEQVASLKEDLKILLKHRSIPHRDIIDTAHQLGNRAFDQSDVLKCRAFLEKYHIRL*
Ga0066903_10026097933300005764Tropical Forest SoilMDEKEIETRFEEIRKELQGYQEQLDSFKEDLKILLKHHSLAPRDILDTAKQVGNAAFDQSDVHKCREFLKKYDLRKGF*
Ga0075023_10021016723300006041WatershedsMNHLRSMNEEEIEARFEEIRNELKAQREQLDAFKEDLMILLKHRSIAPRDILDTAQQLGNSAFDQSDVHKCQEFLKKYAIRRGF*
Ga0070716_10039650023300006173Corn, Switchgrass And Miscanthus RhizosphereMNEQEVEARFEEFRNELKTFQDQLDSLKEYLKILLKHHSIAPRDILETAKQLGNSAFDQSDVHRCREFLKKYDLRKGI*
Ga0075014_10052997023300006174WatershedsMNEAEIETRFEEIQNELKAQKEQLDAFKEDLKILLKHRSIAPRDILDTAQQLGNSAFDQSDVHKCQEFLKRYEIRRGF*
Ga0070712_10001517713300006175Corn, Switchgrass And Miscanthus RhizosphereMNEEEIQSHFLQLQNELKMCNEQIASLKEDLKILLKHRSIAHRDIVDTAHQLGNRAFDQSDVQKCREFLERYNIRV*
Ga0070765_10007498823300006176SoilLPTLAKVDHFKGMNEAEIETRFEQIRNELKAQKEQLDAFKEDLKILLKHRSIAPRDILDTAQQLGNSAFDQSDVHKCQEFLKRYEIRRGF*
Ga0070765_10054405123300006176SoilMENRFRDFRNELKACNEQIASLREDLKILLKHRSIAHRDIIDTAHQLGNRAFDQSDVQKCREFLERYNIRV*
Ga0070765_10150965523300006176SoilMHDFIGMNEEEIENAFREFRNELKTCKEQVASLKEDLKILLKHRSIPHRDIIDTAHQLGNRAFDQSDVLK
Ga0073928_1005594823300006893Iron-Sulfur Acid SpringMNEAEIETRFEEIRNELKAQKEQLDAFKEDLKILLKHRSIAPRDILDTAQQLGNSAFDQSDVHKCQEFLKRYEIRRGF*
Ga0105237_1028639123300009545Corn RhizosphereGELRDELRAFKKELDSVREDLKLLLKHRSIAHRDIIDTAQQLGNRAFDQSDVQKCREFLQKYNLRP*
Ga0127503_1060092713300010154SoilRFEELRSELKGYQEQLDSFREDLKILFEHRIIACRDILDTAKQVGNAAFDQSDAHRCQEFLKKYNLRKGY*
Ga0127503_1091978323300010154SoilRFEELRNELKGYQEQLDSFREDLKILFEHRLIACRDILDTVKQVGNAAFDQSDVHRCQEFMKKYNLRKGY*
Ga0127503_1107101413300010154SoilENRFQEIRNELKAYKEQLDSFKEDLKILLKHRSIASRDIVDTAKQLGNRAFDQSDVRKCQEFLKKYEIRKGF*
Ga0127503_1123059313300010154SoilEIENRFQEIRNELKTYKEQLDSFKEDLKILLKHRSIAPRDIVDTAKQLGNRAFDQSDVRKCQEFLKKYEIRKGF*
Ga0127503_1128828013300010154SoilREIRNELKSCKDQVASLREDLKILLKHRSIPHRDIIDTAHQLGNRAFDQSDVEGCREFLRRYQIRH*
Ga0074046_1056114023300010339Bog Forest SoilMNEAEIETRFEEIRNELKAQKEQLEAFKEDLKILLKHRSIAPRDILDTAQQLGNSAFDQSDVHKCQEFLKRYE
Ga0074044_1019225733300010343Bog Forest SoilMNEQEIETRFEELRNELKAQKEQLDALKEDIKILLKHRSIAPRDIMDTAQQIGNAAFDQSDVHRCREFLKKYDVRKGL*
Ga0126376_1251054623300010359Tropical Forest SoilMKEEEIETRFEEIRNELKAQKEQLDAFKEDLKILFKHHSIAPRDILDTAKQVGNAAFDQSDVHKCREFLKKYDLRKGF*
Ga0126381_10162452723300010376Tropical Forest SoilEEIRNELKAQREQLDAFKEDLKILFKHHSIAPRDILDTAQRLGNAAFDQSDVRRCREFLKKYDLRKHI*
Ga0136449_10101792113300010379Peatlands SoilMKVKHLRGMNEEEIEARFEEIRNELKAQKEQLDAFKEDLKILLKHRSIAPRDILDTAQQLGNSAFDQSDVHKCQEFLKKYEIRRGF*
Ga0136449_10109548323300010379Peatlands SoilMNEEQIENRFEEIRSELKAQKEQLDSLKEDLRILLKHRSIAPRDILDTAQQLGNAAFDQIDVHKCQEFLKKYEIRRGF*
Ga0136449_10151293123300010379Peatlands SoilMNEEQIENRFEEIRSELKAQKEQLDSLKEDLRILLKHRSIAPRDILDTAQQLGNAAFDQTDVHKCQEFLKKYDIRRGF*
Ga0126346_108926513300010865Boreal Forest SoilMNEKEVEARFEELRSELKGYQEQLDSFREDLKILFEHRLIACRDILDTAKQVGNAAFDQSDVHRCQEFLKKYNLRKGY*
Ga0150983_1125121523300011120Forest SoilENDLREIRNELKTCKEQIAALKEDLKILLKHRSIAHRDIIDTAHQLGNRAFDQSDVQKCREFLLRYNIRV*
Ga0150983_1358034923300011120Forest SoilRNELKGYQEQLDSFREDLKILFEHRIIACRDILDTAKQDGNAAFDQSDVHRCQEFLKKYNLRKGY*
Ga0153922_100598843300012181Attine Ant Fungus GardensMDEEEIKNRFEEIRNELKAQKEQLDSFKEDLRILLKHRSIASRDILETAQQLGNAAFDQSDVHKCQEFLKKYGIKRGF*
Ga0137363_1024793523300012202Vadose Zone SoilMNEEEIETRFEEIRNELKAQKEQLDAFKEDLKILFKHHSIAPRDILDTAQQLGNAAFDQSDVRRCREFLKKYDLRKGI*
Ga0137360_1011640833300012361Vadose Zone SoilDDLRCMNEEEIEARFEEIRNELKAYQEQLDSFKEDLKILFKHRSIAPRDILETAKQDGNAAFDQSDVRRCREFLKKYDLRKRI*
Ga0137360_1169555923300012361Vadose Zone SoilMTNEEIENNFRQIRNELETCREQFASLKEDLKILLKHRSIAHRDIVDTAHQLGNRAFDQSDVQKCREFLQKYDIRH*
Ga0137361_1080203723300012362Vadose Zone SoilMNEEEIEARFEEIRNELKAQKEQLDAFKEDLKILLKHHSIAPRDIVDTAQQLGNRAFDQSDVHKCQEFLKKYDIRKGF*
Ga0164301_1121881313300012960SoilMNEKEIVNRFEEIRNELKVYKGLLDSFKEELKILLKHRSIAPRDIVDTAKQLGNAAFDQSDVHKCREFVKKYEIRKGF*
Ga0164309_1187004213300012984SoilMNEEEIENRFQEIRNELKTYKEQLDSFKEDLKILLKHRSIAPRDIVDTAKQLGNRAFDQSDVRKCQEFLKKYEIRKGF*
Ga0164304_1063239213300012986SoilMNDDEMENHFGELRDELRAFKKELDSVREDLKLLLKHRSIAHRDIIDTAQQLGNRAFDQSDVQKCREFLQKYNIRL*
Ga0164305_1040715333300012989SoilMNEQEVEARFEEFRNELKTFQDQLDSLKEELKILLKHHSIAPRDILETAKQLGNSAFDQSDVHRCREFLKKYDLRKGI*
Ga0157375_1161916913300013308Miscanthus RhizosphereMNAEEIEHGFQEIRNELETCKPQIASLKEDLSIPLKPRSIAHRDIIDTAYQVGNRAFDQSDVQKCREFLQRYNIRV*
Ga0181523_1013814023300014165BogMNEAEIEHRFEEIRNELKAYREQLESFKEDLQILLKHRSIAHRDIVETAQQIGNAAFDQSDVHRCRKFLEK
Ga0181526_1018359723300014200BogLRAMNEAEIEHRFEEIRNELKAYREQLESFKEDLQILLKHRSIAHRDIVETAQQIGNAAFDQSDVHRCRKFLEKYEIRKGF*
Ga0137412_1083970413300015242Vadose Zone SoilMKAEEIEHGFQEIRNELETCKQQIASLKEDLKILLKHRSIAHRDIIDTAHQLGNRAFDQSDVQKCREFLQRYNIRV*
Ga0132258_1221353513300015371Arabidopsis RhizosphereMKEEEIEDNFRQIRNELETCRQQFASLKEDLKILLKHRSIAHRDIVDTAHQLGNRAFDQSDVQKCREFLQKYDIRH*
Ga0182038_1080855213300016445SoilMNEEEIEARFEEIRRELKGFQEQLDSFKEDLKILLKHHSVAPRDIVDTAKQLGNAAFDQSDVHRCREFLKKYDLRKGL
Ga0193728_131730213300019890SoilMTNEEIEDNFRQIRNELETCREQFASLKEDLKILLKHRSIAHRDIVDTAHQLGNRAFDQSDVQKCREFLQKYDIRH
Ga0210407_1040572523300020579SoilMNEKEVEARFEELRSELKGYQEQLDSFREDLKILFEHRIIACRDILDTAKQDGNAAFDQSDVHRCQEFLKKYNLRKGY
Ga0210407_1049502313300020579SoilMNEKEVEARFEELRNELKGYQEQLDSFREDLKILFEHRLIACRDILDTAKQVGNAAFDQSDVHRCQEFMKKYNLRKGY
Ga0210399_1001713813300020581SoilLPTLAKVDHFKGMNEAEIETRFEQIRNELKAQKEQLDAFKEDLKILLKHRSIAPRDILDTAQQLGNSAFDQSDVHKCQEFLKRYEIRRGF
Ga0210395_1038882213300020582SoilMNEAEIETRFEQIRNELKAQKEQLDAFKEDLKILLKHRSIAPRDILDTAQQLGNSAFDQSDVHKCQEFLKRYEIRRGF
Ga0210401_1039299223300020583SoilMNEEEIETRFEEIRNELKAQKEQLDAFKEDLKILFKHHSIAPRDILDTAQQLGNAAFDQSDVRRCREFLKKYDLRKRI
Ga0210401_1137714523300020583SoilMNEEEIEARFEEIRNELKAQKEQLDAFKEDLKILLKHHSIAPRDIVDTAQQLGNRAFDQSDVHKCREFLKKYEIRKGF
Ga0210401_1164240123300020583SoilMNEEEVEARFEEIRNELKEYQEQLDSFREDLKILFKHRLIACRDILDTAKQVGNAAFDQSDVNRCREFLKKYDLRKGY
Ga0210406_1138878013300021168SoilMHDLVVMTEEEIESTFQEFRNELKTCKEQVASLNEDLKILLKHRSIAHRDIIDTAHQLGNRAFDQSDVLKCRAFLEKYHIRP
Ga0210405_1028667523300021171SoilMNEDEIENRFQEIRNELNAYKEQLDSYKEDLKILLKHRSIAPRDIVDTAKQFGNRAFDQSDVRKCQEFLKKYEIRKGF
Ga0210408_1069779613300021178SoilNPTAKVTHLRSMNEEEIETRFEEIRNELKAQKEQLDAFKEDLKILFKHHSIAPRDILDTAQQLGNAAFDQSDVRRCREFLKKYDLRKRI
Ga0210396_1044930623300021180SoilMHDPVGMSEEEMEIAFREVQNELKTCKEQVASLKEDLKILLKHRSIAHRDIVDTAHQLGNRAFDQSDVQKCRAFLEKYHI
Ga0210396_1046664113300021180SoilEQIRNELKAQKEQLDAFKEDLKILLKHRSIAPRDILDTAQQLGNSAFDQSDVHKCQEFLKRYEIRRGF
Ga0210388_1129853523300021181SoilMNEKEIQARFEKIQNELEAQRAQLAAFKEDLKILLRHRSIACRDIVETAQQLGNSAFDQSDVQKCEEFLKKYGVRKGF
Ga0213873_1004192423300021358RhizosphereMDEREIEARFEEIRKELKGYQEQLDSFKEDLKILLKHHSLAPRDIVDTAKQLGNAAFDQSDVHKCREFLKKYDLRKGL
Ga0213872_1002249733300021361RhizosphereMNQTIVRGMNEKEIEARFAEIRNELKTCQEQLDSFKEDLKILFKHHSLAPRDILETAKQLGNAAFDQSDVHRCREFLEKYGLRRHV
Ga0213872_1003092923300021361RhizosphereMDEREIEACFEEIRKELKGYQEQLDSFKEDLKILFKHHSLAPRDIVDTAKQLGNAAFDQSDVHKCREFLKKYDLRKGL
Ga0213872_1034976913300021361RhizosphereMNETIVRGMNEKEIEARFAEIRNELKTCQEQLDSFKEDLKILFKHHSLAPRDILETAKQLGNAAFDQSDVHRCCEFLKKYGLNHKSR
Ga0213877_10000233103300021372Bulk SoilMNEKEIEARFAEIRNELKTCQEQLDSFKEDLKILFKHHSLAPRDILETAKQLGNAAFDQSDVHRCREFLEKYGLRRHV
Ga0213877_1001848423300021372Bulk SoilMDAEEIEARFEEIRKQLKAYQEQLDSFKEDLKILIKHHSIAPRDILDTAKQVGNSAFDQSDVHKCREFLKKYDLRKGL
Ga0213877_1017510223300021372Bulk SoilMNEKEIEARFAEIRNELKTCQEQLDSFKEDLKILFKHHSLAPRDILETAKQLGNAAFDQSDVHRCCEFLKKYGLNHKSR
Ga0210393_1057210013300021401SoilMNEKEIQARFEEIQKELETQRAQLAAFKEDLKILLRHRSIACRDIVETAQQLGNSAFGQSDVQKCSLEILHKYGVRKGF
Ga0210385_1000855443300021402SoilMHDPVGMSEEEMEIAFREVQNELKTCKEQVASLKEDLKILLKHRSIAHRDIVDTAHQLGNRAFDQSDVQKCRAFLEKYHIRL
Ga0210385_1038716223300021402SoilMNEKEIQARFEEIQKELETQRAQLAAFKEDLKILLRHRSIACRDIVETAQQLGNSAFDQSDVQKCEEFLKKYGVRKGF
Ga0210397_1124382213300021403SoilLCRMNEKEIQARFEEIQKELETQRAQLAAFKEDLKILLRHRSIACRDIVETAQQLGNSAFDQSDVQKCEEFLKKYGVRKGF
Ga0210389_1131030613300021404SoilIRNELKAQKEQLDAFKEDLKILLKHRSIAPRDILDTAQQLGNSAFDQSDVHKCQEFLKRYEIRRGF
Ga0210387_1056328123300021405SoilMHDPVGMSEEEMEIAFREVQNELKTCKEQVASLKEDLKILLKHRSIAHRDIVDTAHQLGNRAFDQSDVLKCRAFLEKYHIRL
Ga0210387_1109832413300021405SoilMNEEEIENHFREIRNELKACKEQIGTLKDDLKILLKHRSIACRDIVDTSRQLGNRAFDQSDVQKCREFLEKYNVRQ
Ga0210387_1118949813300021405SoilMNEKEIQARFEEIQNELEEQKAQLSAFKEDLKMLLRHRSIACRDIVETAQQLGNSAFDQSDVQKCEEFLKKYGVRKGF
Ga0210394_1084177223300021420SoilLSDVMNEEEIENGFREIRNELKTCKEQIASLKEDLKILLKHRSIAHRDIIDTAHQLGNRAFDQSDVQKCREFLQRYNIRV
Ga0210384_1168634613300021432SoilMNEEEIESDLREIRNELKTCKEQIAALKEDLKILLKHRSIAHRDIIDTAHQLGNQAFDQSDVQKCREFLLRYNIRV
Ga0213878_1003573723300021444Bulk SoilMNEKGIEARFAEIRNELKTCQEQLDSFKEDLKILFKHHSLAPRDILETAKQVGNAAFDQSDVHRCREFSKKYGLRATRER
Ga0210390_1064062113300021474SoilLPKVDHFKGMNEAEIETRFEQIRNELKAQKEQLDAFKEDLKILLKHRSIAPRDILDTAQQLGNSAFDQSDVHKCQEFLKRYEIRRGF
Ga0210392_1039892123300021475SoilMNEKEIQARFGEIQNELETQRAQLAAFQEDLKILLRHRSIACRDIVETAQQLGNSAFDQSDVQKCEEFLRKYGVRKGF
Ga0210392_1121718613300021475SoilMHDLVVMTEEEIESTFQEFRNELKTCKEQIASLNEDLKILLKHRSIAHRDIIDTAHQLGNRAFDQS
Ga0210402_1098964813300021478SoilMHDLVVMTEEEIESTFQEFRNELKTCKEQVASLNEDLKILLKHRSIAHRDIIDTAHQLGNRAFDQSDVLKCRAFLEKYHIRL
Ga0210409_1057828213300021559SoilMNEEEIETRFEEIRNELKAQKEQLDAFKEDLKILFKHHSIAPRDILDTAQQLGNAAFDQSDVRRYREFLKKYDLRKRI
Ga0126371_1034110723300021560Tropical Forest SoilMDEKEIETRFEEIRKELKGYQEQLDSFKEDLKILLKHHSLAPRDILDTAKQVGNAAFDQSDVHKCREFLKKYDLRKGF
Ga0222729_107062813300022507SoilARFEELRNELKGYQEQLDSFREDLKILFEHRLIACRDILDTAKQVGNAAFDQSDVHRCQEFMKKYNLRKGY
Ga0242660_103537413300022531SoilRNELKAQKEQLDAFKEDLKILFKHHSIAPRDILDTAQQLGNAAFDQSDVRRCREFLKKYDLRKRI
Ga0242662_1019746313300022533SoilRFEELRNELKGYQEQLDSFREDLKILFEHRLIACRDILDTAKQVGNAAFDQSDVHRCQEFMKKYNLRKGY
Ga0212123_1002047233300022557Iron-Sulfur Acid SpringMNEAEIETRFEEIRNELKAQKEQLDAFKEDLKILLKHRSIAPRDILDTAQQLGNSAFDQSDVHKCQEFLKRYEIRRGF
Ga0242654_1030347513300022726SoilRFEELRSELKGYQEQLDSFREDLKILFEHRLIACRDILDTAKQVGNAAFDQSDVHRCQEFMKKYNLRKGY
Ga0224572_102856013300024225RhizosphereMNEEEFEHRFDEIRNELKAYKEQLDSFKEDLKILLKHRSIAHRDIVDTAQQLGNAAFDQSDVHRCRKFLEKYEIRKGF
Ga0179589_1040936913300024288Vadose Zone SoilMKAEEIEHGFQEIRNELETCKQQIASLKEDLKILLKHRSIAHRDIIDTAHQLGNRAFDQSDVQKCREFLQRYNIRV
Ga0179591_104438433300024347Vadose Zone SoilMNDEEMENRFGELRDELRAFKNELDSVREDLKLLLKHRSIAHRDIIDTAHQVGNRAFDQSDVQKCREFLEKYNIRL
Ga0207671_1146980413300025914Corn RhizosphereMNAEEIEHGFQEIRNELETCKQQIASLKEDLRILLKHRSIAHRDIIDTAHQVGNRAFDQSDVQKCREFLQRYNIRV
Ga0209648_1003426743300026551Grasslands SoilMDEEEIEARFDEIRYELKAQKEQLDAFKEDLKILLKHRSIAPRDILDTAQQLGNSAFDQSDVHKCQEFLKKYEIRRGF
Ga0207758_101732623300026895Tropical Forest SoilMKVNHLRSMKEEEIEARFEEIRNELKAQKEQLDAFKEDLKILLKHRSIAPRDILETAQQLGNSAFDQSDVHKCQEFLKKYEIRRGF
Ga0209418_107259023300027371Forest SoilNDEEIEARFEEIRNELKAQKEQLDAFREELKILLKHHSIAPRDIVDTAQELGNAAFDQSDVRKCREFMKKYDVRRGF
Ga0209332_110606713300027439Forest SoilLESLLQRXIILGCMTEDEIENRFQEIRNELNAYKEQLDSYKEDLKILLKHRSIAPRDIVDTAKQFGNRAFDQSDVRKCQEFLKKYEIRKGF
Ga0209009_100431933300027667Forest SoilMNEDEIENRFQEIRNELNAYKEQLDSFKEDLKILLKHRSIAPRDIVDTAKQFGNRAFDQSDVRKCQEFLKKYEIRKGF
Ga0209009_104953813300027667Forest SoilMNEEEIQSHFLQIRNELKICNEQIASLKEDLKILLKHRSIAHRDIVDTAHQLGNRAFDQSDVQKCREFLERYNIRV
Ga0207826_101530713300027680Tropical Forest SoilMKEEEIEARFEEIRNELKAQKEQLDAFKEDLKILLKHRSIAPRDILETAQQLGNSAFDQSDVHKCQEFLKKYEIRRGF
Ga0209626_115022923300027684Forest SoilMTEDEIENRFQEIRNELNAYKEQLDSYKEDLKILLKHRSIAPRDIVDTAKQFGNRAFDQSDVRKCQEFLKKYEIRKGF
Ga0209446_110012613300027698Bog Forest SoilMNEQEIETRFEEIQNELKAQKEQLDAFKEDLKILLKHRSIAPRDILDTAQQLGNSAFDQSDVHKCQEFLKKYEIRRGF
Ga0209656_10000114263300027812Bog Forest SoilMNEQEIETRFEELKNELKSQKEQLDALKEDIKILLKHRSIAPRDIVDTAQQIGNAAFDQSDVHRCREFLKKYDVRKGL
Ga0209656_1001743623300027812Bog Forest SoilMNEQEIETRFEELKNELKAQKEQLDALKEDLKILLKHRSIAPRDIVDTAQQIGNAAFDQSDVHRCREFLKKYDVRKGL
Ga0209656_1003407313300027812Bog Forest SoilMNEAEIETRFEEIRNELKAQKEQLDAFKEDLKILLKHRSIAPRDILDTAQQLGNSAFDQSDVHKCQEFLKKYEIRRGF
Ga0209039_1009460723300027825Bog Forest SoilLPTLTTVDHLKSMNEAEIETRFEEIRNELKAQKEQLDAFKEDLKILLKHRSIAPRDILDTAQQLGNSAFDQSDVHKCQEFLKRYEIRRGF
Ga0209275_1070101713300027884SoilMHDFIGMNEEEIENAFREFRNELKTCKEQVASLKEDLKILLKHRSIPHRDIIDTAHQLGNRAFDQSDVLKCRAFLEKYHIRL
Ga0268266_1054208513300028379Switchgrass RhizosphereMNDDEMENHFGELRDELRAFKKELDSVREDLKLLLKHRSIAHRDIIDTAQQLGNRAFDQSDVQKCREFLQKYNLRP
Ga0302222_1030514013300028798PalsaENRFREIRNELNACKEEVASLKEDLKILLKHRSIAHGDIVDTAHQIGNRAFDQSDVQKCREFLEKYNIRV
Ga0302218_1022805313300028863PalsaNELNACKEEVASLKEDLKILLKHRSIAHGDIVDTAHQIGNRAFDQSDVQKCREFLEKYNIRV
Ga0308309_1038494823300028906SoilMENRFRDFRNELKACNEQIASLREDLKILLKHRSIAHRDIIDTAHQLGNRAFDQSDVQKCREFLERYNIRV
Ga0311340_1114663323300029943PalsaMNEEEIESHFREIRNELKACNEQVATLKEDLKILLKHRSIACRDIVDTSRQLGNRAFDQSDVQKCREFLEKYNIRL
Ga0311339_1061384513300029999PalsaKARFEEIQNELEAQRAQLAGLKEDLEILLRHRSIACRDIVETAQQHGNSAFDQSDVHKCQEFLKKYGVRKGF
Ga0311338_1057653813300030007PalsaMNENEIKARFEEIQNELEAQRAQLAGLKEDLEILLRHRSIACRDIVETAQQHGNSAFDQSDVHKCQEFLKKYGVRKGF
Ga0302179_1001225223300030058PalsaMNEEEIENRFREIRNELNACKEEVASLKEDLKILLKHRSIAHGDIVDTAHQIGNRAFDQSDVQKCREFLEKYNIRV
Ga0311353_1104556923300030399PalsaMNEEEIENHFREIRNELKACNEQIGTLKEDLKILLKHRSIACRDIVDTSRQLGNRAFDQSDVQKCREFLQKYDVRQ
Ga0210274_147961313300030531SoilELKGYQEQLDSFREDLKILFEHRLIACRDILDTAKQVGNAAFDQSDVHRCQEFLKKYNLRKGY
Ga0210290_165569113300030532SoilNEKEVEARFEELRNELKGYQEQLDSFREDLKILFEHRLIACRDILDTAKQVGNAAFDQSDVHRCQEFLKKYNLRKGY
Ga0210286_125893723300030597SoilFAVSRMSRQAIRRCSNRILRSSLNEKEVEARFEELRNELKGYQEQLDSFREDLKILFEHRLIACRDILDTAKQVGNAAFDQSDVHRCQEFLKKYNLRKGY
Ga0265459_1216109123300030741SoilFEELRNELKGYQEQLDSFREDLKILFEHRLIACRDILDTAKQVGNAAFDQSDVHRCQEFMKKYNLRKGY
Ga0265461_1045464623300030743SoilMSRQAIRRCSNRILRSSLNEKEVEARFEELRNELKGYQEQLDSFREDLKILFEHRLIACRDILDTAKQVGNAAFDQSDVHRCQEFLKKYNLRKGY
Ga0170834_11090624913300031057Forest SoilEEEIEGTFQEFRNELKTCKEQVASLKEDLNILLKHRSIAHRDIIDTAHQLGNRAFDQSDVLKCRAFLEKYHIRL
Ga0170824_10345132513300031231Forest SoilMHDFVGMTEEEIEGTFQEFRNELKTCKEQVASLKEDLNILLKHRSIAHRDIIDTAHQLGNRAFDQSDVLKCRAFLEKYHIRL
Ga0302325_1016550733300031234PalsaLHSLTNPNHLLGMNENEIKARFEEIQNELEAQRAQLAGLKEDLEILLRHRSIACRDIVETAQQHGNSAFDQSDVHKCQEFLKKYGVRKGF
Ga0310915_1026639713300031573SoilMNEEEIKSRFEEIRNELKAQKEQLEAFKEDLKILFKHHSIAPRDILDTAQQLGNAAFDQSDVRRCREFLKKYDLRRGF
Ga0310686_10593700533300031708SoilMNEEEIEARFEEIRNELKAQQEQLDAFKEDLIILLKHRSIAPRDILDTAQQLGNSAFDQSDVHKCQEFLK
Ga0310686_10955008623300031708SoilMNEEEFEHRFDEIRNELKAYKEQLDSFKEDLKILLKHRSIAHRDIVDTAQQLGNAAFDQSDVH
Ga0307479_1129837013300031962Hardwood Forest SoilVKVNHLGNMNEEEIEARFEEIRNQLKAQKEQLDAFKEDLMILFKHRSIASRDILDTAQQLGNSAFDQSDVHKCQEFLKKYEIRRGF
Ga0318533_1035623313300032059SoilLLFKDSMNEEEIKSRFEEIRNELKAQKEQLEAFKEDLKILFKHHSIAPRDILDTAQQLGNAAFDQSDVRRCREFLKKYDLRRGF
Ga0318504_1036998113300032063SoilNEEEIKSRFEEIRNELKAQKEQLEAFKEDLKILFKHHSIAPRDILDTAQQLGNAAFDQSDVRRCREFLKKYDLRRGF
Ga0311301_1029295333300032160Peatlands SoilMNEEQIENRFEEIRSELKAQKEQLDSLKEDLRILLKHRSIAPRDILDTAQQLGNAAFDQIDVHKCQEFLKKYEIRRGF
Ga0311301_1096345223300032160Peatlands SoilLPTLAKVNHLKSMNEAEIETRFEEIRNELKAQKEQLDAFKEDLKILLKHRSIAPRDILDTAQQLGNSAFDQSDVHKCQEFLKRYEIRRGF
Ga0311301_1106078023300032160Peatlands SoilMNEEQIENRFEEIRSELKAQKEQLDSLKEDLRILLKHRSIAPRDILDTAQQLGNAAFDQTDVHKCQEFLKKYDIRRGF
Ga0311301_1136683823300032160Peatlands SoilMKVKHLRGMNEEEIEARFEEIRNELKAQKEQLDAFKEDLKILLKHRSIAPRDILDTAQQLGNSAFDQSDVHKCQEFLKKYEIRRGF
Ga0307472_10023429633300032205Hardwood Forest SoilMNEEEIETRFEEIRNELKAQKEQLDAFKEDLKILFKHHSIAPRDILDTAQQLGNAAFDQSDVRRCREFLKKYDLRKGI
Ga0306920_10357995123300032261SoilLESPVRVSHRKRMNEEEIETRFEEIRNELKAQKERLDAFKEDLKILFKHHSIAPRDILDTAKQLGNAALDQSDVHRCREFLKKYDLRKGI
Ga0348332_1306232313300032515Plant LitterMNEEEFEHRFDEIRNELKAYREQLDSFKEDLKILLKHRSIAHRDIVDTAQQLGNAAFDQSDVHRCRKFLEKYEIRKGF


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.