NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F039023

Metagenome / Metatranscriptome Family F039023

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F039023
Family Type Metagenome / Metatranscriptome
Number of Sequences 164
Average Sequence Length 120 residues
Representative Sequence MINVIRLVITVVFGFLWWWVYNRIGAGLEYLMILGSVLAVCACCCKGGEPQREWNWNVYVACIRRCWIATLVLMVALFIIGVLVVLFTTGAPPSGVVLTNILLAAIGAPLFVRLICCAYE
Number of Associated Samples 139
Number of Associated Scaffolds 164

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 56.79 %
% of genes near scaffold ends (potentially truncated) 16.46 %
% of genes from short scaffolds (< 2000 bps) 39.02 %
Associated GOLD sequencing projects 130
AlphaFold2 3D model prediction Yes
3D model pTM-score0.77

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (75.610 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(26.220 % of family members)
Environment Ontology (ENVO) Unclassified
(41.463 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(40.244 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 68.92%    β-sheet: 0.00%    Coil/Unstructured: 31.08%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.77
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
f.37.1.0: automated matchesd4q4hb14q4h0.66623
f.37.1.1: ABC transporter transmembrane regiond6bpla16bpl0.6396
f.37.1.0: automated matchesd6raha16rah0.62216
f.37.1.1: ABC transporter transmembrane regiond4a82a14a820.61679
f.41.1.0: automated matchesd6z3ta_6z3t0.61346


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 164 Family Scaffolds
PF04199Cyclase 8.54
PF04226Transgly_assoc 4.27
PF04909Amidohydro_2 3.66
PF01926MMR_HSR1 1.83
PF00496SBP_bac_5 1.22
PF02615Ldh_2 1.22
PF132794HBT_2 1.22
PF03150CCP_MauG 1.22
PF00355Rieske 0.61
PF07885Ion_trans_2 0.61
PF13517FG-GAP_3 0.61
PF13560HTH_31 0.61
PF12002MgsA_C 0.61
PF01839FG-GAP 0.61
PF13442Cytochrome_CBB3 0.61
PF13833EF-hand_8 0.61
PF05292MCD 0.61
PF01068DNA_ligase_A_M 0.61
PF10576EndIII_4Fe-2S 0.61
PF00211Guanylate_cyc 0.61
PF13714PEP_mutase 0.61
PF02371Transposase_20 0.61
PF01402RHH_1 0.61
PF13493DUF4118 0.61
PF02423OCD_Mu_crystall 0.61
PF11512Atu4866 0.61
PF04392ABC_sub_bind 0.61
PF08238Sel1 0.61
PF03401TctC 0.61
PF01964ThiC_Rad_SAM 0.61

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 164 Family Scaffolds
COG1878Kynurenine formamidaseAmino acid transport and metabolism [E] 8.54
COG2261Uncharacterized membrane protein YeaQ/YmgE, transglycosylase-associated protein familyGeneral function prediction only [R] 4.27
COG1858Cytochrome c peroxidasePosttranslational modification, protein turnover, chaperones [O] 1.22
COG2055Malate/lactate/ureidoglycolate dehydrogenase, LDH2 familyEnergy production and conversion [C] 1.22
COG04224-amino-2-methyl-5-hydroxymethylpyrimidine (HMP) synthase ThiCCoenzyme transport and metabolism [H] 0.61
COG1423ATP-dependent RNA circularization protein, DNA/RNA ligase (PAB1020) familyReplication, recombination and repair [L] 0.61
COG1793ATP-dependent DNA ligaseReplication, recombination and repair [L] 0.61
COG2114Adenylate cyclase, class 3Signal transduction mechanisms [T] 0.61
COG2423Ornithine cyclodeaminase/archaeal alanine dehydrogenase, mu-crystallin familyAmino acid transport and metabolism [E] 0.61
COG2984ABC-type uncharacterized transport system, periplasmic componentGeneral function prediction only [R] 0.61
COG3181Tripartite-type tricarboxylate transporter, extracytoplasmic receptor component TctCEnergy production and conversion [C] 0.61
COG3547TransposaseMobilome: prophages, transposons [X] 0.61


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A75.61 %
All OrganismsrootAll Organisms24.39 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
2228664022|INPgaii200_c0947607Not Available692Open in IMG/M
3300000364|INPhiseqgaiiFebDRAFT_101682343Not Available1147Open in IMG/M
3300000364|INPhiseqgaiiFebDRAFT_101683076All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Acidiferrobacterales → Acidiferrobacteraceae → unclassified Acidiferrobacteraceae → Acidiferrobacteraceae bacterium1699Open in IMG/M
3300000364|INPhiseqgaiiFebDRAFT_101686022Not Available641Open in IMG/M
3300002917|JGI25616J43925_10042454All Organisms → cellular organisms → Bacteria1988Open in IMG/M
3300005332|Ga0066388_100087454All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales3523Open in IMG/M
3300005332|Ga0066388_100620168All Organisms → cellular organisms → Bacteria1701Open in IMG/M
3300005713|Ga0066905_101994745Not Available538Open in IMG/M
3300006047|Ga0075024_100496015Not Available639Open in IMG/M
3300007255|Ga0099791_10006623All Organisms → cellular organisms → Bacteria → Proteobacteria4774Open in IMG/M
3300007258|Ga0099793_10339419Not Available733Open in IMG/M
3300007265|Ga0099794_10037192All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium RIFCSPLOWO2_02_FULL_67_212304Open in IMG/M
3300007788|Ga0099795_10010778All Organisms → cellular organisms → Bacteria → Proteobacteria2706Open in IMG/M
3300009088|Ga0099830_10053472All Organisms → cellular organisms → Bacteria → Proteobacteria2859Open in IMG/M
3300009090|Ga0099827_11091352All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium RIFCSPLOWO2_02_FULL_67_21693Open in IMG/M
3300009120|Ga0117941_1123986Not Available708Open in IMG/M
3300009143|Ga0099792_10012771All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria3583Open in IMG/M
3300009143|Ga0099792_10439180Not Available806Open in IMG/M
3300009678|Ga0105252_10077879Not Available1299Open in IMG/M
3300010043|Ga0126380_11038004Not Available693Open in IMG/M
3300010358|Ga0126370_10168227All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium RIFCSPLOWO2_02_FULL_67_211619Open in IMG/M
3300010358|Ga0126370_10698592All Organisms → cellular organisms → Bacteria891Open in IMG/M
3300010362|Ga0126377_10333084All Organisms → cellular organisms → Bacteria1508Open in IMG/M
3300010362|Ga0126377_11044459Not Available884Open in IMG/M
3300010362|Ga0126377_13394425Not Available515Open in IMG/M
3300010398|Ga0126383_12149475All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium RIFCSPLOWO2_02_FULL_67_21645Open in IMG/M
3300011270|Ga0137391_10096252All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium RIFCSPLOWO2_02_FULL_67_212562Open in IMG/M
3300011270|Ga0137391_10496950Not Available1033Open in IMG/M
3300011410|Ga0137440_1020805Not Available1132Open in IMG/M
3300011425|Ga0137441_1029620Not Available1174Open in IMG/M
3300011429|Ga0137455_1231950Not Available546Open in IMG/M
3300011445|Ga0137427_10218894Not Available794Open in IMG/M
3300012096|Ga0137389_10427534All Organisms → cellular organisms → Bacteria → Proteobacteria1133Open in IMG/M
3300012096|Ga0137389_11770950Not Available513Open in IMG/M
3300012113|Ga0137328_1026473Not Available574Open in IMG/M
3300012161|Ga0137336_1097016Not Available547Open in IMG/M
3300012189|Ga0137388_10385269Not Available1297Open in IMG/M
3300012202|Ga0137363_10940722All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium734Open in IMG/M
3300012206|Ga0137380_11082291Not Available683Open in IMG/M
3300012231|Ga0137465_1247087Not Available528Open in IMG/M
3300012351|Ga0137386_10477608Not Available898Open in IMG/M
3300012354|Ga0137366_11153847Not Available530Open in IMG/M
3300012361|Ga0137360_10412695All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium RIFCSPLOWO2_02_FULL_67_211140Open in IMG/M
3300012361|Ga0137360_10668912Not Available891Open in IMG/M
3300012362|Ga0137361_10575036All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium RIFCSPLOWO2_02_FULL_67_211033Open in IMG/M
3300012363|Ga0137390_11599544Not Available589Open in IMG/M
3300012685|Ga0137397_10190415All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium RIFCSPLOWO2_02_FULL_67_211525Open in IMG/M
3300012917|Ga0137395_10037838All Organisms → cellular organisms → Bacteria → Proteobacteria2962Open in IMG/M
3300012922|Ga0137394_11268127Not Available600Open in IMG/M
3300012929|Ga0137404_10384161Not Available1235Open in IMG/M
3300012930|Ga0137407_11291007Not Available693Open in IMG/M
3300012944|Ga0137410_10018144All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria4817Open in IMG/M
3300012944|Ga0137410_10774455Not Available804Open in IMG/M
3300012971|Ga0126369_10665469All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium RIFCSPLOWO2_02_FULL_67_211116Open in IMG/M
3300014878|Ga0180065_1038975Not Available1003Open in IMG/M
3300015053|Ga0137405_1049405All Organisms → cellular organisms → Bacteria → Proteobacteria1616Open in IMG/M
3300015374|Ga0132255_100014215All Organisms → cellular organisms → Bacteria9502Open in IMG/M
3300018053|Ga0184626_10001377All Organisms → cellular organisms → Bacteria8524Open in IMG/M
3300018084|Ga0184629_10306348Not Available836Open in IMG/M
3300019238|Ga0180112_1333232Not Available573Open in IMG/M
3300020199|Ga0179592_10520061Not Available508Open in IMG/M
3300021086|Ga0179596_10003497All Organisms → cellular organisms → Bacteria4468Open in IMG/M
3300021086|Ga0179596_10324151All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria771Open in IMG/M
3300026304|Ga0209240_1072730All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium RIFCSPLOWO2_02_FULL_67_211277Open in IMG/M
3300026358|Ga0257166_1002944All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium RIFCSPLOWO2_02_FULL_67_211779Open in IMG/M
3300026377|Ga0257171_1033396Not Available881Open in IMG/M
3300026551|Ga0209648_10029243All Organisms → cellular organisms → Bacteria → Proteobacteria4815Open in IMG/M
3300027671|Ga0209588_1045486All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium RIFCSPLOWO2_02_FULL_67_211420Open in IMG/M
3300027846|Ga0209180_10085438All Organisms → cellular organisms → Bacteria1785Open in IMG/M
3300027874|Ga0209465_10310771All Organisms → cellular organisms → Bacteria790Open in IMG/M
3300027882|Ga0209590_10796920Not Available600Open in IMG/M
3300027894|Ga0209068_10403661Not Available779Open in IMG/M
3300027903|Ga0209488_10330453All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1135Open in IMG/M
3300027915|Ga0209069_10449956Not Available715Open in IMG/M
3300028536|Ga0137415_10003286All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria16065Open in IMG/M
3300031199|Ga0307495_10019953All Organisms → cellular organisms → Bacteria1120Open in IMG/M
3300031561|Ga0318528_10755963Not Available520Open in IMG/M
3300032174|Ga0307470_10037539All Organisms → cellular organisms → Bacteria2369Open in IMG/M
3300032205|Ga0307472_102175180Not Available559Open in IMG/M
3300033433|Ga0326726_10019475All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria5900Open in IMG/M
3300034090|Ga0326723_0014394All Organisms → cellular organisms → Bacteria3166Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil26.22%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil10.37%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil9.76%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil5.49%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil5.49%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil4.88%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil3.66%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil3.05%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil2.44%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland2.44%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere2.44%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere2.44%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds1.83%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil1.83%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere1.83%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment1.22%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.22%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil1.22%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil1.22%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil1.22%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere1.22%
Peat SoilEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Peat Soil1.22%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere1.22%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment0.61%
Lake SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Lake Sediment0.61%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere0.61%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Rhizosphere0.61%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.61%
Switchgrass RhizosphereHost-Associated → Plants → Roots → Rhizosphere → Soil → Switchgrass Rhizosphere0.61%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.61%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.61%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.61%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere0.61%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
2228664022Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000364Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300002917Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_100cmEnvironmentalOpen in IMG/M
3300004479Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAsEnvironmentalOpen in IMG/M
3300005093Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of KBS All BlocksEnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005455Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-3 metaGHost-AssociatedOpen in IMG/M
3300005713Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 (version 2)EnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300005841Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S6-2Host-AssociatedOpen in IMG/M
3300006047Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2013EnvironmentalOpen in IMG/M
3300006871Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD3Host-AssociatedOpen in IMG/M
3300006914Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD5Host-AssociatedOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300007788Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_2EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009094Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (version 2)Host-AssociatedOpen in IMG/M
3300009100Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD2Host-AssociatedOpen in IMG/M
3300009101Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-4 metaGHost-AssociatedOpen in IMG/M
3300009120Lake sediment microbial communities from Tanners Lake, St. Paul, MNEnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300009678Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT100EnvironmentalOpen in IMG/M
3300010043Tropical forest soil microbial communities from Panama - MetaG Plot_26EnvironmentalOpen in IMG/M
3300010047Tropical forest soil microbial communities from Panama - MetaG Plot_30EnvironmentalOpen in IMG/M
3300010159Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_3EnvironmentalOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010336Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09082015EnvironmentalOpen in IMG/M
3300010358Tropical forest soil microbial communities from Panama - MetaG Plot_3EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010361Tropical forest soil microbial communities from Panama - MetaG Plot_23EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300011119Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M6-4 metaGHost-AssociatedOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011410Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT222_2EnvironmentalOpen in IMG/M
3300011425Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT244_2EnvironmentalOpen in IMG/M
3300011429Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT600_2EnvironmentalOpen in IMG/M
3300011445Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT700_2EnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012113Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT100_2EnvironmentalOpen in IMG/M
3300012161Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT300_2EnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012231Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT828_2EnvironmentalOpen in IMG/M
3300012350Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012354Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012508Arabidopsis rhizosphere microbial communities from North Carolina - M.Col.2.old.270510Host-AssociatedOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012948Tropical forest soil microbial communities from Panama - MetaG Plot_14EnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300012989Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_237_MGEnvironmentalOpen in IMG/M
3300013306Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S5-5 metaGHost-AssociatedOpen in IMG/M
3300013308Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M3-5 metaGHost-AssociatedOpen in IMG/M
3300014325Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S6-5 metaGHost-AssociatedOpen in IMG/M
3300014878Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT200A_16_10DEnvironmentalOpen in IMG/M
3300014968Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S2-5 metaGHost-AssociatedOpen in IMG/M
3300015053Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300015373Combined assembly of cpr5 rhizosphereHost-AssociatedOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300016270Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.080EnvironmentalOpen in IMG/M
3300016294Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.178EnvironmentalOpen in IMG/M
3300016387Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.176EnvironmentalOpen in IMG/M
3300016445Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.108EnvironmentalOpen in IMG/M
3300017959Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_Q2_SP10_10_MGEnvironmentalOpen in IMG/M
3300017961Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_Q2_SP5_20_MGEnvironmentalOpen in IMG/M
3300018053Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_60_b1EnvironmentalOpen in IMG/M
3300018084Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32_b1EnvironmentalOpen in IMG/M
3300018086Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0116_SJ02_MP02_10_MGEnvironmentalOpen in IMG/M
3300019238Metatranscriptome of soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaT ERMLT466_16_1Ra (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300020199Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021086Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021560Tropical forest soil microbial communities from Panama - MetaG Plot_4EnvironmentalOpen in IMG/M
3300022531Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-28-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300024347Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_08_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300025900Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025944Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7.1-3L metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026088Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S6-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_80cm (SPAdes)EnvironmentalOpen in IMG/M
3300026320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026358Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-14-BEnvironmentalOpen in IMG/M
3300026377Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-10-BEnvironmentalOpen in IMG/M
3300026550Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145 (SPAdes)EnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300027671Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027874Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 1 MoBio (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027894Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2012 (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300027915Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2013 (SPAdes)EnvironmentalOpen in IMG/M
3300028381Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S3-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300031199Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 7_SEnvironmentalOpen in IMG/M
3300031545Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.166b4f26EnvironmentalOpen in IMG/M
3300031546Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.166b4f23EnvironmentalOpen in IMG/M
3300031547Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60D4EnvironmentalOpen in IMG/M
3300031561Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.052b4f26EnvironmentalOpen in IMG/M
3300031781Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.169b2f20EnvironmentalOpen in IMG/M
3300031893Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.117b4f28EnvironmentalOpen in IMG/M
3300031941Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.OX080EnvironmentalOpen in IMG/M
3300031942Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.LF176EnvironmentalOpen in IMG/M
3300031947Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.T000HEnvironmentalOpen in IMG/M
3300031981Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.053b4f25EnvironmentalOpen in IMG/M
3300032010Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.088b5f22EnvironmentalOpen in IMG/M
3300032012Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C24D3EnvironmentalOpen in IMG/M
3300032039Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.065b5f21EnvironmentalOpen in IMG/M
3300032052Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.084b2f19EnvironmentalOpen in IMG/M
3300032075Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T20D3EnvironmentalOpen in IMG/M
3300032094Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.166b4f25EnvironmentalOpen in IMG/M
3300032174Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_05EnvironmentalOpen in IMG/M
3300032179Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T20D2EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300032261Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170 (v2)EnvironmentalOpen in IMG/M
3300032770Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.5EnvironmentalOpen in IMG/M
3300032805Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.2EnvironmentalOpen in IMG/M
3300032892Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.5EnvironmentalOpen in IMG/M
3300033289Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.AN108EnvironmentalOpen in IMG/M
3300033433Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF15MNEnvironmentalOpen in IMG/M
3300034090Peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF00NEnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
INPgaii200_094760712228664022SoilMVIVFQIVITVVFGLLWWWVFNRVGVDLHYIMILGSVLAVCVCCCKGTGDPQREWNWQRYIACIRRCLLATVVLMISLVIIGILAVIWATGLPSLGAVLASIFIASIFAPLFVRLICCAYES
INPhiseqgaiiFebDRAFT_10045253913300000364SoilMPQIIITIVFGFLWWWVYNRVGAGLTYLMILGSVLAVCVCCCKGGEPQWGWWNFNWDDYRACIGRCFWQTLEPNIALVVIGILVVLYTTGAFPTGVVLTNILLAAIFAPLFVRIICCAWE
INPhiseqgaiiFebDRAFT_10045618813300000364SoilMPQIFITVVFGLLWWRVFTYVGADLSYLMILGSVLAVCVCCCKGGEPQWGWWNFNWDNYRACIARCVWQTLALIIALFIIGILVVFYTTGAFPSGVLLTKIVLAAVAAPLVVRAICCAYE
INPhiseqgaiiFebDRAFT_10168234333300000364SoilMVIVFQIVITVVFGLLWWFIFSRVGEDFEYIMILGSVLAVCVCCCKGTGDPQREWNWQRYIACXRRCLLATVVLXXSLXIIGXLAVIWXTGLPSXGAVXXXIXIAXIFAPLXVRLICCAYES*
INPhiseqgaiiFebDRAFT_10168307633300000364SoilMVIVFQIVITVVFGLLWWWVFNRVGVDLHYIMILGSVLAVCVCCCKGTGDPQREWNWQRYIACXRRCLLATVVLXXSLXIIGXLAVIWXTGLPSXGAVXXXIXIAXIFAPLXVRLICCAYES*
INPhiseqgaiiFebDRAFT_10168602213300000364SoilMVIVFQIVITVVFGLLWWWVFNRVGVDLHYIMILGSVLAVCVCCCKGTGDPQREWNWQRYIACIRRCLLATVVLMISLVIIGILAVIWATGLPSLGAVLASIFIASIFAPLFVRLICCAYES*
JGI25616J43925_1004245423300002917Grasslands SoilMINVIRLVITVVFGFLWWWIYNRIGAGLEYLMILGSVLAVCACCCRGGEPQLEWRWDVYFTCIRRCWLATVVLMVALFIIGVLVVLFMTGAPPSGVVLTDILLAAIGAPLFVRLICCAYE
Ga0062595_10133411923300004479SoilSMPIRILITVVFGLLWWWLYNHVGAGLTYLMILGSVLAVCVCCCKGNAPPEWGYNFNWDNYFACIQRCRWATLTLTISLFIIGILVVFYTTGKFPSGVDLTNIVLVAIGAPLFVRAICCAYD*
Ga0062594_10163426423300005093SoilMPIRILITVVFGLLWWWLYNHVGAGLNYLMILGSVLAVCVCCCKGGEPPQWGYNFNWDNYFACIRRCRWATLMLMISLFIIGILVVFYTTGAVPSGVVLTNIVLVAI
Ga0066672_1016701923300005167SoilMPQILITIVFGLLWWWVFNHVGAGLTYLMILGSVLAVCVCCCKGGEPQWGWWNFNWDNYRACIGRCVWQTLALIIALVVIGILVVVFTSGAFPTGVVLTNILLAAIFAPLFVRIICCAYE
Ga0066388_10008745423300005332Tropical Forest SoilMPIRILITVVFGLLWWWLYNHVGAGLNYLMILGSVLAVCVCCCKGGEPPQWGYNFNWDNYFACIQRCRWAVLMLTISLFIIGILVVFYTTGAFPSGVMLTNIVLVAIGAPLFVRAICCAYD*
Ga0066388_10020038323300005332Tropical Forest SoilMPIRILITVVFGLLWWWLYNHVGAGLTYLMILGSVLAVCVCCCKGNAPPEWGYNFNWDNYFACIRRCRWATLTLTISLFIIGILVVFYTTGKFPSGVDLTNIVLVAIGAPLFVRAICCAYD*
Ga0066388_10062016823300005332Tropical Forest SoilMNVIRLVITVVFGFLWWWIYNRVGAGLDYLMILGSVLAVCACCCKGGEPQREWRWDVYFLCLRRCWLATVVLMVALFIIGVLVVLFLTGSAPTGVVLTDILLAAIGAPLFVRLICCAYE*
Ga0066388_10128272723300005332Tropical Forest SoilMPQIIITIVFGFLWWWVFRHVGPGLSYLMILGSVLAVCVCCCKGGEQQEWSWWRVNWDTYRACIGRCFWPTLALVIALVVVGILVVLYTTHAFPSGVELTNILLAAIFAPVFVRIICCAYE*
Ga0066388_10205154413300005332Tropical Forest SoilMPQIIITVVFGLLWWWVYNRIGASLDYLMILGSVLAVCVCCCKGGTEPPWGWWRFNWDNYLACIRRCWWATLMLMISLFIIGVLVVIYTTGAAPSGVVLTNIVLAAIAAPWLIRLICCAYE*
Ga0066388_10290656313300005332Tropical Forest SoilMPQILITIVFGLLWWWVFNHVGAGLTYLMILGSVLAVCVCCCKGGEPQWGWWNFNWDNYRACIGRCVWQTLALIIALVVIGVLVVFYTSGAFPTGVVSTNILLAGIFAPLFVRIICCAYE
Ga0070708_10022508413300005445Corn, Switchgrass And Miscanthus RhizosphereMPQILITIVFGLLWWWVFNHVGAGLTYLMILGSVLAVCVCCCKGGEPQWGWWNFNWKNYRACIGRCVWQTLALIIALVVIGILVVLFTSGAFPTGVVLTNILLAAIFAPLFVRIICCAYE
Ga0070663_10015997943300005455Corn RhizosphereSNIGRKARSVSNLDIDGFRPDIWGRSMPIRILITVVFGLLWWWLYNHVGAGLNYLMILGSVLAVCVCCCKGGEPPQWGYNFNWDNYFACIRRCRWATLMLMISLFIIGILVVFYTTGAVPSGVVLTNIVLVAIGAPLFVRAICCAYD*
Ga0066905_10199474513300005713Tropical Forest SoilIITVVFGLLWWWVYNRVGVDLQYLMILGSVLAVCVCCCKGGSDPSSEWNWQRYLACIRRCWLPTLTLIVSLFIIGVLAVIIATGAGSVGAVIVNILLAAIFAPLLVRLICCAYENP*
Ga0066903_10050928123300005764Tropical Forest SoilMPQIIITIVFGFLWWLVFNYVGAGLTYLMILGSVLAVCVCCCKGGEPQWGWWSFNLDTYRACIGRCFWPTLALIIALVVIGILVVLYTTHAFPSAAELTKILLAAIFAPLFVRIICCAYE
Ga0068863_10086005013300005841Switchgrass RhizosphereMPIRILITVVFGLLWWWLYNHVGAGLNYLMILGSVLAVCVCCCKGGEPPQWGYNFNWDNYFACIRRCRWATLMLMISLFIIGILVVFYTTGAVPSGVVLTNIVLVAIGAPLFVRAICCAYD*
Ga0075024_10049601513300006047WatershedsMTIAFQILITVVFGLLWWWVFNHVGEDLEYIMILGSVLAVCVCCCKGGAEPQREWNWKNYIACIRRCWLATLVLMVSLAIIGILAVIWATGSPPAGVLLTNIFIAAIFAPLFVRLICCAYES*
Ga0075434_10042470413300006871Populus RhizosphereMPQILITIVFGFLWWWVFNRVGAGLTYLMILGSVLAVCVCCCKGGEPQWGWWNFNWDKYLACIRRCVWQTLALIIALVVIGILVVLFTSGAFPTGVVLTNILLAAIFAPLFVRIICCAWE
Ga0075436_10045505213300006914Populus RhizosphereDIWGRSMPIRILITVVFGLLWWWLYNHVGAGLNYLMILGSVLAVCVCCCKGGEPPQWGYNFNWDNYFACIRRCRWATLMLMISLFIIGILVVFYTTGAVPSGVVLTNIVLVAIGAPLFVRAICCAYD*
Ga0099791_1000662343300007255Vadose Zone SoilMGESMINVIRLVITVVFGFLWWWIYNRIGAGLDYLMILGSVLAVCACCCKGGEPQREWSWDVYFACIRRCWIATLVLMVSLFIIGVLVVIFTTGAPPSGVVLTDILLAAIGAPLFVRLICCAYE*
Ga0099793_1033941913300007258Vadose Zone SoilMINVIRLVITVVFGFLWWWIYNHIGAGLEYLMILGSVLAVCACCCRGGEPQREWSWGVYFACIRRCWIATCVLMVSLFIIGVLVVIFTTGAPPSGVVLTDILLAAIG
Ga0099794_1003719223300007265Vadose Zone SoilMINVIRLVITVVFGFLWWWIYNRIGAGLDYLMILGSVLAVCACCCKGGEPQREWSWDVYFACIRRCWIATLVLMVSLFIIGVLVVIFTTGAPPSGVVLTDILLAAIGAPLFVRLICCAYE
Ga0099795_1001077833300007788Vadose Zone SoilMGESMINVIRLVITVVFGFLWWWIYDRIGTGLEYLMILGSVLAVCACCCRGGEPQREWSWGVYFACIRRCWIATLVLMVSLFIIGVLVVLFTTGAPPSGVVLTDILLAAIGAPLFVRLICCAYE*
Ga0066710_10318671913300009012Grasslands SoilWWWVFNHVGAGLTYLMILGSVLAVCVCCCKGGEPQWGWWNFNWDNYRACIGRCVWQTLALIIALVVIGILVVVFTSGAFPTGVVLTNILLAAIFAPLFVRIICCAYE
Ga0099830_1005347213300009088Vadose Zone SoilSMINVIRLVITVVFGFLWWWVYNRIGAGLDYLMILGSVLAVCACCCKGGEPQREWNWDVYVACIKRCWVATLVLMVALFIIGVLVVLFVTGAPPSGVVLTDILLAAIGAPLFVRLICCAYE*
Ga0099827_1109135223300009090Vadose Zone SoilMGESMINVIRLVITVVFGFLWWWIYNRIGAGLEYLMILGSVLAVCACCCRGGEPQREWSWDVYFACIRRCWIATLVLMVSLFIIGVLVVIFTTGAPPSGVVLTDILLAAIGAPLFVRLICCAYE*
Ga0111539_1141813523300009094Populus RhizosphereMPIRILITVVFGLLWWWLYNHVGAGLNYLMILGSVLAVCVCCCKGGEPPQWGYNFNWDNYFACIRRCRWATLMLMISLFIIGILVVFYTTGAVPSGVVLT
Ga0075418_1180854213300009100Populus RhizosphereLVLAELGITAFALTYGGRSMPQIVITVVFGLLWWWVYNRVGASLDYLMILGSVLAVCVCCCKGGEPQWGWWRFNWDNYVACIYRCWWVTLMLIIALFIIGVLVVLYTTGAAPSGVVLTNIVLAAIGAPLFIRLICCAYE*
Ga0105247_1008360413300009101Switchgrass RhizosphereMPIRILITVVFGLLWWWLYNHVGAGLNYLMILGSVLAVCVCCCKGGEPPQWGYNFNWDNYFACIRRCRWATLMLMVSLFIIGILVVFYTTGAVPSGVVLTNIVLVAIGAPLFVRAICCAYD*
Ga0117941_112398613300009120Lake SedimentMTIVMRIIITVVFGFLWWWVYNRVGMGLQYLMILGSVLAICVCCCKGSSDPGEWNWQRYLACIRRCWLATAILIVSLFIIGVLVVFYLTGAAPAGVVLVNILLAAIFAPLFVRIICCA
Ga0099792_1001277113300009143Vadose Zone SoilMGESMINVIRLVITVVFGFLWWWIYNRIGTGLEYLMILGSVLAVCACCCRGGEPQREWSWGVYFACIRRCWIATLVLMVSLFIIGVLVVIFTTGAPPSGVVLTDILLAAIGAPLFVRLICCAYE*
Ga0099792_1043918013300009143Vadose Zone SoilMINVIRLVITVVFGFLWWWVYNRIGAGLDYLMILGSVLAVCVCCCKGGEPQREWNWDVYVACIKRCWLATFVLMVALFIIGVLVVLFVTGAPPSGVVLTDILLAAIGAPLFVRLICCAYE
Ga0105252_1007787923300009678SoilMTIVMRIIITVVFGFLWWWVYNRVGMGLQYLMILGSVLAVCVCCCKGSNDPGEWNWQRYLACIRRCWLATAILIVSLFIIGVLVVFYLTGAAPAGVVLVNILLAAIFAPLFVRIICCAYENP*
Ga0126380_1084560113300010043Tropical Forest SoilMPQIIITILFGFLWWLVFNYVGAGLTYLMILGSVLAVCVCCCKGGEPQWGWWRVNWDAYRACIGRCFWPTLALVIALVVIGILVVLYTTHAFPSGVELTNILLAAIFAPLFVRIICCAYE
Ga0126380_1103800413300010043Tropical Forest SoilRKASALLRLSRLSPSHMGESMINVIRLVITVAFGFLWWWIYNRIGAGLDYLMILGSVLAVCVCCCRGGEQQREWSWDVYFACIRRCWIATLVLMVALFIIGILVVVVLTGAPPGGVVLTDILLAAIGAPLFVRLICCAYE*
Ga0126382_1151168513300010047Tropical Forest SoilMPQIIITIVFGFLWWLVFNQVGAGLTYLMILGSVLAVCVCCCKGGEPQWGWWSFNLDAYRACIGRCFWPTLALIIALVVIGILVVLYTTHAFPSGVELTNILLAAIFAPLFVRIICCAYE
Ga0099796_1006747223300010159Vadose Zone SoilMINVIRLVITVVFGFLWWWIYNHIGAGLEYLMILGSVLAVCACCCRGGEPQREWSWDVYFACIRRCWIATLVLMVALFIIGVLVVLFMTGAPPSGVVLTNILLAAIGAPLFVRLICCAYE
Ga0134088_1044060223300010304Grasslands SoilMIIVIRLVITVVFGFLWWWIYNRVGAGLEYLMILGSVLAVCVCCCKGGEPQWGWWNFNWDNYRACIGRCVWQTLALIIALVVIGILVVVFTSGAFPTGVVLTNILLAAIFAPLFVRIICCAYE*
Ga0134071_1060048523300010336Grasslands SoilWIYNRVGAGLEYLMILGSVLAVCVCCCKGGEPQWGWWNFNWDNYRACIGRCVWQTLALIIALVVIGILVVVFTSGAFPTGVVLTNILLAAIFAPLFVRIICCAYE*
Ga0126370_1016822723300010358Tropical Forest SoilFGFLWWWIYNRVGAGLDYLMILGSVLAVCVCCCRGGEQQREWSWDVYFACIRRCWIATLVLMVALLIIGVLVVIFMTGSAPTGVVLTDILLAAIGAPLFVRLICCAYE*
Ga0126370_1069859223300010358Tropical Forest SoilMGESIMNVIRLVITVVFGFLWWWIYNRVGAGLDYLMILGSVLAVCACCCKGGEPQREWRWDAYVACIYRCWLATLVLMVALFIIGVLVVLFMTGAPPSGVVLTDILLAAIGAPLFVRLICCAYE*
Ga0126370_1158821113300010358Tropical Forest SoilMPQILITIVFGLLWWWVFDRVGAGLTYLMILGSVLAVCVCCCKGGEPQWGWWNFNWDNYRACIGRCVWQTLALIIALAVIGILVVLFTSGAFPTGVVLTNILLAAIFAPLFVRLICCAYE
Ga0126376_1020031013300010359Tropical Forest SoilMPQIIITIVFGFLWWLVFNHVGAGLTYLMILGSVLAVCVCCCKGGEPQWGWWSFNWDGYRACIGRCFWPTLALIIALVVIGILVVFYTTHVFPSGVELTNILLAAIFAPLFVRIICCAYE
Ga0126376_1043468433300010359Tropical Forest SoilFGLLWWWLYNHVGAGLTYLMILGSVLAVCVCCCKGNAPPEWGYNFNWDNYFACIRRCRWATLTLTISLLIIGILVVFYTTGTFPSGVVLTNIVLVAIGAPLFVRAICCAYD*
Ga0126378_1129703513300010361Tropical Forest SoilMPQIIITIVFGFLWWLVFNHVGAGLTYLMILGSVLAVCVCCCKGGEPQWGWWSFNWDGYRACIGRCFWPTLALIIALVIIGILVVLYTTHVFPSGVELTNILLAAIFAPLFVRIICCAYE
Ga0126377_1033308423300010362Tropical Forest SoilMTIVFRIIITVVFGLLWWWVYNRVGVDLQYLMILGSVLAVCVCCCKGGSDPSSEWNWQRYLACIRRCWLPTLTLIVSLFIIGVLAVIIATGAGSVGAVIVNILLAAIFAPLLVRLICCAYENP*
Ga0126377_1104445913300010362Tropical Forest SoilMINVIRLVITVVFGFLWWWIYNRIGAGLDYLMILGSVLAVCVCCCKGGEPQREWSWDVYFACIRRCWIATLVLMVSLFIIGVLVVLAMTGAPPSGVVLTDILLAAIGAPLFVRLICCAYE
Ga0126377_1339442513300010362Tropical Forest SoilRWWVYNHVGVDLQYIMILGSVLAVCVCCCKGTGDPQREWNWQRYFACIRRCLLVTVVLMVSLAIIGILAVIWATGSAPAGATLVNIFIASIFAPLFVRLICCAYES*
Ga0126383_1214947513300010398Tropical Forest SoilSMIIVFRLVITVVFGLLWWWVYNRVGAGLEYLMILGSVLAVCVCCCKGGEPQREWRWDVYFLCLRRCWLATVVLMVALFIIGVLVVLFLTGSAPTGVVLTDILLAAIGAPLFVRLICCAYE*
Ga0105246_1029126913300011119Miscanthus RhizosphereMPIRILITVVFGLLWWWLYNHVGAGLNYLMILGSVLAVCVCCCKGGEPPQWGYNFNWDNYFACIQRCRWATLTLTISLFIIGILVVFYTTGAVPSGVVLTNIVLVAIGAPLFVRAICCAYD*
Ga0137391_1009625223300011270Vadose Zone SoilMGESMINVIRLVITVVFGFLWWWIYNRIGTGLEYLMILGSVLAVCACCCRGGEPQREWSWDVYFACIRRCWIATLVLMVSLFIIGVLVVIFTTGAPPSGVVLTDILLAAIGAPLFVRLICCAYE*
Ga0137391_1049695013300011270Vadose Zone SoilMTIAFQVAITVVFGLLWWWVYNRVGAGLEYIMILGSVLAVCVCCCKGGGEPQREWNWQHYFACIRRCWLATVVLMVSLAIIGIIAVTWATGSPPAGAVLTNIFIAAIFAPLCVRLICCAYEN*
Ga0137440_102080513300011410SoilMTIVFRILITVVFGLLWWWVFNRVGVSLQYLMILGSVLAVCVCCCKGGSDPQGEWNWQRYLACIRRCWLPTAILIVSLFIIGVLAVFIATGMAPPGAVLVNILLAAIFAPLFVRFICCAYENP*
Ga0137441_102962013300011425SoilMIIAFRILITVVFGLLWWWVYNRVGVSLQYLMILGSVLAVCVCCCKGGSDPQGEWNWQRYLACIRRCWLPTAILIVSLFIIGVLAVFIATGMAPPGAVLVNILLAAIFAPLFVRFICCAYENP*
Ga0137455_123195013300011429SoilMTIVFRILITVVFGLLWWWVFNRVGVSLQYLMILGSVLAVCVCCCKGGSDPQGEWNWQRYLACIRRCWLPTAILIVSLFIIGVLAVFIATGMAPPGAVLVNILLAAIFAPLFVRFICCAY
Ga0137427_1021889413300011445SoilMTIVMRIIITVVFGFLWWWVYNRVGMGLQYLMILGSVLAVCVCCCKGSNDPGEWNWQRYLACIRRCWLATAILIVSLFIIGVLVVFYLTGAGPAGVVLVNILLAAIFAPLFVRFICCAYENP*
Ga0137389_1042753413300012096Vadose Zone SoilMGESMINVIRLVITVVFGFLWWWIYNRIGTGLEYLMILGSVLAVCACCCRGGEPQREWSWDVYFACIRRCWIATLVLMVSLFIIGVLVVIFTTGAPPSGVVLTDILLAA
Ga0137389_1177095013300012096Vadose Zone SoilMINVIRLVITVVFGFLWWWVYNRIGAGLEYLMILGSVLAVCACCCRGGEPQREWSWEVYVACIYRCWLATLVLMVALFIIGVLVVLFMTGAPPSGVVLTDILLAAIGAPLFVRLICCA
Ga0137328_102647313300012113SoilIVFRILITVVFGLLWWWVFNRVGVSLQYLMILGSVLAVCVCCCKGGSDPQGEWNWQRYLACIRRCWLPTAILIVSLFIIGVLAVFIATGMAPPGAVLVNILLAAIFAPLFVRFICCAYENP*
Ga0137336_109701613300012161SoilMTIVFRILITVVFGLLWWWVFNRVGVSLQYLMILGSVLAVCVCCCKGGSDPQGEWNWQRYLACIRRCWLPTAILIVSLFIIGVLAVFIATGMAPPGAVLVNILLAAIFAPLFVRFI
Ga0137388_1038526923300012189Vadose Zone SoilMINVIRLVITVVFGFLWWWVYNRIGAGLEYLMILGSVLAVCACCCKGGEPQREWNWNVYVACIRRCWIATLVLMVALFIIGVLVVLFTTGAPPSGVVLTNILLAAIGAPLFVRLICCAYE
Ga0137364_1007212343300012198Vadose Zone SoilMPQIIITIVFGFLWWWVFNHVGAGLTYLMILGSVLAVCVCCCKGGEPQWGWWSFNLDGFRACIGRCFWPTLALIIALVVIGILVVLYTTHVFPSGVELTNILLTAIFAPLFVRIICCAYE
Ga0137363_1006547013300012202Vadose Zone SoilMGESMMNVIRLLITVVFGFLWWWIYNHIGAGLEYLMILGSVLAVCACCCKGGEPQRDWRWDVYFACIRRCWLATLVLMVALFIIGILVLLFMTGAPPSGVVLTNILLAAIGAPLFVRLICCAYE*
Ga0137363_1094072223300012202Vadose Zone SoilMTIAFQVAITVVFGLLWWWVYNRVGAGLEYIVILGSVLAVCVCCCKGGGEPQREWNWQHYFACIRRCWLATVVLMVSLAIIGIIAVTWATGSPPAGAVLTNIFIAVIFAPLCVRLICCAYEN*
Ga0137380_1108229113300012206Vadose Zone SoilMIIVIRLVITVVFGFLWWWIYNRVGAGLEYLMILGSVLAVCVCCCKGGEPRWEWNWGVYFACIWRCWLATLVLMVALFIIGVLVVLYMTGAPPGGAVLTNIALAAIGAPLFVRLICCAYE
Ga0137465_124708713300012231SoilSMTIVFRILITVVFGLLWWWVYNRVGVSLQYLMILGSVLAVCVCCCKGGSDPQGEWNWQRYLACIRRCWLPTAILIVSLFIIGVLAVFIATGMAPPGAVLVNILLAAIFAPLFVRFICCAYENP*
Ga0137372_1013547723300012350Vadose Zone SoilMPQILITIVFGLLWWWVFNHVGAGLTYLMILGSVLAVCVCCCKGGEPQWGWWNFNWDNYRACIGRCVWQTLALIIALVVIGILVVVFTSGAFPTGVVLTNILLAAIFAPLFVRIICCTYE
Ga0137386_1047760813300012351Vadose Zone SoilIVIRLVITVVFGFLWWWIYNRVGAGLEYLMILGSVLAVCVCCCKGGEPRWEWNWGVYFACIWRCWLATLVLMVALFIIGVLVVLYMTGAPPGGAVLTNIALAAIGAPLFVRLICCAYE*
Ga0137366_1115384713300012354Vadose Zone SoilMIIVIRLVITVVFGFLWWWVYNRVGAGLDYLMILGSVLAVCVCCCKGGEPRWEWNWNVYFACLWRCWLATLVLMVALFIIGVLVVLYMTGAPPGGAVLTNIVLAAIGAPLFVRLICCAYE
Ga0137360_1041269523300012361Vadose Zone SoilYNRIGTGLEYLMILGSVLAVCACCCKGGEPQREWNWDVYVACIKRCWVATLVLMVALFIIGVLVVLFVTGAPPSGVVLTDILLAAIGAPLFVRLICCAYE*
Ga0137360_1066891213300012361Vadose Zone SoilMINVFRLVITVVFGFLWWWIYNRIGAGLEYLMILGSVLAVCACCCRGGEPQREWSWEVYVACIYRCWLATLVLMVALFIIGVLVVLFMTGAPPSGVVLTDILLAAIGAPLFVRLICCAYE
Ga0137361_1057503623300012362Vadose Zone SoilMGESMINVIRLVITVVFGFLWWWIYNRIGTGLEYLMILGSVLAVCACCCRGGEPQREWSWGVYFACIRRCWIATLVLMVSLFIIGVLVVLFTTGAPPSGVVLTDILLAAIGAPLFVRLICCAYE*
Ga0137390_1159954413300012363Vadose Zone SoilMTIAFQVAITVVFGLLWWWVYNRVGAGLEYIMILGSVLAVCVCCCKGGGEPQREWNWQQYLACIRRCWLATVVLMVSLAMIGIIAVTWATGSPPAGAVLTNIFIAAIFAPLCVRLICCTYES*
Ga0157315_102186613300012508Arabidopsis RhizosphereMPIRILITVVFGLLWWWLYNHVGAGLNYLMILGSVLAVCVYCCKGNAPPEWGYNFNWDNYFACIQRCRWATLTLTISLFIIGILVVFYTTGKFPSGVDLTNIVLVAIGAPLF
Ga0137358_1000878853300012582Vadose Zone SoilMMNVIRLVITVVFGFLWWWIYNHIGAGLEYLMILGSVLAVCACCCKGGEPQREWRWDVYFACIRRCWIATLVLMVALFIIGVLVVLFMTGAPPSGVVLTNILLAAIGAPLFVRLICCAYE
Ga0137397_1019041513300012685Vadose Zone SoilIRLVITVVFGFLWWWIYNRIGAGLDYLMILGSVLAVCACCCKGGEPKREWSWAVYFACIMRCWIATLVLMVSLFIIGVLVVIFTTGAPPSGVVLTDILLAAIGAPLFVRLICCAYE*
Ga0137395_1003783823300012917Vadose Zone SoilMINVIRLVITVVFGFLWWWIYNRIGTGLEYLMILGSVLAVCACCCRGGEPQREWSWDVYFACIRRCWIATLVLMVSLFIIGVLVVLFTTGAPPSGVVLTDILLAAIGAPLFVRLICCAYE
Ga0137394_1126812713300012922Vadose Zone SoilMVIVFQIVITVVFGLLWWWVFNRVGEDFEFIMILGSVLAVCVCCCKGTGDPHSEWSWQRYIACIRRCWLATVVLIVSLFIIGVLAVIWATGAAPAGAVLFNILIASIFAPLFVRLICCAYEN*
Ga0137404_1038416123300012929Vadose Zone SoilMAVAAFARNIGRDSMVIVFQIVITVVFGLLWWWVFNRVGEDFEYIMILGSVLAVCVCCCKGTGDPQREWSWQRYIACIRRCWLATVVLMVSLFIIGVLAVIWATGAPPAGAVLFNILIASIFAPLFVRLICCAYES*
Ga0137407_1129100723300012930Vadose Zone SoilMAVAAFARNIGRDSMVIVFQIVITVVFGLLWWWVFNRVGEDFEYIMILGSVLAVCVCCCKGTGDPQREWSWQRYIACIRRCWLATVVLMVSLFIIGVLAVIWATGAPPAGAVLFNILIASIFAPLFVRLICCAY
Ga0137410_1001814443300012944Vadose Zone SoilMGESMINVIRLVITVVFGFLWWWIYNRIGAGLEYLMILGSVLAVCACCCRGGEPQREWSWGVYFACIRRCWIATLVLMVSLFIIGVLVVIFTTGAPPSGVVLTDILLAAIGAPLFVRLICCAYE*
Ga0137410_1077445523300012944Vadose Zone SoilMAVAAFARNIGRDSMVIVFQIVITVVFGLLWWWVYNRVGAGLEYIVILGSVLAVCVCCCKGTGDPHSEWSWQRYIACIRRCWLATVVLIVSLFIIGVLAVIWATGAAPAGAVLFNILIASIFAPLFVRLICCAYEN*
Ga0126375_1166227313300012948Tropical Forest SoilMGGENMPQIIITIVFGFLWWWVFNHVGPGLTYLMILGSVLAVCVCCCRGGEQQREWSWDVYFACIRRCWIATLVLMVALFIIGVLVVIFMTGAPPSGVVLTDILLAAIGAPLFVRLICCAYE*
Ga0126369_1066546913300012971Tropical Forest SoilMINVIRLVITVVFGFLWWWIYNRVGAGLDYLMILGSVLAVCVCCCRGGEQQRGWSWDVYFACIRRCWIATLVLMVALFIIGVLVVIFMTGSAPTGVVLTDILLAAIGAPLFVRLICCAYE
Ga0164305_1027885923300012989SoilMPIRILITVVFGLLWWWLYNHVGASLTYLMILGSVLAVCVCCCKGNAPPEWGYNFNWDNYFACIQRCRWATLTLTISLFIIGILVVFYTTGKFPSGVDLTNIVLVAIGAPLFVRAICCAYD*
Ga0163162_1043124013300013306Switchgrass RhizosphereMPIRILITVVFGLLWWWLYNHVGAGLNYLMILGSVLAVCVCCCKGGEPPQWGYNFNWDNYFACIRRCRWATLMLMISLFIIGILVVFYTTGKFPSGVDLTNIVLVAIGAPLFVRAICCA
Ga0157375_1303243813300013308Miscanthus RhizosphereMPIRILITVVFGLLWWWLYNHVGAGLTYLMILGSVLAVCVCCCKGGEPPQWGYNFNWDNYFACIRRCRWATLMLMISLFIIGILVVFYTTGAVPSGVVLTNIVLVAIGASLFVRAICCAYD*
Ga0163163_1056024733300014325Switchgrass RhizosphereMPIRILITVVFGLLWWWLYNHVGAGLNYLMILGSVLAVCVCCCKGGEPPQWGYNFNWDNYFACIRRCRWATLMLMISLFIIGILVVFYTTGAVPSGVVLTNIVLVAIGAPLFV
Ga0180065_103897513300014878SoilMTIVFRILITVVFGLLWWWVFNRVGVSLQYLMILGSVLAVCVCCCKGGSDPQGEWNWQRYLACIRRCWLPTAILIVSLFIIGVLAVFIATGMAPPGAVLVNILLAAIFAPLFVRFICCAYENS*
Ga0157379_1064813813300014968Switchgrass RhizosphereSVSNLDIDGFRPDIWGRSMPIRILITVVFGLLWWWLYNHVGAGLNYLMILGSVLAVCVCCCKGGEPPQWGYNFNWDNYFACIRRCRWATLMLMVSLFIIGILVVFYTTGAVPSGVVLTNIVLVAIGAPLFVRAICCAYD*
Ga0137405_104940513300015053Vadose Zone SoilWWWVFNRVVGEDFEYIMILGSVLAVCVCCCKGTGDPQREWSWQRYIACIRRCWLATVVLMVSLAIIGVLAVIWATGAPPAGAVLFNILIASIFAPLFVRLICCAYES*
Ga0132258_1221854723300015371Arabidopsis RhizosphereMPIRILITVVFGLLWWWLYNHVGAGLTYLMILGSVLAVCVCCCKGNAPPEWGYNFNWDNYFACIQRCRWATLTLTISLFIIGILVVFYTTGKFPSGVDLTNIVLVAIGAPLFVRAICCAYD*
Ga0132257_10020751413300015373Arabidopsis RhizosphereMPIRILITVVFGLLWWWLYNHVGAGLSYLMILGSVLAVCVCCCKGGEPPQWGYNFNWDNYFACIRRCRWATLMLMISLFIIGILVVFYTTGAVPSGVVLTNIVLVAIGAPLFVRAICCAYD*
Ga0132255_100014215163300015374Arabidopsis RhizosphereVSNLDIDGFRPDIWGRSIPIRILITVVFGLLWWWLYNHVGAGLNYLMILGSVLAVCVCCCKGGEPPQWGYNFNWDNYFACIRRCRWATLMLMISLFIIGILVVFYTTGAVPSGVVLTNIVLVAIGAPLFVRAICCAYD*
Ga0132255_10025108413300015374Arabidopsis RhizosphereMPIRILITVVFCLLWWWLYNHVGAGLTYLMILGSVLAVCVCCCKGNAPPEWGYNFNWDNYFACIQRCRWATLTLTISLFIIGILVVFYTTGKFPSGVDLTNIVLVAIGAPLFVRAICCAYD*
Ga0182036_1035353023300016270SoilMPQIIITVVFGFLWWWVFRHVGPGLTYLVILGSVLAVCVCCCKGGEQQEWGWWRVNWDTYRTCIGRCLWPTLALIVALVVIGILVVLYTTHAFPSGVELTNILLAAIFAPLFVRIICCAY
Ga0182041_1007220623300016294SoilMPQIIITIVFGFLWWWVFRHVGPGLTYLMVLGSVLAVCVCCCKGGEPREWGWWSVNWDGYRACIARCFWPTLALVIALVVIGILVVVYTTHAFPGGAELTNILLAAIFAPLFVRIICCAY
Ga0182040_1161570213300016387SoilMPQIIITILAGFLWWFIFHQVGPGLTYLMILGSVLAVCVCCCKGGEQREWGWWSVNWDGYRACIARCFWPTLALVIALIVIGILVVLYTTHAFPSSVELR
Ga0182038_1028930323300016445SoilMPQIIITTVFGFLWWWVFRHVGPGLTYLMVLGSVLAVCVCCCKGGEPREWGWWSVNWDGYRACIGRCFWPTLALVIGLVVLGILVVIYTTHAFPGGAELTNILLAAIFAPLCVRIICCAY
Ga0187779_1009238833300017959Tropical PeatlandMPQIIITIVFGFLWWLVFNHVGAGLTYLMILGSVLAVCVCCCKGGEPQWGWWSVNLDGYRACIGRCLWPTLALIIALVVIGILVVLYTTHVFPSGGVLTNILLAGICAPLFVRIICCAYE
Ga0187778_1001487643300017961Tropical PeatlandMSAQSPISASNLDLAALAYWGTNMPQIIITIVFGFLWWLVFNHVGAGLTYLMILGSVLAVCVCCCKAGEPQWGWWSVNLDGYRACVGRCFWPTLALIIALVVIGILVVLYTTHVFPSGGVLTNILLAGICAPLFVRIICCAYE
Ga0187778_1048409523300017961Tropical PeatlandMPQIIITIVFGFLWWLVFNHVGAGLTYLMILGSVLAVCVCCCKGGEPQWGWWSVNLDGYRACIGRCLWPTLALIIALVVIGILVVLYTTHVFPSGVELTNILLAGIFAPLFVRIICCAYE
Ga0184626_1000137783300018053Groundwater SedimentMTILLRIIITVVFGLLWWWVYNRVGVGLQYLVILGSVLAVCVCCCKGSSDPGEWNWQRYFACIRRCWLPTVVLMVSLFIIGVLVVFWATGAAPAGAVLVNILLAAIFAPAFVRFICCAYENP
Ga0184629_1030634813300018084Groundwater SedimentMTIVFRILITVVFGLLWWWVFNRVGVSLQYLMILGSVLAVCVCCCKGGSDPQGEWNWQRYLACIRRCWLPTAILIVSLFIIGVLAVFIATGMAPPGAVLVNILLAAIFAPLFVRFICCAYENP
Ga0187769_1003018033300018086Tropical PeatlandMPQIIITIVFGFLWWLVFNHVGAGLTYLMILGSVLAVCVCCCKAGEPQWGWWSVNLDGYRACVGRCFWPTLALIIALVVIGILVVLYTTHVFPSGGVLTNILLAGICAPLFVRIICCAYE
Ga0180112_133323213300019238Groundwater SedimentMTIVMRIIITVVFGFLWWWVYNRVGMGLQYLMILGSVLAVCVCCCKGSNDPGEWNWQRYLACIRRCWLATAILIVSLFIIGVLVVFYLTGAAPAGVVLVNILLAAIFAPLFVRIICCAYENP
Ga0179592_1052006113300020199Vadose Zone SoilMINVIRLVITVVFGFLWWWIYNRIGTGLEYLMILGSVLAVCACCCRGGEPQREWSWGVYFACIRRCWIATLVLMVSLFIIGVLVVLFTTGAPPSGVVLTDILLAAIGAPLFVRLICCAYE
Ga0179596_1000349743300021086Vadose Zone SoilMINVIRLVITVVFGFLWWWIYNRIGTGLEYLMILGSVLAVCACCCRGGEPQREWSWDVYFACIRRCWIATLVLMVSLFIIGVLVVIFTTGAPPSGVVLTDILLAAIGAPLFVRLICCAYE
Ga0179596_1032415113300021086Vadose Zone SoilVVFGLLWWWVYNRVGAGLEYIMILGSVLAVCVCCCKGGGEPQREWNWQQYLACIRRCWLATVVLMVSLAMIGIIAVTWATGSPPAGAVLTNIFIAAIFAPLCVRLICCAYEN
Ga0126371_1031428923300021560Tropical Forest SoilMPQIIITIVFGFLWWLVFNHVGAGLTYLMILGSVLSVCVCCCKGGEPQWGWWSVNLDGYRACIGRCFWPTLALIIALVVIGILVVLYTTHVFPTGVLLTNILLAAIWHLYSCASSAALTNKVIPRGER
Ga0242660_112610013300022531SoilMPQILITIVFGLLWWWVFNHVGAGLTYLMILGSVLAVCVCCCKGGEPQWGWWNFNWDKYRACIGRCVWQTLALIIALVVIGILVVLFTGGALTGVVLTNILLAGIFAPLFVRIICCAYE
Ga0179591_115099853300024347Vadose Zone SoilVVWIYNHIGAGLEYLMILGSVLAVCACCCKGGEPQREWRWDVYFACIRRCWIATLVLMVALFIIGVLVVLFMTGAPPSGVVLTNILLAAIGAPLFVRLICCAYE
Ga0207710_1018368313300025900Switchgrass RhizosphereMPIRILITVVFGLLWWWLYNHVGAGLNYLMILGSVLAVCVCCCKGGEPPQWGYNFNWDNYFACIRRCRWATLMLMVSLFIIGILVVFYTTGAVPSGVVLTNIVLVAIGAPLFVRAICCAY
Ga0207684_1096405413300025910Corn, Switchgrass And Miscanthus RhizosphereMPQILITIVFGLLWWWVFNHVGAGLTYLMILGSVLAVCVCCCKGGEPQWGWWNFNWDNYRACIGRCVWQTLALIIALVVIGILVVLFTSGAFPTGVVLTNILLAAIFAPLFVRIICCAYE
Ga0207661_1213648113300025944Corn RhizosphereLITVVFGLLWWWLYNHVGAGLNYLMILGSVLAVCVCCCKGGEPPQWGYNFNWDNYFACIRRCRWATLMLMISLFIIGILVVFYTTGAVPSGVVLTNIVLVAIGAPLFVRAICCAYD
Ga0207641_1088359913300026088Switchgrass RhizosphereMPIRILITVVFGLLWWWLYNHVGAGLNYLMILGSVLAVCVCCCKGGEPPQWGYNFNWDNYFACIRRCRWATLMLMISLFIIGILVVFYTTGAVPSGVVLTNIVLVAIGAPLFVRAICCAY
Ga0209240_107273013300026304Grasslands SoilMGESMINVIRLVITVVFGFLWWWIYNRIGAGLEYLMILGSVLAVCACCCRGGEPQLEWRWDVYFTCIRRCWLATVVLMVALFIIGVLVVLFMTGAPPSGVVLTDILLAAIGAPLFVRLICCAYE
Ga0209131_103261023300026320Grasslands SoilMNVIRLVITVVFGFLWWWIYNHIGAGLEYLMILGSVLAVCACCCKGGEPQREWRWDVYFACIRRCWIATLVLMVALFIIGVLVVLFMTGAPPSGVVLTNILLAAIGAPLFVRLICCAYE
Ga0257166_100294423300026358SoilMGESMINVIRLVITVVFGFLWWWIYNRIGTGLEYLMILGSVLAVCACCCRGGEPQREWSWGVYFACIRRCWIATLVLMVSLFIIGVLVVLFTTGAPPSGVVLTDILLAAIGAPLFVRLICCAYE
Ga0257171_103339613300026377SoilLSAAVAWAQSLGPLLRLSPSHMGESMINVIRLVITVVFGFLWWWIYNRIGTGLEYLMILGSVLAVCACCCRGGEPQREWSWGVYFACIRRCWIATLVLMVSLFIIGVLVVLFTTGAPPSGVVLTDILLAAIGAPLFVRLICCAYE
Ga0209474_1050435413300026550SoilMPQIIITIVFGFLWWLVFNHVGAGLTYLMILGSVLAVCVCCCKGGEPQWGWWSFNLDAYRACIGRCFWPTLALIIALVVIGILVVLYTTHAFPSGVELTNILLAAIFAPLFVRIICCAYE
Ga0209648_1002924343300026551Grasslands SoilSHMGESMINVIRLVITVVFGFLWWWIYDHIGAGLEYLMILGSVLAVCACCCKGGEPQLEWRWDVYFTCIRRCWLATVVLMVALFIIGVLVVLFMTGTPPSGVVLTDILLAAIGAPLFVRLICCAYE
Ga0209588_104548613300027671Vadose Zone SoilMGESMINVIRLVITVVFGFLWWWIYNRIGAGLDYLMILGSVLAVCACCCKGGEPQREWSWDVYFACIRRCWIATLVLMVSLFIIGVLVVIFTTGAPPSGVVLTDILLAAIGAPLFVRLICCAYE
Ga0209180_1008543823300027846Vadose Zone SoilMGESMINVIRLVITVVFGFLWWWIYNRIGTGLEYLMILGSVLAVCACCCRGGEPQREWSWDVYFACIRRCWIATLVLMVSLFIIGVLVVIFTTGAPPSGVVLTDILLAAIGAPLFVRLICCAYE
Ga0209465_1031077123300027874Tropical Forest SoilMINVIRLVITVVFGFLWWWIYNRVGAGLDYLMILGSVLAVCVCCCRGGEQQREWSWDVYFACIRRCWIATLVLMVALFIIGVLVVLFLTGSAPTGVVLTDILLAAIGAPLFVRLICCAYE
Ga0209590_1079692013300027882Vadose Zone SoilPSHMGESMINVIRLVITVVFGFLWWWIYNRIGAGLEYLMILGSVLAVCACCCRGGEQQREWSWDVYFACIRRCWIATLVLMVALFIIGVLVVLFMTGAPPSGVVLTDILLAAIGAPLFVRLICCAYE
Ga0209068_1040366113300027894WatershedsMVIAFQVAITVVFGLLWWWVYNRVGADLEYIMILGSVLAVCVCCCRGGGEPQREWNWQHYLACIRRCWLATVVLMVSLAIIGILAVIWATGSPPVAAMLTNIFIAAIFAPLFVRLICCAYEN
Ga0209488_1033045323300027903Vadose Zone SoilMINVIRLVITVVFGFLWWWVYNRIGAGLDYLMILGSVLAVCACCCKGGEPQREWNWDVYVACIRRCWIATFVLMVALFIIGVLVVLFMTGAPPGGVVLTDILLAAIGAPLFVRLICCAYE
Ga0209069_1044995613300027915WatershedsMTIAFQILITVVFGLLWWWVFNHVGEDLEYIMILGSVLAVCVCCCKGGAEPQREWNWKNYIACIRRCWLATLVLMVSLAIIGILAVIWATGSPPAGVLLTNIFIAAIFAPLFVRLICCAYES
Ga0268264_1243895613300028381Switchgrass RhizosphereRPDIWGRSMPIRILITVVFGLLWWWLYNHVGAGLNYLMILGSVLAVCVCCCKGGEPPQWGYNFNWDNYFACIRRCRWATLMLMVSLFIIGILVVFYTTGKFPSGVDLTNIVLVAIGAPLFVRAICCAYD
Ga0137415_1000328693300028536Vadose Zone SoilLSAAVAWAQSLGPLLRLSPSHMGESMINVIRLVITVVFGFLWWWIYNRIGTGLEYLMILGSVLAVCACCCRGGEPQREWSWDVYFACIRRCWIATLVLMVSLFIIGVLVVIFTTGAPPSGVVLTDILLAAIGAPLFVRLICCAYE
Ga0307495_1001995323300031199SoilMVIVFQIVITVVFGLLWWWVFNRVGVDLHYIMILGSVLAVCVCCCKGTGDPQREWNWQRYIACIRRCLLATVVLMVSLVIIGILAVIWATGLPSLGAVLVNILIAAIFAPL
Ga0318541_1023650123300031545SoilMPQIIITIVFGFLWWWVFRHVGPGLAYLVILGSVLAVCVCCCKGGEQQEWGWWRVNWDTYRTCIGRCLWPTLALIVALVVIGILVVLYTTHAFPSGVELTNILLAAIFAPLFVRIICCAY
Ga0318538_1003988463300031546SoilNSMPQIIITIVFGFLWWWVFRHVGPGLAYLVILGSVLAVCVCCCKGGEQQEWGWWRVNWDTYRTCIGRCLWPTLALIVALVVIGILVVLYTTHAFPSGVELTNILLAAIFAPLFVRIICCAYE
Ga0310887_1044485013300031547SoilRSMPIRILITVVFGLLWWWLYNHVGAGLNYLMILGSVLAVCVCCCKGGEPPQWGYNFNWDNYFACIRRCRWATLMLMISLFIIGILVVFYTTGAVPSGVVLTNIVLVAIGAPLFVRAICCAYD
Ga0318528_1075596313300031561SoilNVIRLVITVVFGFLWWWIYNRVGAGLDYLMILGSVLAVCVCCCRGGEQQREWSWDVYFACIRRCWIATLVLMVALFIIGVLVVIFMTGSAPTGVVLTDILLAAIGAPLFVRLICCAYE
Ga0318547_1024581413300031781SoilMPQIIITIVFGFLWWWVFRHVGPGLAYLVILGSVLAVCVCCCKGGEQQEWGWWRVNWDTYRTCIGRCLWPTLALIVALVVIGILVVLYTTHAFPSGVELTNILLAAI
Ga0318536_1066717313300031893SoilTIVFGFLWWWVFRHVGAGLTYLMILGSVLAVCVCCCKGGEPQWGWWSFNLDTYRACIGRCFWPTLALIIALVVIGILVVLYTTHAFPSGVELTNILLAAIFAPLFVRIICCAYE
Ga0310912_1086391823300031941SoilMPQIIITIVFGFLWWWVFRHVGPGLTYLMVLGSVLAVCVCCCKGGEPREWGWWSVNWDGYRACIGRCFWPTLALVIGLVVLGILVVIYTTHAFPGGAELTNILLAAIFAPLCVRIICCAY
Ga0310916_1150212623300031942SoilMPQIIITILAGFLWWFIFHQVGPGLTYLMILGSVLAVCVCCCKGGEQREWGWWSVNWDGYRACIARCFWPTLALVIALIVIGILVVLYTTHAFPSSVELRNIL
Ga0310909_1006401053300031947SoilMPQIIITIVFGFLWWWVFRHVGPGLAYLVILGSVLAVCVCCCKGGEQQEWGWWRVNWDTYRTCIARCLWPTLALIVALVVIGILVVLYTTHAFPSGVELTNILLAAIFAPLFVRIICCAY
Ga0318531_1041644723300031981SoilAYLVILGSVLAVCVCCCKGGEQQEWGWWRVNWDTYRTCIGRCLWPTLALIVALVVIGILVVLYTTHAFPSGVELTNILLAAIFAPLFVRIICCAYE
Ga0318569_1043889813300032010SoilMPQIIITIVFGFLWWWVFRHVGPGLTYLMVLGSVLAVCVCCCKGGEQQEWGWWRVNWDTYRTCIGRCLWPTLALIVALVVIGILVVLYTTHAFPSGVELTNILLAAIFAPLFVRIICCAY
Ga0310902_1011103423300032012SoilMPIRILITVVFGLLWWWLYNHVGAGLTYLMILGSVLAVCVCCCKGNAPPEWGYNFNWDNYFACIQRCRWATLTLTISLFIIGILVVFYTTGKFPSGVDLTNIVLVAIGAPLFVRAICCAY
Ga0318559_1019598423300032039SoilMPQIIITIVFGFLWWWVFRHVGPGLAYLVILGSVLAVCVCCCKGGEQQEWGWWRVNWDTYRTCIGRCLWPTLALIVALVVIGILVVLYTTHAFPGGAELTNILLAAIFAPLFVRIICCAY
Ga0318506_1050900213300032052SoilMPQIIITIVFGFLWWLVFNHVGAGLTYLMILGSVLAVCVCCCKGGEPQWGWWSVNLDGYRACIGRCFWPTLALIIALVVIGILVVLYTTHVFPTGVLLTNILLAAILAPLF
Ga0310890_1127494913300032075SoilSVRVEPRHDGFRPDTWGEKMPIRILITVVFGLLWWWLYNHVGAGLTYLMILGSVLAVCVCCCKGNAPPEWGYNFNWDNYFACIQRCRWATLTLTISLFIIGILVVFYTTGAVPSGVVLTNIVLVAIGAPLFVRAICCAYD
Ga0318540_1014707833300032094SoilYLVILGSVLAVCVCCCKGGEQQEWGWWRVNWDTYRTCIARCLWPTLALIVALVVIGILVVLYTTHAFPSGVELTNILLAAIFAPLFVRIICCAYE
Ga0307470_1003753923300032174Hardwood Forest SoilMTIVFRIIITVVFGLLWWWVYNRVGVDLQYLMILGSVLAVCVCCCKGGSDPSSEWNWQRYLACIRRCWLPTLVLIVSLFIIGVLAVIIATGAGSVGAVILNILLAAIFAPLFVRFICCAYENP
Ga0310889_1032616123300032179SoilMPIRILITVVFGLLWWWLYNHVGAGLNYLMILGSVLAVCVCCCKGGEPPQWGYNFNWDNYFACIRRCRWATLMLMISLFIIGILVVFYTTGAVPSGVVLTNIVLV
Ga0307472_10217518013300032205Hardwood Forest SoilMVIVFQIVITVVFGLLWWWVFNRVGVDLHYIMILGSVLAVCVCCCKGTGDPQREWNWQRYIACIRRCLLATVVLMVSLVIIGILAVIWATGLPSLGAVLASIFIASIFAPLFVRLICCAYES
Ga0306920_10114141513300032261SoilQIIITIVFGFLWWWVFRHVGPGLTYLMVLGSVLAVCVCCCKGGEPREWGWWSVNWDGYRACIGRCFWPTLALVIGLVVLGILVVIYTTHAFPGGAELTNILLAAIFAPLCVRIICCAYE
Ga0306920_10443796913300032261SoilILAGFLWWFIFHQVGPGLTYLMILGSVLAVCVCCCKGGEQREWGWWSVNWDGYRACIARCFWPTLALVIALIVIGILVVLYTTHAFPSSVELRNILFAAIFAPLCVRVICCAYE
Ga0335085_1190029713300032770SoilMPQIIITIVFGFLWWLVFNHVGAGLTYLMILGSVLAVCVCCCKGGEPQWGWWSVNLDGYRACIGRCFWPTLALIIALVVIGILVVLYTTHVFPSGVELTNILLAGIFAPLFVRIICCAYE
Ga0335078_1086695413300032805SoilPGIWGTNMPQIIITIVFGFLWWLVFNHVGAGLTYLMILGSVLAVCVCCCKGGEPQWGWSSVNLDGYRACIGRCFWPTLALIIALVVIGILVVLYTTHVFPSGVVLTNILLAGICAPLFVRIICCAYE
Ga0335081_1055391823300032892SoilMPQIIITIVFGFLWWLVFNHVGAGLTYLMILGSVLAVCVCCCKGGEPQWGWWSVNLDGYRACIGRCFWPTLALIIALVVIGILVVLYTTHVFPSGVVLTNILLAGICAPLFVRIICCAYE
Ga0310914_1135908613300033289SoilMPQIIITILAGFLWWFIFHQVGPGLTYLMILGSVLAVCVCCCKGGEQREWGWWSVNWDGYRACIARCFWPTLALVIALIVIGILVVLYTTHAFPSSVELRN
Ga0326726_1001947533300033433Peat SoilMVIVFQIVITVVFGLLWWLVYNRVGVDLHYIMILGSVLAVCVCCCKGTGDPQREWNWQRYIACIRRCLLATVVLMVSLAIIGILAVIWATGLPSLGAVFVNILIAAIFAPLLVRLICCAYES
Ga0326723_0014394_2740_31503300034090Peat SoilMAVAAFARNIGRNSMVIVFQIVITVVFGLLWWLVYNRVGVDLHYIMILGSVLAVCVCCCKGTGDPQREWNWQRYIACIRRCLLATVVLMVSLAIIGILAVIWATGLPSLGAVFVNILIAAIFAPLLVRLICCAYES


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.