NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F033241

Metagenome / Metatranscriptome Family F033241

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F033241
Family Type Metagenome / Metatranscriptome
Number of Sequences 178
Average Sequence Length 66 residues
Representative Sequence VRDTGLVGVGVVLGWLASKTWSDPGCQLGLVVFLVATAVMTLGFLLAVVGHGRDPGAGDPDEDDEGDHP
Number of Associated Samples 142
Number of Associated Scaffolds 178

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 78.65 %
% of genes near scaffold ends (potentially truncated) 16.85 %
% of genes from short scaffolds (< 2000 bps) 75.28 %
Associated GOLD sequencing projects 134
AlphaFold2 3D model prediction Yes
3D model pTM-score0.40

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (90.449 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(11.798 % of family members)
Environment Ontology (ENVO) Unclassified
(33.146 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(36.517 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 47.42%    β-sheet: 0.00%    Coil/Unstructured: 52.58%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.40
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 178 Family Scaffolds
PF05378Hydant_A_N 18.54
PF13561adh_short_C2 7.87
PF08241Methyltransf_11 5.62
PF05685Uma2 4.49
PF03729DUF308 3.93
PF01266DAO 3.93
PF00378ECH_1 2.81
PF12725DUF3810 2.25
PF04909Amidohydro_2 1.69
PF12697Abhydrolase_6 1.12
PF00296Bac_luciferase 1.12
PF01494FAD_binding_3 1.12
PF12832MFS_1_like 1.12
PF00072Response_reg 1.12
PF05639Pup 0.56
PF01810LysE 0.56
PF00884Sulfatase 0.56
PF03972MmgE_PrpD 0.56
PF01934HepT-like 0.56
PF13450NAD_binding_8 0.56
PF01965DJ-1_PfpI 0.56
PF00701DHDPS 0.56
PF02300Fumarate_red_C 0.56
PF02776TPP_enzyme_N 0.56
PF13289SIR2_2 0.56
PF12706Lactamase_B_2 0.56
PF01471PG_binding_1 0.56
PF05992SbmA_BacA 0.56
PF14384BrnA_antitoxin 0.56
PF04972BON 0.56
PF00535Glycos_transf_2 0.56
PF069833-dmu-9_3-mt 0.56
PF12680SnoaL_2 0.56
PF13350Y_phosphatase3 0.56
PF02538Hydantoinase_B 0.56
PF01520Amidase_3 0.56
PF09992NAGPA 0.56
PF13847Methyltransf_31 0.56
PF01063Aminotran_4 0.56
PF13191AAA_16 0.56
PF00106adh_short 0.56

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 178 Family Scaffolds
COG0145N-methylhydantoinase A/oxoprolinase/acetone carboxylase, beta subunitAmino acid transport and metabolism [E] 37.08
COG4636Endonuclease, Uma2 family (restriction endonuclease fold)General function prediction only [R] 4.49
COG3247Acid resistance membrane protein HdeD, DUF308 familyGeneral function prediction only [R] 3.93
COG06542-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductasesEnergy production and conversion [C] 2.25
COG0146N-methylhydantoinase B/oxoprolinase/acetone carboxylase, alpha subunitAmino acid transport and metabolism [E] 1.12
COG03294-hydroxy-tetrahydrodipicolinate synthase/N-acetylneuraminate lyaseCell wall/membrane/envelope biogenesis [M] 1.12
COG0578Glycerol-3-phosphate dehydrogenaseEnergy production and conversion [C] 1.12
COG0644Dehydrogenase (flavoprotein)Energy production and conversion [C] 1.12
COG0665Glycine/D-amino acid oxidase (deaminating)Amino acid transport and metabolism [E] 1.12
COG2141Flavin-dependent oxidoreductase, luciferase family (includes alkanesulfonate monooxygenase SsuD and methylene tetrahydromethanopterin reductase)Coenzyme transport and metabolism [H] 1.12
COG0115Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyaseAmino acid transport and metabolism [E] 1.12
COG0860N-acetylmuramoyl-L-alanine amidaseCell wall/membrane/envelope biogenesis [M] 0.56
COG20792-methylcitrate dehydratase PrpDCarbohydrate transport and metabolism [G] 0.56
COG2361HEPN domain protein, predicted toxin of MNT-HEPN systemDefense mechanisms [V] 0.56
COG2445Uncharacterized HEPN domain protein YutE, UPF0331/DUF86 familyGeneral function prediction only [R] 0.56
COG2764Zn-dependent glyoxalase, PhnB familyEnergy production and conversion [C] 0.56
COG3029Fumarate reductase subunit CEnergy production and conversion [C] 0.56
COG3865Glyoxalase superfamily enzyme, possible 3-demethylubiquinone-9 3-methyltransferaseGeneral function prediction only [R] 0.56


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms90.45 %
UnclassifiedrootN/A9.55 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
2189573004|GZGWRS401A4X45All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium505Open in IMG/M
3300000443|F12B_10458436All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1431Open in IMG/M
3300000550|F24TB_10837885All Organisms → cellular organisms → Bacteria → Proteobacteria2565Open in IMG/M
3300000559|F14TC_100933217All Organisms → cellular organisms → Bacteria2922Open in IMG/M
3300000891|JGI10214J12806_12327095All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium520Open in IMG/M
3300000956|JGI10216J12902_105334560All Organisms → cellular organisms → Bacteria3264Open in IMG/M
3300001431|F14TB_100364804All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1450Open in IMG/M
3300004013|Ga0055465_10307962All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium546Open in IMG/M
3300004047|Ga0055499_10006438All Organisms → cellular organisms → Bacteria1235Open in IMG/M
3300004052|Ga0055490_10255458All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium541Open in IMG/M
3300004062|Ga0055500_10040611All Organisms → cellular organisms → Bacteria905Open in IMG/M
3300004114|Ga0062593_100305834All Organisms → cellular organisms → Bacteria → Proteobacteria1358Open in IMG/M
3300005295|Ga0065707_11092604Not Available517Open in IMG/M
3300005341|Ga0070691_10019599All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium3123Open in IMG/M
3300005341|Ga0070691_10135608All Organisms → cellular organisms → Bacteria1250Open in IMG/M
3300005444|Ga0070694_100677657All Organisms → cellular organisms → Bacteria837Open in IMG/M
3300005445|Ga0070708_100005910All Organisms → cellular organisms → Bacteria9726Open in IMG/M
3300005445|Ga0070708_100444196All Organisms → cellular organisms → Bacteria1223Open in IMG/M
3300005457|Ga0070662_100889149All Organisms → cellular organisms → Bacteria760Open in IMG/M
3300005459|Ga0068867_100052947All Organisms → cellular organisms → Bacteria → Proteobacteria2996Open in IMG/M
3300005468|Ga0070707_100039420All Organisms → cellular organisms → Bacteria → Proteobacteria4515Open in IMG/M
3300005468|Ga0070707_100794368All Organisms → cellular organisms → Bacteria910Open in IMG/M
3300005545|Ga0070695_100040241All Organisms → cellular organisms → Bacteria2958Open in IMG/M
3300005545|Ga0070695_101134924All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales641Open in IMG/M
3300005546|Ga0070696_100448426All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1018Open in IMG/M
3300005549|Ga0070704_100391208All Organisms → cellular organisms → Bacteria1184Open in IMG/M
3300005616|Ga0068852_100842921All Organisms → cellular organisms → Bacteria932Open in IMG/M
3300005713|Ga0066905_101851657All Organisms → cellular organisms → Bacteria557Open in IMG/M
3300005718|Ga0068866_10639148Not Available723Open in IMG/M
3300005890|Ga0075285_1051603All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium553Open in IMG/M
3300005937|Ga0081455_10000080All Organisms → cellular organisms → Bacteria104190Open in IMG/M
3300005937|Ga0081455_10061159All Organisms → cellular organisms → Bacteria3170Open in IMG/M
3300006041|Ga0075023_100013442All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2163Open in IMG/M
3300006163|Ga0070715_10341572All Organisms → cellular organisms → Bacteria815Open in IMG/M
3300006806|Ga0079220_10632440All Organisms → cellular organisms → Bacteria769Open in IMG/M
3300006847|Ga0075431_100439430All Organisms → cellular organisms → Bacteria1302Open in IMG/M
3300007258|Ga0099793_10338804All Organisms → cellular organisms → Bacteria734Open in IMG/M
3300007265|Ga0099794_10022614All Organisms → cellular organisms → Bacteria → Proteobacteria2868Open in IMG/M
3300009053|Ga0105095_10389851All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium769Open in IMG/M
3300009078|Ga0105106_11330215All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium510Open in IMG/M
3300009088|Ga0099830_10068217All Organisms → cellular organisms → Bacteria2570Open in IMG/M
3300009088|Ga0099830_10241897All Organisms → cellular organisms → Bacteria → Proteobacteria1425Open in IMG/M
3300009089|Ga0099828_10672946All Organisms → cellular organisms → Bacteria930Open in IMG/M
3300009089|Ga0099828_11610104Not Available572Open in IMG/M
3300009090|Ga0099827_10676654All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → unclassified Bradyrhizobium → Bradyrhizobium sp. CCGUVB23891Open in IMG/M
3300009098|Ga0105245_11983043All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium635Open in IMG/M
3300009147|Ga0114129_10012145All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria12262Open in IMG/M
3300009157|Ga0105092_10523091All Organisms → cellular organisms → Bacteria681Open in IMG/M
3300009171|Ga0105101_10542958All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium573Open in IMG/M
3300009174|Ga0105241_11489167All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium651Open in IMG/M
3300009806|Ga0105081_1008005All Organisms → cellular organisms → Bacteria1119Open in IMG/M
3300010047|Ga0126382_10124470All Organisms → cellular organisms → Bacteria1717Open in IMG/M
3300010362|Ga0126377_13122190All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium535Open in IMG/M
3300010371|Ga0134125_10409489All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1504Open in IMG/M
3300010400|Ga0134122_10006211All Organisms → cellular organisms → Bacteria8867Open in IMG/M
3300010400|Ga0134122_10480244Not Available1119Open in IMG/M
3300010400|Ga0134122_10943636All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → unclassified Bradyrhizobium → Bradyrhizobium sp. CCGUVB23839Open in IMG/M
3300010400|Ga0134122_12455346All Organisms → cellular organisms → Bacteria569Open in IMG/M
3300011269|Ga0137392_10006707All Organisms → cellular organisms → Bacteria → Proteobacteria7381Open in IMG/M
3300011269|Ga0137392_10138018All Organisms → cellular organisms → Bacteria1959Open in IMG/M
3300011438|Ga0137451_1123052All Organisms → cellular organisms → Bacteria802Open in IMG/M
3300012133|Ga0137329_1034645All Organisms → cellular organisms → Bacteria643Open in IMG/M
3300012202|Ga0137363_10100242All Organisms → cellular organisms → Bacteria2199Open in IMG/M
3300012202|Ga0137363_11615695All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium540Open in IMG/M
3300012203|Ga0137399_10680902Not Available865Open in IMG/M
3300012207|Ga0137381_11021825All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium712Open in IMG/M
3300012355|Ga0137369_11015024All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium549Open in IMG/M
3300012357|Ga0137384_10362601All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1200Open in IMG/M
3300012361|Ga0137360_10669891All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria890Open in IMG/M
3300012898|Ga0157293_10115271All Organisms → cellular organisms → Bacteria713Open in IMG/M
3300012901|Ga0157288_10413901All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium508Open in IMG/M
3300012910|Ga0157308_10046070All Organisms → cellular organisms → Bacteria1114Open in IMG/M
3300012912|Ga0157306_10070844Not Available936Open in IMG/M
3300012927|Ga0137416_10202972All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1592Open in IMG/M
3300012929|Ga0137404_10843043Not Available834Open in IMG/M
3300012931|Ga0153915_10257231All Organisms → cellular organisms → Bacteria1937Open in IMG/M
3300012931|Ga0153915_11432756All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium808Open in IMG/M
3300013102|Ga0157371_10101754All Organisms → cellular organisms → Bacteria2038Open in IMG/M
3300013307|Ga0157372_10100117All Organisms → cellular organisms → Bacteria3307Open in IMG/M
3300014318|Ga0075351_1015165All Organisms → cellular organisms → Bacteria1147Open in IMG/M
3300014326|Ga0157380_10923644Not Available900Open in IMG/M
3300015200|Ga0173480_10405872All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales → Rhodospirillaceae → unclassified Rhodospirillaceae → Rhodospirillaceae bacterium792Open in IMG/M
3300015371|Ga0132258_11812964All Organisms → cellular organisms → Bacteria1537Open in IMG/M
3300017927|Ga0187824_10026154Not Available1739Open in IMG/M
3300017966|Ga0187776_10003142All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales8439Open in IMG/M
3300017997|Ga0184610_1017926All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1871Open in IMG/M
3300017997|Ga0184610_1019720Not Available1802Open in IMG/M
3300017997|Ga0184610_1027081All Organisms → cellular organisms → Bacteria1584Open in IMG/M
3300017997|Ga0184610_1054461All Organisms → cellular organisms → Bacteria1191Open in IMG/M
3300018027|Ga0184605_10132541All Organisms → cellular organisms → Bacteria → Proteobacteria1114Open in IMG/M
3300018028|Ga0184608_10358282All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → unclassified Bradyrhizobium → Bradyrhizobium sp. CCGUVB23639Open in IMG/M
3300018031|Ga0184634_10298372All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium740Open in IMG/M
3300018052|Ga0184638_1027390All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria2043Open in IMG/M
3300018052|Ga0184638_1034674All Organisms → cellular organisms → Bacteria1825Open in IMG/M
3300018052|Ga0184638_1128280All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Planctomycetia → Planctomycetales → Planctomycetaceae → unclassified Planctomycetaceae → Planctomycetaceae bacterium924Open in IMG/M
3300018053|Ga0184626_10022750All Organisms → cellular organisms → Bacteria2554Open in IMG/M
3300018053|Ga0184626_10048029All Organisms → cellular organisms → Bacteria1784Open in IMG/M
3300018053|Ga0184626_10400850All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium548Open in IMG/M
3300018056|Ga0184623_10099925All Organisms → cellular organisms → Bacteria1340Open in IMG/M
3300018056|Ga0184623_10430355All Organisms → cellular organisms → Bacteria575Open in IMG/M
3300018059|Ga0184615_10319065All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium864Open in IMG/M
3300018063|Ga0184637_10136508All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → Gemmatimonadetes → Gemmatimonadales → unclassified Gemmatimonadales → Gemmatimonadales bacterium1513Open in IMG/M
3300018074|Ga0184640_10257826All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria791Open in IMG/M
3300018075|Ga0184632_10243629All Organisms → cellular organisms → Bacteria786Open in IMG/M
3300018076|Ga0184609_10141436All Organisms → cellular organisms → Bacteria → Proteobacteria1104Open in IMG/M
3300018422|Ga0190265_10856388All Organisms → cellular organisms → Bacteria1032Open in IMG/M
3300018422|Ga0190265_13091476All Organisms → cellular organisms → Bacteria556Open in IMG/M
3300018429|Ga0190272_10046677All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2476Open in IMG/M
3300018429|Ga0190272_10123760All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1720Open in IMG/M
3300018429|Ga0190272_10132130All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1680Open in IMG/M
3300019458|Ga0187892_10016261All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales7899Open in IMG/M
3300019487|Ga0187893_10044668All Organisms → cellular organisms → Bacteria → Proteobacteria4631Open in IMG/M
3300019879|Ga0193723_1020475All Organisms → cellular organisms → Bacteria2030Open in IMG/M
3300019889|Ga0193743_1018888All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria3606Open in IMG/M
3300021073|Ga0210378_10039494All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1873Open in IMG/M
3300021086|Ga0179596_10015501All Organisms → cellular organisms → Bacteria2637Open in IMG/M
3300021090|Ga0210377_10060413All Organisms → cellular organisms → Bacteria2599Open in IMG/M
3300021170|Ga0210400_10171993All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1752Open in IMG/M
3300021344|Ga0193719_10405850All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → unclassified Bradyrhizobium → Bradyrhizobium sp. CCGUVB23560Open in IMG/M
3300022195|Ga0222625_1577662All Organisms → cellular organisms → Bacteria → Acidobacteria875Open in IMG/M
3300022756|Ga0222622_10688246Not Available743Open in IMG/M
3300025885|Ga0207653_10424663All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium521Open in IMG/M
3300025922|Ga0207646_10325258All Organisms → cellular organisms → Bacteria1389Open in IMG/M
3300025922|Ga0207646_11081028All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium706Open in IMG/M
3300025927|Ga0207687_11299835All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium625Open in IMG/M
3300025933|Ga0207706_10828301All Organisms → cellular organisms → Bacteria785Open in IMG/M
3300025954|Ga0210135_1026312Not Available612Open in IMG/M
3300025971|Ga0210102_1128900All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium544Open in IMG/M
3300026089|Ga0207648_10136398All Organisms → cellular organisms → Bacteria2161Open in IMG/M
3300026142|Ga0207698_11802416All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium627Open in IMG/M
3300026351|Ga0257170_1008360All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1257Open in IMG/M
3300026377|Ga0257171_1022273All Organisms → cellular organisms → Bacteria1072Open in IMG/M
3300026515|Ga0257158_1107358All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium554Open in IMG/M
3300027122|Ga0207538_1001591Not Available1143Open in IMG/M
3300027378|Ga0209981_1014363Not Available1104Open in IMG/M
3300027462|Ga0210000_1064341All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium595Open in IMG/M
3300027614|Ga0209970_1003780All Organisms → cellular organisms → Bacteria2532Open in IMG/M
3300027651|Ga0209217_1133885All Organisms → cellular organisms → Bacteria → Terrabacteria group → Armatimonadetes → unclassified Armatimonadetes → Armatimonadetes bacterium CG07_land_8_20_14_0_80_59_28693Open in IMG/M
3300027665|Ga0209983_1003844All Organisms → cellular organisms → Bacteria3182Open in IMG/M
3300027717|Ga0209998_10000481All Organisms → cellular organisms → Bacteria11005Open in IMG/M
3300027815|Ga0209726_10069387All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2297Open in IMG/M
3300027846|Ga0209180_10008696All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria5204Open in IMG/M
3300027894|Ga0209068_10041826All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2305Open in IMG/M
3300028536|Ga0137415_10016483All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria7324Open in IMG/M
3300028793|Ga0307299_10188969All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → unclassified Bradyrhizobium → Bradyrhizobium sp. CCGUVB23775Open in IMG/M
3300028803|Ga0307281_10013929All Organisms → cellular organisms → Bacteria2256Open in IMG/M
3300028828|Ga0307312_10091569All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1874Open in IMG/M
3300030619|Ga0268386_10025065All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales4709Open in IMG/M
(restricted) 3300031150|Ga0255311_1010942Not Available1825Open in IMG/M
(restricted) 3300031150|Ga0255311_1029574All Organisms → cellular organisms → Bacteria → Proteobacteria1141Open in IMG/M
(restricted) 3300031150|Ga0255311_1097557All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria635Open in IMG/M
(restricted) 3300031197|Ga0255310_10072656All Organisms → cellular organisms → Bacteria → Proteobacteria909Open in IMG/M
(restricted) 3300031197|Ga0255310_10089196All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium823Open in IMG/M
(restricted) 3300031197|Ga0255310_10091206All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium814Open in IMG/M
(restricted) 3300031197|Ga0255310_10108885All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium748Open in IMG/M
3300031229|Ga0299913_10372249All Organisms → cellular organisms → Bacteria1416Open in IMG/M
(restricted) 3300031248|Ga0255312_1061280All Organisms → cellular organisms → Bacteria904Open in IMG/M
3300031455|Ga0307505_10122565All Organisms → cellular organisms → Bacteria → Proteobacteria1177Open in IMG/M
3300031720|Ga0307469_10220927All Organisms → cellular organisms → Bacteria → Proteobacteria1498Open in IMG/M
3300031720|Ga0307469_10462240All Organisms → cellular organisms → Bacteria1102Open in IMG/M
3300031720|Ga0307469_12162908All Organisms → cellular organisms → Bacteria541Open in IMG/M
3300031908|Ga0310900_11198895All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium631Open in IMG/M
3300031949|Ga0214473_10533030All Organisms → cellular organisms → Bacteria → Proteobacteria1305Open in IMG/M
3300032174|Ga0307470_10017145All Organisms → cellular organisms → Bacteria3171Open in IMG/M
3300032174|Ga0307470_10698727All Organisms → cellular organisms → Bacteria773Open in IMG/M
3300032770|Ga0335085_10041754All Organisms → cellular organisms → Bacteria6282Open in IMG/M
3300033412|Ga0310810_10007291All Organisms → cellular organisms → Bacteria12599Open in IMG/M
3300033432|Ga0326729_1028816All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium885Open in IMG/M
3300033433|Ga0326726_10005436All Organisms → cellular organisms → Bacteria11536Open in IMG/M
3300033433|Ga0326726_10541904All Organisms → cellular organisms → Bacteria → Terrabacteria group → Armatimonadetes → unclassified Armatimonadetes → Armatimonadetes bacterium CG07_land_8_20_14_0_80_59_281115Open in IMG/M
3300033433|Ga0326726_11738013All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium607Open in IMG/M
3300033485|Ga0316626_11059704All Organisms → cellular organisms → Bacteria721Open in IMG/M
3300033500|Ga0326730_1010173All Organisms → cellular organisms → Bacteria2077Open in IMG/M
3300033502|Ga0326731_1020608Not Available1608Open in IMG/M
3300033513|Ga0316628_100327305All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1918Open in IMG/M
3300033513|Ga0316628_101488210All Organisms → cellular organisms → Bacteria902Open in IMG/M
3300033813|Ga0364928_0086385Not Available727Open in IMG/M
3300034257|Ga0370495_0306119All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium528Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil11.80%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment11.24%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil10.67%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere8.43%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil4.49%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands3.37%
Peat SoilEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Peat Soil3.37%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Freshwater Sediment2.25%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil2.25%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil2.25%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil2.25%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil2.81%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil2.81%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere2.81%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment1.69%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere1.69%
Freshwater WetlandsEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Freshwater Wetlands1.12%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil1.12%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds1.12%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil1.12%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil1.12%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil1.12%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil1.12%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere1.12%
Tabebuia Heterophylla RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Tabebuia Heterophylla Rhizosphere1.12%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere1.12%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere1.12%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere1.12%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere1.12%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment0.56%
GroundwaterEnvironmental → Aquatic → Freshwater → Groundwater → Contaminated → Groundwater0.56%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.56%
Grass SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grass Soil0.56%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Switchgrass Rhizosphere0.56%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil0.56%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil0.56%
Untreated Peat SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Untreated Peat Soil0.56%
Natural And Restored WetlandsEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Natural And Restored Wetlands0.56%
Rice Paddy SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Rice Paddy Soil0.56%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland0.56%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil0.56%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil0.56%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand0.56%
Microbial Mat On RocksEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Microbial Mat On Rocks0.56%
Bio-OozeEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Bio-Ooze0.56%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment0.56%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.56%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere0.56%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.56%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
2189573004Grass soil microbial communities from Rothamsted Park, UK - FG2 (Nitrogen)EnvironmentalOpen in IMG/M
3300000443Amended soil microbial communities from Kansas Great Prairies, USA - BrdU F1.2B clc assemlyEnvironmentalOpen in IMG/M
3300000550Amended soil microbial communities from Kansas Great Prairies, USA - BrdU amended with acetate total DNA F2.4 TB clc assemlyEnvironmentalOpen in IMG/M
3300000559Amended soil microbial communities from Kansas Great Prairies, USA - control no BrdU total DNA F1.4 TC clc assemlyEnvironmentalOpen in IMG/M
3300000891Soil microbial communities from Great Prairies - Wisconsin, Continuous corn soilEnvironmentalOpen in IMG/M
3300000956Soil microbial communities from Great Prairies - Kansas, Native Prairie soilEnvironmentalOpen in IMG/M
3300001431Amended soil microbial communities from Kansas Great Prairies, USA - BrdU F1.4TB clc assemlyEnvironmentalOpen in IMG/M
3300004013Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberryNW_CattailA_D2EnvironmentalOpen in IMG/M
3300004047Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqC_D1EnvironmentalOpen in IMG/M
3300004052Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - RushOxbow_ThreeSqC_D2EnvironmentalOpen in IMG/M
3300004062Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqA_D2EnvironmentalOpen in IMG/M
3300004114Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 5EnvironmentalOpen in IMG/M
3300005295Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL3EnvironmentalOpen in IMG/M
3300005341Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-10-1 metaGEnvironmentalOpen in IMG/M
3300005444Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-1 metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005457Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-3 metaGHost-AssociatedOpen in IMG/M
3300005459Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M3-2Host-AssociatedOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005545Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-2 metaGEnvironmentalOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005549Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-2 metaGEnvironmentalOpen in IMG/M
3300005616Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C2-2Host-AssociatedOpen in IMG/M
3300005713Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 (version 2)EnvironmentalOpen in IMG/M
3300005718Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M2-2Host-AssociatedOpen in IMG/M
3300005890Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_20C_0N_104EnvironmentalOpen in IMG/M
3300005937Tabebuia heterophylla rhizosphere microbial communities from the University of Puerto Rico - S3T2R1Host-AssociatedOpen in IMG/M
3300006041Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2014EnvironmentalOpen in IMG/M
3300006163Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-1 metaGEnvironmentalOpen in IMG/M
3300006806Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100EnvironmentalOpen in IMG/M
3300006847Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD5Host-AssociatedOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009053Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (3) Depth 19-21cm March2015EnvironmentalOpen in IMG/M
3300009078Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (1) Depth 10-12cm September2015EnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009098Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-4 metaGHost-AssociatedOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009157Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (1) Depth 19-21cm March2015EnvironmentalOpen in IMG/M
3300009171Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (1) Depth 19-21cm May2015EnvironmentalOpen in IMG/M
3300009174Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-4 metaGHost-AssociatedOpen in IMG/M
3300009806Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_50_60EnvironmentalOpen in IMG/M
3300010047Tropical forest soil microbial communities from Panama - MetaG Plot_30EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010371Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-1EnvironmentalOpen in IMG/M
3300010400Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011438Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT500_2EnvironmentalOpen in IMG/M
3300012133Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT121_2EnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012355Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012898Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S194-509B-1EnvironmentalOpen in IMG/M
3300012901Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S119-311C-1EnvironmentalOpen in IMG/M
3300012910Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S198-509B-2EnvironmentalOpen in IMG/M
3300012912Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S163-409C-2EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012931Freshwater wetland microbial communities from Ohio, USA - Open water 3 Core 3 Depth 3 metaGEnvironmentalOpen in IMG/M
3300013102Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - C4-5 metaGHost-AssociatedOpen in IMG/M
3300013307Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - C5-5 metaGHost-AssociatedOpen in IMG/M
3300014318Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqA_D1_rdEnvironmentalOpen in IMG/M
3300014326Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S3-5 metaGHost-AssociatedOpen in IMG/M
3300015200Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S209-509C-1 (version 2)EnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300017927Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_4EnvironmentalOpen in IMG/M
3300017966Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_BV02_MP12_20_MGEnvironmentalOpen in IMG/M
3300017997Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_coexEnvironmentalOpen in IMG/M
3300018027Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_coexEnvironmentalOpen in IMG/M
3300018028Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_30_coexEnvironmentalOpen in IMG/M
3300018031Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_200_b1EnvironmentalOpen in IMG/M
3300018052Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b2EnvironmentalOpen in IMG/M
3300018053Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_60_b1EnvironmentalOpen in IMG/M
3300018056Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_b1EnvironmentalOpen in IMG/M
3300018059Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_65_coexEnvironmentalOpen in IMG/M
3300018063Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b2EnvironmentalOpen in IMG/M
3300018074Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_200_b2EnvironmentalOpen in IMG/M
3300018075Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b1EnvironmentalOpen in IMG/M
3300018076Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_coexEnvironmentalOpen in IMG/M
3300018422Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 124 TEnvironmentalOpen in IMG/M
3300018429Populus adjacent soil microbial communities from riparian zone of Shoshone River, Wyoming, USA - 504 TEnvironmentalOpen in IMG/M
3300019458Bio-ooze microbial communities from a basaltic lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - MA170107-3 metaGEnvironmentalOpen in IMG/M
3300019487White microbial mat communities from a basaltic lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - MA170107-4 metaGEnvironmentalOpen in IMG/M
3300019879Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2m2EnvironmentalOpen in IMG/M
3300019889Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L2c2EnvironmentalOpen in IMG/M
3300021073Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_b1 redoEnvironmentalOpen in IMG/M
3300021086Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021090Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_65_b1 redoEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021344Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2a2EnvironmentalOpen in IMG/M
3300022195Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_5 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300022756Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_5_b1EnvironmentalOpen in IMG/M
3300025885Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025927Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025933Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025954Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqC_D1 (SPAdes)EnvironmentalOpen in IMG/M
3300025971Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - RushOxbow_ThreeSqC_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300026089Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M3-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026142Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C2-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026351Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-05-BEnvironmentalOpen in IMG/M
3300026377Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-10-BEnvironmentalOpen in IMG/M
3300026515Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-03-AEnvironmentalOpen in IMG/M
3300027122Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G01A2-11 (SPAdes)EnvironmentalOpen in IMG/M
3300027378Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Inoculated plant M1 PM (SPAdes)Host-AssociatedOpen in IMG/M
3300027462Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Inoculated plant Co PM (SPAdes)Host-AssociatedOpen in IMG/M
3300027614Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Inoculated plant Co S AM (SPAdes)Host-AssociatedOpen in IMG/M
3300027651Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM3H0_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027665Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Inoculated plant M1 S PM (SPAdes)Host-AssociatedOpen in IMG/M
3300027717Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Endophyte Co-N S PM (SPAdes)Host-AssociatedOpen in IMG/M
3300027815Subsurface groundwater microbial communities from S. Glens Falls, New York, USA - GMW46 contaminated, 5.4 m (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027894Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2012 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028793Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_159EnvironmentalOpen in IMG/M
3300028803Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_120EnvironmentalOpen in IMG/M
3300028828Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_202EnvironmentalOpen in IMG/M
3300030619Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT150D86 (Novaseq)EnvironmentalOpen in IMG/M
3300031150 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH4_T0_E4EnvironmentalOpen in IMG/M
3300031197 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH1_T0_E1EnvironmentalOpen in IMG/M
3300031229Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT155D38EnvironmentalOpen in IMG/M
3300031248 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH5_T0_E5EnvironmentalOpen in IMG/M
3300031455Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 23_SEnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031908Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C24D1EnvironmentalOpen in IMG/M
3300031949Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT98D197EnvironmentalOpen in IMG/M
3300032174Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_05EnvironmentalOpen in IMG/M
3300032770Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.5EnvironmentalOpen in IMG/M
3300033412Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - NCEnvironmentalOpen in IMG/M
3300033432Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF6AY SIP fractionEnvironmentalOpen in IMG/M
3300033433Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF15MNEnvironmentalOpen in IMG/M
3300033485Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_T1_C1_D5_AEnvironmentalOpen in IMG/M
3300033500Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF7AN SIP fractionEnvironmentalOpen in IMG/M
3300033502Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF9FY SIP fractionEnvironmentalOpen in IMG/M
3300033513Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_M2_C1_D5_CEnvironmentalOpen in IMG/M
3300033813Sediment microbial communities from East River floodplain, Colorado, United States - 30_j17EnvironmentalOpen in IMG/M
3300034257Peat soil microbial communities from wetlands in Alaska, United States - Frozen_pond_02D_17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
FG2_082982602189573004Grass SoilVRDSGLVGVGVVLGWLASKTWSDPGCQLGLVVFLVATAVMTLGFLLALVGHRRDARDGDHDE
F12B_1045843623300000443SoilMRDVWLVGVGVVLGWLASKTWSDPGCHPXXXXXXXXXXXXXLGFALAVVGHGRNPRTEDPDEGDREDHP*
F24TB_1083788543300000550SoilMRDVWLVGVGVVLGWLASKTWSDPGCQLGLVVFLVATAVMTLGFALAVVGHGRNPRTEDPDEGDREDHP*
F14TC_10093321743300000559SoilMRDVWLVGVGVVLGWLASKTWSDPGCQLGLVVFLVATAVMTLGFALAVVGHGRHPRTEDPDEGDREDQP*
JGI10214J12806_1232709523300000891SoilVRNAGLVGVGVVLGWLASKTWSDPGCQLGLVVFLVATAVMALGFLVAVVGHGRDPDPGDRDEDDEGNPR*
JGI10216J12902_10533456013300000956SoilMRDAGLVGVGIVLGWLASKTWTDPGCQLGLVVFLVATAVMTLGFALAVVGHGRNPRAEDPDEGDREDHP*
F14TB_10036480423300001431SoilMRDVWLVGVGVVLVWLASKTWSDPGCQLGLVVFLVATAVMTLGFALAVVGHGRHPRTEDPDEGDREDQP*
Ga0055465_1030796213300004013Natural And Restored WetlandsMRNAGLVGVGMVLGWLASKTWSDPGCQLGLVVFLVATAAMALGFLLAVVGHGKDPGAGDPDEDDEGDDR*
Ga0055499_1000643833300004047Natural And Restored WetlandsVRDTGLLGVGVVLGWLASKTWSDPGCQLGLVVFLIVTAVMTLGFLLAVVGHGRDSVAGDPDDDDESHQP*
Ga0055490_1025545823300004052Natural And Restored WetlandsVRNTGFVGAGVVLGWLASKTWSDPGCQLGLVVFLVATAVMALGFVFAVVGPRRDPRAGEPDGDDEDEP*
Ga0055500_1004061123300004062Natural And Restored WetlandsMRDAGFVGVGAVLGWLASKAWSDPGCQLGLVVFLIATAVMALGCLLAVVGHGRDPGAGDPDDDDKREHS*
Ga0062593_10030583423300004114SoilVRDTGLLGVGVVLGWLASKTWSDPSCQLGLAVFLIAAAVMTLGFLLAVVGHGRDPDAGDPDDEDNGHQP*
Ga0065707_1109260413300005295Switchgrass RhizosphereVRDTGLLGVGVVLGWLASKAWSDPDCQLGLVVFLIAAAVMSLGFLLAVVGHGRDPGAGDPDDEDDRHQH*
Ga0070691_1001959963300005341Corn, Switchgrass And Miscanthus RhizosphereMRDGGLVGAGIVLGWLVSKTWSDPGCQAGLVIFLVATAVMVLGFVFAVVGHGRGAGGEDRDEDDEE*
Ga0070691_1013560833300005341Corn, Switchgrass And Miscanthus RhizosphereVRASGLVGVGVVLGWLASKTWSDPGCRLGLVVFLVATAAMALAFLLALVGHGRDVRAGDDDEDDEGDRP*
Ga0070694_10067765713300005444Corn, Switchgrass And Miscanthus RhizosphereVRASGLVGVGVVLGWLASKTWSDPGCQLGLVVFLVATAAMALAFLLALVGHGRDVRAGDDDEDDEGDRP*
Ga0070708_10000591073300005445Corn, Switchgrass And Miscanthus RhizosphereVRDSGLVGVGVVLGWLASKTWSDPGCQLGLVVFLVATAVMTLGFLLAVVGHGRDPGPGDPDDDDEGDRS*
Ga0070708_10044419613300005445Corn, Switchgrass And Miscanthus RhizosphereMRDAGFVGVGVVLGWLASKTWSDPGCQLGLVVFSVATAVMTRGFLLAVVGHGRDPDAGDPDKDDKGIIPEAESRRCATRA*
Ga0070662_10088914923300005457Corn RhizosphereRAVRASGLVGVGVVLGWLASKTWSDPGCRLGLVVFLVATAAMALAFLLAVVGHGRDARAGDDDEDDEGDRP*
Ga0068867_10005294723300005459Miscanthus RhizosphereVRDTGLLGVGVVLGWLASKTWSDPSCQLGLAVFLIAAAVMTLGFLLAVVGHGRDPGAGDPDDEDDRHQP*
Ga0070707_10003942023300005468Corn, Switchgrass And Miscanthus RhizosphereMRDGGLVGVGVVLGWLASKTWSDPGCQLGLVVFLVATALMTLGFLLAVVGHGRDPGAGDPDEDDKEDHS*
Ga0070707_10079436813300005468Corn, Switchgrass And Miscanthus RhizosphereVWFASIGFVLGWLISKTWQDPRCQIGLVVFLVATAALLLAFVVAVVGHGRGREEAEPDDTDEEGRR*
Ga0070695_10004024153300005545Corn, Switchgrass And Miscanthus RhizosphereVRASGLVGVGVVLGWLASKTWSDPGCQLGLVVFLVATAAMALAFLLALVGHGRDVRAGDHDEDDEGDRP*
Ga0070695_10113492423300005545Corn, Switchgrass And Miscanthus RhizosphereVVLGWLASKTWSDPGCQLGLVVFLVATAVMALGFLVAVVGHGRDPDPGDRDEDDEGNPR*
Ga0070696_10044842623300005546Corn, Switchgrass And Miscanthus RhizosphereVRESGLVGVGVVLGWLASKTWSDPGCRLGLVVFLVATAVMTLGFLFAVVGHGRDPRAGDPDDEDEGDHS*
Ga0070704_10039120823300005549Corn, Switchgrass And Miscanthus RhizosphereMRDAGLVGVGVVLGWLASKTWSDPACQLGLVVFLIAAAVMTLGLLVAVVGYGRTPDPGGRDEDDEGDDHGSPRA*
Ga0068852_10084292133300005616Corn RhizosphereVRASGLVGVGVVLGWLASKTWSDPGCQLGLVVFLVATAAMALAFLLAVVGHGRDARAGDDDEDDEGDRP*
Ga0066905_10185165723300005713Tropical Forest SoilMRDVWLVGVGVVLGWLASKTWSDPGCQLGLVVFLVATAIMTLGFVLAVVGHGRNPRTEDPDEGDREDHP*
Ga0068866_1063914823300005718Miscanthus RhizosphereVVLGWLASKTWSDPSCQLGLAVFLIAAAVMTLGFLLAVVGHGRDPDAGDPDDEDNGHQP*
Ga0075285_105160323300005890Rice Paddy SoilMRDGGLVGAGIVLGWLVSKTWSDPGCQAGLVVFLEATAVMVLGFVFAVVGHGRGAGGEDRDEDDEE*
Ga0081455_1000008033300005937Tabebuia Heterophylla RhizosphereVRDSGLVGVGVVLGWLASKTWSDPGCRLGLVVFLVATAAMALGFLLAVVGHGRDARADDHDEDDEGDRP*
Ga0081455_1006115933300005937Tabebuia Heterophylla RhizosphereMRDVWLVGVGVVLGWLASKTWTDPGCQLGLVVFLVATALMTLGFALAVVGHGRNPRTGDPDEGDREDHP*
Ga0075023_10001344223300006041WatershedsVRDSGLVGVGVVLGWLASKTWSDPGCQLGLVVFLVATAVMTLGFLLAVVGHGRDPQAGDPDDDDESDRS*
Ga0070715_1034157213300006163Corn, Switchgrass And Miscanthus RhizosphereVRDSGLVGVGVVLGWLASKTWSDPGCQLGLVVFLVATAVMTLGFLLAVVGHGRDARGDDHDE
Ga0079220_1063244013300006806Agricultural SoilMRDGGLVGAGIVLGWLVSKTWSDPGCQLGLVVFLVATAVMVLGFVLAVVGHGRGSAEADREDDDEDE*
Ga0075431_10043943023300006847Populus RhizosphereVRNSGLVGIGVVLGWLASKTWSDPGCQLGLVVFLVATAVMALGALLAVVGHGRDPDPRDRDEDDEGDHR*
Ga0099793_1033880433300007258Vadose Zone SoilVGVVLGWLASKTWSDPGCQLGLVVFLVATAVMTLGFLLAVVGHGRDPHAGDPDDDDES
Ga0099794_1002261423300007265Vadose Zone SoilVRESGLVGVGVVLGWLASKTWSDPGCQLGLVVFLVATAVMTLGFLLAVVGHGRDPHAGDPDDDDESDRP*
Ga0105095_1038985123300009053Freshwater SedimentVRNTGFVGAGVVLGWLASKTWSDPGCQLGLVVFLVATAVMALGFVFAVVGPRRDPRAGEPDGDDEDEQ*
Ga0105106_1133021513300009078Freshwater SedimentVRDTGLLGVGVVLGWLASKTWSDPGCQLGLVVFLIATAVMTLGFLLAVVGHGRDPGAGDPDDDDESHQP*
Ga0099830_1006821723300009088Vadose Zone SoilVRESGLVGVGVVLGWLASKTWSDPGCQLGLVVFLVATAVMTLGFLLAVVGHGRDPGPGDPDDDDESDRP*
Ga0099830_1024189723300009088Vadose Zone SoilMRDAGLVGVGVVLGWLGSKIWSDPGCQLGLVVFLVATAVMTLGFLLAVVGHGRDPRAGDSDEDDEGDHP*
Ga0099828_1067294613300009089Vadose Zone SoilMRDVGLVAVGIVLGWLVSKTWSDSGCQLGLVVFLVATSVMTLGFALAVFGHGRNPRGVGPR*
Ga0099828_1161010423300009089Vadose Zone SoilWLGSKIWSDPGCQLGLVVFLVATAVMTLGFLLAVVGHGRDPGAGDPDEDDKGDHP*
Ga0099827_1067665423300009090Vadose Zone SoilMRDAGFVGIGVVLGWLASKLWSDPGCQLGLVVFLVATAVMVLGFVLAVVGHGRDPDPGAGDPDEDDK*
Ga0105245_1198304323300009098Miscanthus RhizosphereVGVVLGWLASKTWSDPSCQLGLAVFLIAAAVMTLGFLLAVVGHGRDPDAGDPDDEDNGHQP*
Ga0114129_1001214583300009147Populus RhizosphereMSAAGFVGVGAVLGWLASKLWSDPGCQLGLVVFLIATAVMVLGFVLAVVGHGRDPDPGAGDPDEDDK*
Ga0105092_1052309133300009157Freshwater SedimentVGVVIGWLASKTWSDPGCQLGLAVFLIASAVMTLGVRGAVVGHGREPVAGDPGEDDEDDRPGSPRS*
Ga0105101_1054295813300009171Freshwater SedimentVRNTGFIGAGVVLGWLASKTWSDPGCQLGLVVFLVATAVMALGFVFAVVGPRRDPRAGEPDGDDEDEP*
Ga0105241_1148916713300009174Corn RhizosphereVRDTGLLGVGVVLGWLASKTWSDPDCQLGLVVFLIAAAVMTLGFLLAVVGHGRDPGAGDP
Ga0105081_100800523300009806Groundwater SandMRDAGLVGVGVVLGWLASKTWSDPGCQLGLVVFLVVTALMTLGLLLAVVGHGRDPGAGDPDDDDKEDHS*
Ga0126382_1012447023300010047Tropical Forest SoilMEGGSRMRDVWLVGVGVVLGWLASKTWSDPGCQLGLVVFLVATAIMTLGFVLAVVGHGRNPRTEDPDEGDREDHP*
Ga0126377_1312219013300010362Tropical Forest SoilMRDAGLVGVGIVLGWLASKTWSDPGCQLGLVVFLVATAVMTLGFALAVVGHGRGPLAGGQDDDDGEDRP*
Ga0134125_1040948913300010371Terrestrial SoilVRNAGLVGVGVVLGWLASKTWSDPGCQLGLVVFLVVSAVMALGFLFAVVGHGRDSGAGDPDDDDEGGRR*
Ga0134122_1000621123300010400Terrestrial SoilVRNTGLLGVGVVLGWLASKTWSDPSCQLGLAVFLIAAAVMTLGFLLAVVGHGRDPDAGDPDDEDNGHQP*
Ga0134122_1048024423300010400Terrestrial SoilVVLGWLASKTWSDPGCQLGLVVFLVATAVMALGCLLAVVGHGRDPESGDPDDDDPRA*
Ga0134122_1094363633300010400Terrestrial SoilVVLGWLASKTWSDPGCQLGLVVFLVVSAVMALGFLFAVVGHGRDPRAGDPNDDDEGDHS*
Ga0134122_1245534623300010400Terrestrial SoilLASKTWSDPECQLGLVVFLIAAAVMTLGFLLAVVGHGRDPGAVDPDDDDESHQP*
Ga0137392_1000670763300011269Vadose Zone SoilVGVVLGWLASKTWSDPGCQLGLVVFLVATAVMTLGFLLAVVGHGRDPGPGDPDDDDESDRP*
Ga0137392_1013801833300011269Vadose Zone SoilMRDAGLVGVGVVLGWLGSKIWSDPGCQLGLVVFLVATAVLTLGFLLAVVGHGRDPRAGDSDEDDEGDHP*
Ga0137451_112305213300011438SoilMRDAGLVGVGVVLGWLASKTWSDPGCQLGLVVFLVATAVMALGFLLAVVGHGRDPRAGDS
Ga0137329_103464523300012133SoilVRDTGLVGVGVVLGWLASKTCSDPGCQLGLVVFLVATAVMTLGFLLAVVGHGRDPGDPDEDDEGDHP*
Ga0137363_1010024233300012202Vadose Zone SoilVGVVLGWLASKTWSDPGCQLGLVVFLVATAVMTLGFLLAVVGHGRDPGPGDPDDDDEGDRS*
Ga0137363_1161569513300012202Vadose Zone SoilSGLVGVGVVLGWLASKTWSDPGCQLGLVVLLVATAVMTLGFSLALVGHRRDARDGDHDEDDEGDGP*
Ga0137399_1068090213300012203Vadose Zone SoilVGVVLGWLASKAWSDPGCQLGLVVFLVATAVMTLGFLLAVVGHGRDPGPGDPDDDDESDRP*
Ga0137381_1102182523300012207Vadose Zone SoilVVLGWLASKTWSDPGCQLGLVVLLVATAVMTLGFPLALVGHRRDARDGDHDEDDEGDGP*
Ga0137369_1101502413300012355Vadose Zone SoilVRNAGLVGVGVVLGWLASKTWSDPGCQLGLVVFLVATAVMALGFLVAVVGHGRNPDPGDRDEDDEGDHR*
Ga0137384_1036260123300012357Vadose Zone SoilVRDSGLVGVGVVLGWLASKTWSDPGCQLGLVVLLVATAVMTLGFPLALVGHRRDARDGDHDEDDEGDGP*
Ga0137360_1066989113300012361Vadose Zone SoilGVVLGWLASKTWSDPGCQLGLVVLLVATAVMTLGFSLALVGHRRDARDGDHDEDDEGDGP
Ga0157293_1011527123300012898SoilVVLGWLASKTWSDPGCQLGLVVFLVATAAMALAFLLALVGHGRDVRASDHDEDDEGDRP*
Ga0157288_1041390113300012901SoilVRASGLVGVGVVLGWLASKTWSDPGCQLGLVVFLVATAAMALAFLLAVVGHGRDVRAGDDDE
Ga0157308_1004607033300012910SoilVVLGWLASKTWSDPGCQLGLVVFLVATAAMALAFLLALVGHGRDVRAGDHDEDDEGDRP*
Ga0157306_1007084413300012912SoilVVLGWLASKTWSDPGCQLGLVVFLVATAAMALAFLLAVVGHGRDVRAGDDDEDDEGDRP*
Ga0137416_1020297233300012927Vadose Zone SoilVGVVLGWLASKTWSDPGCQLGLVVFLVATAVMTLGFLLAVVGHGRDPHAGDPDDDDESDRP*
Ga0137404_1084304313300012929Vadose Zone SoilMRAAGLVGVGVVLGWLASKTWSDPGCQLGLVVFLVVAAVMTLGFLLAVVGHRRAPDAGDPDDDDEGDHR*
Ga0153915_1025723123300012931Freshwater WetlandsMRDAGLVGIGIVLGWLTSKTWSDPGCQLGLAVFLVATAVMTLGFVLAVVGHGRDPGPGDPDDDDEEER*
Ga0153915_1143275623300012931Freshwater WetlandsMRDGGLVGIGVVLGWLVSKTWSDPGCQMGLVVFLVATAVMVLGFVFAVVGHGRDSGGGDREDDDDEE*
Ga0157371_1010175413300013102Corn RhizosphereVRASGLVGVGVVLGWLASKTWSDPGCQLGLVVFLVATAAMALAFLLALVGHGRDVRAGDDDEDDEG
Ga0157372_1010011763300013307Corn RhizosphereVVLGWLASKTWSDPGCQLGLVVFLVATAAMALAFLLAVVGHARDARAGDDHEDDEGDRP*
Ga0075351_101516523300014318Natural And Restored WetlandsMRDAGFVGVGAVLGWLASKAWSDPGCQLGLVVFLIATAVMALGCLLAVVGHGRDPGAGDPDDDDKRDHS*
Ga0157380_1092364413300014326Switchgrass RhizosphereGVVLGWLASKTWSDPSCQLGLAVFLIAAAVMTLGFLLAVVGHGRDPDAGDPDDEDNSHQP
Ga0173480_1040587233300015200SoilVVLGWLASKTWSDPGCRLGLVVFLVATAAMALAFLLAVVGHGRDARAGDDDEDDEGDRP*
Ga0132258_1181296423300015371Arabidopsis RhizosphereVGVGIVLGWLGSKTWSDPGCQLGLVVFLVATAVMVLGFVLAVVGHGRAPGVGGPGDDDDEEER*
Ga0187824_1002615443300017927Freshwater SedimentMRDGGLVGAGIVLGWLVSKTWSDPGCQLGLVVFLVATAVMVLGFVLAVVGHGRGSAEADREDDDEDE
Ga0187776_1000314253300017966Tropical PeatlandMRDGGLVGVGVILGWLASKTWSDPGCRLGLAIFLVATAAMTLGFALAVVGHGRGPREPDRDDDDDRE
Ga0184610_101792623300017997Groundwater SedimentMRDVGLVGVGIVLGWLVSKTWSDPACQLGLVVFLVATAVMTLGFALAVVGHGRNPGAGDPDEGDD
Ga0184610_101972023300017997Groundwater SedimentMRDVGLVGVGTVLGWLVSKTWSDPGCQLGLVVFLVATAVMTLGFALAVVGHGRNPRAGDPDEGDHGDYP
Ga0184610_102708123300017997Groundwater SedimentMRGAGFVGVGVVLGWLASKTWSDPGCQLGLVVFLVATAVMTLGFLLAVVGHRRDPGDPDEDDEGDHP
Ga0184610_105446113300017997Groundwater SedimentMRDVGLVGIGIVIGWLVSKTWSDPGCQMGLVVFLIATAVMTLGFALAVVGHGRTPGAGGPDAGDD
Ga0184605_1013254133300018027Groundwater SedimentMMAAGLVGVGVVLGWLASKTWSDPGCQLGLVVFLVATAVMTLGFLLAVIGHRRDPDVGDDDDEGDHR
Ga0184608_1035828223300018028Groundwater SedimentMRGAGLVGVGVVLGWLASKTWSDPGCQLGLVVFLVVTAVMTLGFLLAVVGHRREQPGAGDPDEDDESDHR
Ga0184634_1029837213300018031Groundwater SedimentMRDVGLVGVGIVLGWLASKTWSDPGCQLGLVVFLVATAVMTLGFALAVVGHGRNPRAGDPDNDDHGDNP
Ga0184638_102739043300018052Groundwater SedimentMRDVGLVGVGIVLGWLVNKTWSDPECQLGLVVFLVATAVMTLGFALTVVGHGRNPRAGDSDEGDHGDHP
Ga0184638_103467413300018052Groundwater SedimentMRDVGLVGIGVVIGWLVSKTWSDPGCQLGLVVFLIATAVMTLGFALAVVGHGRNPGAGGPDEGDD
Ga0184638_112828013300018052Groundwater SedimentMRDVGLVGVGIVLGWLVSKTWSDPGCQLGLVVFLIATAVMTLGFALAVVGHGRNPRAGGPDEGDD
Ga0184626_1002275023300018053Groundwater SedimentMRDVGLVGIGIVIGWLVSKTWSDPGCQLGLVVFLVATALMTLGFALAVVGHGRNPSGGDPDEDDKGDHS
Ga0184626_1004802923300018053Groundwater SedimentMRGAGFVGVGVVLGWLASKTWSDPGCQLGLVVFLVATAVMTLGFLLAVVGHGRDPGAGDPDEDDKGDHP
Ga0184626_1040085013300018053Groundwater SedimentMRDMGLVGVGIVLGWLVSKTWNDPGCQLGLVVFLVATAIMTLGFALAFVAHGGNPRAGDPDEGDHGDHP
Ga0184623_1009992533300018056Groundwater SedimentMRAAGFVGAGVVLGWLASKTWSDPGCQLGLVVFLVVTAVMTLGFLLAVVGHGRNPGAGDPDDDDKRDHS
Ga0184623_1043035523300018056Groundwater SedimentGVVLGWLASKTWSDPGCQLGLVVFLVATAVMTLGFLLAVVGHGRDPGAGDPDEDDEGDHR
Ga0184615_1031906523300018059Groundwater SedimentVRDTGLVGVGVVLGWLASKTWSDPGCQLGLVVFLVATAVMTLGFLLAVVGHGRDPGAGDPDEDDEGD
Ga0184637_1013650833300018063Groundwater SedimentMRDVGLVVVGIVLGWLVSKTWSDPGCQLGLVVFLVATAVMTLGFALAVVGHGRNPRAGDPDEGDHGDYP
Ga0184640_1025782623300018074Groundwater SedimentMRGAGLVGVGVVLGWLASKTWSDPGCQLGLVVFLIDTALMTLGFLVAVVGHRRHPDAGDPDEDDKADHS
Ga0184632_1024362923300018075Groundwater SedimentMRNAGFVGVGVVLGWLASKIWSDPGCQLGLVVFLVATAVMTLGFLLAVVGHGRDPRAGDSDEDDEGDHP
Ga0184609_1014143633300018076Groundwater SedimentMRGAGFVGVGVVLGWLASKTWSDPGCQLGLVVFLVATAVMTLGFLLAVVGHGRDPGDPDEDDEGDHP
Ga0190265_1085638833300018422SoilVVLGWLASKTWSDPGCQLGLVVFLVATAVMALGFLVAVVGHGREPDPGDRDEDDEGDHR
Ga0190265_1309147623300018422SoilVRDTGLLGVGVVIGWLASKTWSDPGCQLGLVVFLIASAVMTLGFLVAVVGYRRDPVAD
Ga0190272_1004667733300018429SoilMRDVGLVGVGIVLGWLVSKTWSDPACQLGLVVFLVATAVMTLGFALAVVGHGRNPDAGDPDEGDD
Ga0190272_1012376023300018429SoilVRSAGFVGVGVVLGWLASKTWSDPGCQLGLVVFLIAAAVMTLGFLLAVVGHGRDPDPADPDDDEGDHR
Ga0190272_1013213023300018429SoilMRDAGLVGVGVVLGWLASKTWSDPACQLGLVVFLIAAAVMTLGFLIAVVGHGRDPGPDDRDQDDEGDGPGTPRA
Ga0187892_1001626133300019458Bio-OozeMRDAGLVGVGVVLGWLASKTWSDPGCQLGLVVFLVATALMTLGFLLAVVGHGRDPGAGDPDEDDKEDHS
Ga0187893_1004466823300019487Microbial Mat On RocksMRDAGLVGAGIVFGWLTSKTWSDPGCQLGLVVFLVATAVMTLGFVLAVVGHGRHPGDAGDSDEDKGDHS
Ga0193723_102047543300019879SoilMRNAGLVGVGVVLGWLASKTWSDPGCQLGLVVFLVAAAVMALGCLLAVVGHRREPDTGDPDEDDEGDRR
Ga0193743_101888823300019889SoilMILGWLASKTWSDPGCQLGLVVFLVATAAMALGFLLAVVGHGRDPGAGDPDEDDEGDHR
Ga0210378_1003949423300021073Groundwater SedimentMRNAGFVGVGVVLGWLASKTWSDPGCQLGLVVFLVAAAVMTLGFLIAVVGHGRDPDAGDRDEDDKGDHP
Ga0179596_1001550123300021086Vadose Zone SoilVRESGLVGVGVVLGWLASKTWSDPGCQLGLVVFLVATAVMTLGFLLAVVGHGRDPGPGDPDDDDESDRP
Ga0210377_1006041333300021090Groundwater SedimentVRDTGLVGVGVVLGWLASKTWSDPGCQLGLVVFLVATAVMTLGFLLAVVGHGRDPGAGDPDEDDEGDHP
Ga0210400_1017199313300021170SoilVRESGLVGVGVILGWLASKTWSDPGCQLGLVVFLVATAVMTLGFLLAVVGHGRDARGDD
Ga0193719_1040585023300021344SoilVRNAGLVGVGVVLGWLASKTWSDPGCQLGLVVFLVVAAVMTLGFLLAVVGHRRDPDAGDPDEDDEGHPR
Ga0222625_157766213300022195Groundwater SedimentMRNAGLVGVGVVLGWLASKTWSDPACQMGLVVFLIAAAVMTLGFLVAVVGYGRTPDPGGRDEDDEGDDHGSPRA
Ga0222622_1068824623300022756Groundwater SedimentLVLGWLASKTWSDPGCQLGLVVFLVVAAVMTLGFLLAVVGHRRDPDAGDPDEDDEGHPR
Ga0207653_1042466323300025885Corn, Switchgrass And Miscanthus RhizosphereVRDTGLLGVGVVLGWLASKTWSDPSCQLGLAVFLIAAAVMTLGFLLAVVGHGRDPDAGDPDDEDNGHQP
Ga0207646_1032525823300025922Corn, Switchgrass And Miscanthus RhizosphereMRDGGLVGVGVVLGWLASKTWSDPGCQLGLVVFLVATALMTLGFLLAVVGHGRDPGAGDPDEDDKEDHS
Ga0207646_1108102823300025922Corn, Switchgrass And Miscanthus RhizosphereMRDAGFVGVGVVLGWLASQTWSDPGCQLGLVVFSVATAVMTRGFLLAVVGHGRDPDAGDPDEDDKGIHS
Ga0207687_1129983513300025927Miscanthus RhizosphereVRDTGLLGVGVVLGWLASKTWSDPSCQLGLAVFLIAAAVMTLGFLLAVVGHGRDPDAGDPDDEDNGHPP
Ga0207706_1082830123300025933Corn RhizosphereVRASGLVGVGVVLGWLASKTWSDPGCQLGLVVFLVATAAMALAFLLALVGHGRDVRAGDDDEDDEGDRP
Ga0210135_102631223300025954Natural And Restored WetlandsGVVLGWLASKTWSDPGCQLGLVVFLIVTAVMTLGFLLAVVGHGRDSVAGDPDDDDESHQP
Ga0210102_112890023300025971Natural And Restored WetlandsVRNTGFVGAGVVLGWLASKTWSDPGCQLGLVVFLVATAVMALGFVFAVVGPRRDPRAGEPDGDDEDEP
Ga0207648_1013639823300026089Miscanthus RhizosphereVRDTGLLGVGVVLGWLASKTWSDPSCQLGLAVFLIAAAVMTLGFLLAVVGHGRDPGAGDPDDEDDRHQP
Ga0207698_1180241633300026142Corn RhizosphereVRASGLVGVGVVLGWLASKTWSDPGCQLGLVVFLVATAAMALAFLLAVVGHGRDARAGDDDEDDEGDRP
Ga0257170_100836033300026351SoilVRGAGLVGVGVVLGWLASKTWSDPGCQLGLVVFLVVTAVMTLGFLLAVVGHRRDPDAGDPDDDDEG
Ga0257171_102227323300026377SoilVGVVLGWLASKTWSDPGCQLGLVVFLIAAAVMTLGFLLAVVGHGRDPRAGDSDEDDEGDH
Ga0257158_110735823300026515SoilVRESGLVGVGVVLGWLASKTWSDPGCQLGLVVFLVATAVMTLGFLLAVVGHGRDPHADDP
Ga0207538_100159113300027122SoilVRASGLVGVGVVLGWLASKTWSDPGCRLGLVVFLVATAAMALAFLLAVVGHGRDVRAGDDDEDDEGDRP
Ga0209981_101436313300027378Arabidopsis Thaliana RhizosphereVRASGLVGVGVVLGWLASKTWSDPGCQLGLVVFLVATAAMALAFLLALVGHGRDVRAGDHDEDDEGDRP
Ga0210000_106434113300027462Arabidopsis Thaliana RhizosphereVRASGLVGVGVVLGWLASKTWSDPGCRLGLVVFLVATAAMALAFLLAVVGHGRDVRAGDHDEDDEGDRP
Ga0209970_100378023300027614Arabidopsis Thaliana RhizosphereVRASGLVGVGVVLGWLASKTWSDPGCQLGLVVFLVATAVMALGFLVAVVGHGRDPDPGDRDEDDEGNPR
Ga0209217_113388533300027651Forest SoilVRESGLVGVGVVLGWLASKTWSDPGCQLGLVVFLVATAVMTLGFLLAVVGHGRDARDG
Ga0209983_100384413300027665Arabidopsis Thaliana RhizosphereVRASGLVGVGVVLGWLASKTWSDPGCRLGLVVFLVATAAMALAFLLALVGHGRDVRAGDHDEDDEGDRP
Ga0209998_1000048193300027717Arabidopsis Thaliana RhizosphereVRASGLVGVGVVLGWLASKTWSDPGCQLGLVVFLVATAAMALAFLLAVVGHGRDVRAGDDDEDDEGDRP
Ga0209726_1006938733300027815GroundwaterVGVVLGWLASKTWSDPGCQLGLVVFLVATALMTLGFLLAVVGHGRDPGDPDEDDEGDHP
Ga0209180_1000869663300027846Vadose Zone SoilMRDAGLVGVGVVLGWLGSKIWSDPGCQLGLVVFLVATAVMTLGFLLAVVGHGRDPRAGDSDEDDEGDHP
Ga0209068_1004182623300027894WatershedsVRDSGLVGVGVVLGWLASKTWSDPGCQLGLVVFLVATAVMTLGFLLAVVGHGRDPQAGDPDDDDESDRS
Ga0137415_1001648373300028536Vadose Zone SoilVRDTGLLGVGVVLGWLASKTWSDPGCQLGLVVFLVATAVMTLGFLLAVVGHGRDPHAGDPDDDDESDRP
Ga0307299_1018896923300028793SoilMRGAGLVGVGVVLGWLASKTWSDPGCQLGLVVFLVATAVMTLGFLLAVVGHRRDPDVGDDDDEGDHR
Ga0307281_1001392923300028803SoilMRDVGLVGVGIVLGWLVSKTWSDPGCQLGLVVFLIATAVMTLGFALAVVGHGRNPRAGDPDEGDD
Ga0307312_1009156913300028828SoilMRGAGLVGVGVVLGWLASKTWSDPGCQLGLVVFLIAAAVMTLGLLVAVVGYGRTPDPGGRDEDDEG
Ga0268386_1002506533300030619SoilMRAAGLVGVGAVLGWLASKTWSDPGCQLGLVVFLIATAVMTLGVLLAVVGHGRDRDAGDRDEDDEGDHP
(restricted) Ga0255311_101094233300031150Sandy SoilVRNTGFVGAGVVLGWLASKTWSDPGCQLGLVVFLVATAVMALGFVFAVVGPRRDPRAGEPDGDDEDEQ
(restricted) Ga0255311_102957423300031150Sandy SoilVRNAGFVGMGVVLGWLASKTWSDPGCQLGLVVFLIAAAVMTLGVLLAVVGHGRAPDAGDPDEGDEGDHPGSPRA
(restricted) Ga0255311_109755713300031150Sandy SoilMRDVGLVGAGIVFGWLTSKTWSDPGCQLGLVVFLVATAVMILGFALAVVGHGRHSGGGDDDDNGDHS
(restricted) Ga0255310_1007265623300031197Sandy SoilVRDTGLLGVGVVLGWLASKAWSDPDCQLGLVVFLIAAAVMTLGFLLAVVGHGRDPGAGDPDDEDDRHQP
(restricted) Ga0255310_1008919623300031197Sandy SoilMRDVGLVGAGIVFGWLTSKTWSDPGCQLGLVVFLIATAVMTLGFALTVVGHGRNPGAGDPDEGDD
(restricted) Ga0255310_1009120623300031197Sandy SoilVRDIGFVGAGVVLGWLASKTWSDPGCQLGLVVFLVATAVMTLGFVLAVVGHGREPGGGDRDDDDGEE
(restricted) Ga0255310_1010888523300031197Sandy SoilMRDAGLVGVGVVLGWLASKTWSDPGCQLGLVVFLIATAAMTLGFLLAVVGHGRGPGPGAGDPDEDDK
Ga0299913_1037224953300031229SoilMRAAGLVGVGAVLGWLASKTWSDPGCQLGLVVFLIAAAVMTLGFLLAVVGHGRDRDAGDRDED
(restricted) Ga0255312_106128013300031248Sandy SoilVGLVGAGIVFGWLTSKTWSDPGCQLGLVVFLIATAVMTLGFALTVVGHGRNPGAGDPDEGDD
Ga0307505_1012256523300031455SoilVRDTGLLGVGVVLGWLASKTWSDPDCQLGLVVVLIAAAVMTLGFLLAVVGHGRDPGAGDPDDEDDRHQP
Ga0307469_1022092723300031720Hardwood Forest SoilVRNTGLLGVGVVLGWLASKTWSDPSCQLGLVVFLIAAAVMTLGFLLAVVGHGRDPGAGDPDDEDDRHQP
Ga0307469_1046224023300031720Hardwood Forest SoilMRNAGLVGVGVVLGWLASKTWSDPGCQLGLVVFLVVTAVMALGVLLAVVGHGRESDAGDPDDDDPRA
Ga0307469_1216290823300031720Hardwood Forest SoilMRDVWLVGVGVVLGWLASKTWTDPGCQLGLVVFLVATAVMTLGFALAVVGHGRNPRTQDPDE
Ga0310900_1119889533300031908SoilVRASGLVGVGVVLGWLASKTWSDPGCQLGLVVFLVATAAMALAFLLAVVGHGRDARAGDHDEDDEDDRP
Ga0214473_1053303023300031949SoilMRDVGLVGVGIVLGWLVSKTWSDPGCQLGLVVFLVATAVMTLGFALAVVGHGRNPRAGDPDEGDHGDHP
Ga0307470_1001714543300032174Hardwood Forest SoilVRDTGFLGVGVVLGWLASKTWSDPECQLGLVVFLIAAAVMTLGALLAVVGHGRDPVAGDPGEDDEDDRPGSPRA
Ga0307470_1069872713300032174Hardwood Forest SoilMRDVWLVGVGVVLGWLASKTWTDPGCQLGLVVFLVATAVMTLGFALAVVGHGRNPRTQDPDEGDREDHP
Ga0335085_1004175473300032770SoilMRDVGLVGIGIVLGWLVSKTWSDPGCQLGLVVFLVAAAVMTLGFALAVVGHGENPRTGDPDESDHEDHP
Ga0310810_1000729163300033412SoilMRDGGLVGAGIVLGWLGSKTWSDPGCQLGLVVFLVATAVMVLGFVLAVVGHGRASADRDREDDDQDE
Ga0326729_102881633300033432Peat SoilVRDIGFVGAGVVLGWLTSKTWSDPGCQLGLVVFLVATAVMTLGFVLAVVGHGRDPGGGDRDDDDGEE
Ga0326726_1000543653300033433Peat SoilMRDGGLVGIGIVLGWLASKSWSDPECRLGLAVFLVAAAAMTLVFALAVVGHGREPHGPDHDEDDDRD
Ga0326726_1054190433300033433Peat SoilVRDIGFVGAGVVLGWLASKTWSDPGCQLGLVVFLVATAVMTLGFVLAVVGHGRDPGGGDRDDDDGEE
Ga0326726_1173801323300033433Peat SoilVRDIGFVGAGVVLGWLASKTWSDPGCQLGLVVFLVATAVMTLGFVFAVVGHGRDPGGGDRDDDDGEEC
Ga0316626_1105970413300033485SoilWLTSKTWSDPGCQLGLVVFLVATAVMALGFVFAVVGPRRDPRAGEPDGDDEDEQ
Ga0326730_101017323300033500Peat SoilVRDIGFVGAGMVLGWLASKTWSDPGCQLGLVVFLVATAVMTLGFVLAVVGHGRDPGVGGPDDDDEEER
Ga0326731_102060813300033502Peat SoilGFVGAGMVLGWLASKTWSDPGCQLGLVVFLVATAVMTLGFVLAVVGHGRDPGVGGPDDDDEEER
Ga0316628_10032730533300033513SoilMRDGGLVGIGVVLGWLVSKTWSDPGCQMGLVVFLVATAVMVLGFVFAVVGHGRDSGGGDREDDDDEE
Ga0316628_10148821023300033513SoilMRDGGLVGVGIVLGWLASKSWSDPECRLGLIVFLVATAAMTLVFALAVVGHGRGPRGPDRDEDDDRE
Ga0364928_0086385_382_5913300033813SedimentMRDGGLVGVGVVLGWLASKTWSDPGCQLGLVVFLVATALMTLGFLLAVVGHGRDPGADDPDEDDKGDHS
Ga0370495_0306119_194_4183300034257Untreated Peat SoilVRDVGFLGIGVVLGWLASKTWSDPGCQLGLVVFLVATALMTLGFLLAVVGHGRDPGPSDPDADDEGDHPGSPRA


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.