NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F043531

Metagenome / Metatranscriptome Family F043531

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F043531
Family Type Metagenome / Metatranscriptome
Number of Sequences 156
Average Sequence Length 141 residues
Representative Sequence MTNIKTLFGLLAAVAAVFGVVAFGAAQASASSAPIVISYAKTCDETVGHCVGTAGDGGTLEMQVTSFRATGNAAQLTFTEWITVGDISFTAEMNGHASPAGFIVLNGTVTEGSFAGAQVHQRSNLVGIDGTTTAWTGELRLAPASA
Number of Associated Samples 124
Number of Associated Scaffolds 156

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 71.79 %
% of genes near scaffold ends (potentially truncated) 38.46 %
% of genes from short scaffolds (< 2000 bps) 80.77 %
Associated GOLD sequencing projects 118
AlphaFold2 3D model prediction Yes
3D model pTM-score0.70

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (79.487 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil
(15.385 % of family members)
Environment Ontology (ENVO) Unclassified
(28.846 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(48.077 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 6.90%    β-sheet: 50.00%    Coil/Unstructured: 43.10%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.70
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
b.159.2.1: SO1590-liked2q03a12q030.78
b.159.2.1: SO1590-liked2q03a12q030.78
b.159.1.1: Allene oxide cyclase-liked2q4ia_2q4i0.74
b.159.1.1: Allene oxide cyclase-liked2q4ia_2q4i0.74
f.4.1.1: OMPA-liked1qjpa_1qjp0.72
f.4.1.1: OMPA-liked1qjpa_1qjp0.72
f.4.1.1: OMPA-liked1p4ta_1p4t0.7
b.159.2.1: SO1590-liked2ooja12ooj0.7
f.4.1.1: OMPA-liked1p4ta_1p4t0.7
b.159.2.1: SO1590-liked2ooja12ooj0.7


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 156 Family Scaffolds
PF00072Response_reg 9.62
PF00196GerE 8.97
PF14026DUF4242 3.21
PF03704BTAD 2.56
PF02728Cu_amine_oxidN3 1.92
PF01179Cu_amine_oxid 1.28
PF02518HATPase_c 1.28
PF04542Sigma70_r2 1.28
PF00440TetR_N 1.28
PF08281Sigma70_r4_2 1.28
PF12802MarR_2 1.28
PF00561Abhydrolase_1 0.64
PF00325Crp 0.64
PF02580Tyr_Deacylase 0.64
PF01391Collagen 0.64
PF01266DAO 0.64
PF12697Abhydrolase_6 0.64
PF07730HisKA_3 0.64
PF02574S-methyl_trans 0.64
PF00069Pkinase 0.64
PF13649Methyltransf_25 0.64
PF13424TPR_12 0.64
PF13517FG-GAP_3 0.64
PF00392GntR 0.64
PF02481DNA_processg_A 0.64
PF06356DUF1064 0.64
PF09278MerR-DNA-bind 0.64
PF01321Creatinase_N 0.64
PF00730HhH-GPD 0.64
PF12831FAD_oxidored 0.64
PF13191AAA_16 0.64

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 156 Family Scaffolds
COG3733Cu2+-containing amine oxidaseSecondary metabolites biosynthesis, transport and catabolism [Q] 3.21
COG0515Serine/threonine protein kinaseSignal transduction mechanisms [T] 2.56
COG3947Two-component response regulator, SAPR family, consists of REC, wHTH and BTAD domainsTranscription [K] 2.56
COG3629DNA-binding transcriptional regulator DnrI/AfsR/EmbR, SARP family, contains BTAD domainTranscription [K] 2.56
COG4941Predicted RNA polymerase sigma factor, contains C-terminal TPR domainTranscription [K] 1.28
COG0568DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32)Transcription [K] 1.28
COG0758Predicted Rossmann fold nucleotide-binding protein DprA/Smf involved in DNA uptakeReplication, recombination and repair [L] 1.28
COG1191DNA-directed RNA polymerase specialized sigma subunitTranscription [K] 1.28
COG1595DNA-directed RNA polymerase specialized sigma subunit, sigma24 familyTranscription [K] 1.28
COG1490D-aminoacyl-tRNA deacylaseTranslation, ribosomal structure and biogenesis [J] 0.64
COG4585Signal transduction histidine kinase ComPSignal transduction mechanisms [T] 0.64
COG4564Signal transduction histidine kinaseSignal transduction mechanisms [T] 0.64
COG3851Signal transduction histidine kinase UhpB, glucose-6-phosphate specificSignal transduction mechanisms [T] 0.64
COG3850Signal transduction histidine kinase NarQ, nitrate/nitrite-specificSignal transduction mechanisms [T] 0.64
COG22313-Methyladenine DNA glycosylase, HhH-GPD/Endo3 superfamilyReplication, recombination and repair [L] 0.64
COG2040Homocysteine/selenocysteine methylase (S-methylmethionine-dependent)Amino acid transport and metabolism [E] 0.64
COG0006Xaa-Pro aminopeptidaseAmino acid transport and metabolism [E] 0.64
COG1194Adenine-specific DNA glycosylase, acts on AG and A-oxoG pairsReplication, recombination and repair [L] 0.64
COG1059Thermostable 8-oxoguanine DNA glycosylaseReplication, recombination and repair [L] 0.64
COG0789DNA-binding transcriptional regulator, MerR familyTranscription [K] 0.64
COG0646Methionine synthase I (cobalamin-dependent), methyltransferase domainAmino acid transport and metabolism [E] 0.64
COG0177Endonuclease IIIReplication, recombination and repair [L] 0.64
COG01223-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylaseReplication, recombination and repair [L] 0.64


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms79.49 %
UnclassifiedrootN/A20.51 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000364|INPhiseqgaiiFebDRAFT_104737047All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium549Open in IMG/M
3300000955|JGI1027J12803_100221824All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Actinomycetales1723Open in IMG/M
3300000956|JGI10216J12902_101462415Not Available641Open in IMG/M
3300000956|JGI10216J12902_107578316All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium670Open in IMG/M
3300000956|JGI10216J12902_109367376All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium907Open in IMG/M
3300000956|JGI10216J12902_110195551All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium832Open in IMG/M
3300000956|JGI10216J12902_114312941All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Actinomycetales1029Open in IMG/M
3300000956|JGI10216J12902_114768392All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium732Open in IMG/M
3300000956|JGI10216J12902_117437429Not Available752Open in IMG/M
3300002120|C687J26616_10006911All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia4611Open in IMG/M
3300002120|C687J26616_10206458All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium606Open in IMG/M
3300002407|C687J29651_10233925All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium585Open in IMG/M
3300002568|C688J35102_120989227All Organisms → cellular organisms → Bacteria10461Open in IMG/M
3300004114|Ga0062593_100293826All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium1379Open in IMG/M
3300004156|Ga0062589_102634287Not Available522Open in IMG/M
3300004157|Ga0062590_100495127All Organisms → cellular organisms → Bacteria1039Open in IMG/M
3300004479|Ga0062595_100208015All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Actinomycetales1220Open in IMG/M
3300004643|Ga0062591_101976714All Organisms → cellular organisms → Bacteria → Terrabacteria group600Open in IMG/M
3300005093|Ga0062594_100999912All Organisms → cellular organisms → Bacteria → Terrabacteria group804Open in IMG/M
3300005181|Ga0066678_11097819All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium512Open in IMG/M
3300005332|Ga0066388_104384067All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium719Open in IMG/M
3300005332|Ga0066388_105237767Not Available658Open in IMG/M
3300005332|Ga0066388_106007358Not Available613Open in IMG/M
3300005435|Ga0070714_100926778Not Available846Open in IMG/M
3300005437|Ga0070710_10973346Not Available617Open in IMG/M
3300005451|Ga0066681_10507816All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium744Open in IMG/M
3300005535|Ga0070684_102106043All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium532Open in IMG/M
3300005546|Ga0070696_100842871Not Available757Open in IMG/M
3300005553|Ga0066695_10145672All Organisms → cellular organisms → Bacteria1473Open in IMG/M
3300005556|Ga0066707_10181345All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium1350Open in IMG/M
3300005569|Ga0066705_10973839All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium503Open in IMG/M
3300005764|Ga0066903_100216386All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia2896Open in IMG/M
3300005764|Ga0066903_100861827Not Available1632Open in IMG/M
3300005764|Ga0066903_103658930Not Available827Open in IMG/M
3300005764|Ga0066903_105780727Not Available650Open in IMG/M
3300005764|Ga0066903_106384547All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium615Open in IMG/M
3300006046|Ga0066652_100113271All Organisms → cellular organisms → Bacteria2211Open in IMG/M
3300006046|Ga0066652_100256978All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium1532Open in IMG/M
3300006175|Ga0070712_100178105All Organisms → cellular organisms → Bacteria1655Open in IMG/M
3300006574|Ga0074056_11849876All Organisms → cellular organisms → Bacteria5696Open in IMG/M
3300006575|Ga0074053_11804626All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1133Open in IMG/M
3300006791|Ga0066653_10408298Not Available694Open in IMG/M
3300006876|Ga0079217_10541152All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium737Open in IMG/M
3300006954|Ga0079219_10882935All Organisms → cellular organisms → Bacteria719Open in IMG/M
3300007004|Ga0079218_11454146All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium737Open in IMG/M
3300007004|Ga0079218_13362584Not Available542Open in IMG/M
3300009038|Ga0099829_10807400All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium779Open in IMG/M
3300009137|Ga0066709_100284622All Organisms → cellular organisms → Bacteria2235Open in IMG/M
3300009176|Ga0105242_12960066Not Available525Open in IMG/M
3300009822|Ga0105066_1149061All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium536Open in IMG/M
3300010037|Ga0126304_10365869All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium960Open in IMG/M
3300010039|Ga0126309_10583939All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium701Open in IMG/M
3300010039|Ga0126309_10694941All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium652Open in IMG/M
3300010039|Ga0126309_11053333Not Available549Open in IMG/M
3300010048|Ga0126373_10346000All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium1497Open in IMG/M
3300010166|Ga0126306_10458830All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium1003Open in IMG/M
3300010322|Ga0134084_10196879All Organisms → cellular organisms → Bacteria → Terrabacteria group703Open in IMG/M
3300010322|Ga0134084_10406273All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium532Open in IMG/M
3300010337|Ga0134062_10374194All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium691Open in IMG/M
3300010396|Ga0134126_10600317All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Thermoleophilia → Solirubrobacterales1259Open in IMG/M
3300011003|Ga0138514_100088543All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium663Open in IMG/M
3300011107|Ga0151490_1484740All Organisms → cellular organisms → Bacteria2780Open in IMG/M
3300011107|Ga0151490_1641432All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium783Open in IMG/M
3300011270|Ga0137391_10304983All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium1373Open in IMG/M
3300011417|Ga0137326_1135506Not Available568Open in IMG/M
3300012096|Ga0137389_11003074All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium716Open in IMG/M
3300012189|Ga0137388_10194745All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1824Open in IMG/M
3300012198|Ga0137364_10199215All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1467Open in IMG/M
3300012200|Ga0137382_10040706All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium2849Open in IMG/M
3300012201|Ga0137365_10437329All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium963Open in IMG/M
3300012204|Ga0137374_10061495All Organisms → cellular organisms → Bacteria3765Open in IMG/M
3300012208|Ga0137376_10193894All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium1756Open in IMG/M
3300012210|Ga0137378_10409470All Organisms → cellular organisms → Bacteria1258Open in IMG/M
3300012211|Ga0137377_10769246All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium896Open in IMG/M
3300012358|Ga0137368_10023972All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Micrococcales5688Open in IMG/M
3300012360|Ga0137375_10049760All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Micrococcales4588Open in IMG/M
3300012360|Ga0137375_10250812All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium1634Open in IMG/M
3300012363|Ga0137390_10115681All Organisms → cellular organisms → Bacteria2655Open in IMG/M
3300012960|Ga0164301_10120664All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium1548Open in IMG/M
3300012977|Ga0134087_10550675All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium589Open in IMG/M
3300012987|Ga0164307_11527586All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium565Open in IMG/M
3300012988|Ga0164306_11175850Not Available642Open in IMG/M
3300012989|Ga0164305_11463081All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium604Open in IMG/M
3300014166|Ga0134079_10009550All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria2928Open in IMG/M
3300014267|Ga0075313_1002134All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Methylococcales → Methylococcaceae → Methylocucumis → Methylocucumis oryzae4344Open in IMG/M
3300015356|Ga0134073_10209422All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium652Open in IMG/M
3300015359|Ga0134085_10352589Not Available654Open in IMG/M
3300015371|Ga0132258_10712002All Organisms → cellular organisms → Bacteria2528Open in IMG/M
3300015371|Ga0132258_13950418All Organisms → cellular organisms → Bacteria1007Open in IMG/M
3300015372|Ga0132256_103694997Not Available515Open in IMG/M
3300017657|Ga0134074_1407882All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium505Open in IMG/M
3300018027|Ga0184605_10012961All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria3232Open in IMG/M
3300018056|Ga0184623_10333897All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium681Open in IMG/M
3300018071|Ga0184618_10430318Not Available557Open in IMG/M
3300018077|Ga0184633_10153710All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium1193Open in IMG/M
3300018078|Ga0184612_10584563All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium531Open in IMG/M
3300018422|Ga0190265_10274082All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1746Open in IMG/M
3300018422|Ga0190265_13079006All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium557Open in IMG/M
3300018429|Ga0190272_12205418All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium591Open in IMG/M
3300018432|Ga0190275_10262011All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium1674Open in IMG/M
3300018432|Ga0190275_10457090Not Available1301Open in IMG/M
3300018465|Ga0190269_10960870All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium640Open in IMG/M
3300018466|Ga0190268_10168818Not Available1147Open in IMG/M
3300018466|Ga0190268_11548555All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium580Open in IMG/M
3300018469|Ga0190270_10470254All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium1188Open in IMG/M
3300018469|Ga0190270_12165932Not Available616Open in IMG/M
3300018481|Ga0190271_12575822All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium610Open in IMG/M
3300018482|Ga0066669_10345285All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1233Open in IMG/M
3300018482|Ga0066669_10769988All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium850Open in IMG/M
3300019362|Ga0173479_10713418Not Available543Open in IMG/M
3300019377|Ga0190264_11080023All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium652Open in IMG/M
3300021560|Ga0126371_10452849All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium1429Open in IMG/M
3300022756|Ga0222622_10636132All Organisms → cellular organisms → Bacteria → Terrabacteria group772Open in IMG/M
3300025160|Ga0209109_10477839All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium570Open in IMG/M
3300025165|Ga0209108_10321248All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium773Open in IMG/M
3300025289|Ga0209002_10392830All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium791Open in IMG/M
3300025313|Ga0209431_10114697All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium2125Open in IMG/M
3300025313|Ga0209431_10172770All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium1711Open in IMG/M
3300025324|Ga0209640_10051690All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium3571Open in IMG/M
3300025325|Ga0209341_10087849All Organisms → cellular organisms → Bacteria2625Open in IMG/M
3300025325|Ga0209341_10203153All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium1656Open in IMG/M
3300025903|Ga0207680_10082664All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium2022Open in IMG/M
3300025915|Ga0207693_11347361All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium532Open in IMG/M
3300025927|Ga0207687_10340860All Organisms → cellular organisms → Bacteria1219Open in IMG/M
3300025935|Ga0207709_11432135Not Available572Open in IMG/M
3300026306|Ga0209468_1027172All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria2012Open in IMG/M
3300026343|Ga0209159_1228258All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium572Open in IMG/M
3300027775|Ga0209177_10436695Not Available533Open in IMG/M
(restricted) 3300027799|Ga0233416_10008342All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria3357Open in IMG/M
3300027964|Ga0256864_1174167All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium613Open in IMG/M
(restricted) 3300027995|Ga0233418_10292053All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium565Open in IMG/M
(restricted) 3300028043|Ga0233417_10402669All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium631Open in IMG/M
3300028592|Ga0247822_11150067Not Available646Open in IMG/M
3300028717|Ga0307298_10067475All Organisms → cellular organisms → Bacteria994Open in IMG/M
3300028812|Ga0247825_10534707All Organisms → cellular organisms → Bacteria836Open in IMG/M
3300028878|Ga0307278_10003616All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria7522Open in IMG/M
3300028878|Ga0307278_10014031All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium3730Open in IMG/M
3300028880|Ga0307300_10137008All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium762Open in IMG/M
3300031229|Ga0299913_10778363All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium931Open in IMG/M
3300031576|Ga0247727_10183161All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium1952Open in IMG/M
3300031740|Ga0307468_100901281Not Available767Open in IMG/M
3300031740|Ga0307468_101805672Not Available579Open in IMG/M
3300031938|Ga0308175_100002322All Organisms → cellular organisms → Bacteria12998Open in IMG/M
3300031939|Ga0308174_10656882All Organisms → cellular organisms → Bacteria → Terrabacteria group872Open in IMG/M
3300031949|Ga0214473_10162227All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria2598Open in IMG/M
3300031965|Ga0326597_10096715All Organisms → cellular organisms → Bacteria3603Open in IMG/M
3300031965|Ga0326597_10349611All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Geodermatophilales → Geodermatophilaceae → unclassified Geodermatophilaceae → Geodermatophilaceae bacterium1662Open in IMG/M
3300031996|Ga0308176_11294889Not Available774Open in IMG/M
3300032075|Ga0310890_11845437Not Available503Open in IMG/M
3300032122|Ga0310895_10733731Not Available516Open in IMG/M
3300033407|Ga0214472_10000855All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria36284Open in IMG/M
3300033407|Ga0214472_10061923All Organisms → cellular organisms → Bacteria3765Open in IMG/M
3300033417|Ga0214471_10753306All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium762Open in IMG/M
3300033814|Ga0364930_0145117All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium810Open in IMG/M
3300034151|Ga0364935_0262736All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium564Open in IMG/M
3300034165|Ga0364942_0011535All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium2750Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil15.38%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil12.18%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil9.62%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil5.13%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil5.13%
SoilEnvironmental → Terrestrial → Soil → Loam → Unclassified → Soil5.13%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil4.49%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil4.49%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil3.85%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment3.21%
Serpentine SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Serpentine Soil3.21%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil3.21%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil3.21%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere2.56%
SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Sediment1.92%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil1.92%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment1.92%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere1.92%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil1.28%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil1.28%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere1.28%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil0.64%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.64%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.64%
Natural And Restored WetlandsEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Natural And Restored Wetlands0.64%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.64%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere0.64%
Agricultural SoilEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Agricultural Soil0.64%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand0.64%
BiofilmEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Biofilm0.64%
SoilEnvironmental → Terrestrial → Agricultural Field → Unclassified → Unclassified → Soil0.64%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere0.64%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Arabidopsis Rhizosphere0.64%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000364Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000955Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000956Soil microbial communities from Great Prairies - Kansas, Native Prairie soilEnvironmentalOpen in IMG/M
3300002120Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 13_2EnvironmentalOpen in IMG/M
3300002407Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 13_1EnvironmentalOpen in IMG/M
3300002568Grasslands soil microbial communities from Hopland, California, USA - 2EnvironmentalOpen in IMG/M
3300004114Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 5EnvironmentalOpen in IMG/M
3300004156Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 1EnvironmentalOpen in IMG/M
3300004157Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - - Combined assembly of AARS Block 2EnvironmentalOpen in IMG/M
3300004479Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAsEnvironmentalOpen in IMG/M
3300004643Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 3EnvironmentalOpen in IMG/M
3300005093Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of KBS All BlocksEnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005435Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaGEnvironmentalOpen in IMG/M
3300005437Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-2 metaGEnvironmentalOpen in IMG/M
3300005451Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130EnvironmentalOpen in IMG/M
3300005535Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7.2-3L metaGEnvironmentalOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005553Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005569Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154EnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300006046Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101EnvironmentalOpen in IMG/M
3300006175Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaGEnvironmentalOpen in IMG/M
3300006574Soil and rhizosphere microbial communities from Centre INRS-Institut Armand-Frappier, Laval, Canada - Soil microcosm metaTmtHAA (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300006575Soil and rhizosphere microbial communities from Centre INRS-Institut Armand-Frappier, Laval, Canada - Soil microcosm metaTmtLAA (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300006791Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102EnvironmentalOpen in IMG/M
3300006876Agricultural soil microbial communities from Utah to study Nitrogen management - NC AS200EnvironmentalOpen in IMG/M
3300006954Agricultural soil microbial communities from Georgia to study Nitrogen management - GA ControlEnvironmentalOpen in IMG/M
3300007004Agricultural soil microbial communities from Utah to study Nitrogen management - NC CompostEnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009176Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-4 metaGHost-AssociatedOpen in IMG/M
3300009822Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_30_40EnvironmentalOpen in IMG/M
3300010037Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot25EnvironmentalOpen in IMG/M
3300010039Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot56EnvironmentalOpen in IMG/M
3300010048Tropical forest soil microbial communities from Panama - MetaG Plot_11EnvironmentalOpen in IMG/M
3300010166Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot27EnvironmentalOpen in IMG/M
3300010322Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010337Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09082015EnvironmentalOpen in IMG/M
3300010396Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-2EnvironmentalOpen in IMG/M
3300011003Agricultural soil microbial communities from Tamara ranch near Red Deer, Alberta, Canada - d1t9i015EnvironmentalOpen in IMG/M
3300011107Soil and rhizosphere microbial communities from Centre INRS-Institut Armand-Frappier, Laval, Canada - Soil microcosm metaTmtHAC (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011417Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT500_2EnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012204Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012358Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012360Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_113_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012960Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_231_MGEnvironmentalOpen in IMG/M
3300012977Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300012987Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_243_MGEnvironmentalOpen in IMG/M
3300012988Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_242_MGEnvironmentalOpen in IMG/M
3300012989Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_237_MGEnvironmentalOpen in IMG/M
3300014166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09182015EnvironmentalOpen in IMG/M
3300014267Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberryNE_CattailC_D1EnvironmentalOpen in IMG/M
3300015356Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300015359Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09182015EnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300015372Soil combined assemblyHost-AssociatedOpen in IMG/M
3300017657Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09212015EnvironmentalOpen in IMG/M
3300018027Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_coexEnvironmentalOpen in IMG/M
3300018056Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_b1EnvironmentalOpen in IMG/M
3300018071Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_b1EnvironmentalOpen in IMG/M
3300018077Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_170_b1EnvironmentalOpen in IMG/M
3300018078Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_60_coexEnvironmentalOpen in IMG/M
3300018422Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 124 TEnvironmentalOpen in IMG/M
3300018429Populus adjacent soil microbial communities from riparian zone of Shoshone River, Wyoming, USA - 504 TEnvironmentalOpen in IMG/M
3300018432Populus adjacent soil microbial communities from riparian zone of Yellowstone River, Montana, USA - 550 TEnvironmentalOpen in IMG/M
3300018465Populus adjacent soil microbial communities from riparian zone of Blue River, Arizona, USA - 249 ISEnvironmentalOpen in IMG/M
3300018466Populus adjacent soil microbial communities from riparian zone of Blue River, Arizona, USA - 249 TEnvironmentalOpen in IMG/M
3300018469Populus adjacent soil microbial communities from riparian zone of Weber River, Utah, USA - 320 TEnvironmentalOpen in IMG/M
3300018481Populus adjacent soil microbial communities from riparian zone of Weber River, Utah, USA - 356 TEnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300019362Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S104-311B-1 (version 2)EnvironmentalOpen in IMG/M
3300019377Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 112 TEnvironmentalOpen in IMG/M
3300021560Tropical forest soil microbial communities from Panama - MetaG Plot_4EnvironmentalOpen in IMG/M
3300022756Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_5_b1EnvironmentalOpen in IMG/M
3300025160Soil microbial communities from Rifle, Colorado, USA - sediment 10ft 2EnvironmentalOpen in IMG/M
3300025165Soil microbial communities from Rifle, Colorado, USA - sediment 10ft 1EnvironmentalOpen in IMG/M
3300025289Soil microbial communities from Rifle, Colorado, USA - sediment 16ft 2EnvironmentalOpen in IMG/M
3300025313Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 13_3 (SPAdes)EnvironmentalOpen in IMG/M
3300025324Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 10_1 (SPAdes)EnvironmentalOpen in IMG/M
3300025325Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 13_2 (SPAdes)EnvironmentalOpen in IMG/M
3300025903Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025915Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025927Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025935Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M3-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026306Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102 (SPAdes)EnvironmentalOpen in IMG/M
3300026343Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144 (SPAdes)EnvironmentalOpen in IMG/M
3300027775Agricultural soil microbial communities from Georgia to study Nitrogen management - GA Control (SPAdes)EnvironmentalOpen in IMG/M
3300027799 (restricted)Sediment microbial communities from Lake Towuti, South Sulawesi, Indonesia - Sediment_Towuti_2014_0_MGEnvironmentalOpen in IMG/M
3300027964Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT147D111 HiSeqEnvironmentalOpen in IMG/M
3300027995 (restricted)Sediment microbial communities from Lake Towuti, South Sulawesi, Indonesia - Sediment_Towuti_2014_1_MGEnvironmentalOpen in IMG/M
3300028043 (restricted)Sediment microbial communities from Lake Towuti, South Sulawesi, Indonesia - Sediment_Towuti_2014_0.5_MGEnvironmentalOpen in IMG/M
3300028592Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_Cellulose_Day30EnvironmentalOpen in IMG/M
3300028717Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_158EnvironmentalOpen in IMG/M
3300028812Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_Vanillin_Day48EnvironmentalOpen in IMG/M
3300028878Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_117EnvironmentalOpen in IMG/M
3300028880Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_181EnvironmentalOpen in IMG/M
3300031229Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT155D38EnvironmentalOpen in IMG/M
3300031576Biofilm microbial communities from Wishing Well Cave, Virginia, United States - WW16-25EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031938Soil microbial communities from UC Gill Tract Community Farm, Albany, California, United States - DLSLS.C.R1EnvironmentalOpen in IMG/M
3300031939Soil microbial communities from UC Gill Tract Community Farm, Albany, California, United States - DLSLS.P.R2EnvironmentalOpen in IMG/M
3300031949Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT98D197EnvironmentalOpen in IMG/M
3300031965Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT100D185EnvironmentalOpen in IMG/M
3300031996Soil microbial communities from UC Gill Tract Community Farm, Albany, California, United States - DLSLS.C.R2EnvironmentalOpen in IMG/M
3300032075Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T20D3EnvironmentalOpen in IMG/M
3300032122Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C0D4EnvironmentalOpen in IMG/M
3300033407Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT140D175EnvironmentalOpen in IMG/M
3300033417Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT142D155EnvironmentalOpen in IMG/M
3300033814Sediment microbial communities from East River floodplain, Colorado, United States - 55_j17EnvironmentalOpen in IMG/M
3300034151Sediment microbial communities from East River floodplain, Colorado, United States - 2_s17EnvironmentalOpen in IMG/M
3300034165Sediment microbial communities from East River floodplain, Colorado, United States - 19_s17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
INPhiseqgaiiFebDRAFT_10473704713300000364SoilVLGPSPNCVXRLVIGLVAVVGVGXFGAAQGSASSAPIVITYAKTCVEATGHCLGTAGSGGTFEMQVTSFRATGKAAQLTVTESITVGAISFTAEMNAHASPSGFILLNGRVTEGSFAGAQVHQRSNLVGGNATTTDW
JGI1027J12803_10022182413300000955SoilMSTIKAIFGLLLATVATVFAGLALGATQASASSAPIVIPYAKTCNEFGHCKGTVNGVTIEMQITGFRATGDAAQLTLTEWITAGGISFTAVMNGHSSPEGFIVLNGTVTDGSFAGAQVHQRSNYVRGPANASEWTGELQLMPASA*
JGI10216J12902_10146241523300000956SoilAFGAAQASASSAPIVITYEKTCNEPAGYCASSPGASVTIVMQVDPTTFRATGDAFQFTLTEWITVGDVSFTAVMNAHRSPDGFIVLNGMITKGSFTGAQVHQRSNYVSGPPTASKWTGELQLIPASACA*
JGI10216J12902_10757831613300000956SoilSIMKRTLALATVVGVVAFGPAQASASNPPTVISYAKTCDAAAGHCLGTAGNGGTIEMQVTSFRVTGKAAQLTLTEWITVGNISFTAEMNGAVSPAGFIVLNGTVTQGSFAGAQIHQRSNYVGGVAFTSWVGELRLVPASA*
JGI10216J12902_10936737623300000956SoilASASSAPIVIPYAKTCIGTVCDGTAGNGGTLHMEVTGYRATGDAAQLTLTEWITVGNISFRAEMNGTVSPDGFIVLNGTVTEGSFAGAQVHQRSNLVGIDGTTTAWTGKLQLVSATA*
JGI10216J12902_11019555123300000956SoilVLGPSPNFVARLVIGLVAVVGVGAFGAAQGSASSAPIVITYAKTCVEATGHCLGTAGSGGTFEMQVTSFRATGKAAQLTVTESITVGAISFTAEMNAHASPSGFIVLNGRVTEGSFAGAQVHQRSNLVGGNATTTDWIGQLQLMPASA*
JGI10216J12902_11431294113300000956SoilMRQQPFMAIAAVLAVVGIVAFGAARASASSAPIVIAYEKTCDATVGHCVGTVNGVTIEMQVTSFRPSGNAAQLTFTEEITVGNISFTAEMNGHASPAGFIVLNGKVTEGSFAGANVHQRSNLVSGGGTTTTVWTGQLQVMPASA*
JGI10216J12902_11476839223300000956SoilMKKLLVLVVAVVAVVAPGAVQASASSAPIVIAYAKTCDQGGHCVGTAGNGGTFEMQVTSFRSTGDAAQMTATESITVGDISFTAEMSGHVSPAGFIVLNGTVTEGSFAGAQVHQRSNLVGGNATTTAWTGELQLVPASA*
JGI10216J12902_11743742913300000956SoilVVGVVAFGAAQASASSAPIVIQYEKTCTGGVCDGPAGDGGTLHMEVTGYRSTGDAAQLTVTERITDRDISFTVEMSGHFSPAGFIVLNGTVTSGSFAGARVHQRSNFVSADATSDSWVGELQLMPASA*
C687J26616_1000691113300002120SoilMTNLKTIIGLLVAAVAVVGVAAFGAVQASASSAPIVIPYAKTCDETVGKCVGTAGNGGTLEMQVTSFRATGKAAQLTFTEWITVGDISFTAEMNGHASPAGFIVLNGTVADDSDSFAGAQVHQRSNLVGIDGTTTSWTGELRLMPASA*
C687J26616_1020645823300002120SoilMTNIKTLFGLLAAVAAVFGVVAFGAAQASASSAPIVISYAKTCDETVGHCVGTAGDGGTLEMQVTSFRATGNAAQLTFTEWITVGDISFTAEMNGHASPAGFIVLNGTVTEGSFAGAQVHQRSNLVGIDGTTTAWTGELRLAPASA*
C687J29651_1023392513300002407SoilMTNLKTIIGLLVAAVAVVGVAAFGAVQASASSAPIVIPYAKTCDETVGKCVGTAGNGGTLEMQVTSFRATGKAAQLTFTEWITVGDISFTAEMNGHASPAGFIVLNGTVTEGSFAGAQVH
C688J35102_12098922743300002568SoilMIKLKTIGGLLGALAGVVGVVALGAAQASASSAPVVIQYQKTCNEFGHCHGTANGVTIDMQIMSFRATGDAAQLTLTETVTTGRISFTAALSGTSSPDGFIVLNGRVTDGSFAGAQVHQRSNYMSGPPNASLWSGELQLVPATA*
Ga0062593_10029382623300004114SoilMNNIKARFGLLVAVFALVAFGAAQASASSAPIVIPYAKTCGPTGHCAGSAGNGGTLEMQVTSLRPSGDAAQLTLTEWITVGGISFTAEMNGTVSPAGFIVLNGTVTAGSYAGAQVHQRSNLVGAAGTTTSWTGQLQLMPATA*
Ga0062589_10263428713300004156SoilVVAFGAAQASASSAPVVIPYQKTCDETNHCVGTLNGVTIDMQVTGFRATGGGGDLSVTETITLGGGIWFKAEMDGHLSPAGFIVLNGTVTEGKAFVGAQVHQRSNYVSGPANASVWTGELQLMPASA*
Ga0062590_10049512723300004157SoilMKRTLALAVVVGVVAFAAAQASASAAPIVISYAKTCDETIGHCAGTAGNGGTLEMQVTSFRASGHGAVLTLTEWITVGNVSFRAEMNGHRSPAGFIVLNGTVTEGSFVGAEIHQRSNLLGGPPTASQWTGKLQLMPASA*
Ga0062595_10020801513300004479SoilMNRSGRTIVGFLAAAVMAGVFAAAQASASSAPVVIPYAKTCDLTVGHCVGSAGNGGTIEMQVTSFRPTGDDAQLTLTEWITVGRISITAEMNGHWSPAGFIVLNGTVTDGPFAGAQVHQRSDYVGGDPTASRWAGELQLAPASA*
Ga0062591_10197671423300004643SoilMNNIKARFGLLVAVFALVAFGAAQASASSAPIVIPYAKTCGPTGHCAGSAGNGGTLEMQVTSLRPSGDAAQLTLTEWITVGGISFTAEMNGTVSPAGFIVLNGTVTAGSYAGAQVHQRSNLVGAAGTTTS
Ga0062594_10099991223300005093SoilMTNIKSLIGLTVAVVAAVVGPVAFGAAQAFASGAPVAIPYAKTCDETVGHCVGTAGAGGTLEMQVTSFRGTGKAAHLTLTEWITVGGISFTAEMSGNRSPAGFIVLNGTVTEGSFVGAQVHQRSNLMGGSATASEWIGKLRLVLGDDD*
Ga0066678_1109781913300005181SoilTRFGLLAVVGVVVLVAFGAAQASASSAPIVITYDKTCVGTVCTGTAGNGGTIEMQITSYRGTGSAAQLTLTEKITVGDISFTAEMSGHLSPAGFIVLDGTVTADSFAGAQVHQRSNLVGGTPAATDWTGELQLVPASA*
Ga0066388_10438406723300005332Tropical Forest SoilMTSKTLFGLLVVVVAAVVGVLGLGAEQASASSSPIVISYAKTCDETAGHCVGTAGDGGTLEMQVTSFRASGSAAQLTLTEWITVGNISFAAEMSGHQTPAGFIVLNGTVTEGSFAGAEVHQRSNLVGGDATTTQWTGQL
Ga0066388_10523776723300005332Tropical Forest SoilQASASSAPIVISYAKTCDETTGHCLGTAGDGGTFEMQVTSFRATGKAGQLTMTEWITVGDISFTAELNAHASPSGFIVLNGRVTEGSFAGAEVHQRSNLVGGNATTTDWTGKLQLMPASG
Ga0066388_10600735813300005332Tropical Forest SoilMTRLKTIVGLLGAVVVLIGIVALGAAQASASSAPIVIRYAKTCGDTPGHCIGTAGNGGTIEMQITSFRPTGDGAELKLTEWITVGNISFTAEMNGHSSPAGFIVLDGRVTKGSFVGSQVHQRSNLTGA
Ga0070714_10092677813300005435Agricultural SoilMRSKKLLIPILLVVAVAAGMFATAQASASSAPIVITYAKTCDLTVGHCVGTAGNGGKIEMQVTSFQATGGAAQLTLTEWITVGAISFTAEMNGHQSPAGFIVLNGTVTEGSFTGAHIQQRSDFVGGPLTASVWAGQLQLLPASA*
Ga0070710_1097334613300005437Corn, Switchgrass And Miscanthus RhizosphereMRSKKLLIPILLVVAVAAGMFATAQASASSAPIVITYAKTCDLTVGHCVGTAGNGGKIEMQVTSFQATGGAAQLTLTEWITVGAISFTAEMNGHQSPAGFIVLNGTVTEGSFAGAHIHQRSDYVGGPLTA
Ga0066681_1050781613300005451SoilMFGRNIYRRLTALIAIVVGVLAFGAAQASASSAPVVIAYHKTCDASGHCFGSAGNGGTIEMWITSFRATGDAAQLTLRETIKSGNISFTAVMNGTMSPAGFIVLNGTVTDGSFAGAQIHQRSNYDSGPADASVWNGELQLLPASS*
Ga0070684_10210604313300005535Corn RhizosphereASSAPIVISYHKTCDLTVGHCIGTAGNGGTIVMQVTSLRGSGDAAQLTLTEWITVGNISFTAEMNGHQSPDGFIVLNGTVTDGSFLGAQIHQRSNYMGGPLTASEWSGKLQLVPASA*
Ga0070696_10084287113300005546Corn, Switchgrass And Miscanthus RhizosphereMTNLKTIIGLLVAAVAVVGVVAFGAAQASASSAPIVIPYAKTCDQTVGRCVGTAGHGGTIEMQITSFRATGKAAKLTLTEKITVGDIMFTAEMSGNVSPAGFIVLNGTVTKGSFAGARVHQRSNLTSAHETTTEWTGELRLMPASA*
Ga0066695_1014567213300005553SoilMTNLKTIIGLLVAAVAVVGVVAFGAAQASASSAPIVITYKKTCEAGYCAGTAGDGGTLRMQITSYRVTGNDAQLTLTEWITVGDISFTAEMNGHFSPAGFIVLNGTVTEGSFVGAQVHQRSNLMGGDAITQWTGELQLLPASA*
Ga0066707_1018134523300005556SoilMTKLKTIVGLLVALAGVVGVVALGAAQASASSAPIVIRYEKTCNQFGYCKGTVNGVTIEMQITSFRATGDAAQLTLTESITTPQISFTAVMNGTSSPDGFIVLNGTVTAGSFEGAQVHQRSNYVRGPANASEWTGQLQLVSATT*
Ga0066705_1097383913300005569SoilGHKSYGRLAALIAIVVGVVAFGAAQASASSAPIVISYHKQCDAFGHCVGSAGNGGTIEMWITSFRATGDAAQLTLRETIKSGNISFTAVLNGTMTPAGFIVLNGTVTDGSFAGAQIHQRSNYDSGPADASVWNGELQLLPASS*
Ga0066903_10021638623300005764Tropical Forest SoilMKGAMEAMTNKTRFGLLAVVGVVGLAAFGSAQASASGAPIVIPYAKVCDETVGHCVGTAGTGGTLEMQITSFRATGDGAQLTLTEWITVGNIKFTAEMNGHRSPAGFIVLNGTVTEGSFAGAQVHQRSNLLGGPVTASQWTGELQLVPASA*
Ga0066903_10086182733300005764Tropical Forest SoilMTSKTLFGLLVVVFAAVVGVLGLGAEQASASSSPIVISYAKTCDEIAGHCAGTAGDGGTLEMQVTSFRVSGSAAQLTLTEWITVGNISFTAEMSGHQTPAGFIVLNGRVTEGSYAGAEVHQRSNLVGGDATTTQWTGQLQLMPASG*
Ga0066903_10365893023300005764Tropical Forest SoilMAAVVGVAAFGAGQASASSAPIVISYAKACDETAGHCVGTAGNGGTLEMQVTSFRATGHAGQLTLTEWITVGDISFTAEMNAHSSPAGFIVLNGRITEGSFAGAEIHQRSNLVGGDPATTDFWIGELRIMPASG*
Ga0066903_10578072713300005764Tropical Forest SoilESEEESNERKGAEEAMIKTKTGLALLVAVLAAVIGVVAFGAARASATSAPILISYAKTCDETTGHCAGTAGNGGTLEMQVTSFRATGNDAAQLTLTEWITVGDISFTAEMYAHASPSGFIVLNGRVTEGSFAGAEIHQRSNFDGLVGGDPNRDHWLGELQLMPASG*
Ga0066903_10638454713300005764Tropical Forest SoilVLRPSPRWVAVVVIALAAAVGAVALGVTRASASNAPIVISYAKTCDLTVGHCVGTAGNGGTIEMQITSFRGSGDAAQLTLTEWITVGNTSFTAEMNGHESPDGFIVLNGTVTDGSFLGARIHQRSNYVGGSLTASDWTGELQLIPASW*
Ga0066652_10011327123300006046SoilMKRTMALAAVLGVVAFAAAQASASSGPITIPYAKTCEVVNSVLQCSGTAGNGGTIFMQVTSLRASGDAAQLTLTEWITLGGGIGFEAEMSGHLTPAGFIVLNGTVTDGWYVGAEVHQRSNLIGGAATTTEWTGELQLMPASA*
Ga0066652_10025697823300006046SoilMFGHKSYGHLAALIPIVVGVVAFGAAQAAASSAPIVIQYHKTCDAGGHCVGTVNGVTIDMQITGFRATGDAAHLTLNETITTPEMSFKAALSGTMTPAGFIVLNGTVKEGSFAGAQIHQRSNYVSGPPDASVWDGALQLLPATA*
Ga0070712_10017810513300006175Corn, Switchgrass And Miscanthus RhizosphereVLGPSPNCVARLVIGLVAVVGVGAFGAAQGSASSAPIVITYAKTCVEATGHCLGTAGSGGTFEMQVTSFRATGKAAQLTVTESITVGAISFTAEMNAHASPSGFIVLNGRVTEGSFAGAQVHQRSNLVGGNATTTDWIGQLQLMPASA*
Ga0074056_1184987663300006574SoilMTKINAIVGLLVAAVAAVGVVAFGAAQASASSEPIVIPYAKTCDTPTHCAGTAGVGGTLEMWVTGFRPTGNAAQLTLTERITVGDISFTAQMNGDVSPAGFIVLNGTVTEGSFAGAQVHQRSNLVGGTATTTTWTGELRLMPASA*
Ga0074053_1180462613300006575SoilMTKINAIVGLLVAAVAAVGVVAFGAAQASASSEPIVIPYAKTCDTPTHCAGTAGVGGTLEMWVTGFRPTGNAAQLTLTERITVGDISFTAQMKGDVSPAGFIVLNGTVTEGSFAGAQVHQRSNLVGGTATTTTWTGELRLMPASA*
Ga0066653_1040829813300006791SoilGTSRKRTGSSRRHRHSSIESEEESNESKGAEEAMTNLKTIIGLLVAAVAVVGVVAFGAAQASASSAPIVITYKKTCEAGYCAGTAGDGGTLRMQITSYRATGNDAQLTLTEWITVGDISFTAEMNGHFSPAGFIVLNGTVTEGSFVGAQVHQRSNLMGGDETTTSWTGELQLLPASA*
Ga0079217_1054115223300006876Agricultural SoilMTHIKKTVGLLLAAVALVAFGAAPASASTAPIVITYEKDCNQLIGHCEGSAGNGGTFEMQVTSFRPTGRAAQLTMTEWITVGDISFTAVMTAHSSPAGFIVLNGTVTEGDFLGAQVHQRSNFVSAAGFMTSWTGQLMLMPSS*
Ga0079219_1088293523300006954Agricultural SoilAAAQASASSAPIVISYTKTCQVVNGVLQCSGTAGNGGTIFMQVTGFRSSGHAAQLTLTEWVTAGNISFTAEMSGHQTPDGFIVLNGTVTEGPFTGAEVHQRSNLTGGTAAVSEWSGALQLLPATR*
Ga0079218_1145414623300007004Agricultural SoilMTNIKAIFALRAAVVAVVGVVSFGAAQASASSAPIVIPYAKTCDETVGHCVGTAGDSGTLEMQVTSFRATGKAAQLTFTEWVTVGDISFTAEMSGHASPAGFIVLNGTVTEGSFAGARVHQRSNLASTAGTTTSWTGELRLMPASA*
Ga0079218_1336258413300007004Agricultural SoilRQIGEGPSQRGGVMSNVKAIFGLLVAVVVGVVAFGAAQPSASSAPILIPYAKTCNEATGRCVGTAGAGGTIVMQVTSFRATGSAAQLTATETITVGGISFTAEMNLHVSPAGFIVLNGTVTEGSFAGAQVHQRSNYVSGPLTASVWTGELQLTPASA*
Ga0099829_1080740013300009038Vadose Zone SoilNERKGPEEAMTKLKTSIVAVVAVVGVVAFGAAQASASRAPTVISYAKTCNEATGHCSGTAGNGGTLEMQITSFRVTGNDAQLTLTEWIVVGDISFTAEMSGHVSPAGFIVLNGTVTDGSFTGAQVHQRSNLVGVNGTTTSWTGELQVLPASA*
Ga0066709_10028462223300009137Grasslands SoilMTNLKTIIGLLVATVAVVGVLAFGAAQASASSAPIVITYVKTCEAGYCAGTAGSGGTLRMQITSYRVTGNDAQLTLTEWITVGDISFTAEMNGHFSPAGFIVLNGTVIEGSFAGAEVHQRSNLVGADGTTTSWTGEYHLLPASA*
Ga0105242_1296006613300009176Miscanthus RhizosphereAAQASASSAPLVIPYAKTCGAGHCLGTAGAGGRIEMQITSFRATGNAAQLTLTEWITVGDVSFTAEMSGTVSPAGFIVLSGTVTEGPFAGAQVHQRSNLVGGNATTTVWTGELRLMPASA
Ga0105066_114906113300009822Groundwater SandMPRSWLSVTDRQRREEVMTKIKAIFGLLVVVGIVAFGAAQASASSAPIVIPYAKTCDETVGHCVGSAGDGGTFEMQVTSFRATGNAAQLTLTEWITVGDISFTAEMNGHASPAGFIVLNGTVTEGSFAGAQVHQRSDLVSLVGTTSAWTGELRIMPASA*
Ga0126304_1036586913300010037Serpentine SoilMSNIKTLFGLLVAVVAAVVGVVAYGAVQASASSAPIVIPYAKTCDETVGHCVGTAGDGGTLEMQITSFRATGKGAKLTLTEWITVGDISFTAEMEGDVSPAGFIMLNGTVTDTKGSFAGAQVHQRSNLVGAAGTTTSWTGELRLIPASA*
Ga0126309_1058393913300010039Serpentine SoilMTNLKTIVGLLGAVVGVVGVVALGAAQASASSAPIVIPYAKTCDETGHCVGSAGNGGTFEMQVTSFRASGNAAQLTLTEWITVGDISFTAKMNGHVSPAGFIVLNGTVTEGSFAGAQVHQRSNLVGIAGTTTAWTGELQLLPASA*
Ga0126309_1069494123300010039Serpentine SoilMTNIKRICGFLAAVVGLVGVVAFGAAQASASSAPIVIPYEKTCAAGHCLGTAGDGGSFEMQVTSPPQATGKAAQLTVTEWITVGEISFTAEMNGHVSPAGFIVLNGTVTEGSFAGAQVHQRSDLVGSDGTTTVWTGELRLMPASA*
Ga0126309_1105333313300010039Serpentine SoilVKRNFILLVAVLAAVGVVAFGAAQASASSAPIVIPYAKTCNQAGHCMGTAGSGGTLEMQVTSFRATGKGAELTLTEWITVGDISFTAEMNGHVSPAGFILLNGTVTEGSFAGARIHQRSNLVGGDAITT
Ga0126373_1034600013300010048Tropical Forest SoilVSAFGATQASASSAPIVISYAKTCDVNTGHCVGTAGNGGTFEMQVTSFTGTGNAGQLTMTEWITVGDISFTAELNAHASPSGFIVLNGTVTEGSFAGAQVHQRSNLVGGDATTTEWTGELQIMPASG*
Ga0126306_1045883023300010166Serpentine SoilMTNIKTMLGLLVAVVAAVVGVVAFGAAQASASSAPIVIPYAKTCDETVGRCVGSAGDGGTFEMQVTSFRATGNAAQMTATEKITVGGISFTAEMNGHVSPYGFIVLNGTVTEGSFAGAQVHQRSNLVGAAGTTTSWTGELRLIPASA*
Ga0134084_1019687913300010322Grasslands SoilMTNLKTIIGLLVAAVAVVGVVAFGAAQASASSAPIVIPYAKTCDETVGHCVGTAGDGGTLEMQITSFQATGNGAQLTLTEWITVGDISFTAEMEGDVSPAGFIVLNGTVTEGSFAGAQVHQRSNLV
Ga0134084_1040627313300010322Grasslands SoilMFGRNIYRRLTALIAIVVGVLAFGAAQASASSAPVVIAYHKTCDASGHCFGSAGNGGTIEMWITSFRATGDAAQLTLRETIKSGNISFTAVMNGTMSPAGFIVLNGTVTDGSFAGAQIHQRSNYDSGPADASVW
Ga0134062_1037419413300010337Grasslands SoilMTSIKVMLRLLVTAVAAVVVFGAAQASASSAPIVIPYAKTCNELAGHCEGTANGVSIVMQVTGFRPTGKAAQLTLTESITVGDISFTAVMKGDRSPAGFIVLNGNVTEGSFAGAQVHQRSNYVGGPATASEWTGELRIVPASA*
Ga0134126_1060031733300010396Terrestrial SoilMRSKKLLIPILLVVAVAAGMFATAQASASSAPIVITYAKTCDLTVGHCVGTAGNGGKIEMQVTSFQATGGAAQLTLTEWITVGAISFTAEMNGHQSPAGFIVLNGTVTDGPFAGAQVHQRSDYVGGDPTASRWAGELQLAPASA*
Ga0138514_10008854323300011003SoilMTRVKAIFGLLLATVAAVFGVVAFGAGQASASSAPIVIPYAKTCDETVGHCVGSAGNGGTFEMQVTSFRASGNAAQLTVTEWITVGDISFTAEMNGHVSPAGFIVLNGTVTEGSFAGAQVHQRSNLVGIDGTTTEWIGELQLLPASA*
Ga0151490_148474033300011107SoilMTKINAIVGLLVAAVAAVGVVAFGAAQASASSEPIVIPYAKTCDTPTHCAGTAGVGGTLEMWVTGFRPTGNAAQLTLTERITVGDISFTAVMNGHRSPAGFIVLNGTVTQGSFAGAQVHQRSNYVSGPATASVWTGELQLMPSSA*
Ga0151490_164143223300011107SoilMTNLKKIVGLLGAVVGVVGVVAFGAAQASASSAPIVIPYAKTCEAGHCVGTAGDGGTFEMQVTSFRATGSAAQMTATEKITVGGISFTAEMNGHVSPAGFIVLNGTVTEGSFAGAQVHQRSDLVGGNATTTQWTGELQLLPASA*
Ga0137391_1030498313300011270Vadose Zone SoilNPIEGGSEMKGAMEVMTNIKTRFGLLAVVGVVGLAAFGAAQASASSAPIVIPYAKICDETVGHCVGTAGNGGTLEMQITSFQVTGNDAQLTLTEWIVVGDISFTAEMNGHVSPAGFIVLNGTVTDGSFTGAQVHQRSNLVGVNGTTTSWTGELQLLSASA*
Ga0137326_113550623300011417SoilVKRNFILLVAVVGVVAFGAAQASASSAPIVIPYEKTCNELVGHCKGTAGNVTIEMQVDPTSFRATGNAVQFTLTEWITVEDISFIAVMNAHRSPAGFIVLNGTVTEGSFAGAQVHQRSNY
Ga0137389_1100307413300012096Vadose Zone SoilDPSNERKGAEEAMTKLKTIIVAVVAVVGVVAFGAAQASASRAPTVISYAKTCNEATGHCSGTAGNGGTLEMQITSFRVTGNDAQLTLTEWIVVGDISFTAEMSGHVSPAGFIVLNGTVTDGSFTGAQVHQRSDLVGVNGTTTSWTGELQVLPASA*
Ga0137388_1019474513300012189Vadose Zone SoilNIKTRFGLLAVVGVVGLAAFGAAQASASSAPIVIPYAKICDETVGHCVGTAGNGGTLEMQITSFQVTGNDAQLTLTEWIVVGDISFTAEMNGHVSPAGFIVLNGTVTDGSFTGAQVHQRSNLVGVNGTTTSWTGELQLLSASA*
Ga0137364_1019921513300012198Vadose Zone SoilMTKLKTIVGLLVALVGVVGVVALGAAQASASSAPIVIRYEKTCNEFGYCKGTVNGVTIEMQITSFRATGDAAQLTLTESITVGDISFTAVMSGNASPAGFIVLNGTVTEGSFAGAQVHQRSNLVGIDGTTTAWTGELRIVPASS*
Ga0137382_1004070623300012200Vadose Zone SoilMTNLKTIIGLLVAAVAVVGVVAFGAPQASASSAPLVISYEKTCVVATGHCEGSAEGSGTFEMQVTSFRATGDAAQLTVTEWITVGDISFTAEMNGHFSPAGFIVLNGTVTNGSFAGAQVHQRSNLVGGSESTTAWTGELQLVPASA*
Ga0137365_1043732923300012201Vadose Zone SoilMKRTFVLMAVLGVVSFAAAQASAFSAPIVISYEKTCDLTVGHCVGTAGNGGTIEMQVTSLRGSGDAAQLTLTEWITIGNSSFTAEMNGHESPAGFIVLNGTVTEGAFLGAQVHQRSNFMGGPLTASEWGGELQLAPASA*
Ga0137374_1006149553300012204Vadose Zone SoilMPRSWLNVKRKGAEEAMTNLKTIVGLLVAAVAVVGVVAFGAVQASASSAPIVIPYAKTCDETIGHCVGTAGDGGKLEMQVTSFRATGKAAQLTFTEWITVGDISFTAEMNGHASPAGFIVLNGTVTEDSDSFAGAQIHQRSNLVGINGTTTSWTGELRLMPATG*
Ga0137376_1019389413300012208Vadose Zone SoilMTNLKAIIGLLVAAMAVVGVVAFGAPQASASSAPLVISYEKTCVVATGHCEGSAEGSGTFEMQVTSFRATGDAAQLTVTEWITVGDISFTAEMNGHFSPAGFIVLNGTVTNGSFAGAQVHQRSNLVGGSESTTAWTGELQLVPASA*
Ga0137378_1040947033300012210Vadose Zone SoilAVVGVVAFGAAQASASSAPIVIPYAKTCTAGVCVGTAGDGGTLRMQVTSYRTTGNAAQLTLTERITVGDISFTAEMSGHVSPAGFIVLNGKVTEDSFAGAQVHQRSNLVGGNETTTSWTGELQLLPASA*
Ga0137377_1076924633300012211Vadose Zone SoilGVVAFGAPQASASSAPLVISYEKTCVVATGHCEGSAEGSGTFEMQVTSFRATGDAAQLTVTEWITVGDISFTAEMNGHFSPAGFIVLNGTVTNGSFAGAQVHQRSNLVGGNETTTSWTGELQLLPASA*
Ga0137368_1002397223300012358Vadose Zone SoilMTNLKTIVGLLVAAVAVVGVVAFGAVQASASSAPIVIPYAKTCDETIGHCVGTAGDGGKLEMQVTSFRATGKAAQLTFTEWITVGDISFTAEMNGHASPAGFIVLNGTVTEDSDSFAGAQIHQRSNLVGINGTTTSWTGELWLMPATG*
Ga0137375_1004976023300012360Vadose Zone SoilMPRSWLNVKRKGAEEAMTNLKTIVGLLVAAVAVVGVVAFGAVQASASSAPIVIPYAKTCDETIGHCVGTAGDGGKLEMQVTSFRATGRAAQLTFTEWITVGDISFTAEMNGHASPAGFIVLNGTVTEDSDSFAGAQIHQRSNLVGINGTTTSWTGELRLMPATG*
Ga0137375_1025081223300012360Vadose Zone SoilEAMINLKAIVGLRVAVVAAVVGVVAFGAAQASASSAPIVIPYAKTCNEAGYGEGPAGDGGKLKMQVTSLRGTGNAAQLTLTEWITVGDISFTAEMNGHVSPAGFIVLNGTVTEGSFAGAQVHQRSNLVGGNEKTTEWTGELQLLPASA*
Ga0137390_1011568123300012363Vadose Zone SoilMEVMTNIKTRFGLLAVVGVVGLAAFGAAQASASSAPIVIPYAKICDETVGHCVGTAGNGGTLEMQITSFQVTGNDAQLTLTEWIVVGDISFTAEMNGHVSPAGFIVLNGTVTDGSFTGAQVHQRSNLVGVNGTTTSWTGELQLLSASA*
Ga0164301_1012066433300012960SoilMTKINAIVGLLVAAVAAVGVVAFGAAQASASSEPIVIPYAKTCDTPTHCAGTAGVGGTLEMWVTGFRPTGNAAQLTLTERITVGDISFTAQMNGHVSPAGFIVLNGTVTTGSYAGAQVHQRSNLVGGD
Ga0134087_1055067513300012977Grasslands SoilMKKLLVLVVAVVAVVVPGTVQASASNAPIVIAYAKTCDETVGHCVGTAGNGGTLEMQVTSFRATGDGAALTLTEWITVGNIKFTAEMSGHRSPAGFIVLNGTVTEGSFAGAQVHQRSDLVGGVGTTTEWIGKLQLVPASA*
Ga0164307_1152758613300012987SoilMSKKGKTIFGILVAAIVAGAFAAAQASASNAPIVISYAKTCDLTVGHCVGSAGNGGTIEMQVTSFRQTGADAQLTLTEWVTVGHISFTAEMNGHWSPAGFIVLDGTVTAGSLAGARVHQRSNFVGGDPTASKWAGELRLAPATA*
Ga0164306_1117585013300012988SoilMKKLLVLVAAIVTAAGISVTGATASSAPISISYTKTCNALIGHCFGSANGVTIEMWITSFTATGDAAQLTLRESVTAGNISFTAVMNGHNSPAGFIVLNGTVTDGSFKGAQVHQRSNLVGGTATTTTWTGELRLMPASA*
Ga0164305_1146308123300012989SoilGAAQGSASSAPIVITYAKTCVEATGHCLGTAGSGGTFEMQVTSFRATGKAAQLTVTESITVGAISFTAEMNAHASPSGFIVLNGRVTEGSFAGAQVHQRSNLVGGNATTTDWIGQLQLMPASA*
Ga0134079_1000955053300014166Grasslands SoilMKKLLVLVVAVATAAGISVAIATASGAPIVITYEKTCDETVGHCLSTPGSEATLEMQVTSFRATGDGAQLTLTESITVGDISFTAEMNGHRSPAGFIVLNGTVTEGSFAGAQVHQRSNYLGGPATASAWNGELQLVPAS
Ga0075313_100213443300014267Natural And Restored WetlandsVALGAAQASAANAPIVIQYAKTCDEAVGHCVGTAGGSGTLEMQVTSFRATGKRAELSLTEWITVGGISFTAEMDGHVSPAGFIVLNGTVTVGSFAGAQVHQRSNLASAAGTTTTWTGELRLMPASA*
Ga0134073_1020942223300015356Grasslands SoilMFGRNIYRRLTALIAIVVGVLAFGAAQASASSAPVVIAYHKTCDASGHCFGSAGNGGTIEMWITSFRATGDAAQLTLTESITVGEISFTAVMNGTRSPAGFIVLNGTVTEGSFAGAQVHQRSNYVRGPASASEWIGELQLMPASA*
Ga0134085_1035258923300015359Grasslands SoilVAVVGVVAFGAAQASASSAPIVITYKKTCEAGYCAGTAGDGGTLRMQITSYRVTGNDAQLTLTEWITVGDISFTAEMNGHFSPAGFIVLNGTVIEGSFAGAQVHQRSNLMGGDEITQWTGELQLLPASA*
Ga0132258_1071200223300015371Arabidopsis RhizosphereMKKLLVLVATVAGIVAFGAAQAFASSAPIVIPYAKTCDETVGHCLGTAGNGGTLEMQITSFQATGNAAQLTLTEWITVGDISFTAEMEGSVSPHGFIVLNGTVTEGSFAGAQVHQRSNFASAAGTTTSWIGELQLTPASA*
Ga0132258_1395041813300015371Arabidopsis RhizosphereMINSKTLIGSLATVASVVVLGTGQASASSAPIVIPYAKTCNEAVGHCVGTAGSGGTLEMQVTSFRATGDGAELSLTEWITVGSISFTAEMDGRVSPAGFIVLNGTVTRGSFAGAQVHQRSNLVSAAGTTTTWTGQLSLMPASA*
Ga0132256_10369499713300015372Arabidopsis RhizosphereGAAQAAASNAPIVISYGKACAAGHCVGKAGDGGKIEMQITSFRATGTAAQLTLTEKIEVGDIKFTAEMNGHVSPAGFIVLNGRVTEGSFAGAEVHQRSNLVGGSATTTDWTGELRLMPASA*
Ga0134074_140788213300017657Grasslands SoilSSAPIVIPYTKTCDETVGHCVGTAGDGGTLEMQITSFRATGNAAQLTLTEWITVGDISFTAEMEGDVSPAGFIVLNGTVTEGSFAGAQVHQRSNLVGIDGTTTSWTGELQLLPASA
Ga0184605_1001296133300018027Groundwater SedimentMTNLKTIVGLLVAAVAVVGVIAFGTAQASASSAPIVISYAKTCTGGVCDGPAGDGGTLHMEVTGYRSTGDAAQLTVTERITDRDIAFTAEMSGHFSPAGFIVLNGTVTSGSFAGARVHQRSNFVSADATSDSWVGELQLMPASA
Ga0184623_1033389713300018056Groundwater SedimentMKKLLVLVATVAGVVAFGAVQASASSAPIVIPYTKTCNAAGHCEGTAGDGGTLEMQVTSVRATGSAGQLTLTEWITVGDISFTAEMNGHASPAGFIVLNGTVTDGSFKGAQVHQRSDFVGFVGGNPTATAWTGELRLMPASA
Ga0184618_1043031813300018071Groundwater SedimentASASSAPIVSSYAKTCTGGVCDGPAGDGGTLHMEVTGYRSTGDAAQLTVTERITDRDIAFTAEMSGHFSPAGFIVLNGTVTSGSFAGARVHQRSNFVSADATSDSWVGELQLMPASA
Ga0184633_1015371013300018077Groundwater SedimentMTNIKTLFGLLVAVVAAVVGVVAFGAAQASASSAPIVIPYAKTCDETVGHCSGTTGDGRTFEMQITSFRATVKAAQLTATVTVGNISFTAEMKGHVSPAGFIVLTGTVTEDSDSFAGAQVHQRSDLVGADGTTTTWTGELRLIPA
Ga0184612_1058456313300018078Groundwater SedimentMTNIKTLFGLLVAVVAAVVGVVAFGAAQASASSAPIVIPYAKTCDETVGHCTGNAGGVELDMQVTSFRATGNAAQLTFTEKIKVGAISFTAEMNGHASPAGFIVLNGTVTDGSFTGAQVHQRSNLVSVDGTTTFWSGEL
Ga0190265_1027408223300018422SoilMKKLRVLVAAVVGVVAIGAAPVSASSAPIVITYAKTCNELIGLCEGSAGDGGTFSMQVTGFRPTGKAVQLTMTEEITVGDISFTAQMTAHASPAGFIVLNGTVTEGDFLGAQVHQRSNFVSAVGFMTSWVGELRLMPKSG
Ga0190265_1307900613300018422SoilMTNIKAVFGWLAAVAVFVGVVTLGAAQASASSAPIVIPYAKTCDETVGHCVGTAGGGGTLEMQVTGFRATGKAAQLTFTEWITVGDISFTAEMNGHVSPAGFILLNGTVTSGSFAGAQIHQRSNLVGV
Ga0190272_1220541813300018429SoilFASSAPIVITYAKTCNESTGHCVGSAGNGGTFEMQVTGFRATGKAAQLTFTEEITVGDISFTAEVSGHVSPAGFIVLNGTVTEGSYVGAQIHQRSNLVGIEGSTSSWTGELQLMPASA
Ga0190275_1026201113300018432SoilMKKLRMLVAAVVGVVAIGAAPASASSAPIVIPYAKTCDETVGHCVGTAGDGGTLEMQVTSFRATGKAAQLTFTEEITVGDISFTAELSGHVSPAGFIVLNGTVTEGSFAGAQVHQRSDLVGIDGTTTAWTGELRLMPASA
Ga0190275_1045709033300018432SoilVVAVSFEIEAAPASASSAPIVITYAKTCNESIGLCVGSAGNGGTFEMQVTGFRATGKAAQLTVTEEITVGDISFTAEMRGHASPAGFIVLNGTVTEGSFLGAQVHQRSNLVSADGSMTSWIGELRLMPESG
Ga0190269_1096087013300018465SoilMGGHNIYRRLAALIAIVVGVVAFGAAQASASSAPIVIPYAKTCDETGHCVGSAGNGGTFEMQVTSFRPSGKAAQLTVTEWVRVGNISFRAEMNGHASPAGFIVLNGTVTEGSFAGAQVHQRSNLAGLAGTTTAWTGALQLVPASA
Ga0190268_1016881813300018466SoilMKKLRMLLAAVVGVVAFGAAPASADNAPIEITYAKTCNQSIGLCEGTAGTDGTFRMQVTSLRATGKAAQLTMTEWITVGDISLTAQMTAHASPAGFIVLNGTVTEGSFLGAQIHQRSNLEGAFVSNGILHTTWVGELSL
Ga0190268_1154855523300018466SoilMTKLKTIVGLLGAVVGVVGVVALGAAQASASSAPIVIPYAKTCDETRGHCVGSAGNGGTLEMQVTSFRPSGKAAQLTVTEWVRVGNISFRAEMNGHASPAGFVVLNGTVTEGSFAGAQVHQRSNLVGIAGTTTAWTGALQLLPASA
Ga0190270_1047025423300018469SoilMTNVKAIFGLLVAVVVGVVAFGAAQASASSAPIVIPYAKTCNEATGHCVGNAGGVEIDMQVTSFRATGNAAQLTFTEKIKVGAISFTAEMNGHVSPAGFIVLNGRVTEGSFAGAQVHQRSNLVGATATTNSWTGELRIMPASA
Ga0190270_1216593213300018469SoilLVAVVGVVAFGAAQASASSAPIVIPYEKTCNELIGRCVSPAGNAVTLEMQVDPTTFRATGKAVQFTLTEWITVGGISFTAVMNAHRSPAGFIVLNGTVTEGSFAGAQVHQRSNYVSGPATASVWTGELRLMLASA
Ga0190271_1257582213300018481SoilMTNVKAIFGLLVAVVVGVVAFGAAQASASSAPIVIPYAKTCDETVGHCVGSAGAGGTFVMQVTSFRATGSAAQMTATETITVGGISFTAEMNGHVSPAGFIVLNGTVTEGSFAGAQVHQRSNLVGGGGTTTTVWTGQLQLMPASA
Ga0066669_1034528513300018482Grasslands SoilMFGRNIYRRLTALIAIVVGVLAFAAAQASASSAPVVIAYHKTCDASGHCFGSEGNGGTIEMWITSFRATGDAAQLTLRETIKSGNISFTAMMNGTMSPAGFIVLNGTVTDGSFAGAQIHQRSNYDSGPVDASVWNGELQLLPASS
Ga0066669_1076998823300018482Grasslands SoilMTNLKTIIGLLVAAVAVVGVVAFGAAQASASSAPIVIPYAKTCNELAGHCEGTANGVSIVMQVTGFRPTGKAAQLTLTESITVGDISFTAVMKGDRSPAGFIVLNGNVTEGSFAGAQVHQRSNYVGGPATASEWTGELRIVPASA
Ga0173479_1071341813300019362SoilMRLYKSAVFALVAFGAAQASASSAPIVIPYAKTCGPTGHCAGSAGNGGTLEMQVTSLRPSGDAAQLTLTEWITVGGISFTAEMNGTVSPAGFIVLNGTVTAGSYAGAQVHQRSNLVGAAGTTTSWTGQLQLMPATA
Ga0190264_1108002323300019377SoilMKRRFIHLVAVVAAVVGVVAFGAGQASASSAPIVIPYAKTCDETVGHCLGTAGDGGALEMQVTSFRATGNAAQLTFTEEITVGDISFTAEMTGHASPAGFIVLNGTVTEGSFVGAQVHQRSNFVSASPDGSMTSWIGELRLMPKSG
Ga0126371_1045284923300021560Tropical Forest SoilMAGVVGVSAFGATQASASSAPIVISYAKTCDVNTGHCVGTAGNGGTFEMQVTSFTGTGNAGQLTMTESITVGDISFTAELNAHASPSGFIVLNGTVTEGSFAGAQVHQRSNLVGGDATTTDWTGELQIMPASG
Ga0222622_1063613223300022756Groundwater SedimentMTNTKAIFGLLVALVAAVVGVVAFGAAQASASSAPIVIPYAKTCDETVGHCVGTAGDGGTLEMQITSFRATGKGAHLTLTEWITVGDISFTAEMEGEVSTAGFIVLNGTVTATQGSFVGAQVHQRSNLVGAAGTTTSWTGQLRLMPASA
Ga0209109_1047783913300025160SoilMTNIKTLFGLLVAVVAAVVGVVAFGAAQASASSAPIVIPYAKTCDETVGHCVGTAGDGGTLEMQVTSFRATGKAAQLTFTEWITVGDISFTAEMNGHASPAGFIVLNGTVADDSDSFAGAQVHQRSNLVGIDGTTTSWTGELRLMPASA
Ga0209108_1032124823300025165SoilVVAAVGVVAFGAAQASASSAPIVIPYAKTCDETVGHCVGSAGNGGTLEMQVTSFRATGKAAQLTVTEWITVGDISFTAEMSGHASPAGFIVLNGTVTEGSFAGAQVHQRSDLVGADATTTAWTGELRLLPASA
Ga0209002_1039283013300025289SoilMTTIKAIFGLLLAAVAAVFGVVAFGAAQASASSAPIVISYAKTCDETVGHCVGTAGDGGTFEMQVTSFRATGNAAQLTLTEWITVGDISFTAEMEGDVSPAGFIVLNGTVTEGSFAGAQVHQRSDLVGADGTTTEWTGELRLAPASA
Ga0209431_1011469743300025313SoilMTNLKTIIGLLVAAVAVVGVAAFGAVQASASSAPIVIPYAKTCDETVGKCVGTAGNGGTLEMQVTSFRATGKAAQLTFTEWITVGDISFTAEMNGHASPAGFIVLNGTVADDSDSFAGAQVHQRSNLVGIDGTTTSWTGELRLMPASA
Ga0209431_1017277013300025313SoilAQASASSAPIVIPYAKTCDETVGHCVGTAGDGGTLEMQITSFRATGKAAQLTLTEWIRVGDISFTAEMSGHVSPAGFIVLNGTVTEGSFAGAQIHQRSNLVGIDGTTTAWTGELQLLPAS
Ga0209640_1005169023300025324SoilMPFHIKPLAVVAAVGVVAFGAAQASASSAPIVIPYAKTCDETVGHCVGSAGNGGTLEMQVTSFRATGKAAQVTFIEWTTVGDISFTAEMNGHASPAGFIVLNGTVTEGSFAGAQVHQRSDFVGFVGGDATATA
Ga0209341_1008784923300025325SoilMTNLKTIIRLLVAAVAVVGVVAFGAAQASASSAPIVIPYAKTCDETVGHCVGTAGDGGTLEMQITSFRATGKAAQLTLTEWIRVGDISFTAEMSGHVSPAGFIVLNGTVTEGSFAGAQIHQRSNLVGIDGTTTAWTGELQLLPASA
Ga0209341_1020315313300025325SoilMTNIKTLFGLLAAVAAVFGVVAFGAAQASASSAPIVISYAKTCDETVGHCVGTAGDGGTLEMQVTSFRATGNAAQLTFTEWITVGDISFTAEMNGHASPAGFIVLNGTVTEGSFAGAQVHQRSDLVGADGTTTEWTGELRLAPASA
Ga0207680_1008266443300025903Switchgrass RhizosphereVRSSDIFPSKFLRAADLNGHAPIVIPYAKTCDLTVGHCVGTAGDGGTIEMQVTSFRPTGADAQLTLTEWVAVGDISFTAKMNGHWSPAGFIVLDGRVTEGSFAGAQVHQRSNYVGGPSTASAWTGKLQLLPASA
Ga0207693_1134736113300025915Corn, Switchgrass And Miscanthus RhizosphereVLGPSPNCVARLVIGLVAVVGVGAFGAAQGSASSAPIVITYAKTCVEATGHCLGTAGSGGTFEMQVTSFRATGKAAQLTVTESITVGAISFTAEMNAHASPSGFIVLNGRVTEGSFAGAQVHQRSNLVGGNATTTDWIGQLQLMPASA
Ga0207687_1034086023300025927Miscanthus RhizosphereMTNLKTIIGLLVAAVAVVGVVAFGAAQASASSAPIVIPYAKTCDQTVGRCVGTAGHGGTIEMQITSFRATGKAAKLTLTEKITVGDIMFTAEMSGNVSPAGFIVLNGTVTKGSFAGARVHQRSNLTSAHETTTEWTGELRLMPASA
Ga0207709_1143213513300025935Miscanthus RhizosphereMTNLKTIIGSLVAAVAVVGVVAFGAAQASASSAPIVIPYAKTCDQTVGRCVGTAGHGGTIEMQITSFRATGKAAKLTLTEKITVGDIMFTAEMSGNVSPAGFIVLNGTVTKGSFAGARVHQRSNLTSAHETTTEWTGELRLMPASA
Ga0209468_102717223300026306SoilMFGRNIYRRLTALIAIVVGVLAFGAAQASASSAPVVIAYHKTCDASGHCFGSAGNGGTIEMWITSFRATGDAAQLTLRETIKSGNISFTAVMNGTMSPAGFIVLNGTVTDGSFAGAQIHQRSNYDSGPADASVWNGELQLLPASS
Ga0209159_122825813300026343SoilSSIESEEESNESKGAEEAMTNLKTIIGLLVAAVAVVGVVAFGAAQASASSAPIVITYKKTCEAGYCAGTTGDGGTLRMQITSYRVTGNDAQLTLTEWITVGDISFTAEMNGHFSPAGFIVLNGTVTEGSFVGAQVHQRSNLMGGDAITQWTGELQLLPASA
Ga0209177_1043669513300027775Agricultural SoilAFAAAQASASSAPIVISYTKTCQVVNGVLQCSGTAGNGGTIFMQVTGFRSSGHAAQLTLTEWVTAGNISFTAEMSGHQTPDGFIVLNGTVTEGPFTGAEVHQRSNLTGGTAAVSEWSGALQLLPATR
(restricted) Ga0233416_1000834223300027799SedimentMTNSKASFGLFVAVVAAVVGVIAFGAAQASASSAPIVIPYEKTCDEIAGHCVGTAGDGGTIEIQVTSFRANGKAPGFAGRLTLTEWIEVGDISFTAEMNGYASPDGTTLLNGTVTEGSFAGAEVHQRSDFVSASPDGTTTVWRGQLQLMPASA
Ga0256864_117416713300027964SoilMKKLRMLVAAVVGVVAFGAAPASASSAPIVITYAKTCDESIGLCVGSAGTGGTFEMQITGFRATGKAAQLTVTEDITVGDISFTAEMSGHVSPAGFIVLNGTVTEGSFAGAQVHQRSDFDGLVGGDATKTKWIGELSLMPESG
(restricted) Ga0233418_1029205313300027995SedimentMTNSKASFGLFVAVVAAVVGVIAFGAAQASASSAPIVIPYEKTCDEIAGHCVGTAGDGGTIEIQVTSFRANGKAPGFAGRLTLTEWIEVGDISFTAEMNGYASPDGTTLLNGTVTEGSFAGAEVHQRSDFVSASPDGTTTVWRGQLQL
(restricted) Ga0233417_1040266913300028043SedimentVAVVAAVVGVIAFGAAQASASSAPIVIPYEKTCDEIAGHCVGPAGDGGTIEIQVTSFRANGKAPGFAGRLTLTEWIEVGDISFTAEMNGYASPDGTTLLNGTVTEGSFAGAEVHQRSDFVSASPDGTTTVWRGQLQLMPASA
Ga0247822_1115006713300028592SoilMKKLLVVAALVVGVVAFGAVQASASSAPSVIPYAKTCSAGHCLGTAGDGGSIEMRVTSFVATGKAAQLTLTEWITVGDISFTAEMSGTVSPAGFIVLNGTVTRGPFAGAQIHQRSNLVGGNATTTAWTGELRLMPASA
Ga0307298_1006747533300028717SoilVKRNFMLLVAVVGVVAFGAAQASASSAPSVIPYAKTCDETVGHCKGTAGNVTIEMQITGFRATGDAYQLTLTEWITVGDISFTAVMNGHRSPAGFIVLNGTVTEGSFAGAQVHQRSNLVGGPVTASEWTGELRLMPASA
Ga0247825_1053470713300028812SoilMANIKTLFGLLVAVAAVVGVVAFGVAQASASSAPIVISYAKTCAVGQCVGTAGDGGTLEMQVTGFRATGKAGQLTATERITVGGDSFTAEMIGHVSPDGFIVLNGTVTDTEGSFDALAGAQVHQRSNLASAAGTTTAWTGELRLMPASA
Ga0307278_1000361623300028878SoilMTNIKTIFGLLAAVVATVVGVVAFGAAQASASSTPIVITYAKTCNEATGQCVGTAGDGGTLEMQITSFRATGSAAQLTLTEKIRVEDISFTAEMKGLVSPAGFIVLNGTVTDGSFVGAQVHQRSNFVSGPATASVWTGELQLLPASA
Ga0307278_1001403133300028878SoilMKKLLVLVATAGVVAFGAAQASASNAPIVIPYAKTCDETVGHCVGTAGDGGTLEMQVTSFRATGNAAQLTLTEWITVGDISFTAKMNGHVSPDGFILLNGTVTEGSFAGAQVHQRSDLVGIAGTTTSWTGKLQLVPVSA
Ga0307300_1013700813300028880SoilVKRNFIVLVAVVGVVAFGAAQASASSAPSVIPYAKTCDETVGHCKGTAGNVTIEMQITGFRATGDAYQLTLTEWITVGDISFTAVMNGHRSPAGFIVLNGTVTEGSFAGAQVHQRSNLVGGPVTASEWTGELRLMPASA
Ga0299913_1077836323300031229SoilMKKLRMLLAVVVGVMPVGAAPASASSEPIVITYEKTCNELTGHCVGSTGDGGTLEMQVTGLRPTGRAAQLTMNEEVVGDVSFRAEMRAHFSPAGFIVLNGTVTEGSFLGAQIHQRSNLVGVSADGTMTSWIGELRLMPKSS
Ga0247727_1018316153300031576BiofilmMRRLKTSRRFAGLIAVFVVAAVAAQASASSAPIVIPYEKTCDETVGHCLGSAGNGGTFEMQVTGFQGTGKAAQLTATVWITVGNISFTAEMSGHASPAGFIVLNGTVTEGSFAGAQVHQRSNFDGLVGGDQTKTHWIGELQLMPASA
Ga0307468_10090128113300031740Hardwood Forest SoilMKKLLVLAVTLVGVVAFGTAQASASSAPIVIPYAKTCAAGHCMGTAGNGGSIEMRITSFVATGKAAQLTLTEWITVGGISFTAELSGVVSPAGFIVLNGRVTSGSFAGAEVHQRSNLVGGNAGTTAWTGELRIMPASA
Ga0307468_10180567213300031740Hardwood Forest SoilMKKLFVLAVTLTVAVVVGLVAFGAAQASASSAPIVITYTKTCEPAKGHCAGTTGDGAKLEMQITDFRATGKAAQLTLTESITVRGAVWFTAVMEGHVSPAGFIVLNGTVTAGPYAGAQVHQRSNLMGGDATTTDWIGELRIMPASA
Ga0308175_10000232273300031938SoilMKRTLALLAVLGVVAFAAAQASASSAPIVISYHKTCDLTVGHCVGTAGNGGTIVMQVTSLRGSGDAAQLTLTEWITVGNLSFTAQMNGHQSPDGFIVLNGTVTDGSFLGAQIHQRSNYMGGPLTASDWSGQLHLVPASA
Ga0308174_1065688233300031939SoilMKRTFVLMAVLGIVAFAAAQASASSTPIVISYEKTCDLTVGHCVGTAGNGGTIEMQVTSLRDSGDAAQLTLTEWITIGNVSFTAEMKGSQSPAGFIVLNGTVTEGAFFGAQVHQRSNFLGGALTASQWSGELQLVPAS
Ga0214473_1016222733300031949SoilMTNIKTLFGLLVAVVAAVVGVVAFGAAQASASSAPIVIPYAKTCDETVGHCSGTTGDGRTFEMQITSFRATGKAAQLTATVTVGNISENNSFTAEMKGHVSPAGFIVLNGTVTEGSFAGAQVHQRSDFDGFVGGNENMTAWVGELRLVPASA
Ga0326597_1009671543300031965SoilMTNTKAIFGLLAAVVAVVGGVAFGAAQASASSAPIVISYEKTCDETVGHCVGTAGDGGTLEMQVTSFRATGNAAQLTFTEWITDGDISFTAEMNGHVSPAGFIVLNGRVTEGSFVGAQIHQRSNLVGIDGTTTAWTGELQLMPASG
Ga0326597_1034961133300031965SoilMKKLRMLVAAVVGVVAFGAAPASASSAPIVIPYEKTCDETVGHCVGTAGDGGTIEMQVTSFRATGKAAQLTFTEEITVGDISFTAEVSGHVSPAGFIVMNGTVTEGSFAGAQVHQRSNFVGIDGTTSSWIGELRLNAQSG
Ga0308176_1129488933300031996SoilMKRTFVLMAVLGIVAFAAAQASASSTPIVISYEKTCDLTVGHCVGTAGNGGTIEMQVTSLRDSGDAAQLTLTEWITIGNVSFTAEMKGSQSPAGFIVLNGTVTEGAFFGAQVHQRS
Ga0310890_1184543713300032075SoilNIKTLFGLLVAVAAVVGVVAFGVAQASASSAPIVISYAKTCAVGQCVGTAGDGGTLEMQVTGFRATGKAGQLTATERITVGGDSFTAEMIGHVSPDGFIVLNGTVTDTEGSFDALAGAQVHQRSNLASAAGTTTAWTGELRLMPASA
Ga0310895_1073373113300032122SoilMKRLLVLVATVVGVVAYGAAQASASSAPIVIPYAKTCGAGHCLGTAGAGGTIEMQITSFRATGDAAQLTLTEWITVGDISFTAEMSGTVSPAGFIVLNGTVTRGSFAGAEIHQRSNLVGGNATTTAWTGELRLMPATA
Ga0214472_10000855403300033407SoilMTNIKTLFGLLVAVVAAVLGVVAFGAAQASASSAPIVISYAKTCDETVGHCVGTAGDGGTLEMQVTSFRATGKAAQLTFTEWITVGDISFTAEMNGHASPAGFIVLNGTVTEGSFAGAQVHQRSDLVGIDGTTTTWTGELRLMPASG
Ga0214472_1006192343300033407SoilMTNIKAIFGLLAAVVAVVGGVAFGAAQASASSAPIVISYEKTCDETVGHCVGTAGDGGTLEMQVTSFRATGNAAQLTFTEWITDGDISFTAEMNGHVSPAGFIVLNGRVTEGSFVGAQIHQRSNLVGIDGTTTAWTGELQLMPASG
Ga0214471_1075330613300033417SoilMTNIKTLFGLLVAVVAAVLGVVAFGAAQASASSAPIVISYAKTCDETVGHCVGTAGDGGTLEMQVTSFRATGKAAQLTFTEWITVGDISFTAEMSGHVSPAGFIVLNGTVAEDSDSFAGAQIHQRSNLVGIDGTTTTWI
Ga0364930_0145117_355_7983300033814SedimentMTNIKTLFGLLVAVVAAVVGVVAFGAAQASASSAPIVIPYAKTCDETVGHCVGSAGDGGTLEMQITSFRATGKGAQLTLTEWIRVGDISFTAEMSGHVSPAGFIVLNGTVTEGSFVGAQVHQRSDLVGIDGTTTAWTGELRLMPASA
Ga0364935_0262736_61_4803300034151SedimentVKRNFILLVAVVGVVAFGAAQASASSAPIVIPYAKTCDETVGHCTGNAGGVELDMQVTSFRATGNAAQLTFTEKIKVGAISFTAEMNGHASPAGFIVLNGTVTEGSFAGAQVHQRSDLVGINGTTTSWTGELRLTPASA
Ga0364942_0011535_1975_24153300034165SedimentMTNLKTIIRLLVAAVAVVGVVAFGAAQASASSAPIVIPYAKTCDETVGHCSGTTGDGGTLEMQVTSFRATGNAAQLTFTEWITVGDISFRAEMNGHASPAGFIVLNGTVTEGSFAGAQVHQRSDLVGIDGTTTAWTGELQLLPASA


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.