| Basic Information | |
|---|---|
| Family ID | F013430 |
| Family Type | Metagenome / Metatranscriptome |
| Number of Sequences | 271 |
| Average Sequence Length | 64 residues |
| Representative Sequence | MRRLTQVALLGVLVLVVASLATAQQVPQPMVRLGNFIEVGNDVFMHIMASADIRYHT |
| Number of Associated Samples | 154 |
| Number of Associated Scaffolds | 271 |
| Quality Assessment | |
|---|---|
| Transcriptomic Evidence | Yes |
| Most common taxonomic group | Unclassified |
| % of genes with valid RBS motifs | 92.28 % |
| % of genes near scaffold ends (potentially truncated) | 88.93 % |
| % of genes from short scaffolds (< 2000 bps) | 85.61 % |
| Associated GOLD sequencing projects | 146 |
| AlphaFold2 3D model prediction | Yes |
| 3D model pTM-score | 0.44 |
| Hidden Markov Model |
|---|
| Powered by Skylign |
| Most Common Taxonomy | |
|---|---|
| Group | Unclassified (86.716 % of family members) |
| NCBI Taxonomy ID | N/A |
| Taxonomy | N/A |
| Most Common Ecosystem | |
|---|---|
| GOLD Ecosystem | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil (33.948 % of family members) |
| Environment Ontology (ENVO) | Unclassified (35.055 % of family members) |
| Earth Microbiome Project Ontology (EMPO) | Unclassified (39.483 % of family members) |
| ⦗Top⦘ |
| ⦗Top⦘ |
| Predicted Topology & Secondary Structure | |||||
|---|---|---|---|---|---|
| Classification: | Globular | Signal Peptide: | Yes | Secondary Structure distribution: | α-helix: 50.59% β-sheet: 0.00% Coil/Unstructured: 49.41% | Feature Viewer |
|
|
|||||
| Powered by Feature Viewer | |||||
| Structure Viewer | |
|---|---|
|
| |
| Per-residue confidence (pLDDT): 0-50 51-70 71-90 91-100 | pTM-score: 0.44 |
| Powered by PDBe Molstar | |
| ⦗Top⦘ |
| Pfam ID | Name | % Frequency in 271 Family Scaffolds |
|---|---|---|
| PF05685 | Uma2 | 2.58 |
| PF01521 | Fe-S_biosyn | 1.85 |
| PF00486 | Trans_reg_C | 1.11 |
| PF02580 | Tyr_Deacylase | 1.11 |
| PF13191 | AAA_16 | 0.74 |
| PF00472 | RF-1 | 0.37 |
| PF08359 | TetR_C_4 | 0.37 |
| PF03992 | ABM | 0.37 |
| PF02518 | HATPase_c | 0.37 |
| COG ID | Name | Functional Category | % Frequency in 271 Family Scaffolds |
|---|---|---|---|
| COG4636 | Endonuclease, Uma2 family (restriction endonuclease fold) | General function prediction only [R] | 2.58 |
| COG0316 | Fe-S cluster assembly iron-binding protein IscA | Posttranslational modification, protein turnover, chaperones [O] | 1.85 |
| COG4841 | Uncharacterized conserved protein YneR, related to HesB/YadR/YfhF family | Function unknown [S] | 1.85 |
| COG1490 | D-aminoacyl-tRNA deacylase | Translation, ribosomal structure and biogenesis [J] | 1.11 |
| COG0216 | Protein chain release factor RF1 | Translation, ribosomal structure and biogenesis [J] | 0.37 |
| COG1186 | Protein chain release factor PrfB | Translation, ribosomal structure and biogenesis [J] | 0.37 |
| COG1309 | DNA-binding protein, AcrR family, includes nucleoid occlusion protein SlmA | Transcription [K] | 0.37 |
| ⦗Top⦘ |
| Name | Rank | Taxonomy | Distribution |
| Unclassified | root | N/A | 86.72 % |
| All Organisms | root | All Organisms | 13.28 % |
| Visualization |
|---|
| Powered by ApexCharts |
Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).
| ⦗Top⦘ |
| Habitat | Taxonomy | Distribution |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil | 33.95% |
| Groundwater Sediment | Environmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment | 13.65% |
| Grasslands Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil | 9.96% |
| Vadose Zone Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil | 8.12% |
| Groundwater Sediment | Environmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment | 5.17% |
| Tropical Forest Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil | 4.06% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Agricultural → Soil | 4.06% |
| Avena Fatua Rhizosphere | Host-Associated → Plants → Rhizosphere → Soil → Unclassified → Avena Fatua Rhizosphere | 2.95% |
| Populus Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere | 2.58% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil | 1.85% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil | 1.85% |
| Groundwater Sand | Environmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand | 1.85% |
| Tabebuia Heterophylla Rhizosphere | Host-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Tabebuia Heterophylla Rhizosphere | 1.85% |
| Thermal Springs | Environmental → Aquatic → Thermal Springs → Hot (42-90C) → Unclassified → Thermal Springs | 1.11% |
| Switchgrass Rhizosphere | Host-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere | 1.11% |
| Switchgrass Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere | 1.11% |
| Wastewater | Environmental → Aquatic → Freshwater → Drinking Water → Unchlorinated → Wastewater | 0.74% |
| Avena Fatua Rhizosphere | Host-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Avena Fatua Rhizosphere | 0.74% |
| Switchgrass Rhizosphere | Host-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere | 0.74% |
| Freshwater | Environmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater | 0.37% |
| Groundwater | Environmental → Aquatic → Freshwater → Groundwater → Contaminated → Groundwater | 0.37% |
| Soil | Environmental → Aquatic → Sediment → Unclassified → Unclassified → Soil | 0.37% |
| Switchgrass Rhizosphere | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Switchgrass Rhizosphere | 0.37% |
| Grasslands Soil | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil | 0.37% |
| Tropical Forest Soil | Environmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil | 0.37% |
| Switchgrass Rhizosphere | Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere | 0.37% |
| Visualization |
|---|
| Powered by ApexCharts |
Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).
| Taxon OID | Sample Name | Habitat Type | IMG/M Link |
|---|---|---|---|
| 3300003678 | Groundwater microbial communities from S. Glens Falls, New York, USA - water-only treatment rep 1 (Metagenome Metatranscriptome, Counting Only) | Environmental | Open in IMG/M |
| 3300005187 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124 | Environmental | Open in IMG/M |
| 3300005294 | Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL2 Bulk Soil | Environmental | Open in IMG/M |
| 3300005332 | Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly) | Environmental | Open in IMG/M |
| 3300005450 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131 | Environmental | Open in IMG/M |
| 3300005544 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-3L metaG | Environmental | Open in IMG/M |
| 3300005560 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_119 | Environmental | Open in IMG/M |
| 3300005842 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S1-2 | Host-Associated | Open in IMG/M |
| 3300005843 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S3-2 | Host-Associated | Open in IMG/M |
| 3300005937 | Tabebuia heterophylla rhizosphere microbial communities from the University of Puerto Rico - S3T2R1 | Host-Associated | Open in IMG/M |
| 3300006194 | Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 | Host-Associated | Open in IMG/M |
| 3300006797 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108 | Environmental | Open in IMG/M |
| 3300006846 | Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD4 | Host-Associated | Open in IMG/M |
| 3300006847 | Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD5 | Host-Associated | Open in IMG/M |
| 3300006969 | Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD3 | Host-Associated | Open in IMG/M |
| 3300007255 | Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1 | Environmental | Open in IMG/M |
| 3300009012 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159 | Environmental | Open in IMG/M |
| 3300009088 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG | Environmental | Open in IMG/M |
| 3300009090 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG | Environmental | Open in IMG/M |
| 3300009094 | Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (version 2) | Host-Associated | Open in IMG/M |
| 3300009147 | Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2) | Host-Associated | Open in IMG/M |
| 3300009444 | Hot spring microbial communities from Beatty, Nevada to study Microbial Dark Matter (Phase II) - OV2 TP3 | Environmental | Open in IMG/M |
| 3300009553 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-4 metaG | Host-Associated | Open in IMG/M |
| 3300009691 | Hot spring microbial communities from Beatty, Nevada to study Microbial Dark Matter (Phase II) - OV2 TP2 | Environmental | Open in IMG/M |
| 3300009777 | Wastewater microbial communities from Netherlands to study Microbial Dark Matter (Phase II) - VDW unchlorinated drinking water | Environmental | Open in IMG/M |
| 3300009811 | Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_20_30 | Environmental | Open in IMG/M |
| 3300009817 | Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_10_20 | Environmental | Open in IMG/M |
| 3300009821 | Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_20_30 | Environmental | Open in IMG/M |
| 3300010046 | Tropical forest soil microbial communities from Panama - MetaG Plot_36 | Environmental | Open in IMG/M |
| 3300010047 | Tropical forest soil microbial communities from Panama - MetaG Plot_30 | Environmental | Open in IMG/M |
| 3300010065 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_20_5_0_1 metaT (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300010087 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_5_0_1 metaT (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300010088 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_20_5_24_1 metaT (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300010100 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_20_5_0_2 metaT (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300010101 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_20_5_24_2 metaT (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300010103 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_5_16_2 metaT (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300010106 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_20_5_0_1 metaT (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300010119 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_40_5_0_1 metaT (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300010121 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_20_5_16_1 metaT (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300010134 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_2_8_1 metaT (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300010140 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_40_5_24_1 metaT (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300010141 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_5_8_2 metaT (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300010142 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_2_4_1 metaT (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300010358 | Tropical forest soil microbial communities from Panama - MetaG Plot_3 | Environmental | Open in IMG/M |
| 3300010360 | Tropical forest soil microbial communities from Panama - MetaG Plot_6 | Environmental | Open in IMG/M |
| 3300010362 | Tropical forest soil microbial communities from Panama - MetaG Plot_22 | Environmental | Open in IMG/M |
| 3300010398 | Tropical forest soil microbial communities from Panama - MetaG Plot_35 | Environmental | Open in IMG/M |
| 3300010905 | Grasslands soil microbial communities from Angelo Coastal Reserve, California, USA - 15_R_Wat_40_2_8_1 metaT (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300011444 | Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT800_2 | Environmental | Open in IMG/M |
| 3300012096 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaG | Environmental | Open in IMG/M |
| 3300012201 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaG | Environmental | Open in IMG/M |
| 3300012206 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaG | Environmental | Open in IMG/M |
| 3300012211 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaG | Environmental | Open in IMG/M |
| 3300012212 | Combined assembly of Hopland grassland soil | Host-Associated | Open in IMG/M |
| 3300012349 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaG | Environmental | Open in IMG/M |
| 3300012355 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaG | Environmental | Open in IMG/M |
| 3300012359 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaG | Environmental | Open in IMG/M |
| 3300012361 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaG | Environmental | Open in IMG/M |
| 3300012362 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaG | Environmental | Open in IMG/M |
| 3300012371 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_20cm_2_0_1 metaT (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300012373 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_2_0_1 metaT (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300012381 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_20cm_2_24_1 metaT (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300012382 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_20cm_5_4_2 metaT (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300012384 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_20cm_5_24_1 metaT (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300012390 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_8_1 metaT (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300012396 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_0_2 metaT (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300012400 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_2_4_2 metaT (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300012407 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_2_16_2 metaT (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300012409 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_2_16_1 metaT (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300012469 | Combined assembly of Soil carbon rhizosphere | Host-Associated | Open in IMG/M |
| 3300012532 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_80_16 metaG | Environmental | Open in IMG/M |
| 3300012685 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaG | Environmental | Open in IMG/M |
| 3300012929 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly) | Environmental | Open in IMG/M |
| 3300012948 | Tropical forest soil microbial communities from Panama - MetaG Plot_14 | Environmental | Open in IMG/M |
| 3300013306 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S5-5 metaG | Host-Associated | Open in IMG/M |
| 3300015264 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly) | Environmental | Open in IMG/M |
| 3300017792 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S4-5 metaG | Host-Associated | Open in IMG/M |
| 3300018053 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_60_b1 | Environmental | Open in IMG/M |
| 3300018056 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_b1 | Environmental | Open in IMG/M |
| 3300018063 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b2 | Environmental | Open in IMG/M |
| 3300018074 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_200_b2 | Environmental | Open in IMG/M |
| 3300018076 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_coex | Environmental | Open in IMG/M |
| 3300018079 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b1 | Environmental | Open in IMG/M |
| 3300018082 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_170_b2 | Environmental | Open in IMG/M |
| 3300019208 | Metatranscriptome of soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaT ERMLT231_16_1Ra (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300019212 | Metatranscriptome of soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaT ERMLIBT25_16_1Ra (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300019228 | Metatranscriptome of soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaT ERMLT790_16_1Ra (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300019229 | Metatranscriptome of soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaT ERMLT660_1_16_1Ra (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300019232 | Metatranscriptome of soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaT ERMLT530_16_1Ra (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300019233 | Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300019238 | Metatranscriptome of soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaT ERMLT466_16_1Ra (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300019249 | Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300019254 | Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_5 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300019255 | Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300019259 | Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300019263 | Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_5 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300019269 | Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_5 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300019279 | Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300020065 | Metatranscriptome of soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaT ERMLT499_16_1Ra (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300020068 | Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_65 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300022195 | Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_5 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300022531 | Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-28-M (Metagenome Metatranscriptome) (v2) | Environmental | Open in IMG/M |
| 3300023208 (restricted) | Freshwater microbial communities from Lake Towuti, South Sulawesi, Indonesia - Watercolumn_Towuti2014_125_MG | Environmental | Open in IMG/M |
| 3300025149 | Hot spring microbial communities from Beatty, Nevada to study Microbial Dark Matter (Phase II) - OV2 TP2 (SPAdes) | Environmental | Open in IMG/M |
| 3300025173 | Wastewater microbial communities from Netherlands to study Microbial Dark Matter (Phase II) - VDW unchlorinated drinking water (SPAdes) | Environmental | Open in IMG/M |
| 3300025961 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-4 metaG (SPAdes) | Host-Associated | Open in IMG/M |
| 3300026540 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 (SPAdes) | Environmental | Open in IMG/M |
| 3300027511 | Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_20_30 (SPAdes) | Environmental | Open in IMG/M |
| 3300027882 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes) | Environmental | Open in IMG/M |
| 3300027903 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes) | Environmental | Open in IMG/M |
| 3300027907 | Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (SPAdes) | Host-Associated | Open in IMG/M |
| 3300030563 | Metatranscriptome of soil fungal communities from truffle orchard in Rollainville, France - Dnb6 (Eukaryote Community Metatranscriptome) | Environmental | Open in IMG/M |
| 3300030576 | Metatranscriptome of soil fungal communities from truffle orchard in Rollainville, France - Cb9 (Eukaryote Community Metatranscriptome) | Environmental | Open in IMG/M |
| 3300030592 | Metatranscriptome of soil fungal communities from truffle orchard in Rollainville, France - Ab1 (Eukaryote Community Metatranscriptome) | Environmental | Open in IMG/M |
| 3300030628 | Metatranscriptome of soil fungal communities from truffle orchard in Rollainville, France - Bnb6 (Eukaryote Community Metatranscriptome) | Environmental | Open in IMG/M |
| 3300030829 | Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_357 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300030830 | Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_368 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300030902 | Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_356 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300030903 | Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_369 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300030904 | Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_202 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300030986 | Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_143 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300030987 | Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_144 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300030988 | Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_157 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300030990 | Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_149 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300030993 | Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_185 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300031058 | Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_184 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300031081 | Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_159 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300031092 | Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_367 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300031093 | Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_198 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300031094 | Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_203 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300031095 | Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_158 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300031096 | Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_194 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300031098 | Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_186 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300031099 | Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_152 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300031114 | Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_182 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300031124 | Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_140 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300031125 | Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_153 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300031421 | Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_195 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300031422 | Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_181 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300031424 | Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_150 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300032075 | Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T20D3 | Environmental | Open in IMG/M |
| 3300034447 | Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_119 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300034643 | Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_120 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300034660 | Metatranscriptome of lab incibated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60R2 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300034661 | Metatranscriptome of lab incibated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60R3 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300034662 | Metatranscriptome of lab incibated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60R4 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300034667 | Metatranscriptome of lab incibated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C8R1 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300034670 | Metatranscriptome of lab incibated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C8R4 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300034675 | Metatranscriptome of lab incibated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C48R1 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300034676 | Metatranscriptome of lab incibated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C48R2 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300034677 | Metatranscriptome of lab incibated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C48R3 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300034678 | Metatranscriptome of lab incibated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C48R4 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300034680 | Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_116 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300034681 | Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_121 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| Geographical Distribution | |
|---|---|
| Zoom: | Powered by OpenStreetMap |
| ⦗Top⦘ |
Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).
| Protein ID | Sample Taxon ID | Habitat | Sequence |
| Ga0003067J53075_1180462 | 3300003678 | Groundwater | MRRCIQALVVVVVLLALGSLALAQQTPQPVVRMGNWIEVG |
| Ga0066675_111717212 | 3300005187 | Soil | MRRLTQVALLGVLVLAMASLATAQQVPQPVVRVGNFIEVGNDVFMHIMASADIRYHTTTNFDFDS |
| Ga0065705_109754961 | 3300005294 | Switchgrass Rhizosphere | MKRLTQIALVGIFLLAVVSIAGAQQALQPVYRLGNFLEVGNDVFMHIIATVDARYVTVQNRDFEQNVRDRTNS |
| Ga0066388_1005185851 | 3300005332 | Tropical Forest Soil | MKRLPQIALVGIFLLAVVSIAGAQQALQPVYRLGNFVEVGNDVFMHFIA |
| Ga0066682_109082541 | 3300005450 | Soil | MRERLAQVALIGGLILVVASMATAQQVPQPVVRTGNFIEVGNDVFMHIIASADIRYKTAHNYDFDDKVRDR |
| Ga0070686_1007799591 | 3300005544 | Switchgrass Rhizosphere | MEDIMMRRMTWGTLLAGLLLVCASLAGAQQVPQPAVRLGNAWEVSNDVFMKIIATADIRYKTVENYDFENRVR |
| Ga0066670_103990342 | 3300005560 | Soil | MQRLLQIAMVGVLLLAGLSIATAQQEPQPVVRLGNFIEVGNDVWMHILATGDIRYRTTENWDFE |
| Ga0068858_1014693602 | 3300005842 | Switchgrass Rhizosphere | MPKHLTQVVLVGVIVLVVAVMAAAQQPPQPVVRLGNFLEVGNDIFMHILASADIRYRTTQNWDFENKVRDRPASRNPSSTSVHEGDSDMSYAE |
| Ga0068860_1022228201 | 3300005843 | Switchgrass Rhizosphere | MPKHLTQVVLVGVIVLVVAVMAAAQQPPQPVVRLGNFLEVGNDIFMHILASADIRYRTTQNWDFENKVRDRPASRNPSSTSVHEG |
| Ga0081455_1000019576 | 3300005937 | Tabebuia Heterophylla Rhizosphere | MKRLTQIAMVGIFLLAVVSIAGAQQELQPVYRLGNFLEVGNDVFMHIIATTDIRYTTVQ |
| Ga0081455_100101261 | 3300005937 | Tabebuia Heterophylla Rhizosphere | MKRLTQIAMVGIFLLAVVSIAGAQQELQPVYRLGNFLEVGNDVFMHIIATTDI |
| Ga0081455_100249871 | 3300005937 | Tabebuia Heterophylla Rhizosphere | MRQLTQGALLGMLLLGVVSLAAAQQVSQPVVRLGNFLEVGNDVFMHI |
| Ga0081455_108452581 | 3300005937 | Tabebuia Heterophylla Rhizosphere | MKRLTQIAMVSIFLLAVVSIAGAQQALQPVYRLGNFLEVGNDLFMHIIATTDIRYTTVQNRDFEQNVRD |
| Ga0081455_109534341 | 3300005937 | Tabebuia Heterophylla Rhizosphere | MMRRLTQLVLMAVVVLGVATLAAAQQELRPVVRLGNFIEVGNDVFMHII |
| Ga0075427_100957091 | 3300006194 | Populus Rhizosphere | MMRRLTQLVLIGVVVLGVAALAAAQQEPQPVVRLGNFIEVGNDVFMHIIGAADIRYKTTENYDFEGQVRD |
| Ga0066659_118511821 | 3300006797 | Soil | MRGCLVQVALLGVLGLAVASVATAQQVPQPMVRLGNFFGVGNDVYMHIMATADIRYKTVHNWDFDDKV |
| Ga0075430_1004178941 | 3300006846 | Populus Rhizosphere | MLRRWTQGALLGVLLLGVVSWTAAQQASQPAIRLGNFLEVGNDVFMHII |
| Ga0075431_1009753831 | 3300006847 | Populus Rhizosphere | MLKRLTQVVLVSVIVLAVAALAAAQQTVQPVVRLGNFIEVGNDVFMHIIGTADIRYRTVHNWDFENSVRDRPGSRSPGNTTVHE |
| Ga0075419_112278301 | 3300006969 | Populus Rhizosphere | MTRLTQVALVGVAVLVGASIAAAQQGLQPVVRLGNFIEVSNDVFMHMIAAADIRYMTVENRDFESNVRDRAHAREPGSAPTR |
| Ga0099791_103929321 | 3300007255 | Vadose Zone Soil | MRGRLAQVALIGGLVFAVASLATAQQGPQPMVRLGNWIEVGNDVFMHIMASADIRYKTVHNRDFEDKVRDRTPDRSPGNT |
| Ga0066710_1023481351 | 3300009012 | Grasslands Soil | MGRLTQIALMGVLVLMVAALATAQQVPEPMVRLGNFIEVGNDVFMHIMAAIDTRYRTTENYDFDSKVRDRVSSRFP |
| Ga0099830_104619712 | 3300009088 | Vadose Zone Soil | MKRLTQIALVVVLLLAMMSIATAQQAPQPVVRIGNFIEVGNDVFMHLIAATESRYLTMENRDFEKH |
| Ga0099827_112422731 | 3300009090 | Vadose Zone Soil | MRRLTQGALLGALLLGVVSIAAAQRVPQPVVRLGNFIEVSNDVFMHIIGSADIRYKAISGKAYTC* |
| Ga0111539_114457221 | 3300009094 | Populus Rhizosphere | MRRLTQGAVLGVGLGMASLVAAQQVQQPVVRLGNFLEVSNDLFMHIIGSADIRYKTVENLDFENRVRDRVNSRFPG |
| Ga0114129_132095572 | 3300009147 | Populus Rhizosphere | MRRLTQGAGIGVLVLVIASLATAQQAPEPVVRFGNYMEVANELFMHIIATTEMHYN* |
| Ga0114945_110601081 | 3300009444 | Thermal Springs | MKRLTQLALVSVLLLAVMSIATAQQVPQPVVRLGNFIEVANDLFMHIIATSDIRYAT |
| Ga0105249_100437495 | 3300009553 | Switchgrass Rhizosphere | MMRRLTQLVLIVVVVLGAATLATAQQALQPVARLGNFIEVGNDVFMHIISSADIRYKTVQNFDFEQNVRDRTSTRSPSST |
| Ga0105249_100687991 | 3300009553 | Switchgrass Rhizosphere | MKRLTQIALMGIFLLAVVSIAGAQQALQPVYRLGNFLEVGNDVFMHIIATIDARYVTVQNRDFEQNVRDRPN |
| Ga0114944_12394243 | 3300009691 | Thermal Springs | MRRLTQGALVGVLVLVLVSIATAQQTPQPVVRLGNWIEVGNEVFMHIIATADIRYK |
| Ga0105164_102130792 | 3300009777 | Wastewater | MKRLTQIAMMGVLLLAVVSIAAAQQVPQPMVRLGNFFEVGNDVFMHIQATADIRYHTTDNYDFESQV |
| Ga0105084_10745581 | 3300009811 | Groundwater Sand | MMRRLAQVALMGILELVVAAVATAQQVPQPMVRTGNFIEVGNDIFMHIMASADIRYKTVHNDDFDDKVRD |
| Ga0105062_11379582 | 3300009817 | Groundwater Sand | MRRLTQVALLGVLVLGIASLATAQQVPQPVVRLGNFFEVGNDLFLHIIATGDIRYRTT |
| Ga0105064_10830823 | 3300009821 | Groundwater Sand | MGVLVLGIASLATAQQVPQPVVRLGNFFEVGNDVFMHIIATGDLRYRT |
| Ga0126384_114811292 | 3300010046 | Tropical Forest Soil | MVGTFLLAVVSIAGAQQTLQPVYRLGNFLEVDNDVFMHFIATVDARYVTVQ |
| Ga0126384_119994061 | 3300010046 | Tropical Forest Soil | MKHLTQIAIVGVLLLAVVSIAGAQQALQPVYRLGNFIEVGNDVFMHLIATVEAR |
| Ga0126382_100527855 | 3300010047 | Tropical Forest Soil | MRRLTRGAVLGVVLGMASLAAAQQVQQPVVRLGNFLEVSNDVFMHIIGVADIRYKTVEN |
| Ga0127435_1438471 | 3300010065 | Grasslands Soil | VGVLVLAAVSLATAQQALQPVARIGNFIEVGNDVFMHIMAQSEMRYLTMENRDFEKHVRDRPNSRFPSDTALES |
| Ga0127492_10633451 | 3300010087 | Grasslands Soil | MTRRFVQIALVCVVMLGVASLATAQQVPQPMVRLGDFIEVGNDVFMHIMAAADIRYRTTENYDFESGV |
| Ga0127476_10441281 | 3300010088 | Grasslands Soil | VGVLVLAAVSLATAQQALQPVARIGNFIEVGNDVFMHIMAQSEMRYLTMENRDFEKHVRDRPNSRFPSDT |
| Ga0127440_11235561 | 3300010100 | Grasslands Soil | VGVLVLAAVSLATAQQALQPVARIGNFIEVGNDVFMHIIATNDFRYNTTTNFDFERKVR |
| Ga0127481_10939391 | 3300010101 | Grasslands Soil | MTRRLAQITLLGVLVLAVASLAAAQQEPQPVVRLGNYIEVGNDVWMHI |
| Ga0127500_11077491 | 3300010103 | Grasslands Soil | MTRRFVQIALVCIVMLGMASLATAQQVPQPMVRLGDFIEVGNDVFMHIMAAADIRYRTTENYDFESGVRDRTSSRS |
| Ga0127472_10868511 | 3300010106 | Grasslands Soil | MEDTTMRRLTQVALLGVLVLAMASLATAQQVPQPMVRLGDFIEVGNDVFMHIMAAADIRYRTTTNWDFDSRVRDRTPSRFPGD |
| Ga0127452_11118782 | 3300010119 | Grasslands Soil | VGVLVLAAVSLATAQQALQPVARIGNFIEVGNDVFMHIMAQSEMRYLTMENRDF |
| Ga0127438_10654211 | 3300010121 | Grasslands Soil | VLVLVLASLAAAQQEPQPVVRLGNFLEVGNDVWMHILATGDIRYRTTENWDFENRVRDRVNA |
| Ga0127484_11403571 | 3300010134 | Grasslands Soil | MTRRLAQIALMGVLMLGVASLATAQQVPQPVVRLGNFIEVGNDVFMHIIGTADIR |
| Ga0127456_10849081 | 3300010140 | Grasslands Soil | VLVLAAASLATAQQEPQPVVRLGNFIEVGNDVWMHILATADIRYRTTENWDFEN |
| Ga0127499_11137721 | 3300010141 | Grasslands Soil | MTRRFVQIALVCIVMLGMASLATAQQVPQPMVRLGNFIEVGNDVFMHIMAQADIRYRTTENYDFD |
| Ga0127483_11310781 | 3300010142 | Grasslands Soil | MQRVTQLGLVGVLVLAVASLATAQQAPQPVVRLGNFMEVGNDVFMHLIAATEMHYNTVENSDFEANVRDRVT |
| Ga0126370_108970031 | 3300010358 | Tropical Forest Soil | MGVLVLVGASLAAAQQVPQPMVRLGDFIEVGNDVFMHIMTSADIRYRTVENYDFDNNVRDRVASRSPSS |
| Ga0126372_118923271 | 3300010360 | Tropical Forest Soil | MKRLLQIALMSVLLLAGMSIATAQQAPQPVVRLGNFIEVGNDVWMHILATTDFRYQAVHNWDFETQVRDRPPDRNPQSTTD |
| Ga0126377_118812661 | 3300010362 | Tropical Forest Soil | MKRLIQIALVGVVLLAGVSIAGAQETPTLQPVMRLGNFIEVGNDVFM |
| Ga0126377_120427771 | 3300010362 | Tropical Forest Soil | MKRLAQIAMVGVFLLAVVPIAGAQQELQPVVRLGNFMEVGNDVFMHIIATIDTRYIT |
| Ga0126383_131823441 | 3300010398 | Tropical Forest Soil | MEDSTMKRLTQVALFGVLVLAAASLATAQQVLEPVNRLGNFIEVGNDVFMHIIATIDFRLRSAQNYDWDSAV |
| Ga0138112_10837061 | 3300010905 | Grasslands Soil | MGVLVLVVASLATAQQVPQPVVRLGNFIEVGNDVFMHIIGTADIRYHTTHNWDFESKVRDRTAG |
| Ga0137463_12979661 | 3300011444 | Soil | MRRLTQVALVGVVLLAVASIAAAQQVPQPVVRLGNFIEVGNDVFMHIIATIDTRITT |
| Ga0137389_112192533 | 3300012096 | Vadose Zone Soil | TEGKTMKRLTQIALVGILLLAVMSIATAQQVPQPGVRLGNFIEVGNDVWMHILATGDFRFQTVNNFDF* |
| Ga0137389_117797481 | 3300012096 | Vadose Zone Soil | MKRLTQIGLVGVLVLAMLSIATAQQAPQPVVRLGNFMEVGNDVFMHIIATSNIYYTTVENRDFEKHVRDRPTSRFPG |
| Ga0137365_103980682 | 3300012201 | Vadose Zone Soil | MTRRFVQIALVCVVMLGVASMAAAQQAPQPMVRLGNFIEVGNDVFMHIMAQADIRYRTTTNWD |
| Ga0137380_105464411 | 3300012206 | Vadose Zone Soil | MTRRFVQIALVCVVMLGVASMAAAQQAPQPMVRLGNFIEVGNDVFMHIMAQADI |
| Ga0137377_116936101 | 3300012211 | Vadose Zone Soil | MEDTTMRRLTQVALLNVLVLAMASLATAQQVPQPMVRLGNFIEVGNDVFMHIMAQADIRYRTTTN |
| Ga0150985_1158224181 | 3300012212 | Avena Fatua Rhizosphere | MVMLAMASLATAQQAPQPMVRLGNFIEVGNDVFMHIMAAADIRYRTTTNYDFDSGVRDRVSSRSPSSTAV |
| Ga0150985_1187667571 | 3300012212 | Avena Fatua Rhizosphere | MKRLTQIALVGIFLLAVVSIAGAQQALQPVYRLGNFLEVGNDVFMHIIATVDARYVTVQNRDFE |
| Ga0137387_103261712 | 3300012349 | Vadose Zone Soil | MKRLTQITLVGVLLLAMMSIATAQQAPQPVVRLGNFMEVANDVFMHL |
| Ga0137387_107274631 | 3300012349 | Vadose Zone Soil | MRGRLAQVALIGGLVFAVASLATAQQGPQPMVRLGNWIEVGNEVFMHIM |
| Ga0137369_104724091 | 3300012355 | Vadose Zone Soil | MGVLVLGIASLATAQQLPQPMVRLGNFIEVGNDVFMKIMASADIRYHTTENFDF |
| Ga0137385_108026081 | 3300012359 | Vadose Zone Soil | MKRLTQIVLVAVLLLAMMSIATAQQAPQPVVRIGNFVEVGNDVFMHIIAATESRYLT |
| Ga0137385_116193711 | 3300012359 | Vadose Zone Soil | MGVLVLAAASLAGAQQMPQPMVRLGDFIEVGNDVFMHIMASADIRYKTIENDDFEARVRD |
| Ga0137360_101722871 | 3300012361 | Vadose Zone Soil | MKRLTQIALVVVLLLAMMSIATAQQAPQPVVRLGNFIEVGNDVFMHIIAATESRY |
| Ga0137360_111689012 | 3300012361 | Vadose Zone Soil | MEDTTMRRLTQVALLGVLVLAMASLATAQQVPQPVVRVGNFIEVGNDVFMHIMASADIRYHTTENFDFDSRVRDRPGGRFPDDGTQQD |
| Ga0137361_118816451 | 3300012362 | Vadose Zone Soil | MGVLALAVASASTAQQVPQPVVRTGNFIEVGNDVFMHMIASADIRYKTVHNYDFDDKVCDRTPDRSPSSTGS |
| Ga0134022_11943121 | 3300012371 | Grasslands Soil | MTRRLAQIALLGVLVLAVASLAAAQQEPQPVVRLGNYIEVGNDVWMHILATGDFRYRT |
| Ga0134042_12110201 | 3300012373 | Grasslands Soil | MTRRFVQIALVCIVMLGMASLATAQQVPQPMVRLGNFIEVGNDVFMHIMAQADIRYRTTENYD |
| Ga0134026_11387751 | 3300012381 | Grasslands Soil | MRRLTQVALIGVLVLVLASLAAAQQEPQPVVRLGNFIEVGNDVWMHILATGDFRYRTTENWDFENRVRDRVNQRNPS |
| Ga0134026_11776051 | 3300012381 | Grasslands Soil | MTRRLAQIALLGVLVLAVASLAAAQQEPQPVVRLGNYIEVGNDVW |
| Ga0134026_12332582 | 3300012381 | Grasslands Soil | VGVLVLAAVSLATAQQALQPVARIGNFIEVGNDVFMHIMAQSEMRYLTMENRDFEKHVRD |
| Ga0134038_10067781 | 3300012382 | Grasslands Soil | MTRRFVQIALVCIVVLGMASLATAQQVPQPMVRLGDFIEVGNDVFMHIMAAADIRYRTTT |
| Ga0134038_10490462 | 3300012382 | Grasslands Soil | LTQIALIGVLVLAAASLATAQQVPQPMVRLGNFIEVGNDVFMHIMAAADIRYQTV |
| Ga0134036_10496942 | 3300012384 | Grasslands Soil | VGVLVLAAVSLATAQQALQPVARIGNFIEVGNDVFMHIMAQSEMRYLTMENRDFEKHVRDRPNSRFPSDTALESTDFD |
| Ga0134054_11446141 | 3300012390 | Grasslands Soil | MGVLVLAAASLATAQQAPQPVVRLGNFIEVGNDVFMHIIAASDIRYVTVE |
| Ga0134057_12975521 | 3300012396 | Grasslands Soil | MEDTTMGRLTQVALLGVLVLAMASLATAQQVPQPVVRVGNFIEVGNDVFMHIMASADIRY |
| Ga0134048_13986921 | 3300012400 | Grasslands Soil | MTRRLAQIALLGVLVLAVASLATAQQAPQPVVRLGNFMEVGNDVFMHLIAATEMHYNTVENSDFEANVRDR |
| Ga0134050_12132441 | 3300012407 | Grasslands Soil | MQRVTQLGLVGVLVLAVASLATAQQAPQPVVRLGNFMEVGNDVFMHLIATSNIYYTTVQNRDFEANVRDRVLSR |
| Ga0134045_13477382 | 3300012409 | Grasslands Soil | VGVLVLAAVSLATAQQALQPVARIGNFIEVGNDVFMHIIAHSESRYLTMENRDFEKHVRD |
| Ga0150984_1054265342 | 3300012469 | Avena Fatua Rhizosphere | MGVIVLGVASLAAAQQTMQPVVRLGNFTEVANDLFMHIIGSIDTRYITVENRDFEQNVRDRPNSRFPTD |
| Ga0150984_1076544281 | 3300012469 | Avena Fatua Rhizosphere | MKQLTQGALMGVLVLGIMSLVTAQQVPQPMVRLGNFLEVSNDVFLHMMASADIRYHTTHNWDFQDKVRDRPAGRFPDDA |
| Ga0150984_1114959982 | 3300012469 | Avena Fatua Rhizosphere | MTRRLAQIALLGVLVLAVASLAAAQQEPQPVVRLGNYI* |
| Ga0150984_1130678901 | 3300012469 | Avena Fatua Rhizosphere | MGVVVLAVASMAAAQQVPQPMVRLGNFIEVGNDVFMHIMASADIRYRTTQNYDFDSNVRDRVNSRFPGDTV |
| Ga0150984_1141800821 | 3300012469 | Avena Fatua Rhizosphere | MKRLTQVALMGVLVLAVAAIATAQQEPQPVVRLGNFIEVGNDVFMRIIATTEMRYATVENRDFEHRVRDRVSSRTTGSTAAMRS |
| Ga0150984_1188789161 | 3300012469 | Avena Fatua Rhizosphere | MEDTTMRRLTQVALLSVLVLAMASLATAQQVPQPMVRLGDFIEVGNDVFMHI |
| Ga0150984_1210189141 | 3300012469 | Avena Fatua Rhizosphere | MGVLVLAASLATAQQEPQPVVRLGNFIEVGNDVWMHILATGDIRYRTTENWDFERRVRDRVNSRNPSDSVPQAG |
| Ga0150984_1217635971 | 3300012469 | Avena Fatua Rhizosphere | MKRRLAQITLMGVLVLAVASIAAAQQEPQPVVRLGNYIEVGNDVWMHILATGDFRYRTVENYDFERRVRDRTP |
| Ga0137373_108449472 | 3300012532 | Vadose Zone Soil | MEDTTMRRLTQVALLGVLVLAMASLATAQQVPQPVVRLGNFIEVGNDVFMKIMASADIRYHTTENFNFDIRVCGQPGGRYLDN |
| Ga0137397_110804281 | 3300012685 | Vadose Zone Soil | MTRRFVQIALVCVVMLGVASLATAQQVPQPMVRLGDFIEVGNDVFMHIMAAADIRYRTTENYDFESGVR |
| Ga0137404_114248192 | 3300012929 | Vadose Zone Soil | MIGRLAQVAMMGVLVLAVASVATAQQGPQPMVRLGNFIEVGNDVFMHIIASADIRYKTVTNYD |
| Ga0126375_108051811 | 3300012948 | Tropical Forest Soil | MKRLTQVALVGVIILAVASLVAAQQALQPVYRLGNFIEVGNDVFMHIIASTDIRYNTV |
| Ga0126375_114254841 | 3300012948 | Tropical Forest Soil | MKRLTQIALVGIFLLAVVSIAGAQQALQPVYRLGNFVEVGNDVFMHFIATSESRFVTTQNRDF |
| Ga0126375_118149211 | 3300012948 | Tropical Forest Soil | MMRRLTQLVLIVLVVLGGATLATAQQAMQPVARLGNFIEVGNDVFMHIIGSADIRYKTVQNFDFEQNVRDR |
| Ga0163162_119211411 | 3300013306 | Switchgrass Rhizosphere | MKRLTQVALVGVVILAVLAVASLATAQQAMQPVYRLGNFIEVGNDVFMHIIASIDA |
| Ga0137403_111921762 | 3300015264 | Vadose Zone Soil | MIGRLAQVAMMGVLVLAVASVATAQQGPQPMVRLGNFIEVGNDVFMHIMASA |
| Ga0163161_103587702 | 3300017792 | Switchgrass Rhizosphere | MRRLTQVALVGVVILAVASLAAAQQALQPVYRLGNFLEVGNDVFMHFIATSDIRYMTVQNRDFEQNVRDRT |
| Ga0163161_103836562 | 3300017792 | Switchgrass Rhizosphere | MLRRWTQGALLGVLLLGVVSLAAAQQVSQPVVRLGNFLEVGNDVFMHIIGSADIRYKTVQNFDFENRVRDRTNSRSPSDTANARERQ |
| Ga0184626_102803962 | 3300018053 | Groundwater Sediment | MIRRLAQIALMGVVVLAVASMAAAQQVPQPVARLGNFIEVGNDVFMHIIASIDTRYITVENRDFE |
| Ga0184623_100814851 | 3300018056 | Groundwater Sediment | MRRLTQVALVGVVLLAVASIAAAQQVPQPVVRLGNFLEVGNDVWMHIIATTDVRYI |
| Ga0184637_102654202 | 3300018063 | Groundwater Sediment | MRRLTQLVLIGVVVLVVASLAAAQQVPQPVVRLGNFIEVGND |
| Ga0184637_104119551 | 3300018063 | Groundwater Sediment | MKRLTQVALVGVVFLVGMSIAAAQQVPQPVVRLGNFIEVGNDLFMHIIATSDIRYKTVHNLDFEDRIR |
| Ga0184637_105909421 | 3300018063 | Groundwater Sediment | MRRLAQVGLMAVLVLVVASLATAQQVPQPVTRLGNFIEVGNDVWMH |
| Ga0184637_107297702 | 3300018063 | Groundwater Sediment | MKRLAQVALMGILVLTASLATAQQVPQPIVRLGDFIEVGNDVFMHIMASSDIRYKTVENDDF |
| Ga0184640_102994411 | 3300018074 | Groundwater Sediment | MKRLAPIALVGVLLLAMMSIATAQQVPQPVVRLGNFLEVGNDVWMHIIASIDARYTTVENRDFEKR |
| Ga0184609_105228631 | 3300018076 | Groundwater Sediment | MRRLTQVALMGVLVLGIASLATAQQVPQPVVRLGNFIEVGNDVFMKIMATADIRYHTTENYDFDSRVRERVSGREPD |
| Ga0184627_100286604 | 3300018079 | Groundwater Sediment | MIKRLAQITLMGVVILAVASMTAAQQVPQPVVRLGNFLEVGNDVWMHIIATTNIYYTTVHNRDFEGQVRDRTLSRFIND |
| Ga0184627_101237102 | 3300018079 | Groundwater Sediment | MKRLTQVALVGVVFLVGMSIAAAQQVPQPVVRLGNYIEVGNDLFMHIIATADMRYKTVHNLDFED |
| Ga0184627_101384012 | 3300018079 | Groundwater Sediment | MKRLAPIALIGVLLLAMMSIATAQQVPQPVVRLGNFLEVGNDVWMHIIATTDIRYT |
| Ga0184627_103967782 | 3300018079 | Groundwater Sediment | MRRLTQLVLIGVVVLVVASLAAAQQVPQPVVRLGNFIEVGNDVFMRIIAAIDARYTTVENRDFEGRVRDRVNSRFPSDTA |
| Ga0184639_104001041 | 3300018082 | Groundwater Sediment | MRRLAQVGLMAVLVLVVASLATAQQVPQPVTRLGNFIEVGNDVWMHLIATINAYYTTVEN |
| Ga0184639_105493302 | 3300018082 | Groundwater Sediment | MTRRLAQIALMGVVVLSVASMAAAQQVPQPVVRLGNWIEVGNDLFMHVIGTADIRYRTVRNYDFEKRVRDRVPARDPGNISAHEGDS |
| Ga0180110_12007221 | 3300019208 | Groundwater Sediment | MRRLTQLVFVGALLLAMVSLAAAQQSLQPVVRLGNFIEVGNDLFMHIIATTDARY |
| Ga0180110_12272911 | 3300019208 | Groundwater Sediment | MKRLTQLVHVAALLLVMVSLAAAQQTLQPVVRLGNFIEVGNDVWMHIIATADMRYNTVENMDFEKR |
| Ga0180106_10243861 | 3300019212 | Groundwater Sediment | MMRRLVQLALVVLLLAMVSPVVAQQTPQPVVRLGNWIEVGNDVFMHIMATADIRYKTSQNPDF |
| Ga0180119_12178501 | 3300019228 | Groundwater Sediment | MRRLAQVALMGALVLGMASLATAQQVPQPVVRLGNFIEVGNDVFMHIIGTADIRYHAVHNMDFENRVRDRVNG |
| Ga0180116_11678391 | 3300019229 | Groundwater Sediment | MTRRFVQIALVCVVMLAVVSMAAAQQVPQPMVRLGNFIEVGNDVFMHIMAAADIRYRT |
| Ga0180114_12509431 | 3300019232 | Groundwater Sediment | MSWLKQGARAGVLLLAMVSLAAAQQTPQPVVRLGNWIEVGNDVFMHIM |
| Ga0180114_13086061 | 3300019232 | Groundwater Sediment | MRRLTQGALVVVLLLAIVSLAAAQQTPQPVIRLGNYIEVGNDLFMHIIATADMRYKL |
| Ga0184645_13531061 | 3300019233 | Groundwater Sediment | MTRRFVRIVLVCVVVLGMASLAAAQQVPQPMVRFGNFIEVGNDVFMKIIAAADIRYRTTENY |
| Ga0180112_10169811 | 3300019238 | Groundwater Sediment | MRRLTQLVFVGALLLLMVSLAAAQQALQPVYRLGNFIEVGNDVFMHIIATADMRYNTFENLDFEKR |
| Ga0184648_11219701 | 3300019249 | Groundwater Sediment | MRGTESMTRRLMQIVFMGVVVLAVASIAAAQQTMQPVVRLGNFTEVANDVFMHIIATADIRYKTTENFDFENRVRDRVSSRSPSNSVEH |
| Ga0184641_10305131 | 3300019254 | Groundwater Sediment | MQRLTRITLIGVVVLAVASMATAQQTPQPVVRLGNFIEIGNDVFMHIIGTADIRYKTTENLDFESRVRDRVNSRFPTSTTVHEGE |
| Ga0184641_10635481 | 3300019254 | Groundwater Sediment | MTRRFVQIALVCVVVLGVASLATAQQVPQPMVRLGNFIEVGNDVFMKITAAADIRYRTTTDYDFD |
| Ga0184641_11612982 | 3300019254 | Groundwater Sediment | MKRRLAQVALVGVLVLAVASLAAAQQVPQPVVRLGNWIEVGNDVFMHIIGTADIRYRTVHNYDFDNRVRDRVPGRDPGNTATQEGDFD |
| Ga0184641_13138151 | 3300019254 | Groundwater Sediment | MRRLTQVALVGVLLLAMGSLAVAQQTPQPVVRLGNFIEVG |
| Ga0184641_14278332 | 3300019254 | Groundwater Sediment | MRRLTQVALMSVVVLAMASLATAQQVPQPMVRLGDFIEVGNDVFMHIMAAADIRYRTTEN |
| Ga0184641_14315511 | 3300019254 | Groundwater Sediment | MTKRLAQIALMGVLVLGVASLATAQQVPQPVVRLGNFIEVGNDVFMHIIGTADIRYKTTENR |
| Ga0184643_12595661 | 3300019255 | Groundwater Sediment | LRRLTQIALIGVLVLAAASLATAQQVPQPMVRLGNFIEVGNDVFMHIMAAADIRYQTVTNRDFESR |
| Ga0184643_13534311 | 3300019255 | Groundwater Sediment | MRRLAQVALMGVLVLGVASLAAAQQVPQPVVRLGNFIEVGNDVFM |
| Ga0184646_12546051 | 3300019259 | Groundwater Sediment | MRRLTQVALMGVLVLVVASLATAQQVPQPVVRLGNFIEVGNDVFMKIIASSDIR |
| Ga0184646_13642732 | 3300019259 | Groundwater Sediment | MRRLTQVALMGVLVLGVASLATAQQVPQPVVRLGNSIEVGNDVFMKIMATADI |
| Ga0184646_13754031 | 3300019259 | Groundwater Sediment | MTRRLAQIALMGVLVLGVASLATAQQVPQPVVRLGNFIEVGNDVFMHIIGAADIRYRTTHNWDFESKVRDRPNSRFPGDTVAME |
| Ga0184647_13518281 | 3300019263 | Groundwater Sediment | MRRLTQVALVGVLLLAMGSLAAAQQMPQPVVRLGNFIEVGNDVFMHIMATADIRYKT |
| Ga0184647_13921661 | 3300019263 | Groundwater Sediment | MRRLTQVALVGVLLLAMGSLAAVQQIPQPVVRLGNFIEVGNDVFMHIMATADIRYKTSTNADFEER |
| Ga0184644_13030191 | 3300019269 | Groundwater Sediment | MRRLTQVALVGVLLLAMGSLAAAQQTPQPVVRLGNFIEVG |
| Ga0184644_17032031 | 3300019269 | Groundwater Sediment | MRGLTQVVLLGVLVLGVASLATAQQAPQPMVRLGNFIEVGNDVFMHIMATADIRYKTT |
| Ga0184644_17911911 | 3300019269 | Groundwater Sediment | LRRLTQIALIGVLVLAAASLATAQQAPQPMVRLGNFIEVGNDVFMHIMAAADIRYQTVTNRDFESRVRDRAAAREPNSAATRESEG |
| Ga0184642_12232621 | 3300019279 | Groundwater Sediment | MRRLTQGALLGVLLLGVVSLAAAQQVPQPVVRLGNFIEVGNDVFMHIIGSADIRYKTVENFDFENQVRDRTNTR |
| Ga0184642_12283272 | 3300019279 | Groundwater Sediment | MRRLTQVALLGVLVLGVASLATAQQVPQPMVRLGDFIEVGNDVFMHIMATADIRYKTVENYDFASDVRD |
| Ga0184642_12761011 | 3300019279 | Groundwater Sediment | MRRLAQVALMGVLVLVVASMAAAQQVPQPVVRLGNFIEVGNDVFMH |
| Ga0180113_10313471 | 3300020065 | Groundwater Sediment | MKRLTQLVVVGALLLGMVSLAVAQQALQPVVRLGNFIEVGNDVWMHIIATSDIRYMTVQNLDFDERI |
| Ga0180113_12182841 | 3300020065 | Groundwater Sediment | MRKPLTQVVLVSVIVQVVAALATAQQTTQPVVRFGSFIEVGNDVFMHIIGTSDIRYRTVQNFDFDNNVRDRTANRNPDTTSNHEGDG |
| Ga0184649_12732672 | 3300020068 | Groundwater Sediment | MRRLTQVALVGVLLLAMGSLAAAQQMPQPVVRLGNFIEVG |
| Ga0184649_13986381 | 3300020068 | Groundwater Sediment | MMRRFAQVALMGMLALGIASIATAQQTPQPVVRLGNFIEVGNDVFMHIIGTADIRFRTVDNWDFDSNVRDRAAGRNPNCTSCNDGDAQQWYAEVRL |
| Ga0184649_13997331 | 3300020068 | Groundwater Sediment | MRRLTQTVFMGALVLAVASIATAQQTPQPVVRLGNFLEVGNDVFMHIIGTADIRFRTVDNYDFDSKVRDRASDRNPNSSANNDGDGQQWYAEVRL |
| Ga0222625_14669132 | 3300022195 | Groundwater Sediment | MRGRLAQVALIGGLVFAVASLATAQQAAQPMVRLGNWIEAGNDVFMHIMASADIRYKTVHNWDFEDKVRDRTPDRGPGNTVTQEG |
| Ga0222625_15487601 | 3300022195 | Groundwater Sediment | MRRLTQVALMGVLVLGVASLATAQQVPQPVVRLGNSIEVGNDVFMKIMATADIRYHTTD |
| Ga0222625_16793592 | 3300022195 | Groundwater Sediment | MRRLIQVALLGVLVLAMASLATAQQVPQPVVRLGNSIEVGNDVFM |
| Ga0242660_11749642 | 3300022531 | Soil | MIRRLAQIVLMGVVVLAVASIAAAQQVPQPVVRLGNFIEVGNDVWMHIIASIETHFTTVENRDFEKRVRDRTDSRFPDDTAAQ |
| (restricted) Ga0233424_102920691 | 3300023208 | Freshwater | MKCALRFALVLMLVSASASLVAAQQTPQPVVRLGNWIEVGNDVFMHIMASADIRYKTTENFDFDGNVRDRVS |
| Ga0209827_114423461 | 3300025149 | Thermal Springs | MRRLTQGALVGVLVLVLVSIAAAQQTPQPVVRLGNWIEVGNEVFMH |
| Ga0209824_102871122 | 3300025173 | Wastewater | MKRLTQIAMMGVLLLAVVSIAAAQQVPQPMVRLGNFFEVGNDVFMHIQATADIRYHTTDNYDFES |
| Ga0207712_118786381 | 3300025961 | Switchgrass Rhizosphere | MLRRWTQGALLGVLLLGVVSLAAAQQVSQPVVRLGNFLEVGNDVFMHIIGGADIRYKTVQNFDFENRVRDRTQ |
| Ga0209376_13814091 | 3300026540 | Soil | LRRLTQIALVSVLVLAAVSLATAQRAPEPVARIGNFIEVGNDVFMHIMAQTEMRYLTMENRDFEKHVRDRPNSRFPS |
| Ga0209843_10464861 | 3300027511 | Groundwater Sand | MRRLTQGALLGVLLLGVVSLAAAQQVPQPVVRLGNFTEVANDVFMHIIGVADIRYKTVENFDFENRVRDRVNDRSPSASST |
| Ga0209843_10507923 | 3300027511 | Groundwater Sand | MRRLTQVALMGVLVLGIASLATAQQVPQPVVRLGNFFEVGNDVFMHIIATGDLRYRTTENYDFDSKVRERVSERT |
| Ga0209590_109772621 | 3300027882 | Vadose Zone Soil | MRRLTQGALLGALLLGVVSIAAAQRVPQPVVRLGNFIEVSNDVFMHIIGSA |
| Ga0209488_108829661 | 3300027903 | Vadose Zone Soil | MKRLTQIGLVGVLLLAMMSIATAQQAPQPVVRLGNFIEVGNDVFMHIIAATESR |
| Ga0207428_105452711 | 3300027907 | Populus Rhizosphere | MMRRLTQLVLMGVVVLGVAALAAAQQEPQPVVRLGNFIEVGNDVFMHIIGAADIRYKTTENYDFEGQVRDR |
| Ga0247653_10977121 | 3300030563 | Soil | MLKHLTQAVLVGVIVLAIAALAVAQQTPQPVVRLGNFIEVGNDVFMHIIGAADIRYHTVHNWDFEDNVRDRPTSRNPSNTSVQEGDGDILYAELRLGVE |
| Ga0247644_12096461 | 3300030576 | Soil | MLKHLTQAVLVGVIVLAIAALAVAQQTPQPVVRLGNFIEVGNDVFMHIIGAADIRYHTVQNWDFENNVRDRPASRNPSFTSVH |
| Ga0247612_11755991 | 3300030592 | Soil | MLKHLTQAVLVGVIVLAIAALAVAQQTPQPVVRLGNFIEVGNDVFMHIIGAADIRYHTV |
| Ga0247629_103684711 | 3300030628 | Soil | MLKHLTQAVLVGVIVLAIAALAVAQQTPQPVVRLGNFIEVGNDVFMHIIGAADIRYRTVHNWDFENKVRDRPTSRNPGNTSVHEGDGDILYAELR |
| Ga0308203_10064561 | 3300030829 | Soil | MRRLTQVALMGVMVLVVVSLATAQQVPQPMVRLGNFMEVGNDVFMKIM |
| Ga0308203_10244042 | 3300030829 | Soil | MRRLTQVALVGVLLLAMGSLAVAQQTPQPVVRLGNFIEVGNDVFMHIMATGD |
| Ga0308203_10587761 | 3300030829 | Soil | MRRLTQVGLLGVLVLAMASLATAQQVPQPMVRLGNFIEVGNDVFMKIMATADIRYHTTENFDFDTKVRDRTVSR |
| Ga0308203_10735561 | 3300030829 | Soil | MTRRFVQIALVCVVVLAMASLAAAQQVPQPMVRLGDFIEVGNDVFMHIMAAADIRYRTTENY |
| Ga0308203_10770411 | 3300030829 | Soil | MTRRFVQIALVGVVVLAVASMTAAQQVPQPMVRLGNFIEVGNDVFMHI |
| Ga0308203_10978831 | 3300030829 | Soil | MIRRFVQIALVGVVMLAMASFATAQQVPQPMVRLGDFIEVGNDVFMHIMAA |
| Ga0308205_10036572 | 3300030830 | Soil | MRRLIQVALLGVLVLAMASLATAQQVPQPMVRLGNFIEVGNDVFMHIMASADIRYH |
| Ga0308205_10044351 | 3300030830 | Soil | MRRLTQVALVGVLLLAMGSLAVAQQTPQPVVRLGNFIEVGNDVFMHIMATADIRYKTSTNPDFEH |
| Ga0308205_10197881 | 3300030830 | Soil | MRRLTQVALMGVLVLGVASLATAQQVPQPVVRLGNSIEVGNDVFMKIMATADIRYHTT |
| Ga0308205_10625591 | 3300030830 | Soil | MTRRFIQIALVGVVVLTMISMAAAQQVPQPMVRLGNFIEVGNDVFMKIIAAADIRYRTT |
| Ga0308205_10664931 | 3300030830 | Soil | MTRRFVQIALVGVVVLAVASMAAAQQVPQPMVRLGNFIEVGNDVFMKIIAAADIRYRTT |
| Ga0308202_10056131 | 3300030902 | Soil | MRRLIQVALLGVLVLAMASLATAQQVPQPMVRLGNFIEVGNDVFMKIIAAADI |
| Ga0308202_10696371 | 3300030902 | Soil | MRRLTQVALMGVLALGMASLAAAQQVPEPMVRLGNFIEVGNDVFMHIMATADIRYKTVENYDFSSHVRDRTYSRNP |
| Ga0308202_11215111 | 3300030902 | Soil | MTRRFVQIALVCVVLLGMASLAAAQQVPQPMVRLGDFIEVGNDVFMHIMAAADIRYRTTENYDFDSSV |
| Ga0308202_11385891 | 3300030902 | Soil | MTRRFVQIVLVCVVMLGVASLATAQQVPQPMVRLGDFIEVGNDVFMHIMAAADIRYRTTENYDFESGIRD |
| Ga0308206_10571361 | 3300030903 | Soil | MRRLTQVALVGVLLLAMGSLATAQQTPQPVVRLGNFIEVGNDVFMH |
| Ga0308206_10631582 | 3300030903 | Soil | MRRLTQGALVVVLLLAIVSLAAAQQTPQPVVRLGNYIEVGNDLF |
| Ga0308206_10696721 | 3300030903 | Soil | MRRLTQVALMGVVTLAVASMAMAQQVPQPMVRLGDFIEVGNDVFMHIMATADI |
| Ga0308206_11530331 | 3300030903 | Soil | MTRRLAQIALMGVLVLGVASLATAQQVPQPVVRLGNFIEVGNDVFMHIIGTADIRYKTTENRDFESRVRDRTNSRFPGDTVAHDAEGD |
| Ga0308206_11825471 | 3300030903 | Soil | MRRLTQVALMGVLVLGIASLATAQQVPQPMVRLGNFIEVGNDVFMRIMAAADIR |
| Ga0308198_10003773 | 3300030904 | Soil | MRRLTQVALLGVLVLAMASLATAQQVPQPMVRLGNFIEVGNDVFMHIMASA |
| Ga0308198_10389662 | 3300030904 | Soil | MRRLIQVALLGVLVLAMASLATAQQVPQPMVRLGNFIEVGNDVFMHIMASA |
| Ga0308198_10502851 | 3300030904 | Soil | VRRLTQVALVGVLVLAVASLATAQQEPQPVARIGNFIEVGNDVFMHIMAQSEMRYLTMEN |
| Ga0308198_10581252 | 3300030904 | Soil | MRRLAQVALMGVLVLGVASLAAAQQVPQPVVRLGNFIEVGNDVFMHIIGTADIRYHTAH |
| Ga0308198_10995031 | 3300030904 | Soil | MTRRLAQIALMGVLVLGVASLATAQQVPQPVVRLGNFIEVGNDVFMHIIGSADIR |
| Ga0308198_11048601 | 3300030904 | Soil | LRRLTQIALVSVLVLAAASLATAQRAPEPVARIGNFIEVGNDVFMHIIAATEMRYQAVENRDFE |
| Ga0308154_1012522 | 3300030986 | Soil | MRRLTQVVLLGVLVLGVASLATAQQAPQPMVRLGNFIEVGNDVFMHIMATADIRYK |
| Ga0308155_10004431 | 3300030987 | Soil | MRRLTQVALLGVLVLVLAMASLATAQQVPQPMVRLGNFIEVGNDVFMHIMA |
| Ga0308155_10109472 | 3300030987 | Soil | MRRLTQGALVGVLLLAIVSLAAAQQTPQPVVRIGNYIEVG |
| Ga0308155_10329781 | 3300030987 | Soil | MTRRFVQIALVCVVVLGVVSLAAAQQVPQPMVRLGDFIEVGNDVFMHIMATADIRYRTTENYDFDS |
| Ga0308155_10347632 | 3300030987 | Soil | MRGLTQVVLLGVLVLGVASLATAQQAPQPMVRLGNFIEVGNDVFMHIMATADIRYK |
| Ga0308183_10668431 | 3300030988 | Soil | MRRLTQVVLLGVLVLGVASLATAQQAPQPMVRLGNFIEVGNDVFMHIMATADIRYKTTENYDFASDVRDRVGDRNP |
| Ga0308183_11681621 | 3300030988 | Soil | MTRRFVQIALVGVVVLVVASMTAAQQVPQPMVRLGDFIEVGNDVFMHIMAAADIRYRTTENYDFESGIRDRTSSRS |
| Ga0308178_10519852 | 3300030990 | Soil | MRRLTQVALLGVLVLGVASLATAQQVPQPMVRLGNFIEVGNDVFMHIMATADIRYKTVENFDFASDV |
| Ga0308178_10644801 | 3300030990 | Soil | MRRLTQVALVGVLLLAMGSLAVAQQTPQPVVRLGNFIEVGNDVFMHIMATADIRYKTSTNPDFEHRVRDR |
| Ga0308178_10678512 | 3300030990 | Soil | MRRLTQVALMSVVVLAMASLATAQQVPQPMVRLGDFIEVGNDVFMHIMATADIRYRTTENYDFDAQVRD |
| Ga0308178_11470811 | 3300030990 | Soil | MTRRFVQIALVCVVVLAMASLAAAQQVPQPMVRLGDFIEVGNDVFMHIMAAADIRYRTTENYDFD |
| Ga0308190_10505812 | 3300030993 | Soil | LRRLTQIALIGVLVLAAASLATGQQAPQPMVRLGNFIEVGNDVFMHIIAAADIRYMTVTNRDFESRVRDRAHARE |
| Ga0308190_10968281 | 3300030993 | Soil | LRRLTQIALVSVLVLAAASLATAQRAPEPVARIGNFIEVGNDVFMHIIAATEMRYQAVEN |
| Ga0308190_11314001 | 3300030993 | Soil | MRRLTQVALVGVLWLAMGSLAAAQQTPQPVVRLGNFIEVGNDVFMHIMA |
| Ga0308190_11651501 | 3300030993 | Soil | MRGHLVQVALIGGLVLAVASLTTAQQGPQPMVRLGNWIEVGNDVFMHIMASADIRYKTVHNWDFEDKVRDRT |
| Ga0308189_102122902 | 3300031058 | Soil | MRRLTQVALIGVLVLAAASLATAQQAPQPMVRLGNFIEVGNDVFMHIMAAADIRYKTTENYDFA |
| Ga0308185_10168681 | 3300031081 | Soil | LRRLTQIALIGVLVLAAASLATAQQAPQPVVRLGNFIEVGNDVFMHIIAA |
| Ga0308204_101563061 | 3300031092 | Soil | MRRLTQVALLGVLVLAMASLATAQQVPQPMVRLGDFIEVGNDVFMHIMASADIRYRTTENYDFDSQIR |
| Ga0308204_103011331 | 3300031092 | Soil | MRQLTQVALMGFVVLAVASMATAQQVPQPMVRLGNFMEVGNDVFMKIMATADIR |
| Ga0308197_101070251 | 3300031093 | Soil | MRRLTQVALVGVLLLAMGSLAVAQQTPQPVVRLGNFIEVGNDVFMHIMATGDIRYKTSTNPDFEH |
| Ga0308197_102348422 | 3300031093 | Soil | MRRLTQVVLIGVLVLAAASLATAQQAPQPMVRLGNFIEVGNDVFMHIMASADIRYKTTENYDFASDVRDRVGDR |
| Ga0308197_103614452 | 3300031093 | Soil | MRGRLAQVALIGSLVLAVASMATAQQAAQPMVRLGNWIEVGNDVFMHIMASADIRYKTVHNRDFEDKVRDR |
| Ga0308197_104277601 | 3300031093 | Soil | MTRRLAQIALMGVLVLGVASLATAQQVPQPVVRLGNFIEVGNDVFMHIIGTADIRYRTTQNW |
| Ga0308197_104591891 | 3300031093 | Soil | MTRRLAQIALMGVLVLGVASLAAAQQVPQPVVRLGNFIEVGNDVFMHIMGSADIRYKTTENR |
| Ga0308199_11232102 | 3300031094 | Soil | MRRLTQVALLGVLVLVVASLATAQQVPQPMVRLGNFIEVGNDVFMHIMASADIRYHT |
| Ga0308199_11598311 | 3300031094 | Soil | MRRLTQVALLGVLVLAMASLATSQQVPQPMVRLGNFIEVGNDVFMHIMATADIRYKTTDNYDFSRRVRDRE |
| Ga0308199_11764681 | 3300031094 | Soil | LRRLTQIALIGVLVLAAASLATAQQAPQPMVRLGNFIEVGNDVFMHIMASADI |
| Ga0308184_10482741 | 3300031095 | Soil | MTRRFVQRALVCVVMLAVVSMATAQQVPQPMVRLGDFIEVGNDVFMHIMAAADIRYRTTTDYDFDKKVR |
| Ga0308193_10006441 | 3300031096 | Soil | MRRLTQVALVGVLLLAMGSLAVAQQTPQPVVRLGNFIEVGNDVFMHIIATSDIRYVTTENRDFEKHVR |
| Ga0308191_10272761 | 3300031098 | Soil | MRRLTQVALLGVLVLVLAMASLATAQQVPQPMVRLGNFIEVGNDVFMHIMATADIRYHTTENFDFDTKVRER |
| Ga0308191_10324601 | 3300031098 | Soil | MRRLTQVALLGVLVLAMASPATAQQVPQPMVRLGNFIEVGNDVFMHIMATAD |
| Ga0308191_10418511 | 3300031098 | Soil | MTRRFVQIALVCVVMLGVASLATAQQVPQPMVRLGNFIEVGNDVFMHIMA |
| Ga0308181_10143692 | 3300031099 | Soil | LRRLTQIALIGVLVLAAVSLATAQQVPQPMVRLGNFIEVGNDVFMHIIAAADIRYLTVTNRDFESRVRDRAHAREPNGAA |
| Ga0308181_11690551 | 3300031099 | Soil | MMKRLTQLVLMGVIGLGVASLAAAQETPQPVVRLGNFIEVGNDVFMHIIGTADIRYKTTENWDFEKRVRDRTSNRSPSSTVEHEGEGDLSFA |
| Ga0308187_100433352 | 3300031114 | Soil | MRRLAQVALLGVLVLGMASLATAQQVPQPVVRLGNFIEVGNDVFMHIIGTADIRYHTTHNWDFESKTR |
| Ga0308187_100449652 | 3300031114 | Soil | MTRRFIQIALVGVVVLTMISMAAAQQVPQPMVRLGNFIEVGNDVFMKIIAAADIRYRTTENYDFD |
| Ga0308187_101334391 | 3300031114 | Soil | MRRLTQVALLGVLVLAMASLATSQQVPQPMVRLGNLIEVGNDVFMKIMASADIRYHTTEN |
| Ga0308187_102371042 | 3300031114 | Soil | MRRLIQVVLLGVLVLGMASLATAQQVPQPMVRLGNLIEVGNDVFMKIMASADIRYHTTEN |
| Ga0308187_103061651 | 3300031114 | Soil | MRRLTQGTLVVVLLLAIVSLAAAQQTPQPVVRLGNYIEVGNDVFMHIIATIDTRYFTVENRDFE |
| Ga0308187_103732161 | 3300031114 | Soil | MRGHLGRVVLLGGLVLAVASLATAQQGPQPMVRLGNWIEVGNDVLMHIMASADIRYKTV |
| Ga0308187_104480451 | 3300031114 | Soil | MRGRLAQVALIGSLVLAVASMATAQQAAQPMVRLGNWIEVGNDVFMHIMASADIRYKTVHNRDFEDKVRDRTPDRSPGNTA |
| Ga0308187_104814551 | 3300031114 | Soil | MKCARQLVLAGVMVLVTVSMVTAQQVPQPVVRLGNALEVGNDVFMHIIATADIRYHTVQNYDFEGRIRD |
| Ga0308151_10410321 | 3300031124 | Soil | MTRRFVQIALVGIVVLAVASLATAQQVPQPMVRLGNFIEVGNDVFMHILASADIRYRTTQNRDFETGVRDRTNSRYP |
| Ga0308151_10413311 | 3300031124 | Soil | MTRRFVQIALVCIVVLGMASLATAQQVPQPMVRLGDFIEVGNDVFMHIMAAADIRYRTTENYDFDSSVRDRVSS |
| Ga0308151_10423961 | 3300031124 | Soil | MTRRFVQIALVCVVMLGMASLATAQQVPQPMVRLGDFIEVGNDVFMHIMAAADI |
| Ga0308151_10468511 | 3300031124 | Soil | MRRLTQVALVGVLLLAMGSLAVAQQTPQPVVRLGNFIEVGNDVFMH |
| Ga0308151_10498101 | 3300031124 | Soil | MMRRLAQLTLMGALVLAMVSLAAAQQTPQPVVRLGNFIEVGNDVFMHVM |
| Ga0308182_10031971 | 3300031125 | Soil | MRRLTQVALLGVLVLGVASLATAQQAPQPMVRLGNFIEVGNDVFMHIMATADI |
| Ga0308182_10076141 | 3300031125 | Soil | LRRLTQIALIGVLVLAAVSLATAQQAPQPVVRLGNFIEVGNDVFMHIIAA |
| Ga0308182_10202701 | 3300031125 | Soil | MRRLTQRALVVVLWLAIVSLAIAQQAPQPVVRLGNFIEVGNDVFMHI |
| Ga0308194_100024962 | 3300031421 | Soil | MRRLTQVALLGVLVLVLAMASLATAQQVPQPMVRLGNFIEVGNDVFMHIMASADIRYHTTTNW |
| Ga0308194_101129422 | 3300031421 | Soil | LRRLTQIALIGVLVLAAASLATAQQAPQPVVRLGNFIEVGNDVFMHIIAAADIRYQTVTNRDFESRVRDRAHAR |
| Ga0308194_103249052 | 3300031421 | Soil | MTRRLAQIALMGVLVLGVLLLVMASIAAAQQVSQPVVRLGNFIEVGNGVFIV |
| Ga0308186_10002842 | 3300031422 | Soil | MRRLTQVALVGVLLLAMGSLAVAQQTPQPVVRLGNFIEVGNDVFMHIMATADIRYK |
| Ga0308186_10292581 | 3300031422 | Soil | MTRRFVQIALVGIVVLAVASLATAQQVPQPMVRLGNFIEVGNDVFMKIMATADIRYHTTENFDFDTKVRERVNSR |
| Ga0308186_10344171 | 3300031422 | Soil | MTRRFVQIALVCVVLLGMASLAAAQQVPQPMVRLGDFIEVGNDVFMHIMAAADIRYRTTENYDFDSSVRDRVSSR |
| Ga0308179_10158672 | 3300031424 | Soil | MRRLTQVALVGVLLLAMGSLAVAQQTPQPVVRLGNFIEVGNDVFMHIMATAD |
| Ga0308179_10320011 | 3300031424 | Soil | MRRLTQVAFLGVLVLVLAMASLATAQQVPQPMVRLGNFIEVGNDVFMHIMATADIRYHTTENFDFDTKVRERVNSRGPGDVTPQESS |
| Ga0308179_10501111 | 3300031424 | Soil | MTRRFVQIVLVCVVMLGVASLATAQQVPQPMVRLGDFIEVGNDVFMHIMAAADIRYRTTENYDF |
| Ga0310890_106899062 | 3300032075 | Soil | MLRRWTQGALLGVLLLGVVSLAAAQQASQPAIRLGNFLEVGNDVFMHIIGSADIRYKTVENYDFENRVRDRTNDRSPSSNASQEGDSDL |
| Ga0370544_15151_1_231 | 3300034447 | Soil | MTRRFVQIALMGVLVLGVASLATAQQMPEPMVRLGNFIEVGNDVFMHIMAQGDIRYRTTENYDFDSRVRDRVSSRSP |
| Ga0370545_000182_1_216 | 3300034643 | Soil | MEETIMRRLIQVALGVLVLAMASLATAQQVPQPMVRLGNFIEVGNDVFMKIMASGESRYRTTENYDFDTKVR |
| Ga0370545_002807_3_230 | 3300034643 | Soil | MGVLVLAMASLATAQQVPQPMVRLGNFIEVGNDVFMKIMAAADIRYHTTENFDFDTKVRDRVNSRGPDDGTPQESA |
| Ga0370545_101939_382_624 | 3300034643 | Soil | MMRRLAQLTLMGALVLAMVSLAVAQQTPQPVVRLGNWIEVGNDVFMHIMAAADIRYHTTENFDFDTKVRDRVNSRGPDDGT |
| Ga0314781_070082_3_143 | 3300034660 | Soil | MLRRWTQGALLGVLLLGMVSLAAAQQVSQPAIRLGNFLEVGNDVFMH |
| Ga0314782_002744_2012_2191 | 3300034661 | Soil | MRRLIQLVFVGALLLGMVSLAAAQQTLQPVYRLGNFIEVGNDVFMHIIATADIRYNTVEN |
| Ga0314783_081807_3_185 | 3300034662 | Soil | MRRLTQVALVGVVILAVASLAAAQQALQPVYRLGNFLEVGNDVFMHIIATSDIRYQTVQN |
| Ga0314792_018413_1120_1284 | 3300034667 | Soil | MRRLTQVALVGIAILAVASIAAAQQELQPVVRLGNFIEVGNDVFMHFIAATDIRY |
| Ga0314795_055092_2_193 | 3300034670 | Soil | MRRLTQVALVGVVILAVASLAAAQQALQPVYRLGNFLEVGNDVFMHIIATSDIRYQTVQNRDFE |
| Ga0314800_075100_1_240 | 3300034675 | Soil | MTRLTQVALVGVAVLVGASIAAAQQGLQPVVRLGNFIEVSNDVFMHILATADIRYMTVENRDFESNVRDRAHAREPGSAP |
| Ga0314800_078049_2_217 | 3300034675 | Soil | MVILGVAALAVAQQEPQPVVRLGNFIEVGNDVFMHIIGAADIRYKTTENYDFEGQVRDRTGSRSPSSTPEHE |
| Ga0314801_154500_1_213 | 3300034676 | Soil | MIRRFVQIALVGVVMLAMASFATAQQVPQPMVRLGDFIEVGNDVFMHIMAAADIRYRTTENYDFDAQVRDR |
| Ga0314802_019663_2_154 | 3300034677 | Soil | MGVILLAVASMAMAQQVPQPMVRLGDFIEVGNDVFMHIMATADIRYRTTEN |
| Ga0314803_109338_1_189 | 3300034678 | Soil | MGVVVLGVAALAAAQQEPQPVVRLGNFIEVGNDVFMHIIGAADIRYKTTENYDFEGQVRDRTG |
| Ga0370541_001846_1445_1627 | 3300034680 | Soil | MTRLTQIALTGVLVLAAASLAIAQQAPQSMVRLGNFIEVGNDVFMHIMATADIRYKTVEN |
| Ga0370541_052138_2_193 | 3300034680 | Soil | MGVLVLVLAMASLATAQQVPQPMVRLGNFIEVGNDVFMHIMASADIRYHTTTNWDFDSKVRDRP |
| Ga0370546_004390_3_182 | 3300034681 | Soil | MRRLTQIALIGILVLAAASLATGQQAPQPMVRLGNFIEVGNDVFMHIIAAADIRYMTVTN |
| ⦗Top⦘ |