| Basic Information | |
|---|---|
| Family ID | F070460 |
| Family Type | Metagenome / Metatranscriptome |
| Number of Sequences | 123 |
| Average Sequence Length | 47 residues |
| Representative Sequence | ADLIVDGKSDPDLAPISVERFGAAPRDPAALEAACVAQYARRYTH |
| Number of Associated Samples | 114 |
| Number of Associated Scaffolds | 123 |
| Quality Assessment | |
|---|---|
| Transcriptomic Evidence | Yes |
| Most common taxonomic group | Bacteria |
| % of genes with valid RBS motifs | 0.00 % |
| % of genes near scaffold ends (potentially truncated) | 99.19 % |
| % of genes from short scaffolds (< 2000 bps) | 93.50 % |
| Associated GOLD sequencing projects | 103 |
| AlphaFold2 3D model prediction | Yes |
| 3D model pTM-score | 0.38 |
| Hidden Markov Model |
|---|
| Powered by Skylign |
| Most Common Taxonomy | |
|---|---|
| Group | Bacteria (89.431 % of family members) |
| NCBI Taxonomy ID | 2 |
| Taxonomy | All Organisms → cellular organisms → Bacteria |
| Most Common Ecosystem | |
|---|---|
| GOLD Ecosystem | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil (8.943 % of family members) |
| Environment Ontology (ENVO) | Unclassified (34.146 % of family members) |
| Earth Microbiome Project Ontology (EMPO) | Host-associated → Plant → Plant rhizosphere (47.967 % of family members) |
| ⦗Top⦘ |
| ⦗Top⦘ |
| Predicted Topology & Secondary Structure | |||||
|---|---|---|---|---|---|
| Classification: | Globular | Signal Peptide: | No | Secondary Structure distribution: | α-helix: 28.77% β-sheet: 5.48% Coil/Unstructured: 65.75% | Feature Viewer |
|
|
|||||
| Powered by Feature Viewer | |||||
| Structure Viewer | |
|---|---|
|
| |
| Per-residue confidence (pLDDT): 0-50 51-70 71-90 91-100 | pTM-score: 0.38 |
| Powered by PDBe Molstar | |
| ⦗Top⦘ |
| Pfam ID | Name | % Frequency in 123 Family Scaffolds |
|---|---|---|
| PF01656 | CbiA | 13.01 |
| PF13602 | ADH_zinc_N_2 | 8.94 |
| PF02653 | BPD_transp_2 | 6.50 |
| PF13676 | TIR_2 | 4.07 |
| PF00501 | AMP-binding | 3.25 |
| PF00107 | ADH_zinc_N | 3.25 |
| PF07685 | GATase_3 | 2.44 |
| PF08240 | ADH_N | 2.44 |
| PF00528 | BPD_transp_1 | 2.44 |
| PF00005 | ABC_tran | 2.44 |
| PF13416 | SBP_bac_8 | 1.63 |
| PF07726 | AAA_3 | 1.63 |
| PF02026 | RyR | 1.63 |
| PF02518 | HATPase_c | 0.81 |
| PF02728 | Cu_amine_oxidN3 | 0.81 |
| PF00701 | DHDPS | 0.81 |
| PF02727 | Cu_amine_oxidN2 | 0.81 |
| PF00266 | Aminotran_5 | 0.81 |
| PF00578 | AhpC-TSA | 0.81 |
| PF00903 | Glyoxalase | 0.81 |
| PF03167 | UDG | 0.81 |
| PF09520 | RE_TdeIII | 0.81 |
| PF00496 | SBP_bac_5 | 0.81 |
| PF13458 | Peripla_BP_6 | 0.81 |
| PF13191 | AAA_16 | 0.81 |
| PF00072 | Response_reg | 0.81 |
| COG ID | Name | Functional Category | % Frequency in 123 Family Scaffolds |
|---|---|---|---|
| COG0329 | 4-hydroxy-tetrahydrodipicolinate synthase/N-acetylneuraminate lyase | Cell wall/membrane/envelope biogenesis [M] | 1.63 |
| COG3733 | Cu2+-containing amine oxidase | Secondary metabolites biosynthesis, transport and catabolism [Q] | 1.63 |
| COG0692 | Uracil-DNA glycosylase | Replication, recombination and repair [L] | 0.81 |
| COG1573 | Uracil-DNA glycosylase | Replication, recombination and repair [L] | 0.81 |
| COG3663 | G:T/U-mismatch repair DNA glycosylase | Replication, recombination and repair [L] | 0.81 |
| ⦗Top⦘ |
| Name | Rank | Taxonomy | Distribution |
| All Organisms | root | All Organisms | 91.06 % |
| Unclassified | root | N/A | 8.94 % |
| Visualization |
|---|
| Powered by ApexCharts |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| 2088090014|GPIPI_17095955 | All Organisms → cellular organisms → Bacteria | 2977 | Open in IMG/M |
| 2162886012|MBSR1b_contig_1346690 | All Organisms → cellular organisms → Bacteria | 882 | Open in IMG/M |
| 3300000953|JGI11615J12901_10681374 | All Organisms → cellular organisms → Bacteria | 794 | Open in IMG/M |
| 3300000956|JGI10216J12902_110885336 | All Organisms → cellular organisms → Bacteria | 992 | Open in IMG/M |
| 3300000956|JGI10216J12902_115070282 | All Organisms → cellular organisms → Bacteria | 576 | Open in IMG/M |
| 3300001990|JGI24737J22298_10172178 | All Organisms → cellular organisms → Bacteria | 639 | Open in IMG/M |
| 3300003911|JGI25405J52794_10155127 | All Organisms → cellular organisms → Bacteria | 523 | Open in IMG/M |
| 3300004022|Ga0055432_10196941 | All Organisms → cellular organisms → Bacteria | 578 | Open in IMG/M |
| 3300004157|Ga0062590_101828902 | All Organisms → cellular organisms → Bacteria | 624 | Open in IMG/M |
| 3300004479|Ga0062595_102097907 | All Organisms → cellular organisms → Bacteria | 550 | Open in IMG/M |
| 3300004480|Ga0062592_100808121 | All Organisms → cellular organisms → Bacteria | 832 | Open in IMG/M |
| 3300005290|Ga0065712_10324182 | All Organisms → cellular organisms → Bacteria | 824 | Open in IMG/M |
| 3300005293|Ga0065715_10955631 | All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Planctomycetia → Gemmatales → Gemmataceae | 558 | Open in IMG/M |
| 3300005332|Ga0066388_106911483 | All Organisms → cellular organisms → Bacteria | 571 | Open in IMG/M |
| 3300005345|Ga0070692_10310378 | All Organisms → cellular organisms → Bacteria | 966 | Open in IMG/M |
| 3300005353|Ga0070669_100073865 | All Organisms → cellular organisms → Bacteria | 2527 | Open in IMG/M |
| 3300005367|Ga0070667_100245024 | Not Available | 1601 | Open in IMG/M |
| 3300005445|Ga0070708_100268825 | All Organisms → cellular organisms → Bacteria | 1603 | Open in IMG/M |
| 3300005459|Ga0068867_101499431 | All Organisms → cellular organisms → Bacteria | 628 | Open in IMG/M |
| 3300005467|Ga0070706_101269353 | All Organisms → cellular organisms → Bacteria | 676 | Open in IMG/M |
| 3300005467|Ga0070706_101772374 | All Organisms → cellular organisms → Bacteria | 561 | Open in IMG/M |
| 3300005546|Ga0070696_101359266 | All Organisms → cellular organisms → Bacteria | 604 | Open in IMG/M |
| 3300005577|Ga0068857_102321006 | All Organisms → cellular organisms → Bacteria | 527 | Open in IMG/M |
| 3300005586|Ga0066691_10547275 | All Organisms → cellular organisms → Bacteria | 690 | Open in IMG/M |
| 3300005616|Ga0068852_100247300 | All Organisms → cellular organisms → Bacteria | 1707 | Open in IMG/M |
| 3300005719|Ga0068861_100250460 | All Organisms → cellular organisms → Bacteria | 1511 | Open in IMG/M |
| 3300005881|Ga0075294_1000292 | All Organisms → cellular organisms → Bacteria | 1983 | Open in IMG/M |
| 3300006173|Ga0070716_101477423 | All Organisms → cellular organisms → Bacteria | 555 | Open in IMG/M |
| 3300006354|Ga0075021_10010800 | All Organisms → cellular organisms → Bacteria | 4970 | Open in IMG/M |
| 3300006804|Ga0079221_11666850 | All Organisms → cellular organisms → Bacteria | 518 | Open in IMG/M |
| 3300006806|Ga0079220_11067786 | Not Available | 650 | Open in IMG/M |
| 3300006881|Ga0068865_100294376 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria | 1297 | Open in IMG/M |
| 3300006904|Ga0075424_102723969 | Not Available | 516 | Open in IMG/M |
| 3300007076|Ga0075435_100769865 | All Organisms → cellular organisms → Bacteria | 838 | Open in IMG/M |
| 3300007265|Ga0099794_10194584 | All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Planctomycetia → Gemmatales → Gemmataceae | 1038 | Open in IMG/M |
| 3300009012|Ga0066710_103126139 | All Organisms → cellular organisms → Bacteria | 639 | Open in IMG/M |
| 3300009038|Ga0099829_11149408 | All Organisms → cellular organisms → Bacteria | 643 | Open in IMG/M |
| 3300009094|Ga0111539_11329258 | All Organisms → cellular organisms → Bacteria | 834 | Open in IMG/M |
| 3300009162|Ga0075423_10684824 | Not Available | 1082 | Open in IMG/M |
| 3300009545|Ga0105237_10686558 | All Organisms → cellular organisms → Eukaryota → Viridiplantae | 1030 | Open in IMG/M |
| 3300009804|Ga0105063_1058925 | All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Planctomycetia → Gemmatales → Gemmataceae | 568 | Open in IMG/M |
| 3300009836|Ga0105068_1105077 | All Organisms → cellular organisms → Bacteria | 553 | Open in IMG/M |
| 3300010029|Ga0105074_1053892 | Not Available | 714 | Open in IMG/M |
| 3300010321|Ga0134067_10146000 | All Organisms → cellular organisms → Bacteria | 841 | Open in IMG/M |
| 3300010359|Ga0126376_10114747 | All Organisms → cellular organisms → Bacteria | 2086 | Open in IMG/M |
| 3300010359|Ga0126376_11188717 | Not Available | 777 | Open in IMG/M |
| 3300010360|Ga0126372_12566246 | All Organisms → cellular organisms → Bacteria | 561 | Open in IMG/M |
| 3300010360|Ga0126372_12880769 | All Organisms → cellular organisms → Bacteria | 533 | Open in IMG/M |
| 3300010375|Ga0105239_12011400 | All Organisms → cellular organisms → Bacteria | 671 | Open in IMG/M |
| 3300010376|Ga0126381_102991012 | All Organisms → cellular organisms → Bacteria | 671 | Open in IMG/M |
| 3300010398|Ga0126383_11703476 | All Organisms → cellular organisms → Bacteria | 719 | Open in IMG/M |
| 3300010398|Ga0126383_12858835 | All Organisms → cellular organisms → Bacteria | 564 | Open in IMG/M |
| 3300011119|Ga0105246_11836882 | All Organisms → cellular organisms → Bacteria | 580 | Open in IMG/M |
| 3300012189|Ga0137388_10211699 | All Organisms → cellular organisms → Bacteria | 1751 | Open in IMG/M |
| 3300012199|Ga0137383_11205876 | All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Planctomycetia → Gemmatales → Gemmataceae | 544 | Open in IMG/M |
| 3300012231|Ga0137465_1066734 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium 13_1_40CM_68_21 | 1070 | Open in IMG/M |
| 3300012354|Ga0137366_11133284 | Not Available | 536 | Open in IMG/M |
| 3300012363|Ga0137390_10486457 | Not Available | 1209 | Open in IMG/M |
| 3300012891|Ga0157305_10156016 | All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Planctomycetia → Gemmatales → Gemmataceae | 619 | Open in IMG/M |
| 3300012904|Ga0157282_10378166 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia | 524 | Open in IMG/M |
| 3300012927|Ga0137416_10645442 | All Organisms → cellular organisms → Bacteria | 926 | Open in IMG/M |
| 3300012944|Ga0137410_10270446 | All Organisms → cellular organisms → Bacteria | 1338 | Open in IMG/M |
| 3300012948|Ga0126375_10818619 | All Organisms → cellular organisms → Bacteria | 739 | Open in IMG/M |
| 3300012984|Ga0164309_11019506 | All Organisms → cellular organisms → Bacteria | 683 | Open in IMG/M |
| 3300012985|Ga0164308_11565607 | All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Planctomycetia → Gemmatales → Gemmataceae | 607 | Open in IMG/M |
| 3300013308|Ga0157375_13019289 | All Organisms → cellular organisms → Bacteria | 562 | Open in IMG/M |
| 3300014262|Ga0075301_1167287 | All Organisms → cellular organisms → Bacteria | 521 | Open in IMG/M |
| 3300014968|Ga0157379_11402881 | All Organisms → cellular organisms → Bacteria | 677 | Open in IMG/M |
| 3300015359|Ga0134085_10596250 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium | 513 | Open in IMG/M |
| 3300015371|Ga0132258_10582040 | All Organisms → cellular organisms → Bacteria | 2809 | Open in IMG/M |
| 3300015374|Ga0132255_100119441 | All Organisms → cellular organisms → Bacteria | 3620 | Open in IMG/M |
| 3300017789|Ga0136617_10292094 | All Organisms → cellular organisms → Bacteria | 1344 | Open in IMG/M |
| 3300017789|Ga0136617_10466548 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium | 1009 | Open in IMG/M |
| 3300017927|Ga0187824_10040376 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 1423 | Open in IMG/M |
| 3300018054|Ga0184621_10101227 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium 13_1_40CM_68_21 | 1019 | Open in IMG/M |
| 3300018076|Ga0184609_10065846 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 1577 | Open in IMG/M |
| 3300018076|Ga0184609_10450959 | All Organisms → cellular organisms → Bacteria | 591 | Open in IMG/M |
| 3300018422|Ga0190265_10461363 | All Organisms → cellular organisms → Bacteria | 1378 | Open in IMG/M |
| 3300018468|Ga0066662_12561192 | All Organisms → cellular organisms → Bacteria | 538 | Open in IMG/M |
| 3300018469|Ga0190270_11884496 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium 13_1_40CM_68_21 | 654 | Open in IMG/M |
| 3300019883|Ga0193725_1037364 | All Organisms → cellular organisms → Bacteria | 1277 | Open in IMG/M |
| 3300019886|Ga0193727_1119921 | All Organisms → cellular organisms → Bacteria | 753 | Open in IMG/M |
| 3300019997|Ga0193711_1046229 | All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Planctomycetia → Gemmatales → Gemmataceae | 525 | Open in IMG/M |
| 3300020018|Ga0193721_1083786 | All Organisms → cellular organisms → Bacteria | 826 | Open in IMG/M |
| 3300020081|Ga0206354_10082791 | All Organisms → cellular organisms → Bacteria | 800 | Open in IMG/M |
| 3300020580|Ga0210403_11164087 | All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Planctomycetia → Gemmatales → Gemmataceae | 596 | Open in IMG/M |
| 3300022694|Ga0222623_10083064 | All Organisms → cellular organisms → Bacteria | 1243 | Open in IMG/M |
| 3300025535|Ga0207423_1083303 | All Organisms → cellular organisms → Bacteria | 579 | Open in IMG/M |
| 3300025907|Ga0207645_10328742 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium 13_1_40CM_68_21 | 1021 | Open in IMG/M |
| 3300025910|Ga0207684_11415471 | All Organisms → cellular organisms → Bacteria | 569 | Open in IMG/M |
| 3300025913|Ga0207695_11749106 | All Organisms → cellular organisms → Bacteria | 503 | Open in IMG/M |
| 3300025914|Ga0207671_10455213 | All Organisms → cellular organisms → Eukaryota → Viridiplantae | 1019 | Open in IMG/M |
| 3300025915|Ga0207693_10351837 | All Organisms → cellular organisms → Bacteria | 1153 | Open in IMG/M |
| 3300025917|Ga0207660_10776815 | All Organisms → cellular organisms → Bacteria | 782 | Open in IMG/M |
| 3300025927|Ga0207687_10693388 | All Organisms → cellular organisms → Bacteria | 864 | Open in IMG/M |
| 3300025945|Ga0207679_11438553 | All Organisms → cellular organisms → Bacteria | 632 | Open in IMG/M |
| 3300025961|Ga0207712_10911886 | All Organisms → cellular organisms → Bacteria | 777 | Open in IMG/M |
| 3300025986|Ga0207658_11425422 | All Organisms → cellular organisms → Bacteria | 633 | Open in IMG/M |
| 3300026035|Ga0207703_12038207 | All Organisms → cellular organisms → Bacteria | 550 | Open in IMG/M |
| 3300026089|Ga0207648_10290394 | All Organisms → cellular organisms → Bacteria | 1464 | Open in IMG/M |
| 3300026089|Ga0207648_10957316 | All Organisms → cellular organisms → Bacteria | 801 | Open in IMG/M |
| 3300026285|Ga0209438_1049390 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium 13_1_40CM_68_21 | 1387 | Open in IMG/M |
| 3300026371|Ga0257179_1012639 | All Organisms → cellular organisms → Bacteria | 904 | Open in IMG/M |
| 3300026514|Ga0257168_1062538 | All Organisms → cellular organisms → Bacteria | 820 | Open in IMG/M |
| 3300026548|Ga0209161_10353862 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium | 657 | Open in IMG/M |
| 3300027277|Ga0209846_1050922 | All Organisms → cellular organisms → Bacteria | 635 | Open in IMG/M |
| 3300027424|Ga0209984_1045530 | All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Planctomycetia → Gemmatales → Gemmataceae | 637 | Open in IMG/M |
| 3300027671|Ga0209588_1138295 | All Organisms → cellular organisms → Bacteria | 775 | Open in IMG/M |
| 3300027717|Ga0209998_10177221 | Not Available | 554 | Open in IMG/M |
| 3300027775|Ga0209177_10290638 | Not Available | 619 | Open in IMG/M |
| 3300027775|Ga0209177_10409368 | All Organisms → cellular organisms → Bacteria | 545 | Open in IMG/M |
| 3300027894|Ga0209068_10008008 | All Organisms → cellular organisms → Bacteria | 4991 | Open in IMG/M |
| 3300027907|Ga0207428_11038563 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Amorphaceae → Acuticoccus → Acuticoccus yangtzensis | 575 | Open in IMG/M |
| (restricted) 3300028043|Ga0233417_10049410 | All Organisms → cellular organisms → Bacteria | 1687 | Open in IMG/M |
| 3300028381|Ga0268264_10727200 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium 13_1_40CM_68_21 | 987 | Open in IMG/M |
| 3300028536|Ga0137415_10609110 | All Organisms → cellular organisms → Bacteria | 903 | Open in IMG/M |
| 3300028587|Ga0247828_10737620 | All Organisms → cellular organisms → Bacteria | 618 | Open in IMG/M |
| 3300028814|Ga0307302_10597450 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium 13_1_40CM_68_21 | 549 | Open in IMG/M |
| 3300029636|Ga0222749_10224769 | All Organisms → cellular organisms → Bacteria | 947 | Open in IMG/M |
| 3300031820|Ga0307473_11344178 | All Organisms → cellular organisms → Bacteria | 536 | Open in IMG/M |
| 3300031823|Ga0307478_11610301 | All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Planctomycetia → Gemmatales → Gemmataceae | 536 | Open in IMG/M |
| 3300032075|Ga0310890_11242660 | All Organisms → cellular organisms → Bacteria | 608 | Open in IMG/M |
Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).
| ⦗Top⦘ |
| Habitat | Taxonomy | Distribution |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil | 8.94% |
| Vadose Zone Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil | 8.13% |
| Tropical Forest Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil | 6.50% |
| Corn, Switchgrass And Miscanthus Rhizosphere | Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere | 6.50% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil | 4.88% |
| Groundwater Sand | Environmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand | 4.06% |
| Populus Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere | 4.06% |
| Agricultural Soil | Environmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil | 3.25% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil | 3.25% |
| Miscanthus Rhizosphere | Host-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere | 3.25% |
| Corn Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere | 3.25% |
| Groundwater Sediment | Environmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment | 2.44% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil | 2.44% |
| Grasslands Soil | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil | 2.44% |
| Miscanthus Rhizosphere | Host-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Miscanthus Rhizosphere | 2.44% |
| Corn Rhizosphere | Host-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere | 2.44% |
| Switchgrass Rhizosphere | Host-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere | 2.44% |
| Switchgrass Rhizosphere | Host-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere | 2.44% |
| Polar Desert Sand | Environmental → Aquatic → Freshwater → Ice → Unclassified → Polar Desert Sand | 1.63% |
| Natural And Restored Wetlands | Environmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands | 1.63% |
| Watersheds | Environmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds | 1.63% |
| Grasslands Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil | 1.63% |
| Hardwood Forest Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil | 1.63% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Agricultural → Soil | 1.63% |
| Arabidopsis Rhizosphere | Host-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere | 1.63% |
| Arabidopsis Thaliana Rhizosphere | Host-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere | 1.63% |
| Miscanthus Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere | 1.63% |
| Sediment | Environmental → Aquatic → Freshwater → Lake → Sediment → Sediment | 0.81% |
| Freshwater Sediment | Environmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment | 0.81% |
| Soil | Environmental → Aquatic → Sediment → Unclassified → Unclassified → Soil | 0.81% |
| Groundwater Sediment | Environmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment | 0.81% |
| Corn, Switchgrass And Miscanthus Rhizosphere | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Corn, Switchgrass And Miscanthus Rhizosphere | 0.81% |
| Natural And Restored Wetlands | Environmental → Terrestrial → Soil → Wetlands → Unclassified → Natural And Restored Wetlands | 0.81% |
| Rice Paddy Soil | Environmental → Terrestrial → Soil → Wetlands → Unclassified → Rice Paddy Soil | 0.81% |
| Tropical Forest Soil | Environmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil | 0.81% |
| Corn Rhizosphere | Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere | 0.81% |
| Tabebuia Heterophylla Rhizosphere | Host-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Tabebuia Heterophylla Rhizosphere | 0.81% |
| Switchgrass Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere | 0.81% |
| Miscanthus Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere | 0.81% |
| Switchgrass Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere | 0.81% |
| Miscanthus Rhizosphere | Host-Associated → Plants → Rhizosphere → Soil → Unclassified → Miscanthus Rhizosphere | 0.81% |
| Corn Rhizosphere | Host-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere | 0.81% |
| Visualization |
|---|
| Powered by ApexCharts |
Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).
| Taxon OID | Sample Name | Habitat Type | IMG/M Link |
|---|---|---|---|
| 2088090014 | Soil microbial communities from Great Prairies - Iowa, Native Prairie soil | Environmental | Open in IMG/M |
| 2162886012 | Miscanthus rhizosphere microbial communities from Kellogg Biological Station, MSU, sample Bulk Soil Replicate 1 : eDNA_1 | Host-Associated | Open in IMG/M |
| 3300000953 | Soil microbial communities from Great Prairies - Kansas Corn soil | Environmental | Open in IMG/M |
| 3300000956 | Soil microbial communities from Great Prairies - Kansas, Native Prairie soil | Environmental | Open in IMG/M |
| 3300001990 | Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - C3 | Host-Associated | Open in IMG/M |
| 3300003911 | Tabebuia heterophylla rhizosphere microbial communities from the University of Puerto Rico - S3T2R1 | Host-Associated | Open in IMG/M |
| 3300004022 | Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqA_D1 | Environmental | Open in IMG/M |
| 3300004157 | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - - Combined assembly of AARS Block 2 | Environmental | Open in IMG/M |
| 3300004479 | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAs | Environmental | Open in IMG/M |
| 3300004480 | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 4 | Environmental | Open in IMG/M |
| 3300005290 | Miscanthus rhizosphere microbial communities from Kellogg Biological Station, MSU, sample Rhizosphere Soil Replicate 1: eDNA_1 | Host-Associated | Open in IMG/M |
| 3300005293 | Miscanthus rhizosphere microbial communities from Kellogg Biological Station, MSU, sample Bulk Soil Replicate 1 : eDNA_1 | Host-Associated | Open in IMG/M |
| 3300005332 | Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly) | Environmental | Open in IMG/M |
| 3300005345 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-10-2 metaG | Environmental | Open in IMG/M |
| 3300005353 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S5-3 metaG | Host-Associated | Open in IMG/M |
| 3300005367 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3 metaG | Host-Associated | Open in IMG/M |
| 3300005445 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaG | Environmental | Open in IMG/M |
| 3300005459 | Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M3-2 | Host-Associated | Open in IMG/M |
| 3300005467 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG | Environmental | Open in IMG/M |
| 3300005546 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaG | Environmental | Open in IMG/M |
| 3300005577 | Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C7-2 | Host-Associated | Open in IMG/M |
| 3300005586 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 | Environmental | Open in IMG/M |
| 3300005616 | Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C2-2 | Host-Associated | Open in IMG/M |
| 3300005719 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S4-2 | Host-Associated | Open in IMG/M |
| 3300005881 | Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_0N_202 | Environmental | Open in IMG/M |
| 3300006173 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaG | Environmental | Open in IMG/M |
| 3300006354 | Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2012 | Environmental | Open in IMG/M |
| 3300006804 | Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200 | Environmental | Open in IMG/M |
| 3300006806 | Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100 | Environmental | Open in IMG/M |
| 3300006881 | Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M1-2 | Host-Associated | Open in IMG/M |
| 3300006904 | Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3 | Host-Associated | Open in IMG/M |
| 3300007076 | Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD4 | Host-Associated | Open in IMG/M |
| 3300007265 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 | Environmental | Open in IMG/M |
| 3300009012 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159 | Environmental | Open in IMG/M |
| 3300009038 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG | Environmental | Open in IMG/M |
| 3300009094 | Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (version 2) | Host-Associated | Open in IMG/M |
| 3300009162 | Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2 | Host-Associated | Open in IMG/M |
| 3300009545 | Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C2-4 metaG | Host-Associated | Open in IMG/M |
| 3300009804 | Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_30_40 | Environmental | Open in IMG/M |
| 3300009836 | Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_10_20 | Environmental | Open in IMG/M |
| 3300010029 | Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_10_20 | Environmental | Open in IMG/M |
| 3300010321 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09212015 | Environmental | Open in IMG/M |
| 3300010359 | Tropical forest soil microbial communities from Panama - MetaG Plot_15 | Environmental | Open in IMG/M |
| 3300010360 | Tropical forest soil microbial communities from Panama - MetaG Plot_6 | Environmental | Open in IMG/M |
| 3300010375 | Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C4-4 metaG | Host-Associated | Open in IMG/M |
| 3300010376 | Tropical forest soil microbial communities from Panama - MetaG Plot_28 | Environmental | Open in IMG/M |
| 3300010398 | Tropical forest soil microbial communities from Panama - MetaG Plot_35 | Environmental | Open in IMG/M |
| 3300011119 | Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M6-4 metaG | Host-Associated | Open in IMG/M |
| 3300012189 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaG | Environmental | Open in IMG/M |
| 3300012199 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaG | Environmental | Open in IMG/M |
| 3300012231 | Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT828_2 | Environmental | Open in IMG/M |
| 3300012354 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_60_16 metaG | Environmental | Open in IMG/M |
| 3300012363 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaG | Environmental | Open in IMG/M |
| 3300012891 | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S148-409B-2 | Environmental | Open in IMG/M |
| 3300012904 | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S029-104C-1 | Environmental | Open in IMG/M |
| 3300012927 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly) | Environmental | Open in IMG/M |
| 3300012944 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly) | Environmental | Open in IMG/M |
| 3300012948 | Tropical forest soil microbial communities from Panama - MetaG Plot_14 | Environmental | Open in IMG/M |
| 3300012984 | Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_247_MG | Environmental | Open in IMG/M |
| 3300012985 | Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_246_MG | Environmental | Open in IMG/M |
| 3300013308 | Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M3-5 metaG | Host-Associated | Open in IMG/M |
| 3300014262 | Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_TuleA_D1 | Environmental | Open in IMG/M |
| 3300014968 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S2-5 metaG | Host-Associated | Open in IMG/M |
| 3300015359 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09182015 | Environmental | Open in IMG/M |
| 3300015371 | Combined assembly of cpr5 and col0 rhizosphere and soil | Host-Associated | Open in IMG/M |
| 3300015374 | Col-0 rhizosphere combined assembly | Host-Associated | Open in IMG/M |
| 3300017789 | Polar desert sand microbial communities from Dry Valleys, Antarctica - metaG UQ322 (21.06) | Environmental | Open in IMG/M |
| 3300017927 | Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_4 | Environmental | Open in IMG/M |
| 3300018054 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_30_b1 | Environmental | Open in IMG/M |
| 3300018076 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_coex | Environmental | Open in IMG/M |
| 3300018422 | Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 124 T | Environmental | Open in IMG/M |
| 3300018468 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111 | Environmental | Open in IMG/M |
| 3300018469 | Populus adjacent soil microbial communities from riparian zone of Weber River, Utah, USA - 320 T | Environmental | Open in IMG/M |
| 3300019883 | Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2a2 | Environmental | Open in IMG/M |
| 3300019886 | Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2c2 | Environmental | Open in IMG/M |
| 3300019997 | Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H3m2 | Environmental | Open in IMG/M |
| 3300020018 | Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2s2 | Environmental | Open in IMG/M |
| 3300020081 | Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - Diel MetaT C5am-3 (Metagenome Metatranscriptome) (v2) | Environmental | Open in IMG/M |
| 3300020580 | Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-M | Environmental | Open in IMG/M |
| 3300022694 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_30_coex | Environmental | Open in IMG/M |
| 3300025535 | Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqA_D1 (SPAdes) | Environmental | Open in IMG/M |
| 3300025907 | Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-3 metaG (SPAdes) | Host-Associated | Open in IMG/M |
| 3300025910 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes) | Environmental | Open in IMG/M |
| 3300025913 | Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-4 metaG (SPAdes) | Host-Associated | Open in IMG/M |
| 3300025914 | Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C2-4 metaG (SPAdes) | Host-Associated | Open in IMG/M |
| 3300025915 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaG (SPAdes) | Environmental | Open in IMG/M |
| 3300025917 | Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaG (SPAdes) | Environmental | Open in IMG/M |
| 3300025927 | Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-4 metaG (SPAdes) | Host-Associated | Open in IMG/M |
| 3300025945 | Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3 metaG (SPAdes) | Host-Associated | Open in IMG/M |
| 3300025961 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-4 metaG (SPAdes) | Host-Associated | Open in IMG/M |
| 3300025986 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3 metaG (SPAdes) | Host-Associated | Open in IMG/M |
| 3300026035 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S1-2 (SPAdes) | Host-Associated | Open in IMG/M |
| 3300026089 | Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M3-2 (SPAdes) | Host-Associated | Open in IMG/M |
| 3300026285 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_20cm (SPAdes) | Environmental | Open in IMG/M |
| 3300026371 | Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-01-B | Environmental | Open in IMG/M |
| 3300026514 | Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-13-B | Environmental | Open in IMG/M |
| 3300026548 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 (SPAdes) | Environmental | Open in IMG/M |
| 3300027277 | Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_20_30 (SPAdes) | Environmental | Open in IMG/M |
| 3300027424 | Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Inoculated plant M2 S PM (SPAdes) | Host-Associated | Open in IMG/M |
| 3300027671 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 (SPAdes) | Environmental | Open in IMG/M |
| 3300027717 | Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Endophyte Co-N S PM (SPAdes) | Host-Associated | Open in IMG/M |
| 3300027775 | Agricultural soil microbial communities from Georgia to study Nitrogen management - GA Control (SPAdes) | Environmental | Open in IMG/M |
| 3300027894 | Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2012 (SPAdes) | Environmental | Open in IMG/M |
| 3300027907 | Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (SPAdes) | Host-Associated | Open in IMG/M |
| 3300027954 | Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_50_60 (SPAdes) | Environmental | Open in IMG/M |
| 3300028043 (restricted) | Sediment microbial communities from Lake Towuti, South Sulawesi, Indonesia - Sediment_Towuti_2014_0.5_MG | Environmental | Open in IMG/M |
| 3300028381 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S3-2 (SPAdes) | Host-Associated | Open in IMG/M |
| 3300028536 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly) | Environmental | Open in IMG/M |
| 3300028587 | Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day3 | Environmental | Open in IMG/M |
| 3300028814 | Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_183 | Environmental | Open in IMG/M |
| 3300029636 | Metatranscriptome of lab incubated forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-M (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300031820 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515 | Environmental | Open in IMG/M |
| 3300031823 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_05 | Environmental | Open in IMG/M |
| 3300032075 | Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T20D3 | Environmental | Open in IMG/M |
| Geographical Distribution | |
|---|---|
| Zoom: | Powered by OpenStreetMap |
| ⦗Top⦘ |
Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).
| Protein ID | Sample Taxon ID | Habitat | Sequence |
| GPIPI_02023130 | 2088090014 | Soil | GLSPAAGRVLADLVVDGKSDPDLAPLSVERFRGRHEDAAALEAACVAHYARRYLR |
| MBSR1b_0107.00006900 | 2162886012 | Miscanthus Rhizosphere | AGRALADLIVDGKSDPDLAPLSVERLAAASRDGAALEAACVAQYARRYTH |
| JGI11615J12901_106813742 | 3300000953 | Soil | ALAPLSVERLAAASRDGAALEAACVAQYARRYTH* |
| JGI10216J12902_1108853363 | 3300000956 | Soil | LSPATGRALADLIVDGKSDPDLAPISVERFGSAPRDPAALESACVAQYARRYTH* |
| JGI10216J12902_1150702821 | 3300000956 | Soil | DPDLAPLSVERFRGRWQGAAELEAACVGQYARKYTR* |
| JGI24737J22298_101721781 | 3300001990 | Corn Rhizosphere | GRALADLIVDGKSDPDLAPLSVERLAAASRDGAALEAACVAQYARRYTH* |
| JGI25405J52794_101551272 | 3300003911 | Tabebuia Heterophylla Rhizosphere | AGRALADLIVDGKSDPDLAPLSVERFGGAHRDPAALESACVAQYARRYTH* |
| Ga0055432_101969411 | 3300004022 | Natural And Restored Wetlands | ADLILDGRSDPDLGPLSVERFGCRFDAPAELRATCVAQYARKYTH* |
| Ga0062590_1018289021 | 3300004157 | Soil | LIVDGKSDPDLAPLSVERFTDGYRDPAALEAACVAQYARRYTH* |
| Ga0062595_1020979071 | 3300004479 | Soil | AAGRALADLIVDGKSDPDLGPLSVERFRGRWEDAAELEAACVGQYARKYTH* |
| Ga0062592_1008081213 | 3300004480 | Soil | LSLSPSTGRALADLIVDGKSDPDLAPLSVERFTDGYRDPAALEAACVAQYARRYTH* |
| Ga0065712_103241821 | 3300005290 | Miscanthus Rhizosphere | AGRALADLIVDGKSDPDLAPLSVERLAAASRDGAALEAACVAQYARRYTH* |
| Ga0065715_109556311 | 3300005293 | Miscanthus Rhizosphere | ADLIVDGKSDPDLGPLSVGRFAGAARDPGALEAACVAQYARRYTH* |
| Ga0066388_1069114832 | 3300005332 | Tropical Forest Soil | LADLILDGRSDPDLAPLSVERFRGRFAHSAELTSACTSQYARKYTH* |
| Ga0070692_103103783 | 3300005345 | Corn, Switchgrass And Miscanthus Rhizosphere | DLIVDGKSDPDLAPLSVERFTDGYRDPAALEAACVAQYARRYTH* |
| Ga0070669_1000738653 | 3300005353 | Switchgrass Rhizosphere | LSLSPAAGRALADLIVDGKSDPDLAPLSVERLAAASRDGAALEAACVAQYARRYTH* |
| Ga0070667_1002450241 | 3300005367 | Switchgrass Rhizosphere | LADLIVDGKSDPDLAPLSVERLAAASRDGAALEAACVAQYARRYTH* |
| Ga0070708_1002688253 | 3300005445 | Corn, Switchgrass And Miscanthus Rhizosphere | SLSPAAGRALADLIVDGKSDPDLTPISVERFGESYRNPGTLESACVAQYARRYTH* |
| Ga0068867_1014994311 | 3300005459 | Miscanthus Rhizosphere | GKSDPDLGPLSVERFRGRWEDAAGLEAACVGQYARKYTH* |
| Ga0070706_1012693531 | 3300005467 | Corn, Switchgrass And Miscanthus Rhizosphere | LIVDGKSEPDLAPLSVQRFAGAARDGAALEAACVAQYARRYTH* |
| Ga0070706_1017723741 | 3300005467 | Corn, Switchgrass And Miscanthus Rhizosphere | SPAAGRALADLIVDGKSDPDLTPISVERFGESYRNPGTLESACVAQYARRYTH* |
| Ga0070696_1013592661 | 3300005546 | Corn, Switchgrass And Miscanthus Rhizosphere | LSLSPAAGRALADLIVDGKSDPDLGPISVERFGAAPRDPAALEAACVAQYARRYTH* |
| Ga0068857_1023210061 | 3300005577 | Corn Rhizosphere | LIVDGKSDPDLAPLSVRRFAGAARDGAALEAACVAQYARRYTH* |
| Ga0066691_105472753 | 3300005586 | Soil | DGKSDPDLAPISVERFGAAPRDPAALEAACVAQYARRYTH* |
| Ga0068852_1002473001 | 3300005616 | Corn Rhizosphere | SLSPAAGRALADLIVDGKSDPDLAPLSVERLAAASRDGAALQAACVAQYARRYTH* |
| Ga0068861_1002504601 | 3300005719 | Switchgrass Rhizosphere | LSLSPAAGRALADLIVDGKSDPDLAPLSVERFRGRWEDAAELEAACVGQYARKYTH* |
| Ga0075294_10002921 | 3300005881 | Rice Paddy Soil | TGRALADLIADGKSDPDLAPISMERFADGSRDPAALEAACVAQYARRYTH* |
| Ga0070716_1014774232 | 3300006173 | Corn, Switchgrass And Miscanthus Rhizosphere | GLSLSPAAGRALANLIVDGKSDPDLAPLSVERFAGVARDGAALEAACVAQYARRYTH* |
| Ga0075021_100108006 | 3300006354 | Watersheds | PDLAPISVERFGAAPRDPAALEAACIAQYARRYTH* |
| Ga0079221_116668502 | 3300006804 | Agricultural Soil | DGKSDPDLAPLSVERFAGAARDGAALEAACVAQYARRYTH* |
| Ga0079220_110677862 | 3300006806 | Agricultural Soil | DLIVDGKSDPDLGPLSVERFGAAPRDPAALEAACVAQYARRYTH* |
| Ga0068865_1002943761 | 3300006881 | Miscanthus Rhizosphere | PAAGRALADLVVDGRSDPDLTPLSVERFRGRLDDPSALEAACVAQYARRYTR* |
| Ga0075424_1027239692 | 3300006904 | Populus Rhizosphere | DLILDGRSEPDLGPWSVERFRGRFEDKAALEAACVAHYARKYIK* |
| Ga0075435_1007698652 | 3300007076 | Populus Rhizosphere | VGGLSLSPAAGRALADLIVDGKSEPDLAPLSVQRFAGAARDGAALEAACVAQYARRYTH* |
| Ga0099794_101945841 | 3300007265 | Vadose Zone Soil | DLIVDGKSDPDLAPISVERFGAAPRDPAALEAACVAQYARRYTH* |
| Ga0066710_1031261392 | 3300009012 | Grasslands Soil | PSAGRALADLILDGKSDPDLGPLSVERFRGRYAEPRELESACIGQYARKYTH |
| Ga0099829_111494081 | 3300009038 | Vadose Zone Soil | DPDLGPLSVERFRGRYENPRELEAACVGEYARKYLH* |
| Ga0111539_113292581 | 3300009094 | Populus Rhizosphere | RALADLILDGKSDPDLAPLSVERFQGRYPDAAALEAVCVEHYARKYIK* |
| Ga0075423_106848242 | 3300009162 | Populus Rhizosphere | RALADLILDGRSEPDLGPWSVERFRGRFEDKAALEAACVAHYARKYIK* |
| Ga0105237_106865581 | 3300009545 | Corn Rhizosphere | DGKSDPDLAPLSVERLAAASRDGAALEAACVAQYARRYTH* |
| Ga0105063_10589252 | 3300009804 | Groundwater Sand | VDGKSDPDLAPISVERFGEGYREPAALESACVAQYARRYMH* |
| Ga0105068_11050772 | 3300009836 | Groundwater Sand | DGKSDPDLAPISVERFGEGYREPAALESACVAQYARRYMH* |
| Ga0105074_10538921 | 3300010029 | Groundwater Sand | DGKSDPDLGPLSVERFRGRYTAAAELEAACVGQYARRYTR* |
| Ga0134067_101460002 | 3300010321 | Grasslands Soil | GGLSISPALGRALADLILDGRTEPDLRPYYVERFGEAYRDGAELAAACRYAYARKYVK* |
| Ga0126376_101147471 | 3300010359 | Tropical Forest Soil | AVGRTLADLILDGKSDPDLGPLSVERFRNRWDDTAGLTEACVRQYARRYTR* |
| Ga0126376_111887171 | 3300010359 | Tropical Forest Soil | SDPDLSPLSVDRFRGRLEDGAALEAACVAQYARRYVR* |
| Ga0126372_125662461 | 3300010360 | Tropical Forest Soil | DLIVDGKSDPDLSPLSVERFGERYRDQRELESICVAQYARRYTH* |
| Ga0126372_128807691 | 3300010360 | Tropical Forest Soil | PLARRGRTLADLILDGKSDPDLGPLSVERFRNRWDDTAGLTEACVRQYARRYTR* |
| Ga0105239_120114001 | 3300010375 | Corn Rhizosphere | DLIVDGKSDPDLGPLSVERFRGRWEDAAGLEAACVGQYARKYTH* |
| Ga0126381_1029910121 | 3300010376 | Tropical Forest Soil | GRALADLIVDGKSEPDLAPLSVERFRGRYEDTGALEAACVGQYARRYTR* |
| Ga0126383_117034762 | 3300010398 | Tropical Forest Soil | DGKSDPDLGPLSVERFKGRYEGAAELTAACLSQYARKYTH* |
| Ga0126383_128588351 | 3300010398 | Tropical Forest Soil | PAAGRALADLIVDGKSEPDLAPLSVERFRGHYEDTGALEAACVGQYARRYTR* |
| Ga0105246_118368821 | 3300011119 | Miscanthus Rhizosphere | IVDGKSDPDLAPLSVERLAAASRDGAALEAACVAQYARRYTH* |
| Ga0137388_102116993 | 3300012189 | Vadose Zone Soil | ADLIVDGKSEPDLAPISVERFGAATRDPAALDAACVAQYARRYTH* |
| Ga0137383_112058761 | 3300012199 | Vadose Zone Soil | AGRALADLIVDGKSDPDPAPISVERIGPAPRDPPALEAACVAQYARRYTH* |
| Ga0137465_10667342 | 3300012231 | Soil | SPAAGRALADLIVDGRSDPDLAPLGVERFRGRLEDAAALEAACVAQYARRYMR* |
| Ga0137366_111332842 | 3300012354 | Vadose Zone Soil | GKSDPDLTPLSVERFRGRFAGARELESACVAQYARKYIH* |
| Ga0137390_104864571 | 3300012363 | Vadose Zone Soil | TGRALADLIVDGKSDPDLAPISVERFADGYRDPAQLEAACVAQYARRYTH* |
| Ga0157305_101560161 | 3300012891 | Soil | LADLIVDGKSDPDLAPISVERFGGAHRDPAALESACVAQYARRYTH* |
| Ga0157282_103781661 | 3300012904 | Soil | PAAGRALADLIVDGKSDPDLGPLSVERFRGRWEDAAGLEAACVGQYARKYTH* |
| Ga0137416_106454423 | 3300012927 | Vadose Zone Soil | DPDLAPISVERFGAAPRDPAALEAACVAQYARRYTH* |
| Ga0137410_102704463 | 3300012944 | Vadose Zone Soil | LSPAAGRALADLIVDGKSDPDLAPISVERFADGYRDPAALEAACVAQYARRYTH* |
| Ga0126375_108186191 | 3300012948 | Tropical Forest Soil | SDPDLSPLSVERFAGRHETPGDLEAACVRQYARKYTH* |
| Ga0164309_110195063 | 3300012984 | Soil | DLGPLSVGRLAGAARDPGALEAACVAQYARRYTH* |
| Ga0164308_115656071 | 3300012985 | Soil | DLIVDGKSDPDLGPLSVERFRGRWEDAADLEAACVGQYARRYTR* |
| Ga0157375_130192892 | 3300013308 | Miscanthus Rhizosphere | GLSLSPAAGRALADLIVDGKSDPDLAPLSVERLAAASRDGAALEAACVAQYARRYTH* |
| Ga0075301_11672871 | 3300014262 | Natural And Restored Wetlands | DGKSDPDLAPISVERFGTGDRDAATLESACVAQYARRYTH* |
| Ga0157379_114028813 | 3300014968 | Switchgrass Rhizosphere | PDLAPLSVERFRGRWEDAAGLEAACVGQYARRYTR* |
| Ga0134085_105962502 | 3300015359 | Grasslands Soil | RALADLILDGKSDPDLTPLSVERFRGRFADARELESACVGQYARKYMH* |
| Ga0132258_105820401 | 3300015371 | Arabidopsis Rhizosphere | KSDPDLGPLSVERFRGRWEDAADLEAACVGQYARRYTR* |
| Ga0132255_1001194415 | 3300015374 | Arabidopsis Rhizosphere | ALADLILDGKSDPDLAPLSVERFAGRHETQADLEAACVRQYARKYTH* |
| Ga0136617_102920941 | 3300017789 | Polar Desert Sand | GGLSLSPAAGRALADLIVDGRSVPDLAPLSVERFRGRYEDPAELEAACVRAYARKYTK |
| Ga0136617_104665483 | 3300017789 | Polar Desert Sand | RALADLIVDGKCEPDLAPLSVERFGAYADDPAALTAACVDHYARKYMK |
| Ga0187824_100403762 | 3300017927 | Freshwater Sediment | PDLAPVSVERFAGVGRDPAALEAACVAQYARRYTH |
| Ga0184621_101012271 | 3300018054 | Groundwater Sediment | VDGRSDPDLAPLGVERFRGRLEDAAALEAACVAQYARRYTR |
| Ga0184609_100658463 | 3300018076 | Groundwater Sediment | LADLVVDGRSDPDLAPLGVERFRGRLEDAAALEAACVAQYARRYTR |
| Ga0184609_104509591 | 3300018076 | Groundwater Sediment | GLSPAAGRVLADLILDSKSDPDLTPLSVERFRGRYENARELEAACVGEYARKYRH |
| Ga0190265_104613631 | 3300018422 | Soil | AGRALADLVLDGKSDPDLTPISVARFAEGDRDPAALEAACVAQYARRYTH |
| Ga0066662_125611922 | 3300018468 | Grasslands Soil | DPDLAPISVERFGAAPRDPAALEAACVAQYARRYTH |
| Ga0190270_118844961 | 3300018469 | Soil | DLVVDGRSDPDLAPLGVERFRGRLEDAAALEAACVAQYARRYTR |
| Ga0193725_10373643 | 3300019883 | Soil | RALADLIVDGKSEPDLAPLSVERFTDGYRDPAALEAACVAQYARRYTH |
| Ga0193727_11199213 | 3300019886 | Soil | LIVDGKSDPDLAPISVERFADGYRDPAQLEAACVAQYARRYTH |
| Ga0193711_10462292 | 3300019997 | Soil | PAAGRALADLIVDGKSDPDLAPISVERFADSYRDAAALEAACVAQYARRYTH |
| Ga0193721_10837863 | 3300020018 | Soil | PDLAPISVERFADSYRDPVALEAACVAQYARRYTH |
| Ga0206354_100827912 | 3300020081 | Corn, Switchgrass And Miscanthus Rhizosphere | ADLIVDGKSDPDLAPLSVERLAAASRDGAALEAACVAQYARRYTH |
| Ga0210403_111640872 | 3300020580 | Soil | ADLIVDGKSDPDLTPISVERFAAAPRDAAALEAACVAQYARRYTH |
| Ga0222623_100830643 | 3300022694 | Groundwater Sediment | GKSDPDLAPISVERFADSYRDPVALEAACVAQYARRYTH |
| Ga0207423_10833032 | 3300025535 | Natural And Restored Wetlands | ADLILDGRSDPDLGPLSVERFGCRFDAPAELRATCVAQYARKYTH |
| Ga0207645_103287421 | 3300025907 | Miscanthus Rhizosphere | VGGLGLSPAAGRALADLVVDGRSDPDLTPLSVERFRGRLEDPSALEAACVAQYARRYTR |
| Ga0207684_114154711 | 3300025910 | Corn, Switchgrass And Miscanthus Rhizosphere | LIVDGKSEPDLAPLSVQRFAGAARDGAALEAACVAQYARRYTH |
| Ga0207695_117491061 | 3300025913 | Corn Rhizosphere | VGGLSLSPAAGRALADLIVDGKSDPDLAPLSVERLAAASRDGAALEAACVAQYARRYTH |
| Ga0207671_104552131 | 3300025914 | Corn Rhizosphere | DPDLAPLSVERLAAASRDGAALEAACVAQYARRYTH |
| Ga0207693_103518371 | 3300025915 | Corn, Switchgrass And Miscanthus Rhizosphere | AAGRALADLIVDGKSDPDLGPLSVGRFAGAARDPGALEAACVAQYARRYTH |
| Ga0207660_107768151 | 3300025917 | Corn Rhizosphere | GRALADLIVDGKSDPDLTPISVERFADGYRDPAALEAACVAQYARRYTH |
| Ga0207687_106933883 | 3300025927 | Miscanthus Rhizosphere | LSLSPSTGRALADLIVDGKSDPDLAPLSVERFTDGYRDPAALKAACVAQYARRYTH |
| Ga0207679_114385532 | 3300025945 | Corn Rhizosphere | DGKSEPDLAPLSVQRFAGAARDGAALEAACVAQYARRYTH |
| Ga0207712_109118861 | 3300025961 | Switchgrass Rhizosphere | LIVDGKSDPDLAPLSVERFTDGYRDPAALEAACVAQYARRYTH |
| Ga0207658_114254222 | 3300025986 | Switchgrass Rhizosphere | PAAGRALADLIVDGKSDPDLAPLSVERLAAASRDGAALEAACVAQYARRYTH |
| Ga0207703_120382071 | 3300026035 | Switchgrass Rhizosphere | PDLAPLSVERFRGRWEDAAGLEAACVGQYARRYTR |
| Ga0207648_102903941 | 3300026089 | Miscanthus Rhizosphere | RALADLIVDGKSDPDLAPLSVERFTDGYRDPAALEAACVAQYARRYTH |
| Ga0207648_109573163 | 3300026089 | Miscanthus Rhizosphere | IVDGKSDPDLGPLSVERFRGRWEDAAGLEAACVGQYARKYTH |
| Ga0209438_10493901 | 3300026285 | Grasslands Soil | GRSDPDLAPLGVERFRGRLEDAAALEAACVAQYARRYTR |
| Ga0257179_10126391 | 3300026371 | Soil | AGRALADLIVDGKSDPDLGPISVERFGAAPRDPAALEAACVAQYARRYTH |
| Ga0257168_10625383 | 3300026514 | Soil | IVDGKSDPDLGPISVERFGAAPRDPAALEAACVAQYARRYTH |
| Ga0209161_103538622 | 3300026548 | Soil | ALADLILDGKSDPDLTPLSVERFRGRFADARELESACVGQYARKYMH |
| Ga0209846_10509222 | 3300027277 | Groundwater Sand | PDLTPLSVERFQDRYDDPTELEAACVGRYARKYLK |
| Ga0209984_10455301 | 3300027424 | Arabidopsis Thaliana Rhizosphere | AAGRALADLIVDGKSDPDLAPISVERFGGAHRDPAALESACVAQYARRYTH |
| Ga0209588_11382951 | 3300027671 | Vadose Zone Soil | ADLIVDGKSDPDLAPISVERFGAAPRDPAALEAACVAQYARRYTH |
| Ga0209998_101772211 | 3300027717 | Arabidopsis Thaliana Rhizosphere | FSILAAAALADLIVDGASDPDLAPLSVERFRGRLEDRAALEAACVAQYARRYTR |
| Ga0209177_102906381 | 3300027775 | Agricultural Soil | DGKSDPDLGPLSVERFAGAARDGGALEAACVAQYARRYTH |
| Ga0209177_104093682 | 3300027775 | Agricultural Soil | SLSPAAGRALADLIVDGKSDPDLAPLSVERFAGAARDGAALEAACVAQYARRYTH |
| Ga0209068_100080081 | 3300027894 | Watersheds | IVDGKSDPDLAPISVERFGAAPRDPAALEAACIAQYARRYTH |
| Ga0207428_110385631 | 3300027907 | Populus Rhizosphere | LTLSPAAGRALADLILDGKSDPDLAPLSVERFQGRYPDAAALEAVCVEHYARKYIK |
| Ga0209859_10708172 | 3300027954 | Groundwater Sand | GRSEPDLTPLSVERFQDRYDDPTELEAACVGRYARKYLK |
| (restricted) Ga0233417_100494103 | 3300028043 | Sediment | SPAAGRALADLIVDGKSDPDLTPLAVERFADGYRDPAALEAACVAQYARRYTH |
| Ga0268264_107272001 | 3300028381 | Switchgrass Rhizosphere | GLSPAAGRALADLVVDGRSDPDLTPLSVERFRGRLEDPSALEAACVAQYARRYTR |
| Ga0137415_106091101 | 3300028536 | Vadose Zone Soil | PAAGRALADLIVDGKSDPDLAPISVERFGAAPRDPAALEAACVAQYARRYTH |
| Ga0247828_107376201 | 3300028587 | Soil | LSPSTGRALADLIVDGKSDPDLAPLSVERFTDGYRDPAALEAACVAQYARRYTH |
| Ga0307302_105974501 | 3300028814 | Soil | ADLVVDGRSDPDLAPLSVERFRGRLEDTAALEAACVAQYARRYTR |
| Ga0222749_102247691 | 3300029636 | Soil | PDLTPISVERFAAAPRDAAALEAACVAQYARRYTH |
| Ga0307473_113441782 | 3300031820 | Hardwood Forest Soil | VGGLSLSPAAGRALADLIVDGKSDPDLAPLSVERFAGAARDGAALEAACVAQYARRYTH |
| Ga0307478_116103011 | 3300031823 | Hardwood Forest Soil | AAGRTLADLIVDGKSDPDLTPISVERFAAAPRDAAALEAACVAQYARRYTH |
| Ga0310890_112426602 | 3300032075 | Soil | LIVDGKSDPDLAPLAVERFGAAPRDPAALEAACVAQYARRYTH |
| ⦗Top⦘ |