| Basic Information | |
|---|---|
| Family ID | F105979 |
| Family Type | Metagenome / Metatranscriptome |
| Number of Sequences | 100 |
| Average Sequence Length | 45 residues |
| Representative Sequence | MNDFTLTTPTQEPGLATATQVVYQWNYDPEVEELRRLYVKAAEAQW |
| Number of Associated Samples | 91 |
| Number of Associated Scaffolds | 100 |
| Quality Assessment | |
|---|---|
| Transcriptomic Evidence | Yes |
| Most common taxonomic group | Bacteria |
| % of genes with valid RBS motifs | 97.00 % |
| % of genes near scaffold ends (potentially truncated) | 98.00 % |
| % of genes from short scaffolds (< 2000 bps) | 90.00 % |
| Associated GOLD sequencing projects | 89 |
| AlphaFold2 3D model prediction | Yes |
| 3D model pTM-score | 0.35 |
| Hidden Markov Model |
|---|
| Powered by Skylign |
| Most Common Taxonomy | |
|---|---|
| Group | Bacteria (96.000 % of family members) |
| NCBI Taxonomy ID | 2 |
| Taxonomy | All Organisms → cellular organisms → Bacteria |
| Most Common Ecosystem | |
|---|---|
| GOLD Ecosystem | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil (11.000 % of family members) |
| Environment Ontology (ENVO) | Unclassified (32.000 % of family members) |
| Earth Microbiome Project Ontology (EMPO) | Host-associated → Plant → Plant rhizosphere (35.000 % of family members) |
| ⦗Top⦘ |
| ⦗Top⦘ |
| Predicted Topology & Secondary Structure | |||||
|---|---|---|---|---|---|
| Classification: | Globular | Signal Peptide: | No | Secondary Structure distribution: | α-helix: 32.43% β-sheet: 0.00% Coil/Unstructured: 67.57% | Feature Viewer |
|
|
|||||
| Powered by Feature Viewer | |||||
| Structure Viewer | |
|---|---|
|
| |
| Per-residue confidence (pLDDT): 0-50 51-70 71-90 91-100 | pTM-score: 0.35 |
| Powered by PDBe Molstar | |
| ⦗Top⦘ |
| Pfam ID | Name | % Frequency in 100 Family Scaffolds |
|---|---|---|
| PF07729 | FCD | 32.00 |
| PF00392 | GntR | 11.00 |
| PF00501 | AMP-binding | 2.00 |
| PF09413 | DUF2007 | 2.00 |
| PF10095 | DUF2333 | 2.00 |
| PF11583 | AurF | 2.00 |
| PF00561 | Abhydrolase_1 | 1.00 |
| PF00300 | His_Phos_1 | 1.00 |
| PF07040 | DUF1326 | 1.00 |
| PF00216 | Bac_DNA_binding | 1.00 |
| PF13193 | AMP-binding_C | 1.00 |
| PF07298 | NnrU | 1.00 |
| PF00691 | OmpA | 1.00 |
| PF13738 | Pyr_redox_3 | 1.00 |
| PF03405 | FA_desaturase_2 | 1.00 |
| PF13517 | FG-GAP_3 | 1.00 |
| PF09335 | SNARE_assoc | 1.00 |
| PF14561 | TPR_20 | 1.00 |
| PF01966 | HD | 1.00 |
| PF06628 | Catalase-rel | 1.00 |
| PF00578 | AhpC-TSA | 1.00 |
| PF00107 | ADH_zinc_N | 1.00 |
| PF00730 | HhH-GPD | 1.00 |
| PF00106 | adh_short | 1.00 |
| COG ID | Name | Functional Category | % Frequency in 100 Family Scaffolds |
|---|---|---|---|
| COG1802 | DNA-binding transcriptional regulator, GntR family | Transcription [K] | 32.00 |
| COG2186 | DNA-binding transcriptional regulator, FadR family | Transcription [K] | 32.00 |
| COG0122 | 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase | Replication, recombination and repair [L] | 1.00 |
| COG0177 | Endonuclease III | Replication, recombination and repair [L] | 1.00 |
| COG0398 | Uncharacterized membrane protein YdjX, related to fungal oxalate transporter, TVP38/TMEM64 family | Function unknown [S] | 1.00 |
| COG0586 | Membrane integrity protein DedA, putative transporter, DedA/Tvp38 family | Cell wall/membrane/envelope biogenesis [M] | 1.00 |
| COG0753 | Catalase | Inorganic ion transport and metabolism [P] | 1.00 |
| COG0776 | Bacterial nucleoid DNA-binding protein IHF-alpha | Replication, recombination and repair [L] | 1.00 |
| COG1059 | Thermostable 8-oxoguanine DNA glycosylase | Replication, recombination and repair [L] | 1.00 |
| COG1194 | Adenine-specific DNA glycosylase, acts on AG and A-oxoG pairs | Replication, recombination and repair [L] | 1.00 |
| COG1238 | Uncharacterized membrane protein YqaA, VTT domain | Function unknown [S] | 1.00 |
| COG2231 | 3-Methyladenine DNA glycosylase, HhH-GPD/Endo3 superfamily | Replication, recombination and repair [L] | 1.00 |
| COG4094 | Uncharacterized membrane protein | Function unknown [S] | 1.00 |
| COG5588 | Uncharacterized conserved protein, DUF1326 domain | Function unknown [S] | 1.00 |
| ⦗Top⦘ |
| Name | Rank | Taxonomy | Distribution |
| All Organisms | root | All Organisms | 96.00 % |
| Unclassified | root | N/A | 4.00 % |
| Visualization |
|---|
| Powered by ApexCharts |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| 2035918004|FACENC_F56XM5W01DQ64T | All Organisms → cellular organisms → Bacteria → Proteobacteria | 524 | Open in IMG/M |
| 3300000559|F14TC_100329335 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria | 1682 | Open in IMG/M |
| 3300004019|Ga0055439_10067761 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 1004 | Open in IMG/M |
| 3300004027|Ga0055459_10055515 | All Organisms → cellular organisms → Bacteria | 955 | Open in IMG/M |
| 3300004463|Ga0063356_100738936 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 1361 | Open in IMG/M |
| 3300005183|Ga0068993_10303702 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 577 | Open in IMG/M |
| 3300005332|Ga0066388_100522947 | All Organisms → cellular organisms → Bacteria | 1819 | Open in IMG/M |
| 3300005332|Ga0066388_106952323 | All Organisms → cellular organisms → Bacteria | 569 | Open in IMG/M |
| 3300005347|Ga0070668_100849169 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 814 | Open in IMG/M |
| 3300005364|Ga0070673_100902460 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 819 | Open in IMG/M |
| 3300005441|Ga0070700_100447962 | All Organisms → cellular organisms → Bacteria | 982 | Open in IMG/M |
| 3300005446|Ga0066686_10613396 | All Organisms → cellular organisms → Bacteria | 740 | Open in IMG/M |
| 3300005545|Ga0070695_101121662 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi | 644 | Open in IMG/M |
| 3300005548|Ga0070665_100204354 | All Organisms → cellular organisms → Bacteria | 1976 | Open in IMG/M |
| 3300005553|Ga0066695_10810994 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 539 | Open in IMG/M |
| 3300005559|Ga0066700_10349576 | All Organisms → cellular organisms → Bacteria | 1044 | Open in IMG/M |
| 3300005576|Ga0066708_10948175 | All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium | 536 | Open in IMG/M |
| 3300005617|Ga0068859_101393040 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 773 | Open in IMG/M |
| 3300005713|Ga0066905_100365953 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 1157 | Open in IMG/M |
| 3300005713|Ga0066905_101179612 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 683 | Open in IMG/M |
| 3300005764|Ga0066903_100555582 | All Organisms → cellular organisms → Bacteria | 1971 | Open in IMG/M |
| 3300005764|Ga0066903_108161186 | All Organisms → cellular organisms → Bacteria | 536 | Open in IMG/M |
| 3300005843|Ga0068860_100827420 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 940 | Open in IMG/M |
| 3300005937|Ga0081455_10852433 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 569 | Open in IMG/M |
| 3300006046|Ga0066652_101497480 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 626 | Open in IMG/M |
| 3300006163|Ga0070715_10485261 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 704 | Open in IMG/M |
| 3300006465|Ga0082250_11248006 | All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium | 501 | Open in IMG/M |
| 3300006865|Ga0073934_10041203 | All Organisms → cellular organisms → Bacteria | 4181 | Open in IMG/M |
| 3300009090|Ga0099827_11127699 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 681 | Open in IMG/M |
| 3300009094|Ga0111539_13037603 | Not Available | 542 | Open in IMG/M |
| 3300009137|Ga0066709_101811866 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 856 | Open in IMG/M |
| 3300009137|Ga0066709_104357768 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 516 | Open in IMG/M |
| 3300009156|Ga0111538_11752728 | All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium | 782 | Open in IMG/M |
| 3300009162|Ga0075423_11294528 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 779 | Open in IMG/M |
| 3300010047|Ga0126382_11330186 | Not Available | 651 | Open in IMG/M |
| 3300010047|Ga0126382_12351765 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 516 | Open in IMG/M |
| 3300010303|Ga0134082_10252780 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 730 | Open in IMG/M |
| 3300010360|Ga0126372_12200641 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 600 | Open in IMG/M |
| 3300010362|Ga0126377_12317881 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 613 | Open in IMG/M |
| 3300010366|Ga0126379_13210533 | All Organisms → cellular organisms → Bacteria | 547 | Open in IMG/M |
| 3300010376|Ga0126381_104474448 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 540 | Open in IMG/M |
| 3300010400|Ga0134122_12168607 | All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium | 598 | Open in IMG/M |
| 3300011271|Ga0137393_11193993 | All Organisms → cellular organisms → Bacteria | 646 | Open in IMG/M |
| 3300012205|Ga0137362_11142828 | All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium | 661 | Open in IMG/M |
| 3300012205|Ga0137362_11425035 | All Organisms → cellular organisms → Bacteria | 579 | Open in IMG/M |
| 3300012349|Ga0137387_11181408 | All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium | 541 | Open in IMG/M |
| 3300012351|Ga0137386_10977737 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 603 | Open in IMG/M |
| 3300012362|Ga0137361_10168650 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 1967 | Open in IMG/M |
| 3300012582|Ga0137358_10022062 | All Organisms → cellular organisms → Bacteria | 4110 | Open in IMG/M |
| 3300012685|Ga0137397_10775900 | All Organisms → cellular organisms → Bacteria | 711 | Open in IMG/M |
| 3300012923|Ga0137359_10456404 | All Organisms → cellular organisms → Bacteria | 1130 | Open in IMG/M |
| 3300012975|Ga0134110_10262350 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria | 738 | Open in IMG/M |
| 3300012977|Ga0134087_10031231 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria | 2016 | Open in IMG/M |
| 3300014155|Ga0181524_10146972 | Not Available | 1226 | Open in IMG/M |
| 3300014638|Ga0181536_10065266 | All Organisms → cellular organisms → Bacteria | 2262 | Open in IMG/M |
| 3300014885|Ga0180063_1192771 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 655 | Open in IMG/M |
| 3300015259|Ga0180085_1226195 | All Organisms → cellular organisms → Bacteria | 554 | Open in IMG/M |
| 3300015264|Ga0137403_10987978 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 689 | Open in IMG/M |
| 3300018058|Ga0187766_11246812 | Not Available | 540 | Open in IMG/M |
| 3300018468|Ga0066662_12098854 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 592 | Open in IMG/M |
| 3300019487|Ga0187893_10353882 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 1016 | Open in IMG/M |
| 3300020220|Ga0194119_10899827 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 515 | Open in IMG/M |
| 3300021859|Ga0210334_10941793 | All Organisms → cellular organisms → Bacteria | 4405 | Open in IMG/M |
| (restricted) 3300024054|Ga0233425_10242805 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 841 | Open in IMG/M |
| 3300025550|Ga0210098_1098809 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 503 | Open in IMG/M |
| 3300025900|Ga0207710_10343214 | All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium | 760 | Open in IMG/M |
| 3300025910|Ga0207684_11655064 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 517 | Open in IMG/M |
| 3300025933|Ga0207706_10970237 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Rhizobiaceae → Rhizobium/Agrobacterium group → Rhizobium | 715 | Open in IMG/M |
| 3300025936|Ga0207670_11071298 | All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium | 680 | Open in IMG/M |
| 3300025938|Ga0207704_11327472 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 615 | Open in IMG/M |
| 3300025972|Ga0207668_11005436 | All Organisms → cellular organisms → Bacteria | 745 | Open in IMG/M |
| 3300025986|Ga0207658_11458796 | All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium | 626 | Open in IMG/M |
| 3300026035|Ga0207703_10086952 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 2620 | Open in IMG/M |
| 3300026088|Ga0207641_12401648 | All Organisms → cellular organisms → Bacteria | 526 | Open in IMG/M |
| 3300026529|Ga0209806_1298801 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 541 | Open in IMG/M |
| 3300026548|Ga0209161_10070034 | All Organisms → cellular organisms → Bacteria | 2207 | Open in IMG/M |
| 3300027770|Ga0209086_10027302 | All Organisms → cellular organisms → Bacteria | 3464 | Open in IMG/M |
| 3300027900|Ga0209253_10340312 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 1154 | Open in IMG/M |
| 3300027905|Ga0209415_10647236 | All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium | 770 | Open in IMG/M |
| (restricted) 3300027977|Ga0247834_1307756 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 544 | Open in IMG/M |
| 3300028379|Ga0268266_10535160 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 1121 | Open in IMG/M |
| 3300028380|Ga0268265_11116010 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 783 | Open in IMG/M |
| 3300031242|Ga0265329_10109930 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 880 | Open in IMG/M |
| 3300031344|Ga0265316_10946893 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 600 | Open in IMG/M |
| 3300031681|Ga0318572_10424188 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 791 | Open in IMG/M |
| 3300031708|Ga0310686_109077153 | All Organisms → cellular organisms → Bacteria | 691 | Open in IMG/M |
| 3300031769|Ga0318526_10070173 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 1368 | Open in IMG/M |
| 3300031805|Ga0318497_10188674 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 1137 | Open in IMG/M |
| 3300031942|Ga0310916_11303014 | All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium | 598 | Open in IMG/M |
| 3300031949|Ga0214473_10747215 | All Organisms → cellular organisms → Bacteria | 1061 | Open in IMG/M |
| 3300031949|Ga0214473_11310853 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 742 | Open in IMG/M |
| 3300032180|Ga0307471_101889837 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 747 | Open in IMG/M |
| 3300032180|Ga0307471_103194771 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 581 | Open in IMG/M |
| 3300032180|Ga0307471_103348651 | All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium | 568 | Open in IMG/M |
| 3300032205|Ga0307472_100951754 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 800 | Open in IMG/M |
| 3300032261|Ga0306920_103904888 | All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium | 543 | Open in IMG/M |
| 3300032829|Ga0335070_10098049 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 3075 | Open in IMG/M |
| 3300032955|Ga0335076_10728836 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 873 | Open in IMG/M |
| 3300033486|Ga0316624_10063422 | All Organisms → cellular organisms → Bacteria | 2433 | Open in IMG/M |
| 3300034165|Ga0364942_0270909 | All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium | 555 | Open in IMG/M |
Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).
| ⦗Top⦘ |
| Habitat | Taxonomy | Distribution |
| Vadose Zone Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil | 11.00% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil | 7.00% |
| Tropical Forest Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil | 6.00% |
| Tropical Forest Soil | Environmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil | 6.00% |
| Switchgrass Rhizosphere | Host-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere | 6.00% |
| Switchgrass Rhizosphere | Host-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere | 5.00% |
| Natural And Restored Wetlands | Environmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands | 4.00% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil | 4.00% |
| Hardwood Forest Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil | 4.00% |
| Corn, Switchgrass And Miscanthus Rhizosphere | Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere | 4.00% |
| Grasslands Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil | 3.00% |
| Grasslands Soil | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil | 3.00% |
| Soil | Environmental → Terrestrial → Soil → Wetlands → Unclassified → Soil | 3.00% |
| Populus Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere | 3.00% |
| Freshwater | Environmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater | 2.00% |
| Bog | Environmental → Aquatic → Freshwater → Wetlands → Bog → Bog | 2.00% |
| Soil | Environmental → Aquatic → Sediment → Unclassified → Unclassified → Soil | 2.00% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil | 2.00% |
| Rhizosphere | Host-Associated → Plants → Rhizosphere → Soil → Unclassified → Rhizosphere | 2.00% |
| Freshwater Lake Sediment | Environmental → Aquatic → Freshwater → Lentic → Sediment → Freshwater Lake Sediment | 1.00% |
| Freshwater Lake | Environmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater Lake | 1.00% |
| Freshwater Lake | Environmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater Lake | 1.00% |
| Sediment | Environmental → Aquatic → Marine → Oceanic → Sediment → Sediment | 1.00% |
| Estuarine | Environmental → Aquatic → Marine → Intertidal Zone → Estuary → Estuarine | 1.00% |
| Hot Spring Sediment | Environmental → Aquatic → Thermal Springs → Sediment → Unclassified → Hot Spring Sediment | 1.00% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil | 1.00% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil | 1.00% |
| Terrestrial Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil | 1.00% |
| Peatlands Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Peatlands Soil | 1.00% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil | 1.00% |
| Tropical Peatland | Environmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland | 1.00% |
| Soil | Environmental → Terrestrial → Soil → Loam → Grasslands → Soil | 1.00% |
| Switchgrass Rhizosphere | Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere | 1.00% |
| Microbial Mat On Rocks | Environmental → Terrestrial → Cave → Unclassified → Unclassified → Microbial Mat On Rocks | 1.00% |
| Sediment | Environmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment | 1.00% |
| Tabebuia Heterophylla Rhizosphere | Host-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Tabebuia Heterophylla Rhizosphere | 1.00% |
| Arabidopsis Thaliana Rhizosphere | Host-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere | 1.00% |
| Miscanthus Rhizosphere | Host-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere | 1.00% |
| Switchgrass Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere | 1.00% |
| Corn Rhizosphere | Host-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere | 1.00% |
| Visualization |
|---|
| Powered by ApexCharts |
Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).
| Taxon OID | Sample Name | Habitat Type | IMG/M Link |
|---|---|---|---|
| 2035918004 | Soil microbial communities from sample at FACE Site 2 North Carolina CO2- | Environmental | Open in IMG/M |
| 3300000559 | Amended soil microbial communities from Kansas Great Prairies, USA - control no BrdU total DNA F1.4 TC clc assemly | Environmental | Open in IMG/M |
| 3300004019 | Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_TuleB_D2 | Environmental | Open in IMG/M |
| 3300004027 | Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Joice_CattailNLC_D2 | Environmental | Open in IMG/M |
| 3300004463 | Combined assembly of Arabidopsis thaliana microbial communities | Host-Associated | Open in IMG/M |
| 3300005183 | Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqC_D1 | Environmental | Open in IMG/M |
| 3300005332 | Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly) | Environmental | Open in IMG/M |
| 3300005347 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-3 metaG | Host-Associated | Open in IMG/M |
| 3300005364 | Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-3 metaG | Host-Associated | Open in IMG/M |
| 3300005441 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-10-1 metaG | Environmental | Open in IMG/M |
| 3300005446 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135 | Environmental | Open in IMG/M |
| 3300005545 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-2 metaG | Environmental | Open in IMG/M |
| 3300005548 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S1-3 metaG | Host-Associated | Open in IMG/M |
| 3300005553 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144 | Environmental | Open in IMG/M |
| 3300005559 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149 | Environmental | Open in IMG/M |
| 3300005576 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157 | Environmental | Open in IMG/M |
| 3300005617 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S2-2 | Host-Associated | Open in IMG/M |
| 3300005713 | Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 (version 2) | Environmental | Open in IMG/M |
| 3300005764 | Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2) | Environmental | Open in IMG/M |
| 3300005843 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S3-2 | Host-Associated | Open in IMG/M |
| 3300005937 | Tabebuia heterophylla rhizosphere microbial communities from the University of Puerto Rico - S3T2R1 | Host-Associated | Open in IMG/M |
| 3300006046 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101 | Environmental | Open in IMG/M |
| 3300006163 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-1 metaG | Environmental | Open in IMG/M |
| 3300006465 | Deep-sea sediment bacterial and archaeal communities from Fram Strait - Hausgarten IX | Environmental | Open in IMG/M |
| 3300006865 | Hot spring sediment bacterial and archeal communities from British Columbia, Canada, to study Microbial Dark Matter (Phase II) - Larsen N4 metaG | Environmental | Open in IMG/M |
| 3300009090 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG | Environmental | Open in IMG/M |
| 3300009094 | Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (version 2) | Host-Associated | Open in IMG/M |
| 3300009137 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158 | Environmental | Open in IMG/M |
| 3300009156 | Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD1 (version 2) | Host-Associated | Open in IMG/M |
| 3300009162 | Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2 | Host-Associated | Open in IMG/M |
| 3300010047 | Tropical forest soil microbial communities from Panama - MetaG Plot_30 | Environmental | Open in IMG/M |
| 3300010303 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09182015 | Environmental | Open in IMG/M |
| 3300010360 | Tropical forest soil microbial communities from Panama - MetaG Plot_6 | Environmental | Open in IMG/M |
| 3300010362 | Tropical forest soil microbial communities from Panama - MetaG Plot_22 | Environmental | Open in IMG/M |
| 3300010366 | Tropical forest soil microbial communities from Panama - MetaG Plot_24 | Environmental | Open in IMG/M |
| 3300010376 | Tropical forest soil microbial communities from Panama - MetaG Plot_28 | Environmental | Open in IMG/M |
| 3300010400 | Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2 | Environmental | Open in IMG/M |
| 3300011271 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaG | Environmental | Open in IMG/M |
| 3300012205 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaG | Environmental | Open in IMG/M |
| 3300012349 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaG | Environmental | Open in IMG/M |
| 3300012351 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaG | Environmental | Open in IMG/M |
| 3300012362 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaG | Environmental | Open in IMG/M |
| 3300012582 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaG | Environmental | Open in IMG/M |
| 3300012685 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaG | Environmental | Open in IMG/M |
| 3300012923 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaG | Environmental | Open in IMG/M |
| 3300012975 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_11112015 | Environmental | Open in IMG/M |
| 3300012977 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_5_24_1 metaG | Environmental | Open in IMG/M |
| 3300014155 | Peatland microbial communities from Houghton, MN, USA - PEATcosm2014_Bin05_60_metaG | Environmental | Open in IMG/M |
| 3300014638 | Peatland microbial communities from Houghton, MN, USA - PEATcosm2014_Bin17_60_metaG | Environmental | Open in IMG/M |
| 3300014885 | Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLIBT47_16_10D | Environmental | Open in IMG/M |
| 3300015259 | Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT730_16_10D | Environmental | Open in IMG/M |
| 3300015264 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly) | Environmental | Open in IMG/M |
| 3300018058 | Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_QUI02_MP05_20_MG | Environmental | Open in IMG/M |
| 3300018468 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111 | Environmental | Open in IMG/M |
| 3300019487 | White microbial mat communities from a basaltic lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - MA170107-4 metaG | Environmental | Open in IMG/M |
| 3300020220 | Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015018 Mahale Deep Cast 100m | Environmental | Open in IMG/M |
| 3300021859 | Metatranscriptome of estuarine sediment microbial communities from the Columbia River estuary, Oregon, United States ? S.306 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300024054 (restricted) | Freshwater microbial communities from Lake Towuti, South Sulawesi, Indonesia - Watercolumn_Towuti2014_140_MG | Environmental | Open in IMG/M |
| 3300025550 | Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Joice_ThreeSqA_D1 (SPAdes) | Environmental | Open in IMG/M |
| 3300025900 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-4 metaG (SPAdes) | Host-Associated | Open in IMG/M |
| 3300025910 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes) | Environmental | Open in IMG/M |
| 3300025933 | Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-3 metaG (SPAdes) | Host-Associated | Open in IMG/M |
| 3300025936 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-3H metaG (SPAdes) | Environmental | Open in IMG/M |
| 3300025938 | Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M1-2 (SPAdes) | Host-Associated | Open in IMG/M |
| 3300025972 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-3 metaG (SPAdes) | Host-Associated | Open in IMG/M |
| 3300025986 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3 metaG (SPAdes) | Host-Associated | Open in IMG/M |
| 3300026035 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S1-2 (SPAdes) | Host-Associated | Open in IMG/M |
| 3300026088 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S6-2 (SPAdes) | Host-Associated | Open in IMG/M |
| 3300026529 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152 (SPAdes) | Environmental | Open in IMG/M |
| 3300026548 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 (SPAdes) | Environmental | Open in IMG/M |
| 3300027770 | Freshwater microbial communities from Lake Montjoie, Canada to study carbon cycling - M_130207_XF_MetaG (SPAdes) | Environmental | Open in IMG/M |
| 3300027900 | Freshwater lake sediment microbial communities from the University of Notre Dame, USA, for methane emissions studies - BRP12 BR (SPAdes) | Environmental | Open in IMG/M |
| 3300027905 | Peat soil microbial communities from Weissenstadt, Germany - SII-SIP-2007 (SPAdes) | Environmental | Open in IMG/M |
| 3300027977 (restricted) | Freshwater microbial communities from meromictic Lake La Cruz, Castile-La Mancha, Spain - LaCruzMarch2015_12m | Environmental | Open in IMG/M |
| 3300028379 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S1-3 metaG (SPAdes) | Host-Associated | Open in IMG/M |
| 3300028380 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S5-2 (SPAdes) | Host-Associated | Open in IMG/M |
| 3300031242 | Rhizosphere microbial communities from Carex aquatilis grown in University of Washington, Seatle, WA, United States - 4-16-27 metaG | Host-Associated | Open in IMG/M |
| 3300031344 | Rhizosphere microbial communities from Carex aquatilis grown in University of Washington, Seatle, WA, United States - 4-5-22 metaG | Host-Associated | Open in IMG/M |
| 3300031681 | Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.089b5f20 | Environmental | Open in IMG/M |
| 3300031708 | FICUS49499 Metagenome Czech Republic combined assembly | Environmental | Open in IMG/M |
| 3300031769 | Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.052b4f24 | Environmental | Open in IMG/M |
| 3300031805 | Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.109b1f23 | Environmental | Open in IMG/M |
| 3300031942 | Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.LF176 | Environmental | Open in IMG/M |
| 3300031949 | Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT98D197 | Environmental | Open in IMG/M |
| 3300032180 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515 | Environmental | Open in IMG/M |
| 3300032205 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05 | Environmental | Open in IMG/M |
| 3300032261 | Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170 (v2) | Environmental | Open in IMG/M |
| 3300032829 | Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_1.3 | Environmental | Open in IMG/M |
| 3300032955 | Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_2.5 | Environmental | Open in IMG/M |
| 3300033486 | Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_N3_C1_D5_A | Environmental | Open in IMG/M |
| 3300034165 | Sediment microbial communities from East River floodplain, Colorado, United States - 19_s17 | Environmental | Open in IMG/M |
| Geographical Distribution | |
|---|---|
| Zoom: | Powered by OpenStreetMap |
| ⦗Top⦘ |
Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).
| Protein ID | Sample Taxon ID | Habitat | Sequence |
| FACENCA_6872110 | 2035918004 | Soil | MNEFTLQTPTQEAGLATAMEVVYQWNYDAEVEELRNLYVKAAEAQWVGARDIEWDRPI |
| F14TC_1003293353 | 3300000559 | Soil | MSDFTLNTTTQEPGLGTAMQVVYQWNYDPEVEELRRLYVKAAEAQWVAERD |
| Ga0055439_100677611 | 3300004019 | Natural And Restored Wetlands | MNDFTLHSSTQTPGLATAMQVIYQWNYDSEVDELRRLYVKGTEAQWIAER |
| Ga0055459_100555151 | 3300004027 | Natural And Restored Wetlands | MSDFKIETTTQEADLPTAMQVIYQWNYDPEVEELRNLYVKAAEAQWIG |
| Ga0063356_1007389361 | 3300004463 | Arabidopsis Thaliana Rhizosphere | MRDFGLTTDTQEPELDTAMKVIYQWSYDPEVDELRRLYVKAAEA |
| Ga0068993_103037021 | 3300005183 | Natural And Restored Wetlands | MKNGAFQLDTPTQEPGLATAMEVIYQWNYDSEVEELRRLYVKAAE |
| Ga0066388_1005229473 | 3300005332 | Tropical Forest Soil | MNEFTVQTPTQETGLATAMEVVYQWNYDAEVEELRNLYVK |
| Ga0066388_1069523232 | 3300005332 | Tropical Forest Soil | MSEFMLDTPTQEPDLESAMKVVYQWNYGSEVEELRRLYVKAAEAQWVAERDID |
| Ga0070668_1008491692 | 3300005347 | Switchgrass Rhizosphere | MSEFTLTTPSQEPGYQTAMEVVFQWNYDPELEELRNLYVKAAE |
| Ga0070673_1009024602 | 3300005364 | Switchgrass Rhizosphere | MTTEFTIETPTQEPGYGTAMEVVFQWNYEPEVEELRRLYAK |
| Ga0070700_1004479622 | 3300005441 | Corn, Switchgrass And Miscanthus Rhizosphere | MSEFMLKTATQEPDLATAMQVVYQWNYDPEVEELRNLYVKA |
| Ga0066686_106133961 | 3300005446 | Soil | MNDFTLTTPTQEPGLATSMQVVYQWNYDPEVEELRRLYVKAAEAQWVAERDLDW |
| Ga0070695_1011216621 | 3300005545 | Corn, Switchgrass And Miscanthus Rhizosphere | MSDFTLQTPTQEPDLESAMRVVYQWNYDPEVEQLRSLYVKAAEAQWISNRD |
| Ga0070665_1002043543 | 3300005548 | Switchgrass Rhizosphere | MSEFTLKTPSQEPGYQTAMEVVFQWNYDTELEELRN |
| Ga0066695_108109942 | 3300005553 | Soil | MNDFTLTTPTQEPGLATATQVVYQWNYDPEVEELRRLYVKAAEAQWI |
| Ga0066700_103495761 | 3300005559 | Soil | MNDFTLTTSTQEPGLATAMAVVYQWNYDAEVDELRRLYVKAAEAQWI |
| Ga0066708_109481752 | 3300005576 | Soil | MSEFTIDTPTQEPGLETAMKVVYQWNYDPEVEELRRLYVKAAEAQWISERDVDWNRPIDH |
| Ga0068859_1013930401 | 3300005617 | Switchgrass Rhizosphere | MSEFTLTSATQEPELPTAMKVVYQWNYEPEVEELR |
| Ga0066905_1003659533 | 3300005713 | Tropical Forest Soil | MNEFTLTTPTQEPDLGTAMKVVYQWNYGSEVEELR |
| Ga0066905_1011796122 | 3300005713 | Tropical Forest Soil | MNDFELTTPTQEPDLGTAMKVVYQWNYGSEVEELRRLYVKAAEAQWVAE |
| Ga0066903_1005555821 | 3300005764 | Tropical Forest Soil | MSDFTLATTTQEPNLDTAMKVVYQWNYDPEVEELRRLY |
| Ga0066903_1081611861 | 3300005764 | Tropical Forest Soil | MSDFTVTTSTQEPELDTAMKVVYQWNYEPEVEELRRLYVKAAEAQW |
| Ga0068860_1008274201 | 3300005843 | Switchgrass Rhizosphere | MSEFTLTTPSQEPGYQTAMEVVFQWNYDPELEELR |
| Ga0081455_108524331 | 3300005937 | Tabebuia Heterophylla Rhizosphere | MKSDGFNLQTPTQEPGLATAMEVIYQWNYDSEVEELRR |
| Ga0066652_1014974802 | 3300006046 | Soil | MSEFTLSSPSQEPGYQTAMEVVFQWNYDPEVEELRNLYVKAAEAQW |
| Ga0070715_104852612 | 3300006163 | Corn, Switchgrass And Miscanthus Rhizosphere | MNEFTLQTPTQEAGLATAMEVVYQWNYDAEVEELRNLYV |
| Ga0082250_112480062 | 3300006465 | Sediment | MKLQTETQEPDMETAMKVVYQWNYGSELEELRRLYVKGAELQWVA |
| Ga0073934_100412036 | 3300006865 | Hot Spring Sediment | MNEFRLRTATQEPDLDTAMKIIYQWNYDPEVEELRRLYIKAAEAQWIAERD |
| Ga0099827_111276992 | 3300009090 | Vadose Zone Soil | MNEFTLQTPTQEAGLATAMEVVYQWNYDAEVEELRNLYVKAAEAQWIGARDI |
| Ga0111539_130376032 | 3300009094 | Populus Rhizosphere | MTTEFTIETPTQEPGYGTAMEVVFQWNYEPEVEELR |
| Ga0066709_1018118661 | 3300009137 | Grasslands Soil | MNEFTLQTPTQEAGLATAMEVVYQWNYDAEVEELRNLYVK |
| Ga0066709_1043577681 | 3300009137 | Grasslands Soil | MTVQTSDFTIQTPTQEPGLGTAMEVVYQWNYDPEVEELRSLY |
| Ga0111538_117527282 | 3300009156 | Populus Rhizosphere | MSDFTLRTPTQEPDLATAMQVVYQWKYDPEVEELR |
| Ga0075423_112945281 | 3300009162 | Populus Rhizosphere | MSEFTVQTPTQETGIDTAMQVVYQWNYDPEVEELRSLYVKAA |
| Ga0126382_113301861 | 3300010047 | Tropical Forest Soil | MNEFTLQTPTQEAGLATAMEVVYQWNYDAEVEELRNL |
| Ga0126382_123517652 | 3300010047 | Tropical Forest Soil | METLTLQTPTQDPGLATAMEVVYQWNYDPEVEELRNLYVKAAEAQ |
| Ga0134082_102527801 | 3300010303 | Grasslands Soil | MNDFKLTTPTQEPGLASAMQVVYQWNYDPEVEELRRLY |
| Ga0126372_122006412 | 3300010360 | Tropical Forest Soil | MNDFMLETPTQEPDLASAMKVVYQWNYGSEVEELRRLYVKAAEAQWVAE |
| Ga0126377_123178811 | 3300010362 | Tropical Forest Soil | MNEFTLQTPTQEAGLATAMEVVYQWNYDAEVEELRNLYVKAAEAQWVGARDI |
| Ga0126379_132105332 | 3300010366 | Tropical Forest Soil | MSDFRLQTDTQEPGLDTAMKVVYQWNYDPEVEELRRLYVKAAEAQWVSE |
| Ga0126381_1044744481 | 3300010376 | Tropical Forest Soil | MSDFTLRTPTQEPGYQTAMEVVFQWNYDPEVEELRNLYVKAAE |
| Ga0134122_121686071 | 3300010400 | Terrestrial Soil | MSEFTLNTPTQEPELDTAMKVVFQWNYDPEVDELRRLYVKAAEAQWISSRDLDW |
| Ga0137393_111939931 | 3300011271 | Vadose Zone Soil | MSTLPIRTETQEPDLETAMKVVYQWNYDSEVDELRR |
| Ga0137362_111428282 | 3300012205 | Vadose Zone Soil | MSDAFTLKTSTQEPDLPTAMQIVYQWNYDPEVEELRNLYVK |
| Ga0137362_114250352 | 3300012205 | Vadose Zone Soil | MSDFTTVTATQEPALDTAMKVVYQWNYDPEVEELR |
| Ga0137387_111814081 | 3300012349 | Vadose Zone Soil | MNDFTLTTPTQEPGLATSMQVVYQWNYDPEVDELRRLYVKAAEA* |
| Ga0137386_109777371 | 3300012351 | Vadose Zone Soil | MNDFTLTTPTQEPGLATSMQVVYQWNYDPEVEELRRLYVKAAEAQWVAER |
| Ga0137361_101686503 | 3300012362 | Vadose Zone Soil | MPKERSMSDDFTLKTPTQEPDLPTAMQVIYQWNYDPEIEELRNLYVKAAEAQWIGAKDL |
| Ga0137358_100220621 | 3300012582 | Vadose Zone Soil | MSDDFTLKTPTQEPDLPTAMQVIYQWNYDPEIEELRNLYVKAAEAQWIGAKDLDWNRE |
| Ga0137397_107759002 | 3300012685 | Vadose Zone Soil | MEEPPMKEFTLQTATQEPGLGTAMEVVYQWNYDVEVDELRSLY |
| Ga0137359_104564041 | 3300012923 | Vadose Zone Soil | MDDFKLTTATQEPPLETAMKVVYQWNYDPEVEELRRLY |
| Ga0134110_102623501 | 3300012975 | Grasslands Soil | MSDFTIETPTQEPGLETAMKVVYQWNYDPEVEELRRLYVKAAEAQWISEATSI |
| Ga0134087_100312311 | 3300012977 | Grasslands Soil | MNDFTLTTPTQEPGLATATQVVYQWNYDPEVEELRRLYVKAAEAQW |
| Ga0181524_101469722 | 3300014155 | Bog | MSQFDINSATQEPDLATAMKVVYQWNYGSEVEELR |
| Ga0181536_100652661 | 3300014638 | Bog | MSQFDINTATQEPDLATAMKVVYQWNYGSEVEELRHLYVKALEAQ |
| Ga0180063_11927711 | 3300014885 | Soil | MDDFSLNTPTQEPGLETAMKVVYQWDYDPQVEELRRLYVKAAEAQWIADRDI |
| Ga0180085_12261952 | 3300015259 | Soil | MSEFTLKTTTQEPDLATAMQVVYQWNYDAEVEELRNLYVKAAEAQWIGEKHLDW |
| Ga0137403_109879782 | 3300015264 | Vadose Zone Soil | MNDFTLTTPTQEPGLASAMQVVYQWNYDPEVDELRRLYVKAAEAQWV |
| Ga0187766_112468121 | 3300018058 | Tropical Peatland | MSEFTVTTATQEAGLDTAMQVVYQWNYEPQVDELRRLYVKATEAQ |
| Ga0066662_120988541 | 3300018468 | Grasslands Soil | MNDFTLTTSTQEPGLATAMAVVYPWNYDAEVDELRRL |
| Ga0187893_103538822 | 3300019487 | Microbial Mat On Rocks | MSDFSLATATQEPGLDTAMKVVYQWDYEPQVEELRRLYVKAA |
| Ga0194119_108998271 | 3300020220 | Freshwater Lake | MSDFHIQTETQEPPLDTAMQVIYQWSYDPEVDELRNL |
| Ga0210334_109417933 | 3300021859 | Estuarine | MSEFKLRTETQEPDLDTAMKVVYQWNYDPEVEELRRLYHKATE |
| (restricted) Ga0233425_102428051 | 3300024054 | Freshwater | MSDFTLETPTQEPGLATAMQVVYQWSYEPEVDELRNLYVKGAEAQWVATRDIDWDRDID |
| Ga0210098_10988091 | 3300025550 | Natural And Restored Wetlands | MSDFKIETTTQEADLPTAMQVIYQWNYDPEVEELRNLY |
| Ga0207710_103432142 | 3300025900 | Switchgrass Rhizosphere | MSDFTVKTSTQEPDLSTAMQIVYQWNYEPEVDELRNLYVKAAEAQWV |
| Ga0207684_116550642 | 3300025910 | Corn, Switchgrass And Miscanthus Rhizosphere | MKNGAFSLDSPTQEPGLATAMEVIYQWNYDSEVEELRRLYVKAAEA |
| Ga0207706_109702371 | 3300025933 | Corn Rhizosphere | MIDFTLNTTTQEPGLGTAMQVVYQWNYDPEVEELRRLYVKAAEAQWVAERDL |
| Ga0207670_110712981 | 3300025936 | Switchgrass Rhizosphere | MSEFMLKTATQEPDLATAMQVVYQWNYDPEVEELRNLYVKAAEGQWIGA |
| Ga0207704_113274722 | 3300025938 | Miscanthus Rhizosphere | MNEFTLQTPTQEPGYETAMEVVFQWNYDPEVEELRNLYVKAAEAQW |
| Ga0207668_110054361 | 3300025972 | Switchgrass Rhizosphere | MTTEFTIETPTQEPGYGTAMEVVFQWNYEPEVEELRRLYAKATEAQWI |
| Ga0207658_114587962 | 3300025986 | Switchgrass Rhizosphere | MRDFGLTTDTQEPELDTAMKVIYQWSYDPEVDELRRLYVKAAEAQWVHRVATRAPVP |
| Ga0207703_100869525 | 3300026035 | Switchgrass Rhizosphere | MSDFTLNTMTQEPGLGTAMQVVYQWNYDPEVEELRRLYVKAAEAQWVAER |
| Ga0207641_124016482 | 3300026088 | Switchgrass Rhizosphere | MTTEFTIETPTQEPGYGTAMEVVFQWNYEPEVEELRRL |
| Ga0209806_12988011 | 3300026529 | Soil | MNDFTLTTPTQEPGLASAMQVVYQWNYDPEVEELRRLYVKAAEAQWVSERDLDWSRP |
| Ga0209161_100700343 | 3300026548 | Soil | MNDFTLTTPTQEPGLASAMQVVYQWNYDPEVDELRRLYVKAAEAQ |
| Ga0209086_100273021 | 3300027770 | Freshwater Lake | MSEFQLTTGTEEPGLDTAMKVVYQWNYEPDVEELRSLYHKAT |
| Ga0209253_103403121 | 3300027900 | Freshwater Lake Sediment | MSDFSLTTSTQEPDLGTAMKVVYQWNYGSEVEELRRLYVKAAEAQWVATR |
| Ga0209415_106472362 | 3300027905 | Peatlands Soil | MSEFTITPATQDPGFETAMKVVYQWNYGSEVEELRRLYVKA |
| (restricted) Ga0247834_13077562 | 3300027977 | Freshwater | MSDFQLTTTTQEPGFDTAMQVIYQWKYDPDVEELRDLYHKATQLQWV |
| Ga0268266_105351603 | 3300028379 | Switchgrass Rhizosphere | MSEFTLKTPSQEPGYQTAMEVVFQWNYDTELEELRNLYVKAAEAQWI |
| Ga0268265_111160101 | 3300028380 | Switchgrass Rhizosphere | MDDFTLNTATQEAGLDTAMKVVYQWNYEPEVEELRRLYMKAT |
| Ga0265329_101099302 | 3300031242 | Rhizosphere | MSDFTITTGTQEPPLDTAMQVIYQWNYDPEVEELRNLYVKAAEAQWV |
| Ga0265316_109468931 | 3300031344 | Rhizosphere | MDDFTLETATQEPGIATAMQVIYQWNYEPQVEELRRLYGKAT |
| Ga0318572_104241882 | 3300031681 | Soil | MSEFRLDTPTQEPDLESAMTVVYQWNYGSEVEELRRLYVKAAEAQWVAERDI |
| Ga0310686_1090771532 | 3300031708 | Soil | MNQFDITSATQEPGLSTAMQVVYQWNYGSEVEELR |
| Ga0318526_100701731 | 3300031769 | Soil | MSEFMLDTPTQEPDLESAMKVVYQWNYGSEVDELRRLYVK |
| Ga0318497_101886741 | 3300031805 | Soil | MSEFMLDTPTQEPDLESAMKVVYQWNYGSEVDELR |
| Ga0310916_113030142 | 3300031942 | Soil | MNEFTLTTRTQEPDLGTAMKVVYQWNYGSEVEELRRLYVKA |
| Ga0214473_107472152 | 3300031949 | Soil | MREFTLETPTQDPPLETAMRVVYQWSYEPEVEELRRLYLKAVEAQWIAARDIDWE |
| Ga0214473_113108531 | 3300031949 | Soil | MRDFTLTTDTQEPALDAAMKVIYQWNYDPEVEELRRLYVKAADAPWVSE |
| Ga0307471_1018898371 | 3300032180 | Hardwood Forest Soil | MSEFTLTTPSQEPGYQTAMEVVFQWNYDPELEELRNLYVKAAEAQW |
| Ga0307471_1031947711 | 3300032180 | Hardwood Forest Soil | MKPFSLHTTTQEPGLGTAMEVVYQWNYDAEVDELRNLYVKAAEA |
| Ga0307471_1033486512 | 3300032180 | Hardwood Forest Soil | MRDFSLSTDTQEPELDTAMKVIYQWSYDPEVEELRRLYVKAAEAQWVSERDLDWNRSIDH |
| Ga0307472_1009517542 | 3300032205 | Hardwood Forest Soil | MNDFTLASPTQEPGLATAMHVVYQWNYDPEVDELRRLYVKAAEAQWIADRD |
| Ga0306920_1039048881 | 3300032261 | Soil | MNEFTLTTPTQEPDLGTAMKVVYQWNYGSEVEELRRLYVKAAEA |
| Ga0335070_100980491 | 3300032829 | Soil | MSAFTVKTPTQEPDLATAMQVVYQWNYDPEVEELRNLYVKAAEAQWISNRDLD |
| Ga0335076_107288362 | 3300032955 | Soil | MSDFNVRTTTQEPDLDTAMKVIYQWNYEPEVEELRRLYVKAAD |
| Ga0316624_100634221 | 3300033486 | Soil | MSDFKLTTPTQEPDLDTAMKVIYQWNYDPEVEELRRLYVKATEAQW |
| Ga0364942_0270909_3_155 | 3300034165 | Sediment | MSEFTVKTSTQEPDLPTAMQVVYQWNYDTDVEELRNLYVKAAEAQWIGAKH |
| ⦗Top⦘ |