| Basic Information | |
|---|---|
| Family ID | F045958 |
| Family Type | Metagenome |
| Number of Sequences | 152 |
| Average Sequence Length | 78 residues |
| Representative Sequence | MPVARIPFWQLRRHGVVEEAVRGGSRRRIVGRDWVLPEAVRERMRGLLEPLGFELDRPILVREPEGEDALEFRQD |
| Number of Associated Samples | 103 |
| Number of Associated Scaffolds | 152 |
| Quality Assessment | |
|---|---|
| Transcriptomic Evidence | No |
| Most common taxonomic group | Bacteria |
| % of genes with valid RBS motifs | 67.76 % |
| % of genes near scaffold ends (potentially truncated) | 27.63 % |
| % of genes from short scaffolds (< 2000 bps) | 75.66 % |
| Associated GOLD sequencing projects | 89 |
| AlphaFold2 3D model prediction | Yes |
| 3D model pTM-score | 0.77 |
| Hidden Markov Model |
|---|
| Powered by Skylign |
| Most Common Taxonomy | |
|---|---|
| Group | Bacteria (94.737 % of family members) |
| NCBI Taxonomy ID | 2 |
| Taxonomy | All Organisms → cellular organisms → Bacteria |
| Most Common Ecosystem | |
|---|---|
| GOLD Ecosystem | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil (23.026 % of family members) |
| Environment Ontology (ENVO) | Unclassified (35.526 % of family members) |
| Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Soil (non-saline) (52.632 % of family members) |
| ⦗Top⦘ |
| ⦗Top⦘ |
| Predicted Topology & Secondary Structure | |||||
|---|---|---|---|---|---|
| Classification: | Globular | Signal Peptide: | No | Secondary Structure distribution: | α-helix: 22.33% β-sheet: 17.48% Coil/Unstructured: 60.19% | Feature Viewer |
|
|
|||||
| Powered by Feature Viewer | |||||
| Structure Viewer | |
|---|---|
|
| |
| Per-residue confidence (pLDDT): 0-50 51-70 71-90 91-100 | pTM-score: 0.77 |
| Powered by PDBe Molstar | |
| SCOP family | SCOP domain | Representative PDB | TM-score |
|---|---|---|---|
| d.110.2.1: GAF domain | d1mc0a1 | 1mc0 | 0.60616 |
| d.110.2.0: automated matches | d6p58a_ | 6p58 | 0.54669 |
| d.110.2.0: automated matches | d7ckva1 | 7ckv | 0.54014 |
| d.110.2.1: GAF domain | d1mc0a2 | 1mc0 | 0.53618 |
| d.145.1.4: CorC/HlyC domain-like | d2p3ha1 | 2p3h | 0.52935 |
| ⦗Top⦘ |
| Pfam ID | Name | % Frequency in 152 Family Scaffolds |
|---|---|---|
| PF01061 | ABC2_membrane | 6.58 |
| PF00753 | Lactamase_B | 5.92 |
| PF02515 | CoA_transf_3 | 5.26 |
| PF13436 | Gly-zipper_OmpA | 3.29 |
| PF00691 | OmpA | 3.29 |
| PF12698 | ABC2_membrane_3 | 3.29 |
| PF05685 | Uma2 | 3.29 |
| PF13488 | Gly-zipper_Omp | 2.63 |
| PF12705 | PDDEXK_1 | 2.63 |
| PF02668 | TauD | 2.63 |
| PF04519 | Bactofilin | 2.63 |
| PF12695 | Abhydrolase_5 | 1.97 |
| PF13744 | HTH_37 | 1.97 |
| PF13361 | UvrD_C | 1.97 |
| PF00296 | Bac_luciferase | 1.97 |
| PF07690 | MFS_1 | 1.97 |
| PF12697 | Abhydrolase_6 | 1.32 |
| PF13533 | Biotin_lipoyl_2 | 1.32 |
| PF03631 | Virul_fac_BrkB | 1.32 |
| PF00529 | CusB_dom_1 | 0.66 |
| PF01425 | Amidase | 0.66 |
| PF10518 | TAT_signal | 0.66 |
| PF00326 | Peptidase_S9 | 0.66 |
| PF00575 | S1 | 0.66 |
| PF01244 | Peptidase_M19 | 0.66 |
| PF13711 | DUF4160 | 0.66 |
| PF02082 | Rrf2 | 0.66 |
| PF13441 | Gly-zipper_YMGG | 0.66 |
| PF09350 | DJC28_CD | 0.66 |
| PF00990 | GGDEF | 0.66 |
| PF13304 | AAA_21 | 0.66 |
| PF12146 | Hydrolase_4 | 0.66 |
| PF00486 | Trans_reg_C | 0.66 |
| PF16576 | HlyD_D23 | 0.66 |
| PF01734 | Patatin | 0.66 |
| COG ID | Name | Functional Category | % Frequency in 152 Family Scaffolds |
|---|---|---|---|
| COG1804 | Crotonobetainyl-CoA:carnitine CoA-transferase CaiB and related acyl-CoA transferases | Lipid transport and metabolism [I] | 5.26 |
| COG4636 | Endonuclease, Uma2 family (restriction endonuclease fold) | General function prediction only [R] | 3.29 |
| COG1664 | Cytoskeletal protein CcmA, bactofilin family | Cytoskeleton [Z] | 2.63 |
| COG2175 | Taurine dioxygenase, alpha-ketoglutarate-dependent | Secondary metabolites biosynthesis, transport and catabolism [Q] | 2.63 |
| COG2141 | Flavin-dependent oxidoreductase, luciferase family (includes alkanesulfonate monooxygenase SsuD and methylene tetrahydromethanopterin reductase) | Coenzyme transport and metabolism [H] | 1.97 |
| COG1295 | Uncharacterized membrane protein, BrkB/YihY/UPF0761 family (not an RNase) | Function unknown [S] | 1.32 |
| COG4667 | Predicted phospholipase, patatin/cPLA2 family | Lipid transport and metabolism [I] | 0.66 |
| COG0154 | Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit or related amidase | Translation, ribosomal structure and biogenesis [J] | 0.66 |
| COG3621 | Patatin-like phospholipase/acyl hydrolase, includes sporulation protein CotR | General function prediction only [R] | 0.66 |
| COG2524 | Predicted transcriptional regulator, contains C-terminal CBS domains | Transcription [K] | 0.66 |
| COG2378 | Predicted DNA-binding transcriptional regulator YobV, contains HTH and WYL domains | Transcription [K] | 0.66 |
| COG2355 | Zn-dependent dipeptidase, microsomal dipeptidase homolog | Posttranslational modification, protein turnover, chaperones [O] | 0.66 |
| COG2188 | DNA-binding transcriptional regulator, GntR family | Transcription [K] | 0.66 |
| COG2186 | DNA-binding transcriptional regulator, FadR family | Transcription [K] | 0.66 |
| COG1959 | DNA-binding transcriptional regulator, IscR family | Transcription [K] | 0.66 |
| COG1752 | Predicted acylesterase/phospholipase RssA, containd patatin domain | General function prediction only [R] | 0.66 |
| COG1725 | DNA-binding transcriptional regulator YhcF, GntR family | Transcription [K] | 0.66 |
| COG1414 | DNA-binding transcriptional regulator, IclR family | Transcription [K] | 0.66 |
| COG0640 | DNA-binding transcriptional regulator, ArsR family | Transcription [K] | 0.66 |
| ⦗Top⦘ |
| Name | Rank | Taxonomy | Distribution |
| All Organisms | root | All Organisms | 94.74 % |
| Unclassified | root | N/A | 5.26 % |
| Visualization |
|---|
| Powered by ApexCharts |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| 3300000364|INPhiseqgaiiFebDRAFT_101591877 | All Organisms → cellular organisms → Bacteria | 4072 | Open in IMG/M |
| 3300000955|JGI1027J12803_108243037 | All Organisms → cellular organisms → Bacteria | 601 | Open in IMG/M |
| 3300002558|JGI25385J37094_10029760 | All Organisms → cellular organisms → Bacteria | 1929 | Open in IMG/M |
| 3300002562|JGI25382J37095_10000502 | All Organisms → cellular organisms → Bacteria | 9898 | Open in IMG/M |
| 3300002908|JGI25382J43887_10042861 | All Organisms → cellular organisms → Bacteria | 2460 | Open in IMG/M |
| 3300003324|soilH2_10003947 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 8958 | Open in IMG/M |
| 3300005166|Ga0066674_10046809 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 1944 | Open in IMG/M |
| 3300005167|Ga0066672_10218023 | All Organisms → cellular organisms → Bacteria | 1220 | Open in IMG/M |
| 3300005174|Ga0066680_10092221 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria | 1842 | Open in IMG/M |
| 3300005174|Ga0066680_10172343 | All Organisms → cellular organisms → Bacteria | 1361 | Open in IMG/M |
| 3300005176|Ga0066679_10226307 | All Organisms → cellular organisms → Bacteria | 1199 | Open in IMG/M |
| 3300005176|Ga0066679_10299923 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 1044 | Open in IMG/M |
| 3300005176|Ga0066679_10904834 | All Organisms → cellular organisms → Bacteria | 555 | Open in IMG/M |
| 3300005177|Ga0066690_10137280 | All Organisms → cellular organisms → Bacteria | 1599 | Open in IMG/M |
| 3300005181|Ga0066678_10544908 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 770 | Open in IMG/M |
| 3300005184|Ga0066671_10140458 | All Organisms → cellular organisms → Bacteria | 1395 | Open in IMG/M |
| 3300005186|Ga0066676_10005666 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 5782 | Open in IMG/M |
| 3300005187|Ga0066675_10010503 | All Organisms → cellular organisms → Bacteria | 4842 | Open in IMG/M |
| 3300005187|Ga0066675_11260100 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 547 | Open in IMG/M |
| 3300005332|Ga0066388_100089638 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria | 3492 | Open in IMG/M |
| 3300005332|Ga0066388_100482217 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 1878 | Open in IMG/M |
| 3300005332|Ga0066388_100626223 | All Organisms → cellular organisms → Bacteria | 1695 | Open in IMG/M |
| 3300005332|Ga0066388_100823309 | All Organisms → cellular organisms → Bacteria | 1518 | Open in IMG/M |
| 3300005332|Ga0066388_107069103 | All Organisms → cellular organisms → Bacteria | 564 | Open in IMG/M |
| 3300005406|Ga0070703_10525579 | All Organisms → cellular organisms → Bacteria | 536 | Open in IMG/M |
| 3300005445|Ga0070708_100016878 | All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae | 6071 | Open in IMG/M |
| 3300005445|Ga0070708_100098962 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 2667 | Open in IMG/M |
| 3300005445|Ga0070708_100115321 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 2473 | Open in IMG/M |
| 3300005446|Ga0066686_10036361 | All Organisms → cellular organisms → Bacteria | 2921 | Open in IMG/M |
| 3300005447|Ga0066689_10001830 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria | 7712 | Open in IMG/M |
| 3300005450|Ga0066682_10017859 | All Organisms → cellular organisms → Bacteria | 3945 | Open in IMG/M |
| 3300005454|Ga0066687_10392629 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 801 | Open in IMG/M |
| 3300005467|Ga0070706_101870070 | All Organisms → cellular organisms → Bacteria | 545 | Open in IMG/M |
| 3300005468|Ga0070707_100796734 | All Organisms → cellular organisms → Bacteria | 909 | Open in IMG/M |
| 3300005518|Ga0070699_100303432 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 1433 | Open in IMG/M |
| 3300005518|Ga0070699_101846738 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 553 | Open in IMG/M |
| 3300005536|Ga0070697_100161532 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 1893 | Open in IMG/M |
| 3300005536|Ga0070697_100814907 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria | 826 | Open in IMG/M |
| 3300005536|Ga0070697_101222324 | All Organisms → cellular organisms → Bacteria | 670 | Open in IMG/M |
| 3300005540|Ga0066697_10070460 | All Organisms → cellular organisms → Bacteria | 2006 | Open in IMG/M |
| 3300005546|Ga0070696_100530765 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 940 | Open in IMG/M |
| 3300005546|Ga0070696_101155495 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 653 | Open in IMG/M |
| 3300005549|Ga0070704_100716150 | All Organisms → cellular organisms → Bacteria | 889 | Open in IMG/M |
| 3300005554|Ga0066661_10460017 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 776 | Open in IMG/M |
| 3300005557|Ga0066704_10389198 | All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria | 928 | Open in IMG/M |
| 3300005559|Ga0066700_10157977 | All Organisms → cellular organisms → Bacteria | 1541 | Open in IMG/M |
| 3300005559|Ga0066700_10268674 | All Organisms → cellular organisms → Bacteria | 1196 | Open in IMG/M |
| 3300005559|Ga0066700_10582757 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 780 | Open in IMG/M |
| 3300006049|Ga0075417_10013196 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 3129 | Open in IMG/M |
| 3300006049|Ga0075417_10177300 | All Organisms → cellular organisms → Bacteria | 1001 | Open in IMG/M |
| 3300006049|Ga0075417_10222493 | All Organisms → cellular organisms → Bacteria | 899 | Open in IMG/M |
| 3300006173|Ga0070716_100526073 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 877 | Open in IMG/M |
| 3300006804|Ga0079221_10877511 | All Organisms → cellular organisms → Bacteria | 655 | Open in IMG/M |
| 3300006844|Ga0075428_100872278 | All Organisms → cellular organisms → Bacteria | 955 | Open in IMG/M |
| 3300006845|Ga0075421_101363608 | All Organisms → cellular organisms → Bacteria | 782 | Open in IMG/M |
| 3300006852|Ga0075433_10218209 | All Organisms → cellular organisms → Bacteria | 1694 | Open in IMG/M |
| 3300006854|Ga0075425_101451745 | All Organisms → cellular organisms → Bacteria | 775 | Open in IMG/M |
| 3300006871|Ga0075434_101062681 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria | 823 | Open in IMG/M |
| 3300006871|Ga0075434_102073331 | All Organisms → cellular organisms → Bacteria | 573 | Open in IMG/M |
| 3300006904|Ga0075424_101488157 | All Organisms → cellular organisms → Bacteria | 719 | Open in IMG/M |
| 3300006904|Ga0075424_102270427 | All Organisms → cellular organisms → Bacteria | 570 | Open in IMG/M |
| 3300006954|Ga0079219_10006638 | All Organisms → cellular organisms → Bacteria | 3601 | Open in IMG/M |
| 3300006969|Ga0075419_10015319 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria | 4723 | Open in IMG/M |
| 3300007076|Ga0075435_101840867 | All Organisms → cellular organisms → Bacteria | 531 | Open in IMG/M |
| 3300007265|Ga0099794_10061459 | All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → unclassified Acidobacteriaceae → Acidobacteriaceae bacterium | 1825 | Open in IMG/M |
| 3300009012|Ga0066710_101412016 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 1078 | Open in IMG/M |
| 3300009012|Ga0066710_104283744 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 533 | Open in IMG/M |
| 3300009038|Ga0099829_10902998 | Not Available | 733 | Open in IMG/M |
| 3300009089|Ga0099828_10056531 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Methylococcales → Methylococcaceae → Methyloterricola → Methyloterricola oryzae | 3277 | Open in IMG/M |
| 3300009090|Ga0099827_10687186 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 884 | Open in IMG/M |
| 3300009090|Ga0099827_10749189 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria | 845 | Open in IMG/M |
| 3300009100|Ga0075418_10033895 | All Organisms → cellular organisms → Bacteria | 5550 | Open in IMG/M |
| 3300009137|Ga0066709_102170469 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 766 | Open in IMG/M |
| 3300009137|Ga0066709_102646892 | All Organisms → cellular organisms → Bacteria | 670 | Open in IMG/M |
| 3300009137|Ga0066709_103179771 | Not Available | 599 | Open in IMG/M |
| 3300009147|Ga0114129_10017055 | All Organisms → cellular organisms → Bacteria | 10339 | Open in IMG/M |
| 3300009147|Ga0114129_11094726 | Not Available | 998 | Open in IMG/M |
| 3300009147|Ga0114129_11971710 | All Organisms → cellular organisms → Bacteria | 707 | Open in IMG/M |
| 3300009162|Ga0075423_12344325 | All Organisms → cellular organisms → Bacteria | 581 | Open in IMG/M |
| 3300009162|Ga0075423_12361525 | All Organisms → cellular organisms → Bacteria | 579 | Open in IMG/M |
| 3300009777|Ga0105164_10014662 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 4484 | Open in IMG/M |
| 3300010046|Ga0126384_10005032 | All Organisms → cellular organisms → Bacteria | 8063 | Open in IMG/M |
| 3300010046|Ga0126384_10155419 | All Organisms → cellular organisms → Bacteria | 1766 | Open in IMG/M |
| 3300010046|Ga0126384_10434612 | All Organisms → cellular organisms → Bacteria | 1117 | Open in IMG/M |
| 3300010048|Ga0126373_10355286 | All Organisms → cellular organisms → Bacteria | 1478 | Open in IMG/M |
| 3300010301|Ga0134070_10001545 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria | 6903 | Open in IMG/M |
| 3300010304|Ga0134088_10016499 | All Organisms → cellular organisms → Bacteria | 3224 | Open in IMG/M |
| 3300010320|Ga0134109_10505155 | All Organisms → cellular organisms → Bacteria | 500 | Open in IMG/M |
| 3300010359|Ga0126376_10110860 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 2116 | Open in IMG/M |
| 3300010359|Ga0126376_10705491 | All Organisms → cellular organisms → Bacteria | 971 | Open in IMG/M |
| 3300010359|Ga0126376_10983257 | All Organisms → cellular organisms → Bacteria | 842 | Open in IMG/M |
| 3300010359|Ga0126376_13203018 | All Organisms → cellular organisms → Bacteria | 506 | Open in IMG/M |
| 3300010360|Ga0126372_10292782 | All Organisms → cellular organisms → Bacteria | 1425 | Open in IMG/M |
| 3300010376|Ga0126381_101248644 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria | 1074 | Open in IMG/M |
| 3300010398|Ga0126383_10996962 | All Organisms → cellular organisms → Bacteria | 926 | Open in IMG/M |
| 3300012096|Ga0137389_10758812 | Not Available | 833 | Open in IMG/M |
| 3300012199|Ga0137383_10512261 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 879 | Open in IMG/M |
| 3300012205|Ga0137362_10170175 | All Organisms → cellular organisms → Bacteria | 1869 | Open in IMG/M |
| 3300012208|Ga0137376_10701193 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 873 | Open in IMG/M |
| 3300012285|Ga0137370_10157146 | All Organisms → cellular organisms → Bacteria | 1317 | Open in IMG/M |
| 3300012362|Ga0137361_10004456 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales | 9279 | Open in IMG/M |
| 3300012362|Ga0137361_10769225 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria | 877 | Open in IMG/M |
| 3300012362|Ga0137361_11802456 | All Organisms → cellular organisms → Bacteria | 530 | Open in IMG/M |
| 3300012685|Ga0137397_10202145 | All Organisms → cellular organisms → Bacteria | 1478 | Open in IMG/M |
| 3300012922|Ga0137394_10164453 | All Organisms → cellular organisms → Bacteria | 1891 | Open in IMG/M |
| 3300012922|Ga0137394_10267921 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 1461 | Open in IMG/M |
| 3300012922|Ga0137394_11094682 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 660 | Open in IMG/M |
| 3300012922|Ga0137394_11473437 | All Organisms → cellular organisms → Bacteria | 540 | Open in IMG/M |
| 3300012923|Ga0137359_10450882 | All Organisms → cellular organisms → Bacteria | 1138 | Open in IMG/M |
| 3300012948|Ga0126375_10577871 | Not Available | 854 | Open in IMG/M |
| 3300012971|Ga0126369_11738523 | All Organisms → cellular organisms → Bacteria | 713 | Open in IMG/M |
| 3300017959|Ga0187779_10596704 | All Organisms → cellular organisms → Bacteria | 739 | Open in IMG/M |
| 3300017966|Ga0187776_10558745 | Not Available | 791 | Open in IMG/M |
| 3300018431|Ga0066655_10039906 | All Organisms → cellular organisms → Bacteria | 2355 | Open in IMG/M |
| 3300018433|Ga0066667_10063934 | All Organisms → cellular organisms → Bacteria | 2292 | Open in IMG/M |
| 3300018433|Ga0066667_11031034 | All Organisms → cellular organisms → Bacteria | 710 | Open in IMG/M |
| 3300018433|Ga0066667_11132360 | All Organisms → cellular organisms → Bacteria | 678 | Open in IMG/M |
| 3300018468|Ga0066662_10496430 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 1110 | Open in IMG/M |
| 3300018482|Ga0066669_10000603 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 11926 | Open in IMG/M |
| 3300018482|Ga0066669_10135140 | All Organisms → cellular organisms → Bacteria | 1785 | Open in IMG/M |
| 3300018482|Ga0066669_11020865 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 747 | Open in IMG/M |
| 3300019883|Ga0193725_1000121 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 26296 | Open in IMG/M |
| 3300024284|Ga0247671_1062759 | All Organisms → cellular organisms → Bacteria | 599 | Open in IMG/M |
| 3300025173|Ga0209824_10012899 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 3574 | Open in IMG/M |
| 3300025910|Ga0207684_10177258 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 1838 | Open in IMG/M |
| 3300026298|Ga0209236_1035065 | All Organisms → cellular organisms → Bacteria | 2640 | Open in IMG/M |
| 3300026313|Ga0209761_1066557 | All Organisms → cellular organisms → Bacteria | 1932 | Open in IMG/M |
| 3300026318|Ga0209471_1176960 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 847 | Open in IMG/M |
| 3300026318|Ga0209471_1228565 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 672 | Open in IMG/M |
| 3300026324|Ga0209470_1127388 | All Organisms → cellular organisms → Bacteria | 1127 | Open in IMG/M |
| 3300026332|Ga0209803_1002182 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria | 12358 | Open in IMG/M |
| 3300026524|Ga0209690_1045778 | All Organisms → cellular organisms → Bacteria | 1989 | Open in IMG/M |
| 3300026532|Ga0209160_1098556 | All Organisms → cellular organisms → Bacteria | 1486 | Open in IMG/M |
| 3300026536|Ga0209058_1030419 | All Organisms → cellular organisms → Bacteria | 3342 | Open in IMG/M |
| 3300027748|Ga0209689_1037671 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 2811 | Open in IMG/M |
| 3300027748|Ga0209689_1228505 | All Organisms → cellular organisms → Bacteria | 790 | Open in IMG/M |
| 3300027748|Ga0209689_1236881 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 766 | Open in IMG/M |
| 3300027873|Ga0209814_10014169 | All Organisms → cellular organisms → Bacteria | 3176 | Open in IMG/M |
| 3300027875|Ga0209283_10240142 | Not Available | 1202 | Open in IMG/M |
| 3300027882|Ga0209590_10307041 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 1019 | Open in IMG/M |
| 3300027909|Ga0209382_10369446 | All Organisms → cellular organisms → Bacteria | 1603 | Open in IMG/M |
| (restricted) 3300031248|Ga0255312_1090443 | All Organisms → cellular organisms → Bacteria | 744 | Open in IMG/M |
| 3300031720|Ga0307469_10615303 | Not Available | 973 | Open in IMG/M |
| 3300031720|Ga0307469_10800339 | All Organisms → cellular organisms → Bacteria | 865 | Open in IMG/M |
| 3300031720|Ga0307469_11871790 | All Organisms → cellular organisms → Bacteria | 581 | Open in IMG/M |
| 3300031720|Ga0307469_12520860 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 503 | Open in IMG/M |
| 3300031740|Ga0307468_100371774 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 1077 | Open in IMG/M |
| 3300031820|Ga0307473_10164466 | All Organisms → cellular organisms → Bacteria | 1279 | Open in IMG/M |
| 3300032180|Ga0307471_100046943 | All Organisms → cellular organisms → Bacteria | 3524 | Open in IMG/M |
| 3300032180|Ga0307471_102929839 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 606 | Open in IMG/M |
| 3300032180|Ga0307471_103558115 | All Organisms → cellular organisms → Bacteria | 551 | Open in IMG/M |
| 3300032205|Ga0307472_100328815 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 1244 | Open in IMG/M |
Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).
| ⦗Top⦘ |
| Habitat | Taxonomy | Distribution |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil | 23.03% |
| Vadose Zone Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil | 13.82% |
| Populus Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere | 13.82% |
| Grasslands Soil | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil | 11.84% |
| Corn, Switchgrass And Miscanthus Rhizosphere | Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere | 10.53% |
| Tropical Forest Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil | 8.55% |
| Hardwood Forest Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil | 6.58% |
| Tropical Forest Soil | Environmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil | 3.29% |
| Grasslands Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil | 1.97% |
| Wastewater | Environmental → Aquatic → Freshwater → Drinking Water → Unchlorinated → Wastewater | 1.32% |
| Agricultural Soil | Environmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil | 1.32% |
| Tropical Peatland | Environmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland | 1.32% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil | 0.66% |
| Sugarcane Root And Bulk Soil | Environmental → Terrestrial → Soil → Unclassified → Agricultural Land → Sugarcane Root And Bulk Soil | 0.66% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil | 0.66% |
| Sandy Soil | Environmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil | 0.66% |
| Visualization |
|---|
| Powered by ApexCharts |
Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).
| Taxon OID | Sample Name | Habitat Type | IMG/M Link |
|---|---|---|---|
| 3300000364 | Soil microbial communities from Great Prairies - Iowa, Native Prairie soil | Environmental | Open in IMG/M |
| 3300000955 | Soil microbial communities from Great Prairies - Iowa, Native Prairie soil | Environmental | Open in IMG/M |
| 3300002558 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cm | Environmental | Open in IMG/M |
| 3300002562 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cm | Environmental | Open in IMG/M |
| 3300002908 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cm | Environmental | Open in IMG/M |
| 3300003324 | Sugarcane bulk soil Sample H2 | Environmental | Open in IMG/M |
| 3300005166 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123 | Environmental | Open in IMG/M |
| 3300005167 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121 | Environmental | Open in IMG/M |
| 3300005174 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129 | Environmental | Open in IMG/M |
| 3300005176 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128 | Environmental | Open in IMG/M |
| 3300005177 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139 | Environmental | Open in IMG/M |
| 3300005181 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127 | Environmental | Open in IMG/M |
| 3300005184 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_120 | Environmental | Open in IMG/M |
| 3300005186 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125 | Environmental | Open in IMG/M |
| 3300005187 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124 | Environmental | Open in IMG/M |
| 3300005332 | Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly) | Environmental | Open in IMG/M |
| 3300005406 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-1 metaG | Environmental | Open in IMG/M |
| 3300005445 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaG | Environmental | Open in IMG/M |
| 3300005446 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135 | Environmental | Open in IMG/M |
| 3300005447 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138 | Environmental | Open in IMG/M |
| 3300005450 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131 | Environmental | Open in IMG/M |
| 3300005454 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_136 | Environmental | Open in IMG/M |
| 3300005467 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG | Environmental | Open in IMG/M |
| 3300005468 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG | Environmental | Open in IMG/M |
| 3300005518 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaG | Environmental | Open in IMG/M |
| 3300005536 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaG | Environmental | Open in IMG/M |
| 3300005540 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146 | Environmental | Open in IMG/M |
| 3300005546 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaG | Environmental | Open in IMG/M |
| 3300005549 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-2 metaG | Environmental | Open in IMG/M |
| 3300005554 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110 | Environmental | Open in IMG/M |
| 3300005557 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 | Environmental | Open in IMG/M |
| 3300005559 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149 | Environmental | Open in IMG/M |
| 3300006049 | Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD1 | Host-Associated | Open in IMG/M |
| 3300006173 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaG | Environmental | Open in IMG/M |
| 3300006804 | Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200 | Environmental | Open in IMG/M |
| 3300006844 | Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD2 | Host-Associated | Open in IMG/M |
| 3300006845 | Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5 | Host-Associated | Open in IMG/M |
| 3300006852 | Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD2 | Host-Associated | Open in IMG/M |
| 3300006854 | Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4 | Host-Associated | Open in IMG/M |
| 3300006871 | Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD3 | Host-Associated | Open in IMG/M |
| 3300006904 | Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3 | Host-Associated | Open in IMG/M |
| 3300006954 | Agricultural soil microbial communities from Georgia to study Nitrogen management - GA Control | Environmental | Open in IMG/M |
| 3300006969 | Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD3 | Host-Associated | Open in IMG/M |
| 3300007076 | Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD4 | Host-Associated | Open in IMG/M |
| 3300007265 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 | Environmental | Open in IMG/M |
| 3300009012 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159 | Environmental | Open in IMG/M |
| 3300009038 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG | Environmental | Open in IMG/M |
| 3300009089 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG | Environmental | Open in IMG/M |
| 3300009090 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG | Environmental | Open in IMG/M |
| 3300009100 | Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD2 | Host-Associated | Open in IMG/M |
| 3300009137 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158 | Environmental | Open in IMG/M |
| 3300009147 | Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2) | Host-Associated | Open in IMG/M |
| 3300009162 | Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2 | Host-Associated | Open in IMG/M |
| 3300009777 | Wastewater microbial communities from Netherlands to study Microbial Dark Matter (Phase II) - VDW unchlorinated drinking water | Environmental | Open in IMG/M |
| 3300010046 | Tropical forest soil microbial communities from Panama - MetaG Plot_36 | Environmental | Open in IMG/M |
| 3300010048 | Tropical forest soil microbial communities from Panama - MetaG Plot_11 | Environmental | Open in IMG/M |
| 3300010301 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09082015 | Environmental | Open in IMG/M |
| 3300010304 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015 | Environmental | Open in IMG/M |
| 3300010320 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_11112015 | Environmental | Open in IMG/M |
| 3300010359 | Tropical forest soil microbial communities from Panama - MetaG Plot_15 | Environmental | Open in IMG/M |
| 3300010360 | Tropical forest soil microbial communities from Panama - MetaG Plot_6 | Environmental | Open in IMG/M |
| 3300010376 | Tropical forest soil microbial communities from Panama - MetaG Plot_28 | Environmental | Open in IMG/M |
| 3300010398 | Tropical forest soil microbial communities from Panama - MetaG Plot_35 | Environmental | Open in IMG/M |
| 3300012096 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaG | Environmental | Open in IMG/M |
| 3300012199 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaG | Environmental | Open in IMG/M |
| 3300012205 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaG | Environmental | Open in IMG/M |
| 3300012208 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaG | Environmental | Open in IMG/M |
| 3300012285 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaG | Environmental | Open in IMG/M |
| 3300012362 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaG | Environmental | Open in IMG/M |
| 3300012685 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaG | Environmental | Open in IMG/M |
| 3300012922 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaG | Environmental | Open in IMG/M |
| 3300012923 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaG | Environmental | Open in IMG/M |
| 3300012948 | Tropical forest soil microbial communities from Panama - MetaG Plot_14 | Environmental | Open in IMG/M |
| 3300012971 | Tropical forest soil microbial communities from Panama - MetaG Plot_1 | Environmental | Open in IMG/M |
| 3300017959 | Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_Q2_SP10_10_MG | Environmental | Open in IMG/M |
| 3300017966 | Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_BV02_MP12_20_MG | Environmental | Open in IMG/M |
| 3300018431 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104 | Environmental | Open in IMG/M |
| 3300018433 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116 | Environmental | Open in IMG/M |
| 3300018468 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111 | Environmental | Open in IMG/M |
| 3300018482 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118 | Environmental | Open in IMG/M |
| 3300019883 | Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2a2 | Environmental | Open in IMG/M |
| 3300024284 | Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK12 | Environmental | Open in IMG/M |
| 3300025173 | Wastewater microbial communities from Netherlands to study Microbial Dark Matter (Phase II) - VDW unchlorinated drinking water (SPAdes) | Environmental | Open in IMG/M |
| 3300025910 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes) | Environmental | Open in IMG/M |
| 3300026298 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cm (SPAdes) | Environmental | Open in IMG/M |
| 3300026313 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes) | Environmental | Open in IMG/M |
| 3300026318 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128 (SPAdes) | Environmental | Open in IMG/M |
| 3300026324 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125 (SPAdes) | Environmental | Open in IMG/M |
| 3300026332 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138 (SPAdes) | Environmental | Open in IMG/M |
| 3300026524 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150 (SPAdes) | Environmental | Open in IMG/M |
| 3300026532 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 (SPAdes) | Environmental | Open in IMG/M |
| 3300026536 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147 (SPAdes) | Environmental | Open in IMG/M |
| 3300027748 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149 (SPAdes) | Environmental | Open in IMG/M |
| 3300027873 | Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD1 (SPAdes) | Host-Associated | Open in IMG/M |
| 3300027875 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes) | Environmental | Open in IMG/M |
| 3300027882 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes) | Environmental | Open in IMG/M |
| 3300027909 | Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5 (SPAdes) | Host-Associated | Open in IMG/M |
| 3300031248 (restricted) | Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH5_T0_E5 | Environmental | Open in IMG/M |
| 3300031720 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515 | Environmental | Open in IMG/M |
| 3300031740 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05 | Environmental | Open in IMG/M |
| 3300031820 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515 | Environmental | Open in IMG/M |
| 3300032180 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515 | Environmental | Open in IMG/M |
| 3300032205 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05 | Environmental | Open in IMG/M |
| Geographical Distribution | |
|---|---|
| Zoom: | Powered by OpenStreetMap |
| ⦗Top⦘ |
Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).
| Protein ID | Sample Taxon ID | Habitat | Sequence |
| INPhiseqgaiiFebDRAFT_1015918778 | 3300000364 | Soil | VPFARIPFWQLRRHGVVEEAVRGGTRRRVVGRDWALPEAVREPLRALLEPLGFDVDRSITVRELADENTLEFRQDESP* |
| JGI1027J12803_1082430372 | 3300000955 | Soil | AVRPRTRRVATPVPFARIPFWQLRRHGVVEEAVRGGTRRRVVGRDWALPEAVREPLRALLEPLGFDVDRSITVRELADENT |
| JGI25385J37094_100297602 | 3300002558 | Grasslands Soil | MLVTRIPFWRLRQHGVVEEAVRGGSRCRIVGRDWPLPEPVRERMRGLLAPLGFDLMRPILVAEPEDENALEFRQDDGGDGD* |
| JGI25382J37095_1000050210 | 3300002562 | Grasslands Soil | MPVTRIPFWRLRQHGVVEEAVRGGSRRRIVGRDWPLPEPVRERMRGLLEPLGFELRRPILVSEPEDENALEFRQDDGGDGD* |
| JGI25382J43887_100428612 | 3300002908 | Grasslands Soil | MLVTRIPFWRLRQHGVVEEAVRGGSRRRIVGRDWPLPEPVRERMRGLLAPLGFDLMRPILVAEPEDENALEFRQDDGGDGD* |
| soilH2_100039478 | 3300003324 | Sugarcane Root And Bulk Soil | MPVARIPFWRLRAQGVVEEAVRGGSRRRVTDREWPLPDAVRDKMRGMLEPLGFDLARVVTVREPAGEEALEFHQD* |
| Ga0066674_100468092 | 3300005166 | Soil | MLVTRIPFWRLRQHGVVEEAVRGGSRRRIGGRDWPLPEPVRERMRGLLAPLGFDLMRPILVAEPEDENALEFRQDDGGDGD* |
| Ga0066672_102180232 | 3300005167 | Soil | MPVARIPFWQLRRHGVVEEAVRGGSRRRIVGRDWVLPEAVRERMRGLLEPLGFELDRPILVREPEGEDALEFRQDG* |
| Ga0066680_100922213 | 3300005174 | Soil | MALARIPFWRLRAHGVVEEAVRGGSRRRLLGHEWPLPDGVRERMRGLLEPLGFDLARPVAVREPEGEDALEFSQDA* |
| Ga0066680_101723432 | 3300005174 | Soil | MPVTRIPFWRLRQHGVVEEAVRGGSRRRIVGRDWPLPEPVRERMRGLLEPLGFELRRPILVSEPEDENALEFRQDDGGDG |
| Ga0066679_102263071 | 3300005176 | Soil | ARIPFWQLRRHGVVEEAIRGGSRRRIVGRDWALPEAVRERMRGLLEPLGFELDRPILVREPEGEDALEFRQDVQEGNNGPNHG* |
| Ga0066679_102999232 | 3300005176 | Soil | MPVARIPFWQLRRHGVVEEAIRGGSRRRIVGRDWALPEAVRDRMRELLEPLGFELGRPILVREPEGEDALEFRQDGDPAVV* |
| Ga0066679_109048342 | 3300005176 | Soil | MPVARIPFWQLRRQGVVEEAIRGGSRRRIVGRDWVLPEAMRDRMRELLEPLGFELGRPILVREPEGEDALEFRQDVQGEQ* |
| Ga0066690_101372801 | 3300005177 | Soil | MPITRIPFWQLRRHGVVEEAVRGGSRRRIVGRDWVLPEAMRDRMRELLEPLGFELGRPILVREPEGEDALEFRQDVQGEQ* |
| Ga0066678_105449082 | 3300005181 | Soil | MPVARIPFWQLRRHGVVEEAIRGGSRRRIVGRDWALPEAVRDRMRELLEPLGFELGRPILVREPEGEDALEF |
| Ga0066671_101404583 | 3300005184 | Soil | MPVTRIPFWQLRRHGVVEEAVRGGSRRRIVGRDWPLPEGVRERMRELLAPLGFDLGRPIFVREPEGEDALEFRQDA* |
| Ga0066676_100056665 | 3300005186 | Soil | VTSRIPFWQLRRHGVVEEAVRGGSRRRIVGRDWPLPDAVFERVRELLEPLGFDLGRPIFVREPEGEDALEFRQDV* |
| Ga0066675_100105033 | 3300005187 | Soil | VTSRIPFWQLRRHGVVEEAVRGGSRRRIVGRDWPLPDAVLERVRELLEPLGFDLGRPIFVREPEGEDALEFRQDV* |
| Ga0066675_112601002 | 3300005187 | Soil | MPITRIPFWQLRRHGVVEEAIRGGSRRRIVGRDWVLPEAVRERMRELLEPLGFELDRPILVREPEGEDALEFRQDGDPAVV* |
| Ga0066388_1000896383 | 3300005332 | Tropical Forest Soil | VRPWRGSVATCRRYSERMPVTRIPFWRLRAQGVVEEAVRGGSRRRATDHDWPLPDAVREKMRGVLEPLGFDLARAVTVREPSGEDALEFQQD* |
| Ga0066388_1004822174 | 3300005332 | Tropical Forest Soil | VSVATPVPVTRIPFWELRRHGVVEEAVRGGTRRRVVGRDWPLPESVREPLRALLEPLGFDVGRSISVREPEGENALEFRQE* |
| Ga0066388_1006262233 | 3300005332 | Tropical Forest Soil | MPVTRIPFWQLRRHGVVEEAVRGGHRRRIVGRDWLLPDAVRDRVREMLEPLGFDVGRPILVREPDGEDALEFRQDDT |
| Ga0066388_1008233091 | 3300005332 | Tropical Forest Soil | VSVATPVPVTRIPFWELRRHGVVEEAVRGGTRRRVVGRDWPLPESVREPLRALLEPLGFDVGRSISVREPEGENALEFRQD* |
| Ga0066388_1070691031 | 3300005332 | Tropical Forest Soil | VSVATAVPFARIPFWQLRRHGVVEEAVRGGTRRRVVGREWALPEAVREPLRALLEPLGFDVDRSISVRELADENTLEFRQD* |
| Ga0070703_105255792 | 3300005406 | Corn, Switchgrass And Miscanthus Rhizosphere | VSVATPVPITRIPFWELRRHGVVEEAVRGGTRRRVVGRDWPLPDSVRERMRDLLEPLGFDVARSIIVRELEDENALEFRQD* |
| Ga0070708_1000168783 | 3300005445 | Corn, Switchgrass And Miscanthus Rhizosphere | MPVARIPFWQLRRHGVVEEAVRGGSRRRIVGRDWVLPEAVRERMRALLEPLGFELDRPILVREPEGEDALEFRQDGV* |
| Ga0070708_1000989623 | 3300005445 | Corn, Switchgrass And Miscanthus Rhizosphere | VPIALIPFWRLRQHAVSEEAVRGGSRRRALDRTWPLPEPVRERVRPLLEPLGFDLTRPISVREPANQDALEFTQPE* |
| Ga0070708_1001153211 | 3300005445 | Corn, Switchgrass And Miscanthus Rhizosphere | VSVTPAVPITRIPFWELRRHGVVEEAVRGGSRRRVVGRDWPLPDSVRDRMRDLLEPLGFDVARAIMVRELEDQNALEFRQD* |
| Ga0066686_100363613 | 3300005446 | Soil | VTSRIPFWQLRRHGVVEEAVRGGSRRRIVGRDWPLPEAVLERVRELLEPLGFDLGRPIFVREPEGEDERT* |
| Ga0066689_100018302 | 3300005447 | Soil | MLVTRIPFWRLRQHGVVEEAVRGGSRRRIVGRDWPLPEPVRERMRGLLAPLGFDLMRPILVAEPEDENALEFRQDDGADGD* |
| Ga0066682_100178593 | 3300005450 | Soil | VTSRIPFWQLRRHGVVEEAVRGGSRRRIVGRDWPLPDAVFERVRELLEPLGFDLGRPIFVREPEGEDALEFRQDVIDG* |
| Ga0066687_103926291 | 3300005454 | Soil | MPVARIPFWQLRRQGVVEEAIRGGSRRRIVGRDWVLPEAMRDRMRELLEPLGLELGRPILVREPEGEDALEFRQDVQGEQ* |
| Ga0070706_1018700702 | 3300005467 | Corn, Switchgrass And Miscanthus Rhizosphere | VSVTPAVPITRIPFWELRRHGVVEEAVRGGTRRRVVGRDWPLPDSVRERMRDLLEPLGFDVARPIMVRELEDQNALE |
| Ga0070707_1007967342 | 3300005468 | Corn, Switchgrass And Miscanthus Rhizosphere | MPVALIPFWRLRQHAVSEEAIRGGSRRRALDRTWPLPDAVKERMRDLLEPLGFDLDRPISVNEPANQDALEFTQPE* |
| Ga0070699_1003034322 | 3300005518 | Corn, Switchgrass And Miscanthus Rhizosphere | VPVALIPFWRLRQHAASEEAVRGGSRRRVLDRTWPLPDAVKERLRPLLEPLGFDVDQPVSVSEPVGQDALEFTQP* |
| Ga0070699_1018467381 | 3300005518 | Corn, Switchgrass And Miscanthus Rhizosphere | MPVTRIPFWQLRRHGVAEEAVRGGSRRRILGRDWPLPEPVRERLRALLEPLGFDLQRPVSVREPEGEDALEFSQS |
| Ga0070697_1001615321 | 3300005536 | Corn, Switchgrass And Miscanthus Rhizosphere | ARIPFWQLRRHGVVEEAVRGGSRRRIVGRDWVLPEAVRERMRTLLEPLGFELDRPILVREPEGEDALEFRQDGV* |
| Ga0070697_1008149072 | 3300005536 | Corn, Switchgrass And Miscanthus Rhizosphere | MTALARIPFWRLRAHGVVEEAVRGGSRRRQIGHEWLLPAGVRERMRGLLEPLGFDLARPVAVREPEGEDALEFSQDA* |
| Ga0070697_1012223242 | 3300005536 | Corn, Switchgrass And Miscanthus Rhizosphere | VSVTPAVPITRIPFWELRRHGVVEEAVRGGTRRRVVGRDWPLPDSVRERMRDLLEPLGFDVARPIMVRELEDQNALEFRQD* |
| Ga0066697_100704604 | 3300005540 | Soil | PFWQLRRHGVVEEAIRGGSRRRIVGRDWALPEAVRDRMRELLEPLGFELGRPILVREPEGEDALEFRQDGDPAVV* |
| Ga0070696_1005307652 | 3300005546 | Corn, Switchgrass And Miscanthus Rhizosphere | VPVTRIPFWRLRAHGVVEEAVRGGTRRRLVGRDWTLPDAVHERVRGLLEPLGFDLGRPVSVREPEGEDALEFRQD* |
| Ga0070696_1011554951 | 3300005546 | Corn, Switchgrass And Miscanthus Rhizosphere | MPVTRIPFWQLRRHGVAEEAVRGGSRRRILGRDWPLPEPVRERLRALLEPLGFDLQRPVSVREPEGEDALEFSQS* |
| Ga0070704_1007161502 | 3300005549 | Corn, Switchgrass And Miscanthus Rhizosphere | VPVALIPFWRLRQHAASEEAVRGGSRRRVLDRTWPLPDAVKERLRPLLEPLGFDVDQPVSVSEPANQDALEFTQP* |
| Ga0066661_104600171 | 3300005554 | Soil | MPVARIPFWQLRRQGVVEEAIRGGSRRRIVGRDWVLPEAMRDRMRELLEPLGFELGRPILVREPEGEDALEFRQDGDPGVV* |
| Ga0066704_103891982 | 3300005557 | Soil | VTRIPFWRLRQHGVVEEAVRGGSRRRIVGRDWPLPEPVRERMRGLLEPLGFELRRPILVSEPEDENALEFRQDDGGDGN* |
| Ga0066700_101579772 | 3300005559 | Soil | MPVARIPFWQLRRQGVVEEAIRGGSRRRIVGRDWVLPEAMRDRMRELLEPLGFELGRPILVCEPEGEDALEFRQDVQGEQ* |
| Ga0066700_102686742 | 3300005559 | Soil | VTSRIPFWQLRRHGVVEEAVRGGSRRRIVGRDWPLPDAVLERVRELLEPLGFDLGRPIFVREPEGEEPSAVSNSRH* |
| Ga0066700_105827572 | 3300005559 | Soil | MPVARIPFWQLRRHGVVEEAVRGGSRRRIVGRDWVLPEAVRERMRGLLEPLGFELDRPILVREPEGEDALEFRQD |
| Ga0075417_100131963 | 3300006049 | Populus Rhizosphere | VPATRIPFWQLRRHGVAEEAVRGGSRRRIVGRDWPLPDGVRERLRGLLEPIGFDLARPIVVREPHGEDALEFQQD* |
| Ga0075417_101773004 | 3300006049 | Populus Rhizosphere | VSVATPVPVTRIPFWELRRHGVVEEAVRGGTRRRVVGRDWPLPASVREPLRALLEPLGFDVGRSISVREPEDENALEFRQD* |
| Ga0075417_102224933 | 3300006049 | Populus Rhizosphere | GGTRRRVVGREWALPEAVREPLRALLEPLGFDVDRSISVRELADDNTLEFRQD* |
| Ga0070716_1005260731 | 3300006173 | Corn, Switchgrass And Miscanthus Rhizosphere | MPVARIPFWQLRRHGVVEEAIRGGSRRRIVGRDWVLPEAVRERMRELLEPLGFELGRPILVREPEGEDALEFRQDGDPAVV* |
| Ga0079221_108775111 | 3300006804 | Agricultural Soil | MARARGRGAACRSRTSRVTRVPIARIPFWQLRRHGVVEEAVRGGTRRRIVGRDWPLPDAVRETMRGLLEPLGFDLGRAISVREPDGEEALE |
| Ga0075428_1008722783 | 3300006844 | Populus Rhizosphere | AAGPRPRGLAAPVPATRIPFWQLRRHGVAEEAVRGGSRRRIVGRDWPLPDGVRERLRGLLEPIGFDLARPIVVREPHGEDALEFQQD* |
| Ga0075421_1013636083 | 3300006845 | Populus Rhizosphere | LATPVPVTRIPFWELRRHGVVEEAVRGGTRRRVVGRDWPLPDAVREPLRGLLEPLGFDVTRSIIVRELGDDNALEFRQD* |
| Ga0075433_102182093 | 3300006852 | Populus Rhizosphere | VSVATAVPSARIPFWQLRRHGVVEEAVRGGTRRRVVGREWALPEAVREPLRALLEPLGFDVDRSINVRELADDNTLEFRQD* |
| Ga0075425_1014517453 | 3300006854 | Populus Rhizosphere | WQLRRHGVVEEAVRGGTRRRVVGREWALPEAVREPLRALLEPLGFDVDRSINVRELADDNTLEFRQD* |
| Ga0075434_1010626812 | 3300006871 | Populus Rhizosphere | MPVTRVPFWRLRAQGVVEEAVRGGTRRRVRGHDWPLPEAVRDRMREVLEPLGFDLARPVSVREPEGEDALEFSQDEPGPGGECRGS* |
| Ga0075434_1020733313 | 3300006871 | Populus Rhizosphere | WQLRRHGVVEEAVRGGTRRRVVGREWALPEAVREPLRALLEPLGFDVDRSISVRELADDNTLEFRQD* |
| Ga0075424_1014881571 | 3300006904 | Populus Rhizosphere | VPLARIPFWQLRRHGVVEEAVRGGTRRRVVGRDWALPEAVREPLRALLEPLGFDVDRSISVRELADENTLEFRQDESG* |
| Ga0075424_1022704273 | 3300006904 | Populus Rhizosphere | LRRHGVVEEAVRGGTRRRVVGREWALPEAVREPLRALLEPLGFDVDRSISVRELADDNTLEFRQD* |
| Ga0079219_100066387 | 3300006954 | Agricultural Soil | PIARIPFWQLRRHGVVEEAVRGGTRRRIVGRDWPLPDAVRETMRGLLEPLGFDLGRAISVREPDGEEALEFRQDAGGH* |
| Ga0075419_100153194 | 3300006969 | Populus Rhizosphere | VSVATSVPVTRIPFWELRRHGVVEEAVRGGTRRRVVGRDWPLPASVREPLRALLEPLGFDVGRSISVREPEDENALEFRQD* |
| Ga0075435_1018408671 | 3300007076 | Populus Rhizosphere | SRIPFWQLRRHGVVEEAVRGGARRRIVGRDWPLPDAVRERMRGLLEPVGFDFARPILVREPDGEDALEFRQD* |
| Ga0099794_100614593 | 3300007265 | Vadose Zone Soil | MPITRIPFWQLRRHGVVEEAVRGGSRRRIVGRDWVLPEAVRERMRELLEPLGFELDRPILVREPEG |
| Ga0066710_1014120162 | 3300009012 | Grasslands Soil | MPVTRIPFWQLRRHGVAEEAVRGGSRRRILGRDWPLPEAVHERVRTLLEPLGFDLERPVSVREPEDEDALEFRQDDGQN |
| Ga0066710_1042837441 | 3300009012 | Grasslands Soil | MPITRIPFWQLRRHGVVEEAIRGGSRRRIVGRDWVLPEAVRERMRELLEPLGFELDRPILVREPEGEDALEFRQDGDPAVV |
| Ga0099829_109029982 | 3300009038 | Vadose Zone Soil | MPIARIPFWELRRHGVAEEAVRGGVRTRILDREWPLPDATRERLRELLEPLGFDLARPVSVREPAGEDALEFRQEEPPA* |
| Ga0099828_100565314 | 3300009089 | Vadose Zone Soil | VPIALIPFWRLRQHAVSEEAVRGGSRRRALDRTWPLPDAVRERMRNLLEPLGFDLDRPISVREPANQDALEFIQPE* |
| Ga0099827_106871862 | 3300009090 | Vadose Zone Soil | VPIALIPFWRLRQHAVSEEAVRGGSRRRALDRTWSLPEPVRERMRPLLEPLGFDLDRPISVREPANQDALEFIQPE* |
| Ga0099827_107491893 | 3300009090 | Vadose Zone Soil | MTALARIPFWRLRAHGVFEEAVRGGSRRRQIGHEWPLPDGVRERMRGLLEPLGFDLARPVAVREPEGEDALEFSQ |
| Ga0075418_1003389510 | 3300009100 | Populus Rhizosphere | VPVTRIPFWELRRHGVVEEAVRGGTRRRVVGRDWPLPASVREPLRALLEPLGFDVGRSISVREPEDENALEFRQD* |
| Ga0066709_1021704692 | 3300009137 | Grasslands Soil | MALARIPFWRLRAHGVVEEAVRGGSRRRLVGREWPLPAGVRERMRGLLEPFGFDLARPVAVREPEGEDALEFSQDA* |
| Ga0066709_1026468922 | 3300009137 | Grasslands Soil | MPVARIPFWQLRRQGVVEEAIRGGNRRRIVGRDWVLPEAMRDRMRELLEPLGFELGRPILVCEPEGEDALEFRQDVQGEQ* |
| Ga0066709_1031797711 | 3300009137 | Grasslands Soil | MALARIPFWRLRAHGVVEEAVRGGSRRRLIGHEWPLPEGVREGLRGLLEPLGFDLGRTVAVREPEGEDALE |
| Ga0114129_100170557 | 3300009147 | Populus Rhizosphere | VSVATAVPSARIPFWQLRRHGVVEEAVRGGTRRRVVGREWALPEAVREPLRALLEPLGFDVDRSISVRELADDNTLEFRQD* |
| Ga0114129_110947262 | 3300009147 | Populus Rhizosphere | MPVTRIPFWQLRRHGVVEEAVRGGSRRRIVGRDWPLPEAVRERMRELLAPLGFDLGRPIFVREPEGEDALEFRQDA* |
| Ga0114129_119717103 | 3300009147 | Populus Rhizosphere | VSLATPVPVTRIPFWELRRHGVVEEAVRGGTRRRVVGRDWPLPDAVREPLRGLLEPLGFDVTRSIIVRELGDDNALEFRQD* |
| Ga0075423_123443253 | 3300009162 | Populus Rhizosphere | FWQLRRHGVVEEAVRGGTRRRVVGREWALPEAVREPLRALLEPLGFDVDRSINVRELADDNTLEFRQD* |
| Ga0075423_123615253 | 3300009162 | Populus Rhizosphere | FWQLRRHGVVEEAVRGGTRRRVVGREWALPEAVREPLRALLEPLGFDVDRSISVRELADDNTLEFRQD* |
| Ga0105164_100146623 | 3300009777 | Wastewater | VPTALIPFWRLRQHAVTEEAVRGGSRRRALDQSWPLPATVTERLRPLLEPLGFDVDRPVSVREPAGQDALEFTQDQ* |
| Ga0126384_1000503211 | 3300010046 | Tropical Forest Soil | MPITRIPFWQLRRHGVVEEAVRGGHRRRIVGRDWPLPDAVRERVRELLEPLGFDVGRPILVREPDGEDALEFRQD* |
| Ga0126384_101554194 | 3300010046 | Tropical Forest Soil | VSVATAVPFARIPFWQLRRHGVVEEAVRGGTRRRVVGREWALPEAVREPLRALLEPLGFDVDRSISVRELADDNTLEFRQD* |
| Ga0126384_104346123 | 3300010046 | Tropical Forest Soil | VVTRIPFWELRRHGVVEEAVRGGTRRRVVGRDWPLPDAVREPLRDLLEPLGFDVARSITVRELGDDNALEFQQD* |
| Ga0126373_103552862 | 3300010048 | Tropical Forest Soil | VVTRIPFWELRRHGVVEEAVRGGTRRRVVGRDWPLPDAVREPLRGLLEPLGFDVARSITVRELGDDNALEFQQD* |
| Ga0134070_100015457 | 3300010301 | Grasslands Soil | MPTMLVTRIPFWRLRQHGVVEEAVRGGSRRRIGGRDWPLPEPVRERMRGLLAPLGFDLMRPILVAEPEDENALEFRQDDGGDGD* |
| Ga0134088_100164993 | 3300010304 | Grasslands Soil | MPTMLVTRIPFWRLRQHGVVEEAVRGGSRRRIVGRDWPLPEPVRERMRGLLAPLGFDLMRPILVAEPEDENALEFRQDDGGDGD* |
| Ga0134109_105051551 | 3300010320 | Grasslands Soil | MPTMLVTRIPFWRLRQHGVVEEAVRGGSRRRIGGRDWPLPEPVRERMRGLLAPLGFDLMRPILVAEPEDENALEFRQDVGGDGD* |
| Ga0126376_101108604 | 3300010359 | Tropical Forest Soil | VPSARIPFWQLRSHGVVEEAVRGGTRRRVVGRDWALPEAVREPLRALLEPLGFDVERSISVRELADENTLEFRQD* |
| Ga0126376_107054912 | 3300010359 | Tropical Forest Soil | VPVTRIPFWELRRHGVVEEAVRGGTRRRVVGRDWPLPDAVREPLRGLLEPLGFDVTRSIIVRELGDDNALEFRQD* |
| Ga0126376_109832574 | 3300010359 | Tropical Forest Soil | VSVATPVPVTRIPFWELRRHGVVEEAVRGGTRRRVVGRDWPLPESVREPLRALLEPLGFDVGRSISVREPEGGNA |
| Ga0126376_132030181 | 3300010359 | Tropical Forest Soil | VSVATAVPFARIPFWQLRRHGVVEEAVRGGTRRRVVGREWALPEAVREPLRSFLEPLGFDFDRSISVRELADDNTLEFRQD* |
| Ga0126372_102927822 | 3300010360 | Tropical Forest Soil | VVTRIPFWELRRHGVVEEAVRGGTRRRVVGRDWPLPDAVREPLRGLLEPLGFDVARSIMVRELGDDNALEFRQD* |
| Ga0126381_1012486441 | 3300010376 | Tropical Forest Soil | VSVATAVPFARIPFWQLRRHGVVEEAVRGGTRRRVVGREWALPEAVREPLRALLEPLGFDVDRSISVRELADDNTL |
| Ga0126383_109969622 | 3300010398 | Tropical Forest Soil | VSVATAVPSARIPFWQLRRHGVVEEAVRGGTRRRVVGREWALPEAVREPLRALLEPLGFDVSRSISVRELADENTLEFRQD* |
| Ga0137389_107588122 | 3300012096 | Vadose Zone Soil | VPVALIPFWRLRQHAVSEEAVRGGSRRRALDRTWPLPDAVRERVRPLLEPLGFDLDKPISVSEPTNEDALEFTQPE* |
| Ga0137383_105122611 | 3300012199 | Vadose Zone Soil | MPTMLVTRIPFWRLRQHGVVEEAVRGGSRRRIVGRDWPLPEPVRERMRGLLAPLGFDLMRPILVAEPEDENALEFRQDDG |
| Ga0137362_101701752 | 3300012205 | Vadose Zone Soil | MPITRIPFWQLRRHGVVEEAVRGGSRRRIVGRDWVLPEAVRERMRELLEPLGFELDRPILVREPEGEDALEFRQDGDPAVV* |
| Ga0137376_107011932 | 3300012208 | Vadose Zone Soil | MPVARVPFWQLRRHGVVEEAVRGGSRRRLVGREWPLPEAVREGLRALLEPLGFDLGRPISVREPDGEDALEFLQADVPEPPQAP* |
| Ga0137370_101571463 | 3300012285 | Vadose Zone Soil | MPVARVPFWQLRRHGVVEEAVRGGSRRRLVGREWPLPEAVREGLRALLEPLGFDLGRPISVREPDGEDALEFLQADVPEPPEAP* |
| Ga0137361_100044568 | 3300012362 | Vadose Zone Soil | MPTMPVTRVPFWRLRQHGVVEEAVRGGSRRRIVGRDWPLPEPVRERMRGLLAPLGFDLMRPILVAEPEDENALEFRQDDGGDGD* |
| Ga0137361_107692251 | 3300012362 | Vadose Zone Soil | MTALARIPFWRLRAHGVVEEAVRGGSRRRQIGHEWPLPDGVRERMRGLLEPLGFDLARPVAVREPEGEDALEFSQ |
| Ga0137361_118024562 | 3300012362 | Vadose Zone Soil | MPITRIPFWQLRRHGVVEEAVRGGSRRRIVGRDWVLPEAVRERMRELLEPLGFELGRPILVREPEGEDALEFRQDGDPAVV* |
| Ga0137397_102021453 | 3300012685 | Vadose Zone Soil | MPITRIPFWQLRRHGVVEEAVRGGSRRRIVGRDWVLPEAVRERMRELLEPLGFELDRPILVREPEGEDALEFRQDGGT* |
| Ga0137394_101644533 | 3300012922 | Vadose Zone Soil | PFWQLRRHGVVEEAVRGGSRRRIVGRDWVLPEAVRERMRELLEPLGFELDRPILVREPEGEDALEFRQDGGT* |
| Ga0137394_102679212 | 3300012922 | Vadose Zone Soil | VPVALIPFWRLRQHAVSEEAVRGGSRRRALDRTWPLPDAVRERMRPLLEPLGFDLARPVSVSEPAHQDALEFTQAE* |
| Ga0137394_110946822 | 3300012922 | Vadose Zone Soil | MPVIRVPFWQLRRHGVVEEAVRGGTRRRIVGRDWLLPEAVRERMRELLEPLGFDLARPVSVREPEGEDALEFRQDDSPA* |
| Ga0137394_114734372 | 3300012922 | Vadose Zone Soil | MPVARFPFWQLRRHGVVEEAIRGGSRRRIVGRDWMLPEAVRERMRELLEPLGFELGRPIVVREPEGEDALEFRQDGDPAVV* |
| Ga0137359_104508823 | 3300012923 | Vadose Zone Soil | MPITRIPFWQLRRHGVVEEAVRGGSRRRIVGRDWVLPEAVRERMRGLLEPLGFELDRPILVREPEGEDALEFRQDGDPAVV* |
| Ga0126375_105778712 | 3300012948 | Tropical Forest Soil | MPVTRIPFWRLRAQGVVEEAVRGGTRRRVTGHDWPLPNAVRDRLRAMLEPLGFDLGRPVSVREPEGEDTLEFSQD* |
| Ga0126369_117385231 | 3300012971 | Tropical Forest Soil | VSVATAVPSARIPFWQLRRHGVVEEAVRGGTRRRVVGREWALPEAVREPLRALLEPLGFDVDRSISVRELADENTLEFRQD* |
| Ga0187779_105967042 | 3300017959 | Tropical Peatland | MPVARIPFWRLRAQGVVEEAVRGGTRRRLTGQSWPLPETVRAGLRAVLEPLGFDLARPVSVGEPADEDALEFSQD |
| Ga0187776_105587451 | 3300017966 | Tropical Peatland | VPVARLAFWELRRHNVAEEAVRGGVRRRIHDRAWPLPEAVRERLRVVLEPLGFDLARPVSVREPEGADALEFSQDEPRA |
| Ga0066655_100399062 | 3300018431 | Grasslands Soil | MLVTRIPFWRLRQHGVVEEAVRGGSRRRIGGRDWPLPEPVRERMRGLLAPLGFDLMRPILVAEPEDENALEFRQDDGGDGD |
| Ga0066667_100639343 | 3300018433 | Grasslands Soil | MLVTRIPFWRLRQHGVVEEAVRGGSRRRIVGRDWPLPEPVRERMRGLLAPLGFDLMRPILVAEPEDENALEFRQDDGGDGD |
| Ga0066667_110310341 | 3300018433 | Grasslands Soil | MPVARIPFWQLRRHGVVEEAIRGGSRRRIVGRDWALPEAVRDRMRELLEPLGFELGRPILVREPEGEDALEFRQDGDPAVV |
| Ga0066667_111323602 | 3300018433 | Grasslands Soil | MALARIPFWRLRAHGVVEEAVRGGSRRRLIGHEWPLPEGVREGLRGLLEPLGFDLGRTVAVREPEGE |
| Ga0066662_104964301 | 3300018468 | Grasslands Soil | AERIITAMALARIPFWRLRAHGVVEEAVRGGSRRRQLGHEWPLPDGVRERMRGLLEPLGFDLARPVAVREPEGEDALEFSQDA |
| Ga0066669_100006035 | 3300018482 | Grasslands Soil | VTSRIPFWQLRRHGVVEEAVRGGSRRRIVGRDWPLPDAVLERVRELLEPLGFDLGRPIFVREPEGEDALEFRQDV |
| Ga0066669_101351402 | 3300018482 | Grasslands Soil | MLVTRIPFWRLRQNGVVEEAVRGGSRRRIVGRDWPLPEPVRERMRGLLAPLGFDLMRPILVAEPEDENALEFRQDDGGDGD |
| Ga0066669_110208652 | 3300018482 | Grasslands Soil | MARARIPFWRLRAHGVVEEAVRGGSRRRLIGHEWPLPEGVREGLRGLLEPLGFDLGRTVAVREPEGEDALEFSQDA |
| Ga0193725_10001216 | 3300019883 | Soil | VPVALIPFWRLRQHAVSEEAVRGGSRRRALDRTWPLPEPVRERLRPLLEPLGFDLARPVSVSEPAGQDALRFDQQDAAG |
| Ga0247671_10627591 | 3300024284 | Soil | MPVTRVPFWRLRAQGVVEEAVRGGTRRRVRGHDWPLPEAVRDRMREVLEPLGFDLARPVSVREPEGEDALEFSQDEPGPGGECRG |
| Ga0209824_100128993 | 3300025173 | Wastewater | VPTALIPFWRLRQHAVTEEAVRGGSRRRALDQSWPLPATVTERLRPLLEPLGFDVDRPVSVREPAGQDALEFTQDQ |
| Ga0207684_101772582 | 3300025910 | Corn, Switchgrass And Miscanthus Rhizosphere | VPVALIPFWRLRQHAVSEEAVRGGSRRRALDRTWPLPDAVKERMRDLLEPLGFDLNRPISVSEPANQDALEFTQPE |
| Ga0209236_10350652 | 3300026298 | Grasslands Soil | MLVTRIPFWRLRQHGVVEEAVRGGSRCRIVGRDWPLPEPVRERMRGLLAPLGFDLMRPILVAEPEDENALEFRQDDGGDGD |
| Ga0209761_10665572 | 3300026313 | Grasslands Soil | MLVTRIPFWRLRQNGVVEEAVRGGSRRRIVGRDWPLPEPVRERMRGLLEPLGFELRRPILVSEPEDENALEFRQDDGGDGD |
| Ga0209471_11769602 | 3300026318 | Soil | MPVARIPFWQLRRHGVVEEAVRGGSRRRIVGRDWVLPEAVRERMRGLLEPLGFELDRPILVREPEGEDALEFRQDG |
| Ga0209471_12285652 | 3300026318 | Soil | MPVARIPFWQLRRHGVVEEAIRGGSRRRIVGRDWALPEAVRDRMRELLEPLGFELGRPILVREPEGEDALEFRQ |
| Ga0209470_11273881 | 3300026324 | Soil | VTSRIPFWQLRRHGVVEEAVRGGSRRRIVGRDWPLPDAVFERVRELLEPLGFDLGRPIFVREPEGEDALEFRQDVIDG |
| Ga0209803_10021822 | 3300026332 | Soil | MLVTRIPFWRLRQHGVVEEAVRGGSRCRIVGRDWPLPEPVRERMRGLLAPLGFDLMRPILVAEPEDENALEFRQDDGADGD |
| Ga0209690_10457783 | 3300026524 | Soil | MRRSSFWQLRRHGVVEEAVRGGSRRRIVGRDWPLPEAVLERVRELLEPLGFDLGRPIFVREPEGEDALEFRQDV |
| Ga0209160_10985563 | 3300026532 | Soil | MPPMPVTRIPFWRLRQHGVVEEAVRGGSRRRIVGRDWPLPEPVRERMRGLLEPLGFELRRPILVSEPEDENALEFRQDDGGDGN |
| Ga0209058_10304192 | 3300026536 | Soil | VTSRIPFWQLRRHGVVEEAVRGGSRRRIVGRDWPLPDAVFERVRELLEPLGFDLGRPIFVREPEGEDALEFRQDV |
| Ga0209689_10376714 | 3300027748 | Soil | IPFWRLRQHGVVEEAVRGGSRRRIVGRDWPLPEPVRERMRGLLAPLGFDLMRPILVAEPEDENALEFRQDDGGDGD |
| Ga0209689_12285052 | 3300027748 | Soil | MPVARIPFWQLRRQGVVEEAIRGGSRRRIVGRDWVLPEAMRDRMRELLEPLGFELGRPILVCEPEGEDALEFRQDVQGEQ |
| Ga0209689_12368812 | 3300027748 | Soil | MPVARIPFWQLRRHGVVEEAIRGGSRRRIVGRDWALPEAVRDRMRELLEPLGFELGRPILVREPEGEDALEFRQDGDP |
| Ga0209814_100141694 | 3300027873 | Populus Rhizosphere | VPATRIPFWQLRRHGVAEEAVRGGSRRRIVGRDWPLPDGVRERLRGLLEPIGFDLARPIVVREPHGEDALEFQQD |
| Ga0209283_102401422 | 3300027875 | Vadose Zone Soil | VPIALIPFWRLRQHAVSEEAVRGGSRRRALDRTWPLPDAVRERMRNLLEPLGFDLDRPISVREPANQDALEFIQPE |
| Ga0209590_103070411 | 3300027882 | Vadose Zone Soil | WPRPARSELASGSDRVPVALIPFWRLRQHAVSEEAVRGGSRRRALDRTWPLPDAVRERMRNLLEPLGFDLDRPISVREPANQDALEFIQPE |
| Ga0209382_103694464 | 3300027909 | Populus Rhizosphere | VPLATPVPVTRIPFWELRRHGVVEEAVRGGTRRRVVGRDWPLPDAVREPLRGLLEPLGFDVTRSIIVRELGDDNALEFRQD |
| (restricted) Ga0255312_10904433 | 3300031248 | Sandy Soil | MPRALIPFWRLRQHAASEEAVRGGSRRRAVDRSWPLPEAVTERLRPLLEPLGFDLARPITVSEPAGQDALQFDQPDPD |
| Ga0307469_106153032 | 3300031720 | Hardwood Forest Soil | VPVALIPFWRLRQHAVSEEAVRGGSRRRALDRTWPLPDAVRERMRDVLEPLGFDLDRPISVSEPANKDALEFTQPE |
| Ga0307469_108003392 | 3300031720 | Hardwood Forest Soil | MPVTRVPFWRLRAQGVVEEAVRGGTRRRVRGHDWPLPEAVRDRMREVLEPLGFDLARPVSVREPEGEDALEFSQDEPGPGGECRGS |
| Ga0307469_118717901 | 3300031720 | Hardwood Forest Soil | VPVALIPFWRLRQHAVSEEAVRGGSRRRALDRTWPLPEPVRERLRPLLEPLGFDLARPISVSEPPGQDALQFDQP |
| Ga0307469_125208602 | 3300031720 | Hardwood Forest Soil | ARIPYWRLRAQAVAEEAVRGGTRRRLTEREWLLPEAVRDRLRDVLEPLGFDLARPVSVREAEGEDALEFSQDEGHKE |
| Ga0307468_1003717742 | 3300031740 | Hardwood Forest Soil | VPLARIPYWRLRAQAVAEEAVRGGTRRRLTEREWLLPEAVRDRLRDVLEPLGFDLARPVSVREAEGEDALEFSQDEGNKE |
| Ga0307473_101644662 | 3300031820 | Hardwood Forest Soil | MPVARIPFWQLRRHGVVEEAIRGGSRRRIVGRDWVLPEAVRERMRDLLEPLGFELGRPILVREPEGEDALEFRQDGDPAVV |
| Ga0307471_1000469434 | 3300032180 | Hardwood Forest Soil | MPVARIPFWQLRRHGVVEEAVRGGSRRRIVGRDWVLPEAVRERMRTLLEPLGFELDRPILVREPEGEDALEFRQDGV |
| Ga0307471_1029298392 | 3300032180 | Hardwood Forest Soil | MPTTRIPFWELRRHGVVEEAVRGGSRRRIVGRDWPLPEAVRERMGALLAPLGFDLGRPIVVREPDGEDALEFSQE |
| Ga0307471_1035581151 | 3300032180 | Hardwood Forest Soil | AVPITRIPFWELRRHGVVEEAVRGGTRRRVVGRDWPLPDSVRERMRDLLEPLGFDVARPIMVRELEDQNALEFRQD |
| Ga0307472_1003288152 | 3300032205 | Hardwood Forest Soil | MPVTRIPFWRLRSQGVAEEAVRGGSRRRLTGRDWPLPDEVRDRMRGVLEPLGFDLGRPVSVREPEGEDAREFSQD |
| ⦗Top⦘ |