Basic Information | |
---|---|
Family ID | F097229 |
Family Type | Metagenome / Metatranscriptome |
Number of Sequences | 104 |
Average Sequence Length | 87 residues |
Representative Sequence | MTFKVQRKMCSTCIYRPDSPLDLAKLEADVADKHIGFRGHRICHHSDDVCCRGFWEAHKDEFQLGQVAQRLNLVEFVNVDNLKP |
Number of Associated Samples | 98 |
Number of Associated Scaffolds | 104 |
Quality Assessment | |
---|---|
Transcriptomic Evidence | Yes |
Most common taxonomic group | Bacteria |
% of genes with valid RBS motifs | 73.08 % |
% of genes near scaffold ends (potentially truncated) | 20.19 % |
% of genes from short scaffolds (< 2000 bps) | 64.42 % |
Associated GOLD sequencing projects | 87 |
AlphaFold2 3D model prediction | Yes |
3D model pTM-score | 0.85 |
Hidden Markov Model |
---|
Powered by Skylign |
Most Common Taxonomy | |
---|---|
Group | Bacteria (87.500 % of family members) |
NCBI Taxonomy ID | 2 |
Taxonomy | All Organisms → cellular organisms → Bacteria |
Most Common Ecosystem | |
---|---|
GOLD Ecosystem | Environmental → Aquatic → Sediment → Unclassified → Unclassified → Soil (9.615 % of family members) |
Environment Ontology (ENVO) | Unclassified (29.808 % of family members) |
Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Soil (non-saline) (24.038 % of family members) |
⦗Top⦘ |
⦗Top⦘ |
Predicted Topology & Secondary Structure | |||||
---|---|---|---|---|---|
Classification: | Globular | Signal Peptide: | No | Secondary Structure distribution: | α-helix: 20.54% β-sheet: 10.71% Coil/Unstructured: 68.75% | Feature Viewer |
|
|||||
Powered by Feature Viewer |
Structure Viewer | |
---|---|
| |
Per-residue confidence (pLDDT): 0-50 51-70 71-90 91-100 | pTM-score: 0.85 |
Powered by PDBe Molstar |
PDB ID | Structure Name | Biol. Assembly | TM-score |
---|---|---|---|
6sbe | STRUCTURE OF TYPE II TERPENE CYCLASE MSTE_D109N FROM SCYTONEMA IN COMPLEX WITH GERANYLGERANYL DIHYDROXYBENZOATE (SUBSTRATE) | 1 | 0.50297 |
6sbd | STRUCTURE OF TYPE II TERPENE CYCLASE MSTE_D109A FROM SCYTONEMA IN COMPLEX WITH MEROSTEROLIC ACID A (PRODUCT) | 1 | 0.5018 |
6sbb | STRUCTURE OF TYPE II TERPENE CYCLASE MSTE FROM SCYTONEMA (APO) | 1 | 0.50131 |
6sbg | STRUCTURE OF TYPE II TERPENE CYCLASE MSTE_R337A FROM SCYTONEMA IN COMPLEX WITH GERANYLGERANYL DIHYDROXYBENZOATE (SUBSTRATE) | 1 | 0.50111 |
6sbf | STRUCTURE OF TYPE II TERPENE CYCLASE MSTE_Y157F FROM SCYTONEMA (APO) | 1 | 0.50054 |
⦗Top⦘ |
Pfam ID | Name | % Frequency in 104 Family Scaffolds |
---|---|---|
PF04545 | Sigma70_r4 | 2.88 |
PF02195 | ParBc | 1.92 |
PF00589 | Phage_integrase | 1.92 |
PF13456 | RVT_3 | 1.92 |
PF01068 | DNA_ligase_A_M | 1.92 |
PF13671 | AAA_33 | 0.96 |
PF12083 | DUF3560 | 0.96 |
PF04448 | DUF551 | 0.96 |
PF10520 | Lipid_desat | 0.96 |
PF07589 | PEP-CTERM | 0.96 |
PF12684 | DUF3799 | 0.96 |
PF13560 | HTH_31 | 0.96 |
PF01022 | HTH_5 | 0.96 |
PF01726 | LexA_DNA_bind | 0.96 |
PF12762 | DDE_Tnp_IS1595 | 0.96 |
PF12957 | DUF3846 | 0.96 |
PF05876 | GpA_ATPase | 0.96 |
PF02075 | RuvC | 0.96 |
PF13245 | AAA_19 | 0.96 |
PF05772 | NinB | 0.96 |
PF14359 | DUF4406 | 0.96 |
PF13328 | HD_4 | 0.96 |
PF08719 | NADAR | 0.96 |
PF12843 | QSregVF_b | 0.96 |
PF02511 | Thy1 | 0.96 |
PF13730 | HTH_36 | 0.96 |
PF00149 | Metallophos | 0.96 |
PF04851 | ResIII | 0.96 |
PF01724 | DUF29 | 0.96 |
PF00078 | RVT_1 | 0.96 |
PF12323 | HTH_OrfB_IS605 | 0.96 |
PF12224 | Amidoligase_2 | 0.96 |
PF12705 | PDDEXK_1 | 0.96 |
PF07460 | NUMOD3 | 0.96 |
PF00271 | Helicase_C | 0.96 |
PF14579 | HHH_6 | 0.96 |
PF13361 | UvrD_C | 0.96 |
PF10049 | DUF2283 | 0.96 |
PF04266 | ASCH | 0.96 |
COG ID | Name | Functional Category | % Frequency in 104 Family Scaffolds |
---|---|---|---|
COG1423 | ATP-dependent RNA circularization protein, DNA/RNA ligase (PAB1020) family | Replication, recombination and repair [L] | 1.92 |
COG1793 | ATP-dependent DNA ligase | Replication, recombination and repair [L] | 1.92 |
COG0817 | Holliday junction resolvasome RuvABC endonuclease subunit RuvC | Replication, recombination and repair [L] | 0.96 |
COG1351 | Thymidylate synthase ThyX, FAD-dependent family | Nucleotide transport and metabolism [F] | 0.96 |
COG2411 | Predicted RNA-binding protein, contains PUA-like ASCH domain | General function prediction only [R] | 0.96 |
COG3097 | Uncharacterized conserved protein YqfB, UPF0267 family | Function unknown [S] | 0.96 |
COG3236 | N-glycosidase YbiA/RibX (riboflavin biosynthesis, damage control), NADAR superfamily | Defense mechanisms [V] | 0.96 |
COG4405 | Predicted RNA-binding protein YhfF, contains PUA-like ASCH domain | General function prediction only [R] | 0.96 |
COG5525 | Phage terminase, large subunit GpA | Mobilome: prophages, transposons [X] | 0.96 |
⦗Top⦘ |
Name | Rank | Taxonomy | Distribution |
All Organisms | root | All Organisms | 96.15 % |
Unclassified | root | N/A | 3.85 % |
Visualization |
---|
Powered by ApexCharts |
Scaffold | Taxonomy | Length | IMG/M Link |
---|---|---|---|
3300000579|AP72_2010_repI_A01DRAFT_1032567 | All Organisms → cellular organisms → Bacteria | 753 | Open in IMG/M |
3300002161|JGI24766J26685_10040823 | All Organisms → Viruses → Predicted Viral | 1070 | Open in IMG/M |
3300002460|C687J35021_10000503 | Not Available | 30594 | Open in IMG/M |
3300005096|Ga0072503_161052 | All Organisms → cellular organisms → Bacteria | 5478 | Open in IMG/M |
3300005336|Ga0070680_101508281 | All Organisms → cellular organisms → Bacteria | 582 | Open in IMG/M |
3300005526|Ga0073909_10056198 | All Organisms → cellular organisms → Bacteria | 1444 | Open in IMG/M |
3300005530|Ga0070679_100361788 | All Organisms → cellular organisms → Bacteria | 1398 | Open in IMG/M |
3300005664|Ga0073685_1186478 | All Organisms → cellular organisms → Bacteria | 516 | Open in IMG/M |
3300005764|Ga0066903_106296579 | All Organisms → cellular organisms → Bacteria | 620 | Open in IMG/M |
3300005841|Ga0068863_100172119 | All Organisms → cellular organisms → Bacteria | 2077 | Open in IMG/M |
3300005843|Ga0068860_100845146 | All Organisms → cellular organisms → Bacteria | 930 | Open in IMG/M |
3300005987|Ga0075158_10344010 | All Organisms → cellular organisms → Bacteria | 841 | Open in IMG/M |
3300006638|Ga0075522_10069016 | All Organisms → cellular organisms → Bacteria | 1985 | Open in IMG/M |
3300006639|Ga0079301_1000042 | All Organisms → cellular organisms → Bacteria | 60267 | Open in IMG/M |
3300006940|Ga0079099_1150361 | All Organisms → Viruses → Predicted Viral | 1289 | Open in IMG/M |
3300007363|Ga0075458_10003032 | All Organisms → cellular organisms → Bacteria | 5484 | Open in IMG/M |
3300008266|Ga0114363_1024242 | All Organisms → cellular organisms → Bacteria | 5323 | Open in IMG/M |
3300008470|Ga0115371_10701260 | All Organisms → cellular organisms → Bacteria | 631 | Open in IMG/M |
3300009083|Ga0105047_10101711 | All Organisms → cellular organisms → Bacteria | 3620 | Open in IMG/M |
3300009146|Ga0105091_10026768 | All Organisms → cellular organisms → Bacteria | 2498 | Open in IMG/M |
3300009149|Ga0114918_10005558 | All Organisms → cellular organisms → Bacteria | 10810 | Open in IMG/M |
3300009176|Ga0105242_10000110 | All Organisms → cellular organisms → Bacteria | 59335 | Open in IMG/M |
3300009177|Ga0105248_10000102 | All Organisms → cellular organisms → Bacteria | 94720 | Open in IMG/M |
3300009500|Ga0116229_10180210 | All Organisms → cellular organisms → Bacteria | 1830 | Open in IMG/M |
3300009551|Ga0105238_11306041 | All Organisms → cellular organisms → Bacteria | 751 | Open in IMG/M |
3300009701|Ga0116228_10443117 | All Organisms → cellular organisms → Bacteria | 893 | Open in IMG/M |
3300009709|Ga0116227_10314993 | All Organisms → cellular organisms → Bacteria | 1197 | Open in IMG/M |
3300009787|Ga0116226_10352082 | All Organisms → cellular organisms → Bacteria | 1510 | Open in IMG/M |
3300009983|Ga0105041_122389 | All Organisms → cellular organisms → Bacteria | 509 | Open in IMG/M |
3300010045|Ga0126311_10773071 | All Organisms → cellular organisms → Bacteria | 772 | Open in IMG/M |
3300010233|Ga0136235_1020290 | Not Available | 1522 | Open in IMG/M |
3300010339|Ga0074046_10606742 | All Organisms → cellular organisms → Bacteria | 647 | Open in IMG/M |
3300010343|Ga0074044_10045514 | All Organisms → cellular organisms → Bacteria | 3033 | Open in IMG/M |
3300010358|Ga0126370_10820538 | All Organisms → cellular organisms → Bacteria | 831 | Open in IMG/M |
3300010396|Ga0134126_11147384 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Negativicutes → Selenomonadales → Sporomusaceae → Sporomusa | 865 | Open in IMG/M |
3300010399|Ga0134127_12340387 | All Organisms → cellular organisms → Bacteria | 614 | Open in IMG/M |
3300010400|Ga0134122_11664925 | All Organisms → cellular organisms → Bacteria | 665 | Open in IMG/M |
3300011407|Ga0137450_1007929 | All Organisms → cellular organisms → Bacteria | 1588 | Open in IMG/M |
3300011407|Ga0137450_1083048 | All Organisms → cellular organisms → Bacteria | 618 | Open in IMG/M |
3300011413|Ga0137333_1000881 | All Organisms → cellular organisms → Bacteria | 7742 | Open in IMG/M |
3300011421|Ga0137462_1030479 | All Organisms → Viruses → Predicted Viral | 1103 | Open in IMG/M |
3300011424|Ga0137439_1000541 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage | 3852 | Open in IMG/M |
3300011428|Ga0137456_1078287 | All Organisms → cellular organisms → Bacteria | 829 | Open in IMG/M |
3300012164|Ga0137352_1001403 | All Organisms → cellular organisms → Bacteria | 3469 | Open in IMG/M |
3300012172|Ga0137320_1000988 | All Organisms → cellular organisms → Bacteria | 5080 | Open in IMG/M |
3300012361|Ga0137360_10437284 | All Organisms → Viruses → Predicted Viral | 1107 | Open in IMG/M |
3300012582|Ga0137358_10120234 | All Organisms → cellular organisms → Bacteria | 1785 | Open in IMG/M |
3300012676|Ga0137341_1044535 | All Organisms → cellular organisms → Bacteria | 739 | Open in IMG/M |
3300012929|Ga0137404_11285014 | All Organisms → cellular organisms → Bacteria | 674 | Open in IMG/M |
3300012944|Ga0137410_10110800 | All Organisms → cellular organisms → Bacteria | 2048 | Open in IMG/M |
3300012944|Ga0137410_12070415 | All Organisms → cellular organisms → Bacteria | 507 | Open in IMG/M |
3300012956|Ga0154020_10007187 | All Organisms → cellular organisms → Bacteria | 13017 | Open in IMG/M |
3300012956|Ga0154020_10446004 | All Organisms → cellular organisms → Bacteria | 1096 | Open in IMG/M |
3300014501|Ga0182024_10024690 | All Organisms → cellular organisms → Bacteria | 10665 | Open in IMG/M |
3300014811|Ga0119960_1016568 | All Organisms → cellular organisms → Bacteria | 875 | Open in IMG/M |
3300014879|Ga0180062_1151152 | All Organisms → cellular organisms → Bacteria | 534 | Open in IMG/M |
3300015024|Ga0167669_1103802 | All Organisms → cellular organisms → Bacteria | 894 | Open in IMG/M |
3300015360|Ga0163144_10001782 | All Organisms → cellular organisms → Bacteria | 50395 | Open in IMG/M |
3300015374|Ga0132255_102592245 | All Organisms → cellular organisms → Bacteria | 775 | Open in IMG/M |
3300017792|Ga0163161_10857282 | All Organisms → cellular organisms → Bacteria | 767 | Open in IMG/M |
3300020060|Ga0193717_1070596 | All Organisms → cellular organisms → Bacteria | 1169 | Open in IMG/M |
3300020199|Ga0179592_10341029 | All Organisms → cellular organisms → Bacteria | 661 | Open in IMG/M |
3300021420|Ga0210394_10959358 | All Organisms → cellular organisms → Bacteria | 742 | Open in IMG/M |
3300022878|Ga0247761_1005869 | All Organisms → cellular organisms → Bacteria | 2042 | Open in IMG/M |
(restricted) 3300023112|Ga0233411_10000670 | All Organisms → cellular organisms → Bacteria | 10196 | Open in IMG/M |
3300024262|Ga0210003_1013310 | All Organisms → cellular organisms → Bacteria | 5432 | Open in IMG/M |
3300025012|Ga0209727_1000490 | Not Available | 43090 | Open in IMG/M |
3300025635|Ga0208147_1002627 | All Organisms → cellular organisms → Bacteria | 5484 | Open in IMG/M |
3300025924|Ga0207694_10470300 | All Organisms → cellular organisms → Bacteria | 1051 | Open in IMG/M |
3300025941|Ga0207711_10154914 | All Organisms → cellular organisms → Bacteria | 2070 | Open in IMG/M |
3300026300|Ga0209027_1109422 | All Organisms → cellular organisms → Bacteria | 978 | Open in IMG/M |
3300026320|Ga0209131_1249228 | All Organisms → cellular organisms → Bacteria | 718 | Open in IMG/M |
3300026320|Ga0209131_1349950 | All Organisms → cellular organisms → Bacteria | 549 | Open in IMG/M |
3300026480|Ga0257177_1008562 | All Organisms → cellular organisms → Bacteria | 1317 | Open in IMG/M |
3300026557|Ga0179587_10580739 | All Organisms → cellular organisms → Bacteria | 737 | Open in IMG/M |
3300027499|Ga0208788_1000063 | All Organisms → cellular organisms → Bacteria | 60294 | Open in IMG/M |
3300027675|Ga0209077_1060423 | All Organisms → cellular organisms → Bacteria | 1037 | Open in IMG/M |
3300027805|Ga0209229_10002129 | All Organisms → cellular organisms → Bacteria | 8119 | Open in IMG/M |
3300027821|Ga0209811_10020042 | All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium | 2185 | Open in IMG/M |
3300027860|Ga0209611_10244117 | All Organisms → cellular organisms → Bacteria | 1076 | Open in IMG/M |
(restricted) 3300027861|Ga0233415_10008590 | All Organisms → cellular organisms → Bacteria | 4041 | Open in IMG/M |
3300027910|Ga0209583_10046240 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Hyphomicrobiaceae → Hyphomicrobium → Hyphomicrobium sulfonivorans | 1515 | Open in IMG/M |
3300027968|Ga0209061_1101526 | All Organisms → cellular organisms → Bacteria | 1009 | Open in IMG/M |
3300028381|Ga0268264_10795502 | All Organisms → cellular organisms → Bacteria | 944 | Open in IMG/M |
3300028800|Ga0265338_10001083 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales | 45206 | Open in IMG/M |
3300028800|Ga0265338_10143188 | All Organisms → Viruses → Predicted Viral | 1869 | Open in IMG/M |
3300028806|Ga0302221_10370278 | All Organisms → cellular organisms → Bacteria | 624 | Open in IMG/M |
3300028882|Ga0302154_10466583 | All Organisms → cellular organisms → Bacteria | 603 | Open in IMG/M |
3300029907|Ga0311329_10781192 | All Organisms → cellular organisms → Bacteria | 611 | Open in IMG/M |
3300029917|Ga0311326_10006517 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales | 7527 | Open in IMG/M |
3300029956|Ga0302150_10114200 | All Organisms → cellular organisms → Bacteria | 1036 | Open in IMG/M |
3300030520|Ga0311372_11143635 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium | 1005 | Open in IMG/M |
3300031232|Ga0302323_100194079 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria | 2048 | Open in IMG/M |
3300031539|Ga0307380_10018087 | All Organisms → cellular organisms → Bacteria | 8504 | Open in IMG/M |
3300031565|Ga0307379_10039850 | Not Available | 5531 | Open in IMG/M |
3300031565|Ga0307379_10605641 | All Organisms → cellular organisms → Bacteria | 1003 | Open in IMG/M |
3300031707|Ga0315291_10084264 | All Organisms → Viruses → Predicted Viral | 3469 | Open in IMG/M |
3300031708|Ga0310686_109373880 | All Organisms → cellular organisms → Bacteria | 1086 | Open in IMG/M |
3300031726|Ga0302321_102022835 | All Organisms → cellular organisms → Bacteria | 669 | Open in IMG/M |
3300031857|Ga0315909_10375926 | All Organisms → Viruses → Predicted Viral | 1028 | Open in IMG/M |
3300031951|Ga0315904_10263068 | All Organisms → cellular organisms → Bacteria | 1643 | Open in IMG/M |
3300032053|Ga0315284_11936717 | All Organisms → cellular organisms → Bacteria | 601 | Open in IMG/M |
3300032397|Ga0315287_10260571 | All Organisms → Viruses → Predicted Viral | 2037 | Open in IMG/M |
3300032515|Ga0348332_13170495 | All Organisms → cellular organisms → Bacteria | 1010 | Open in IMG/M |
Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).
⦗Top⦘ |
Habitat | Taxonomy | Distribution |
Soil | Environmental → Aquatic → Sediment → Unclassified → Unclassified → Soil | 9.62% |
Vadose Zone Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil | 6.73% |
Host-Associated | Host-Associated → Plants → Peat Moss → Unclassified → Unclassified → Host-Associated | 4.81% |
Bog | Environmental → Terrestrial → Peat → Unclassified → Unclassified → Bog | 3.85% |
Sediment | Environmental → Aquatic → Freshwater → Lake → Sediment → Sediment | 2.88% |
Terrestrial Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil | 2.88% |
Surface Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil | 2.88% |
Grasslands Soil | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil | 2.88% |
Soil | Environmental → Terrestrial → Soil → Clay → Unclassified → Soil | 2.88% |
Switchgrass Rhizosphere | Host-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere | 2.88% |
Freshwater Sediment | Environmental → Aquatic → Freshwater → Sediment → Unclassified → Freshwater Sediment | 1.92% |
Freshwater And Sediment | Environmental → Aquatic → Freshwater → Lentic → Unclassified → Freshwater And Sediment | 1.92% |
Freshwater | Environmental → Aquatic → Freshwater → Unclassified → Unclassified → Freshwater | 1.92% |
Deep Subsurface | Environmental → Aquatic → Marine → Oceanic → Sediment → Deep Subsurface | 1.92% |
Seawater | Environmental → Aquatic → Marine → Inlet → Unclassified → Seawater | 1.92% |
Aqueous | Environmental → Aquatic → Marine → Coastal → Unclassified → Aqueous | 1.92% |
Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil | 1.92% |
Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil | 1.92% |
Bog Forest Soil | Environmental → Terrestrial → Soil → Wetlands → Unclassified → Bog Forest Soil | 1.92% |
Corn Rhizosphere | Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere | 1.92% |
Deep Subsurface | Environmental → Terrestrial → Deep Subsurface → Unclassified → Unclassified → Deep Subsurface | 1.92% |
Fen | Environmental → Terrestrial → Peat → Unclassified → Unclassified → Fen | 1.92% |
Palsa | Environmental → Terrestrial → Peat → Unclassified → Unclassified → Palsa | 1.92% |
Switchgrass Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere | 1.92% |
Corn Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere | 1.92% |
Rhizosphere | Host-Associated → Plants → Rhizosphere → Soil → Unclassified → Rhizosphere | 1.92% |
Active Sludge | Engineered → Wastewater → Activated Sludge → Unclassified → Unclassified → Active Sludge | 1.92% |
Freshwater, Plankton | Environmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater, Plankton | 0.96% |
Freshwater Microbial Mat | Environmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater Microbial Mat | 0.96% |
Freshwater | Environmental → Aquatic → Freshwater → Lotic → Unclassified → Freshwater | 0.96% |
Aquatic | Environmental → Aquatic → Freshwater → Lotic → Unclassified → Aquatic | 0.96% |
Soil | Environmental → Aquatic → Freshwater → Groundwater → Unclassified → Soil | 0.96% |
Aquatic | Environmental → Aquatic → Freshwater → Unclassified → Unclassified → Aquatic | 0.96% |
Freshwater | Environmental → Aquatic → Freshwater → Ice → Glacial Lake → Freshwater | 0.96% |
Marine | Environmental → Aquatic → Marine → Unclassified → Unclassified → Marine | 0.96% |
Sediment | Environmental → Aquatic → Sediment → Unclassified → Unclassified → Sediment | 0.96% |
Watersheds | Environmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds | 0.96% |
Tropical Forest Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil | 0.96% |
Serpentine Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Serpentine Soil | 0.96% |
Glacier Forefield Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Glacier Forefield Soil | 0.96% |
Arctic Peat Soil | Environmental → Terrestrial → Soil → Unclassified → Permafrost → Arctic Peat Soil | 0.96% |
Forest Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil | 0.96% |
Permafrost | Environmental → Terrestrial → Soil → Wetlands → Permafrost → Permafrost | 0.96% |
Tropical Forest Soil | Environmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil | 0.96% |
Soil | Environmental → Terrestrial → Soil → Loam → Unclassified → Soil | 0.96% |
Plant Litter | Environmental → Terrestrial → Plant Litter → Unclassified → Unclassified → Plant Litter | 0.96% |
Plant Litter | Environmental → Terrestrial → Plant Litter → Unclassified → Unclassified → Plant Litter | 0.96% |
Arabidopsis Rhizosphere | Host-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere | 0.96% |
Switchgrass Rhizosphere | Host-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere | 0.96% |
Miscanthus Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere | 0.96% |
Switchgrass Associated | Host-Associated → Plants → Phyllosphere → Leaf → Endophytes → Switchgrass Associated | 0.96% |
Anaerobic Digestor Sludge | Engineered → Wastewater → Anaerobic Digestor → Unclassified → Unclassified → Anaerobic Digestor Sludge | 0.96% |
Wastewater Effluent | Engineered → Wastewater → Nutrient Removal → Unclassified → Unclassified → Wastewater Effluent | 0.96% |
Visualization |
---|
Powered by ApexCharts |
Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).
Taxon OID | Sample Name | Habitat Type | IMG/M Link |
---|---|---|---|
3300000579 | Forest soil microbial communities from Amazon forest - Pasture72 2010 replicate I A01 | Environmental | Open in IMG/M |
3300002161 | Freshwater and sediment microbial communities from dead zone in Sandusky Bay, Ohio, USA | Environmental | Open in IMG/M |
3300002460 | Soil microbial communities from Rifle, Colorado - Rifle CSP2_plank highO2_1.2 | Environmental | Open in IMG/M |
3300005096 | Hydrothermal chimney microbial communities from the East Pacific Rise - M vent 7 | Environmental | Open in IMG/M |
3300005336 | Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaG | Environmental | Open in IMG/M |
3300005526 | Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire. - Coalmine Soil_Cen17_06102014_R1 | Environmental | Open in IMG/M |
3300005530 | Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-3B metaG | Environmental | Open in IMG/M |
3300005664 | Freshwater viral communities from Emiquon reservoir, Havana, Illinois, USA | Environmental | Open in IMG/M |
3300005764 | Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2) | Environmental | Open in IMG/M |
3300005841 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S6-2 | Host-Associated | Open in IMG/M |
3300005843 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S3-2 | Host-Associated | Open in IMG/M |
3300005987 | Wastewater effluent complex algal communities from Wisconsin, to seasonally profile nutrient transformation and Carbon sequestration - JI 9/18/14 B DNA | Engineered | Open in IMG/M |
3300006638 | Arctic peat soil microbial communities from the Barrow Environmental Observatory site, Barrow, Alaska, USA - NGEE PermafrostL2-A | Environmental | Open in IMG/M |
3300006639 | Deep subsurface shale carbon reservoir microbial communities from Ohio, USA - Utica-2 Time Series FC 2014_7_11 | Environmental | Open in IMG/M |
3300006940 | Active sludge microbial communities from Illinois, USA, of municipal wastewater-treating anaerobic digesters - ADurb_H2B_02_SludgeMetaT (Metagenome Metatranscriptome) | Engineered | Open in IMG/M |
3300007363 | Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - DEBay_Fall_0.3_<0.8_DNA | Environmental | Open in IMG/M |
3300008266 | Freshwater microbial communities from Harmful Algal Blooms in Lake Erie, Western Basin, USA - Station WLE12, Sample HABS-E2014-0108-C-NA | Environmental | Open in IMG/M |
3300008470 | Sediment core microbial communities from Adelie Basin, Antarctica. Combined Assembly of Gp0136540, Gp0136562, Gp0136563 | Environmental | Open in IMG/M |
3300009083 | Freshwater microbial communities from Lake Fryxell liftoff mats and glacier meltwater in Antarctica - MAT-04 (megahit assembly) | Environmental | Open in IMG/M |
3300009146 | Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (1) Depth 10-12cm March2015 | Environmental | Open in IMG/M |
3300009149 | Deep subsurface microbial communities from Baltic Sea to uncover new lineages of life (NeLLi) - Landsort_02402 metaG | Environmental | Open in IMG/M |
3300009176 | Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-4 metaG | Host-Associated | Open in IMG/M |
3300009177 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-4 metaG | Host-Associated | Open in IMG/M |
3300009500 | Host-associated microbial communities from peat moss isolated from Minnesota, USA - S1T2_Fc - Sphagnum magellanicum MG | Host-Associated | Open in IMG/M |
3300009551 | Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C3-4 metaG | Host-Associated | Open in IMG/M |
3300009701 | Host-associated microbial communities from peat moss isolated from Minnesota, USA - S1T2_Fc - Sphagnum fallax MG | Host-Associated | Open in IMG/M |
3300009709 | Host-associated microbial communities from peat moss isolated from Minnesota, USA - S1T2_Fb - Sphagnum magellanicum MG | Host-Associated | Open in IMG/M |
3300009787 | Host-associated microbial communities from peat moss isolated from Minnesota, USA - S1T2_Fa - Sphagnum fallax MG | Host-Associated | Open in IMG/M |
3300009983 | Switchgrass associated microbial communities from Austin, Texas, USA - LS_189 metaG | Host-Associated | Open in IMG/M |
3300010045 | Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot61 | Environmental | Open in IMG/M |
3300010233 | Filterable freshwater microbial communities from Conwy River, North Wales, UK. Fraction, filtered through 0.2 um filter. After WGA. | Environmental | Open in IMG/M |
3300010339 | Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - Bog Forest MetaG ECP23OM3 | Environmental | Open in IMG/M |
3300010343 | Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - Bog Forest MetaG ECP23OM1 | Environmental | Open in IMG/M |
3300010358 | Tropical forest soil microbial communities from Panama - MetaG Plot_3 | Environmental | Open in IMG/M |
3300010396 | Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-2 | Environmental | Open in IMG/M |
3300010399 | Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-3 | Environmental | Open in IMG/M |
3300010400 | Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2 | Environmental | Open in IMG/M |
3300011407 | Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT454_2 | Environmental | Open in IMG/M |
3300011413 | Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT231_2 | Environmental | Open in IMG/M |
3300011421 | Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT769_2 | Environmental | Open in IMG/M |
3300011424 | Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT200_2 | Environmental | Open in IMG/M |
3300011428 | Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT615_2 | Environmental | Open in IMG/M |
3300012164 | Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT730_2 | Environmental | Open in IMG/M |
3300012172 | Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT366_2 | Environmental | Open in IMG/M |
3300012361 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaG | Environmental | Open in IMG/M |
3300012582 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaG | Environmental | Open in IMG/M |
3300012676 | Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT433_2 | Environmental | Open in IMG/M |
3300012929 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly) | Environmental | Open in IMG/M |
3300012944 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly) | Environmental | Open in IMG/M |
3300012956 | Active sludge microbial communities from wastewater, Klosterneuburg, Austria - Klosneuvirus_20160825_MG | Engineered | Open in IMG/M |
3300014501 | Permafrost microbial communities from Stordalen Mire, Sweden - P3-2 metaG (Illumina Assembly) | Environmental | Open in IMG/M |
3300014811 | Aquatic viral communities from ballast water - Michigan State University - AB_ballast water | Environmental | Open in IMG/M |
3300014879 | Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLIBT45_16_10D | Environmental | Open in IMG/M |
3300015024 | Arctic sediment microbial communities from supraglacial cryoconite, Rabots glacier, Tarfala, Sweden (Sample Rb cryoconite) | Environmental | Open in IMG/M |
3300015360 | Freshwater microbial mat bacterial communities from Lake Vanda, McMurdo Dry Valleys, Antarctica - Oligotrophic Lake LV.19.BULKMAT1 | Environmental | Open in IMG/M |
3300015374 | Col-0 rhizosphere combined assembly | Host-Associated | Open in IMG/M |
3300017792 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S4-5 metaG | Host-Associated | Open in IMG/M |
3300020060 | Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2c2 | Environmental | Open in IMG/M |
3300020199 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly) | Environmental | Open in IMG/M |
3300021420 | Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-M | Environmental | Open in IMG/M |
3300022878 | Plant litter microbial communities from Arlington Agricultural Research Station in Wisconsin, United States - UWRJ-L111-311C-4 | Environmental | Open in IMG/M |
3300023112 (restricted) | Seawater microbial communities from Saanich Inlet, British Columbia, Canada - Na_anoxic_2_MG | Environmental | Open in IMG/M |
3300024262 | Deep subsurface microbial communities from Baltic Sea to uncover new lineages of life (NeLLi) - Landsort_02402 metaG (SPAdes) | Environmental | Open in IMG/M |
3300025012 | Soil microbial communities from Rifle, Colorado, USA - Groundwater C1 | Environmental | Open in IMG/M |
3300025635 | Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - DEBay_Fall_0.3_<0.8_DNA (SPAdes) | Environmental | Open in IMG/M |
3300025924 | Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C3-4 metaG (SPAdes) | Host-Associated | Open in IMG/M |
3300025941 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-4 metaG (SPAdes) | Host-Associated | Open in IMG/M |
3300026300 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_20cm (SPAdes) | Environmental | Open in IMG/M |
3300026320 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_40cm (SPAdes) | Environmental | Open in IMG/M |
3300026480 | Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-07-B | Environmental | Open in IMG/M |
3300026557 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal | Environmental | Open in IMG/M |
3300027499 | Deep subsurface shale carbon reservoir microbial communities from Ohio, USA - Utica-2 Time Series FC 2014_7_11 (SPAdes) | Environmental | Open in IMG/M |
3300027675 | Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (1) Depth 10-12cm March2015 (SPAdes) | Environmental | Open in IMG/M |
3300027805 | Freshwater and sediment microbial communities from dead zone in Sandusky Bay, Ohio, USA (SPAdes) | Environmental | Open in IMG/M |
3300027821 | Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire. - Coalmine Soil_Cen17_06102014_R1 (SPAdes) | Environmental | Open in IMG/M |
3300027860 | Host-associated microbial communities from peat moss isolated from Minnesota, USA - S1T2_Fc - Sphagnum magellanicum MG (SPAdes) | Host-Associated | Open in IMG/M |
3300027861 (restricted) | Seawater microbial communities from Saanich Inlet, British Columbia, Canada - Na_anoxic_12_MG | Environmental | Open in IMG/M |
3300027910 | Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2014 (SPAdes) | Environmental | Open in IMG/M |
3300027968 | Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen10_05102014_R1 (SPAdes) | Environmental | Open in IMG/M |
3300028381 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S3-2 (SPAdes) | Host-Associated | Open in IMG/M |
3300028800 | Rhizosphere microbial communities from Carex aquatilis grown in University of Washington, Seatle, WA, United States - 4-21-26 metaG | Host-Associated | Open in IMG/M |
3300028806 | Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - II_Palsa_E2_1 | Environmental | Open in IMG/M |
3300028882 | Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - I_Bog_N2_3 | Environmental | Open in IMG/M |
3300029907 | I_Bog_N1 coassembly | Environmental | Open in IMG/M |
3300029917 | I_Bog_E1 coassembly | Environmental | Open in IMG/M |
3300029956 | Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - I_Bog_N1_2 | Environmental | Open in IMG/M |
3300030520 | III_Palsa_N2 coassembly | Environmental | Open in IMG/M |
3300031232 | Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - Fen_T0_3 | Environmental | Open in IMG/M |
3300031539 | Soil microbial communities from Risofladan, Vaasa, Finland - UN-3 | Environmental | Open in IMG/M |
3300031565 | Soil microbial communities from Risofladan, Vaasa, Finland - UN-2 | Environmental | Open in IMG/M |
3300031707 | Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G12_20 | Environmental | Open in IMG/M |
3300031708 | FICUS49499 Metagenome Czech Republic combined assembly | Environmental | Open in IMG/M |
3300031726 | Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - Fen_T0_1 | Environmental | Open in IMG/M |
3300031857 | Freshwater fungal communities from buoy surface, Lake Erie, Ohio, United States - Buoy 2 MA125 | Environmental | Open in IMG/M |
3300031951 | Freshwater fungal communities from buoy surface, Lake Erie, Ohio, United States - Buoy 12 MA120 | Environmental | Open in IMG/M |
3300032053 | Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G09_16 | Environmental | Open in IMG/M |
3300032397 | Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G11_0 | Environmental | Open in IMG/M |
3300032515 | FICUS49499 Metatranscriptome Czech Republic combined assembly (additional data) | Environmental | Open in IMG/M |
Geographical Distribution | |
---|---|
Zoom: | Powered by OpenStreetMap |
⦗Top⦘ |
Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).
Protein ID | Sample Taxon ID | Habitat | Sequence |
AP72_2010_repI_A01DRAFT_10325672 | 3300000579 | Forest Soil | MFKVQRKPCATCIYRSDSVLDLGKLEAEIADPHMKGFFSGYRICHHSDDACCRGFWSRHKNKFAMGQIAQRLGLVEFVDVDTLAELKRKGASNAKAR* |
JGI24766J26685_100408234 | 3300002161 | Freshwater And Sediment | STCIYRPDSPLDLAKLEADVADGYGGFKGHRVCHHSDNACCAGFWARHKNEFQLGQIAQRLGMVEFVEDDKFRESGKRVEP* |
C687J35021_1000050349 | 3300002460 | Soil | VKVCSRQCPTCIYRPDSLFDLKKLEAQIADPYMAGFFKGHRVCHHTKDACCRGFWNKHKDSFALGQVAQRMNVVIFVAPGG* |
Ga0072503_1610522 | 3300005096 | Marine | MFRVQKKMCSTCIYRPDSTLDLDSLEDAVRDPHVGFKAHRICHHSADVCCRGFWEAHKGEFPAGQMAQRLGVVDFVDVDNLIEVTK* |
Ga0070680_1015082812 | 3300005336 | Corn Rhizosphere | VAMSFPVQRRQCATCIYRKDSPLDLKKLEREIADPYMKGFFSGHRICHHSDTACCAGFFARHKDHFPLGQIAQRLGFVEYVEHDKMVGIKRTTR* |
Ga0073909_100561985 | 3300005526 | Surface Soil | VKRGVGFKVQRRMCRTCIYHKGSPLDLAELERQVRDPHMGFKGFRICHHSKDACCRGFWDMHKDEFAVGQVAQRLGLVCFVDIDIIKRITHRERKVK* |
Ga0070679_1003617884 | 3300005530 | Corn Rhizosphere | MSFPVQRRQCATCIYRKDSPLDLKKLEREIADPYMKGFFSGHRICHHSDTACCAGFFARHKDHFPLGQIAQRLGFVEYVEHDKMVGIKRTTR* |
Ga0073685_11864781 | 3300005664 | Aquatic | MFLVQRKQCSTCIYRADSPLDLNQLEEQVKDPYGGFSGHRVCHHTGKGNEACCAGFWARHKDEFQLGQVAQRMGMV |
Ga0066903_1062965792 | 3300005764 | Tropical Forest Soil | MKEVPADLKTELGTLKVQARPCDTCIYRSDSPLDLESLEEVVRDPYIGFKGFRVCHHSDDACCRGFWNRHKDAFAAGQIAQRLGLVEYVNDD |
Ga0068863_1001721191 | 3300005841 | Switchgrass Rhizosphere | MLRVQRRQCATCIYRADSPLDIVKLENDVRDPYVGFKGHRICHHSRDAVCAGFWARHKWAFALGQIAQRLGMVEY |
Ga0068860_1008451461 | 3300005843 | Switchgrass Rhizosphere | MFKVQKRQCETCIYRKSSPLDIKRLEAQVADKYGGYKGHRICHHSKDACCRGFWDRHKDQFQMGQLAQ |
Ga0075158_103440101 | 3300005987 | Wastewater Effluent | MTFKVQEKMCKTCIYRPDSPLDLEKLEDQVRDNYGGFKGHRICHHASEACCAGFWAKHKDEFQMGQIAQRLNMVELVNIDETPA* |
Ga0075522_100690166 | 3300006638 | Arctic Peat Soil | MRSIGIKVQKRACSTCIYRKDSTLDIKELEREIADPRMPGFFRGHRICHHSKDVACRGFWNRHKNHFTLGQLAQRFRAVFFVSVDTLSRRKV* |
Ga0079301_100004285 | 3300006639 | Deep Subsurface | MRVQKTQCSTCIYRPDSPLDLAKLEADVADGYGGFKGHRICHHSDDACCAGFWARHKNEFQLGQIAQRFGVVEFVQDDTLKGKTK* |
Ga0079099_11503613 | 3300006940 | Anaerobic Digestor Sludge | MFRVQNRQCSTCIYRADNPLDIVELENQVRDPYGGFSGHRICHHTDGDQEACCAGFWARHKDEFQLGQVAQRLGMVRYVDVDTLAKDKE* |
Ga0075458_100030329 | 3300007363 | Aqueous | MTFKVQRKMCSTCIYRPDSPLDLAKLEADVADKHIGFRGHRICHHSDDVCCRGFWEAHKDEFQLGQVAQRLNLVEFVNVDNLKP* |
Ga0114363_10242423 | 3300008266 | Freshwater, Plankton | MRVQKTQCSTCIYRPDSPLDLAKLEAAVADGYGGFTGHRICHHSDDACCAGFWARHKNEFQLGKIAQRLGMVEFVEDDTLKGKTK* |
Ga0115371_107012603 | 3300008470 | Sediment | MKFKVQKKLCSTCIYRPSSPLDLKMLEDQVRDEYVGFKGHRMCHHSKDVCCRGFWEAHKDEFPMGQIAQRMGLVEFVREDTL* |
Ga0105047_101017113 | 3300009083 | Freshwater | MTGFRVQSKQCSTCIYRPDSPLDLENLESQIADPYGGFTGYRICHNSDDACCAGFWAKHKDEFPMGQVSQRLNLVDLVEDDRLKKTPA* |
Ga0105091_100267682 | 3300009146 | Freshwater Sediment | MCATCIYRPDSPLDLRRLENAIRDNYGGFKDYRVCHHSDDVCCRGFWLHHKNKFAMGQIAQRLNCVEFVDVDKS* |
Ga0114918_1000555812 | 3300009149 | Deep Subsurface | MFKVQRTACSTCIFKKSSPLDLDRLLNEIRDPYGGFSGHRICHHSEDACCAGFWKNHKDEFALGQIAQRLGMVEKVDADILPLKESDDVRS* |
Ga0105242_1000011044 | 3300009176 | Miscanthus Rhizosphere | LTGFAVQARACRTCIYRKDSPLDLAQLEAAVADDYGGFHSFRICHHSRDACCRGFWDRHKNKFALGQVAQRLGMVRFVKDDIPNDR* |
Ga0105248_10000102141 | 3300009177 | Switchgrass Rhizosphere | VAVGVGWYCRDAGCVGGDGRVVAPVTGFEVQKRQCATCIYRKDSPLDLKALERQIADPYGGFVGHRICHHSDTACCRGFWSRHKNKFPLGQIAQRLGMVRYVEHDKA* |
Ga0116229_101802104 | 3300009500 | Host-Associated | MLTVQRRLCKTCIYRPDSALDLTKLENDVRDPNMAGFFIGSRICHHSEDAVCRGFWNAHRNHFTAGQIAQRLGMVRFVDQDTLA* |
Ga0105238_113060412 | 3300009551 | Corn Rhizosphere | LTGFAVQARACRTCIYRKDSPLDLAQLEAAMADDYGGFHSFRICHHSRDACCRGFWDRHKNKFALGQVAQRLGMVRFVKDDIPNDR* |
Ga0116228_104431173 | 3300009701 | Host-Associated | MTFKVQSRPCESCIYRADSPLDLDRLEQCVKDKYGFFNGHRICHHTKDVCCRGFWNLHKDEFPLGQLAQRLDFVEFVDVDDWKQNENGAGKRP* |
Ga0116227_103149932 | 3300009709 | Host-Associated | MTGFLVMSKRCSTCIYRKDSHFDLKKHEDEVRDPHMGFKGHRICHHSSKSKPACCNGFWTEHKDEFAAGQLAQRLNLVEFIEPGKSA* |
Ga0116226_103520822 | 3300009787 | Host-Associated | MLTVQRRLCKTCIYRPDSALDLTKLENDVRDPNMSGYFIGSRICHHSEDVVCRGFWNAHRNHFTAGQIAQRLGMVRFVDVDTLA* |
Ga0105041_1223892 | 3300009983 | Switchgrass Associated | MTFKVQARMCATCIYRPDSPLDIEKLENDVRDPYVGFKGHRICHHSDDVCCRGFWEAHKDEFPAGQIAQRLGLVEFVNVDTLGDRHGE* |
Ga0126311_107730712 | 3300010045 | Serpentine Soil | VGFKVMDKPCGTCIYRKDSPLDLRGLEDQVRSPHGGFSGHRVCHHSEKGGGCCRGFWNAHKDEFQAGQIAQRLGVVDFIPADDEEQAA* |
Ga0136235_10202903 | 3300010233 | Freshwater | MFKVQSKQCSTCIYRKSSPLDLKKLEGDIKDNYGGFYGYRICHHSDDVCCRGFWNRHKDKFSLGQIAQRLGFVKFVSEDRLKRN* |
Ga0074046_106067421 | 3300010339 | Bog Forest Soil | MSFHVQKAACSTCIYRKDSPLDLEKLEGEIADAYGGFNGYRECHHAAPGSGVCCRGFWNRHKDRFAAGQIAQRLDLV* |
Ga0074044_100455144 | 3300010343 | Bog Forest Soil | VSKQAEEAGMTFRMQRYRCRTCIYRKDSALDLKVLEDAVRDRYVGFRGYRICHHSKALCCRGFWDRHKDEFQLGQIVQRLNLVEFVTEDKLA* |
Ga0126370_108205382 | 3300010358 | Tropical Forest Soil | MFKVQRRRCKTCIYRKDSTLDIEKLENDVRDKYMGFKGYRICHHSKDVCCKGFWDHHKDEFQAGQIAQRLKLVQFVDVDILT* |
Ga0134126_111473842 | 3300010396 | Terrestrial Soil | MPDRIRPREVDSGFRVQSKACATCIYRKDSPLNIKALEAQVADGYGGFRGHRICHHSKDVCCRGFWNRHKDKFQLGRIAQRLGMVRFVKVDNA* |
Ga0134127_123403872 | 3300010399 | Terrestrial Soil | MFEVQAVACSSCIYRKDSPLDVKKLEATIADGYGGFRSFRICHHSDSACCRGFWNRHKDKFQVGQLAQRLRAVAFVRHDNQKEKRR* |
Ga0134122_116649251 | 3300010400 | Terrestrial Soil | MSEGFRVQAKQCATCISRPGSPLDLKKLEAEVADDYGGFKTFRVCHHSEDACCRGFWNRHKDEFQVAQIAQRLNLVKFVEAEP* |
Ga0137450_10079293 | 3300011407 | Soil | MGFGFKVQKRACATCIYRKDSSLDIKKLENDVRDRFMGFKGHRICHHSKDVCCRGFWNRHKDEFPAGQIAQRLKLVKFVSVDTLRGKG* |
Ga0137450_10830482 | 3300011407 | Soil | MRKQDKPGFKVMKKPCPTCIYRKDSSLDLKKLEADVADKYGGFKGYRICHHSKDVCCRGFWNKHKDEFAAGQIAQRLNCVVFVEGNDERKTKRPD* |
Ga0137333_10008813 | 3300011413 | Soil | MGFKVRRTQCNTCIYRKDSPLDLKKLEDQVRDKYVGFKGHRICHHSKGICCRGFWVRHRDEFAAGQIAQRLNLVEFTNGDTSRTDGLLGM* |
Ga0137462_10304791 | 3300011421 | Soil | VQSKQCNTCIYRKDSPLDLKQLEAQIADPHGGFKGHRICHHSEDACCRGFWNRHKDKFAIGQIAQRLDAVMFVKDDNAESV* |
Ga0137439_10005413 | 3300011424 | Soil | MSFKVQRKQCATCIYRADSPLDLAKLEADVADPCGFGFKGHRICHHSAPGSDSCCRGFWNRHKDEFPAGQIAQRLGLVEFIK* |
Ga0137456_10782872 | 3300011428 | Soil | MRKKDKPGFKVMKKSCATCIYRKDSSLDLKKLEADVADKYGGFKGYRICHHSKDVCCRGFWNKHKDEFAAGQIAQRLNCVVFVEGNDERKTKRPR* |
Ga0137352_10014033 | 3300012164 | Soil | MGFKVRRTQCNTCIYRKDSPLDLKKLEDQVRDKYVGFKGHRICHHSKGICCRGFWVRHRDEFAAGQIAQRLNLVEFTNGDTSRTEGLLGM* |
Ga0137320_10009882 | 3300012172 | Soil | VNVDLGFKVQRRMCATCIYRPDSALDLKKLERDVADKHMAGYFRGHRICHHSKDVCCRGFWDKHKDDFTAGQVAQRLKLVRFVDVDTLKGKV* |
Ga0137360_104372842 | 3300012361 | Vadose Zone Soil | MKVQKVACSTCIYRKDSPLSIKKLEADVADKYGGFKGYRICHHSKDVCCRGFWNRHKNEFALGQIAQRLKLVEFVDVDTLTTRKS* |
Ga0137358_101202342 | 3300012582 | Vadose Zone Soil | MSFRVQKKQCKTCIYRSDCPLDIAKLEDQVRDKYIGFSGHRICHHPSKQEPICCRGFWNLHKDEFAAGQIAQRLNCVEFVTVDYLA* |
Ga0137341_10445351 | 3300012676 | Soil | VSRRHRGFVVQRKMCATCIYRPDSVLDLAELERQVKDRHMGFRGYRICHHSKDACCRGFWEAHKNEFALGQIAQRLNFVRFVDDDLLT* |
Ga0137404_112850143 | 3300012929 | Vadose Zone Soil | MTFKVQEKQCSTCIYRIESPLDLKVLEDQVRDPHVGFKGHRICHNSRDVCCRGFWNRHKDEFPMGQIAQRLNFVEFVKVREK* |
Ga0137410_101108003 | 3300012944 | Vadose Zone Soil | MMFKVQHKPCASCIYRKDSPLDLAKLEREIADPYGGFKGWRICHNSKDVCCRGFWNRHKDSFALGQIAQRLGMVEFVKEKTDGETRYRP* |
Ga0137410_120704152 | 3300012944 | Vadose Zone Soil | AMTFKVQRRMCATCIYRPDSTLDLVKLENDVRDPYVGFKGHRVCHHAPDRSAVCCRGFWDRHKDEFTAGQIAQRLNFVEFVDVDRFAKGAV* |
Ga0154020_1000718719 | 3300012956 | Active Sludge | MFKVQKTQCSTCIYRPDSPLDLEKLEAEIADSYGGFKGHRICHHSKDVCCAGFWARHKDEFQLGQVAQRLQGVEFVVCDTLSSKKDIENGSISD* |
Ga0154020_104460042 | 3300012956 | Active Sludge | MFKVQKTQCSTCIYRPDSPLDLEKLEAEIADPYGGFKGHRICHHSADVCCAGFWARHKNEFQLGQVAQRLQGVEFVVCDTLSSKKDIENDGDLD* |
Ga0182024_1002469024 | 3300014501 | Permafrost | MIGDTFRVQRRMCDTCIYRSDCPLELAMLEEQVRDPHIGFRSYRVCHHSDDVCCRGFWNAHKDEFPVGQIAQRLGLVELVDVDLLR* |
Ga0119960_10165682 | 3300014811 | Aquatic | MAEFEGFKVQRKQCSTCIYLTNSPLDLARLEAQVADKWGGFHSYRVCHHSEDVCCRGFWNRHKDKFAMGQIAQRLGFVRFVTVDTLAKEGTGK* |
Ga0180062_11511522 | 3300014879 | Soil | MLRVQKKQCETCIYRKDSPLDLKELEAAIADPYGGFKGHRICHHSKDACCQGFWKRHKDKFQLGQIAQRLNMVLFVQDDTLKGKKK* |
Ga0167669_11038023 | 3300015024 | Glacier Forefield Soil | MMSGFLVQTKPCSTCIYRKDSPLDIKKLEADVADSYGGFKGHRVCHHSDTACCRGFWNRHKDDFQMGQVAQRLGMVRMVDHDTLK* |
Ga0163144_100017825 | 3300015360 | Freshwater Microbial Mat | MTGFRVQSKQCSTCIYRPDSPLDLENLESQIADPYGGFTGHRICHNSDDACCAGFWAKHKDEFPMGQVSQRLNLVDLVEDDRLKKTPA* |
Ga0132255_1025922452 | 3300015374 | Arabidopsis Rhizosphere | MTKRAEPKVVGFRVQRKACATCIYRADSPLDLAHLESQVADKFGGFRGHRVCHHSRDACCRGFWNRHKDEFQMGQIAQRLGFVVFVDDDIRR* |
Ga0163161_108572822 | 3300017792 | Switchgrass Rhizosphere | MTFKVQKKACATCIYRKDSSLDIKKLENDVRDKHMGFKGHRICHHSKDVCCRGFWNRHKNEFALGQIAQRLNMVEFVTVDTIKKGKS |
Ga0193717_10705963 | 3300020060 | Soil | MTFKVQKRMCSTCIYRPDSPLDIEKLESDVRDPYVGFSGHRVCHHSADVCCRGFWEAHKDEFPMGQVAQRLGFVEFVYVDTLQDNGT |
Ga0179592_103410291 | 3300020199 | Vadose Zone Soil | MFKVQAKSCSTCIYRKDSSLDIKQLEEQIADGYGGFKGHRICHHSEDVCCRGFWNRHKDEFQAGQLAQRLGWVKFVNED |
Ga0210394_109593582 | 3300021420 | Soil | MTFGLKVQKKMCATCIYRPDSTLDLKKLEADVADPHMAGFFKGSRTCHHSEDAVCRGFWEAHKDSFTAGQIAQRLNMVEFVDEDVFDPLHGRYG |
Ga0247761_10058694 | 3300022878 | Plant Litter | VGFKVQQRMCATCIYKPNFNLDLRKLENDVRDQHIGFKEHRICHHSKDVCCRGFWDAHKDEFQAGQLAQRLGCVEFVDVDTLE |
(restricted) Ga0233411_1000067013 | 3300023112 | Seawater | MKTFKVQKTRCTTCIYKPDSPLDLKELEAQVADNYGGFQGHRICHHSEDACCSGFWKKNKDKFQLGQIAQRLGMVEEVEVDTLT |
Ga0210003_10133103 | 3300024262 | Deep Subsurface | MFKVQRTACSTCIFKKSSPLDLDRLLNEIRDPYGGFSGHRICHHSEDACCAGFWKNHKDEFALGQIAQRLGMVEKVDADILPLKESDDVRS |
Ga0209727_100049016 | 3300025012 | Soil | VKVCSRQCPTCIYRPDSLFDLKKLEAQIADPYMAGFFKGHRVCHHTKDACCRGFWNKHKDSFALGQVAQRMNVVIFVAPGG |
Ga0208147_10026279 | 3300025635 | Aqueous | MTFKVQRKMCSTCIYRPDSPLDLAKLEADVADKHIGFRGHRICHHSDDVCCRGFWEAHKDEFQLGQVAQRLNLVEFVNVDNLKP |
Ga0207694_104703002 | 3300025924 | Corn Rhizosphere | LTGFAVQARACRTCIYRKDSPLDLAQLEAAVADDYGGFHSFRICHHSRDACCRGFWDRHKNKFALGQVAQRLGMVRFVKDDIPNDR |
Ga0207711_101549141 | 3300025941 | Switchgrass Rhizosphere | VVAPVTGFEVQKRQCATCIYRKDSPLDLKALERQIADPYGGFVGHRICHHSDTACCRGFWSRHKNKFPLGQIAQRLGMVRYVEHDKA |
Ga0209027_11094222 | 3300026300 | Grasslands Soil | VGGVDPVRGFQVQRRMCATCIYRPTCALDVAKLENDVRDKHMGFKGHRVCHHAPDKSGICCRGFWDRHKDEFPAGQIAQRLRAVTFVDVDILVTRP |
Ga0209131_12492282 | 3300026320 | Grasslands Soil | MSGFRVMAKQCATCIYRKDSPLDIKKLEAQIKDRFMGFRTYRQCHHSRKGNTGCCRGFWNRHKDEFPAGQIAQRLNCVVFVNGQL |
Ga0209131_13499501 | 3300026320 | Grasslands Soil | MFKVQRKQCETCIYRKDSPLDLAQLEAAISDPYVGFRGWRICHHTDDVCCRGFSNRHKDEFQMG |
Ga0257177_10085624 | 3300026480 | Soil | MFKVQRKQCETCIYRKDSPLDLAQLEAAIADPHVGFRGWRICHHTDDVCCRGFWNRHKDEFQMGQIAQRLGFVEFV |
Ga0179587_105807393 | 3300026557 | Vadose Zone Soil | MFKVQAKSCSTCIYRKDSSLDIKQLEEQIADGYGGFKGHRICHHSEDVCCRGFWNRHKDEFQAGQLAQRLGWVKFVNEDNQ |
Ga0208788_100006345 | 3300027499 | Deep Subsurface | MRVQKTQCSTCIYRPDSPLDLAKLEADVADGYGGFKGHRICHHSDDACCAGFWARHKNEFQLGQIAQRFGVVEFVQDDTLKGKTK |
Ga0209077_10604232 | 3300027675 | Freshwater Sediment | MQEKSSVFKVQRRLCKTCIYRPSSTLDLKALEDQVRDPYVGFKDYRVCHHSIHACCRGFWNAHKDEFTLGQLAQRLNCVEFVNEETSVVDGMERRK |
Ga0209229_100021296 | 3300027805 | Freshwater And Sediment | MRVQKTQCSTCIYRPDSPLDLAKLEADVADGYGGFKGHRVCHHSDNACCAGFWARHKNEFQLGQIAQRLGMVEFVEDDKFRESGKRVEP |
Ga0209811_100200425 | 3300027821 | Surface Soil | VKRGVGFKVQRRMCRTCIYHKGSPLDLAELERQVRDPHMGFKGFRICHHSKDACCRGFWDMHKDEFAVGQVAQRLGLVCFVDIDIIKRITHRERKVK |
Ga0209611_102441172 | 3300027860 | Host-Associated | MLTVQRRLCKTCIYRPDSALDLTKLENDVRDPNMAGFFIGSRICHHSEDAVCRGFWNAHRNHFTAGQIAQRLGMVRFVDQDTLA |
(restricted) Ga0233415_100085909 | 3300027861 | Seawater | MFKVQQKQCKTCIYRPESPLDLKTLEQAIADQHGGFKGHRICHHSDDVCCRGFWERHKNQFQMGQIAQRLNMVEYVNIDTLNQVNDQQTK |
Ga0209583_100462401 | 3300027910 | Watersheds | CSTCIYRSDSPLDLKDLESAVADRFGGFRGHRICHHSDDVCCRGFWKRHKDKFAIGQIAQRLKMVEFVNVDTLSRHAKIEERER |
Ga0209061_11015262 | 3300027968 | Surface Soil | LFKVQAEACSTCIYRKDSPLDLNKLEAEIADGYGGFNGYRICHHSEDVCCRGFWNRHKDEFAAGQIAQRLDAVQFVTVDVSK |
Ga0268264_107955022 | 3300028381 | Switchgrass Rhizosphere | MFKVQKRQCETCIYRKSSPLDIKRLEAQVADKYGGYKGHRICHHSKDACCRGFWDRHKDQFQMGQLAQRLGW |
Ga0265338_1000108324 | 3300028800 | Rhizosphere | MTFKVQAKPCSTCIYRKDSPLDLKALEDAVRDPHMGFKGHRICHHSDDVYCRGFWNAHKDEFTAGQVAQRLGLVEFVEVDTLK |
Ga0265338_101431887 | 3300028800 | Rhizosphere | MRQKMQKMLKVQSKQCETCIYRKDSSLDIKQLESQVADPNMEGYFKGHRICHHSKDVCCRGFWNRHKDQFTLGQIAQRLDLVEFVTENTPK |
Ga0302221_103702782 | 3300028806 | Palsa | MKSGFKVQSKMCDTCIYRKDSPLDLQSLEAQIADKYGGFIGHRVCHHSKDVCCNGFWNAHKNEFQMGQVAQRLNMVKFVQVDSLKKKK |
Ga0302154_104665831 | 3300028882 | Bog | LAFKVQRKACATCIYRRDSPLNIKALEDQVRDKYMGFKGHRVCHHSKDACCRGFWNRHKNEFAMGQIAQRLKFVEFVDVDCRPPQATKGDLE |
Ga0311329_107811922 | 3300029907 | Bog | RKACATCIYRRDSPLNIKALEDQVRDKYMGFKGHRVCHHSKDACCRGFWNRHKNEFAMGQIAQRLKFVEFVDVDCRPPQATKGDLE |
Ga0311326_1000651717 | 3300029917 | Bog | KLAFKVQRKACATCIYRRDSPLNIKALEDQVRDKYMGFKGHRVCHHSKDACCRGFWNRHKNEFAMGQIAQRLKFVEFVDVDCRPPQATKGDLE |
Ga0302150_101142004 | 3300029956 | Bog | MFKVQARACPTCIYRSDSPLDIRKLEADVADKYGGFHGWRVCHHTRDVCCAGFWARHRNKFALGQIAQRLGLVE |
Ga0311372_111436352 | 3300030520 | Palsa | MADTSADRDGFLVQDRPCSTCIYRKDSTLDIKKLEADVADKHGGFKGHRICHHSKDACCRGFWNRHKDKFAMGQIAQRLGLVRFVRDDILKT |
Ga0302323_1001940791 | 3300031232 | Fen | MSYTFKVQRRLCATCIYRPDTPLDLAKLENDVRDRYMGFRGHRICHHSDDVCCRGFWNAHKDAFPAGQIAQRLDLVEFVDVDTLADPKPTRKRK |
Ga0307380_100180872 | 3300031539 | Soil | MKEKLGFKVQKVQCKTCIYRPDSPLDIQKLEADVADSYGGFKGHRICHHSVDACCAGFWARHKDKFQLGQIAQRLGMVTFVELETRT |
Ga0307379_100398501 | 3300031565 | Soil | MKEKLGFKVQKVQCKTCIYRPDSPLDIQKLEADVADSYGGFKGHRICHHSVDACCAGFWARHKDKFQLGQIAQRLGMVTFVELET |
Ga0307379_106056412 | 3300031565 | Soil | MFKVQAKQCSSCIYHTDSPLDLGKLEADVADGYGGFNGHRTCHHSDDVCCRGFWNKHKDKFQAGQIAQRLDAVEFVDVDKFSA |
Ga0315291_100842644 | 3300031707 | Sediment | VSRYGFKVQRRMCTTCIYRKDSPLSLKKLEADVADKFGGFRGHRICHHSKDVCCRGFWQRHKDKFAAGQIAQRLGLVVFVEVDTLKR |
Ga0310686_1093738802 | 3300031708 | Soil | MTFKVQRKPCSTCIYRADSTLDLAALEDAVRDEHVGFKGHRICHHSNDACCRGFWNAHKDEFAAGQIAQRLNFVEFVDDDNLSSKGVNPC |
Ga0302321_1020228353 | 3300031726 | Fen | IYRPDTPLDLAKLENDVRDRYMGFRGHRICHHSDDVCCRGFWNAHKDAFPAGQIAQRLDLVEFVDVDTLADPKPTRKRK |
Ga0315909_103759262 | 3300031857 | Freshwater | MRVQKTQCSTCIYRPDSPLDLAKLEAAVADGYGGFTGHRICHHSDDACCAGFWARHKNEFQLGKIAQRLGMVEFVEDDTLKGKTK |
Ga0315904_102630682 | 3300031951 | Freshwater | MRVQKTQCSTCIYRPDSPLDLAKLEADVADGYGGFKGHRVCHHSDDACCAGFWARHKNEFQLGQIAQRLGMVEFVEDDTLKGKTK |
Ga0315284_119367171 | 3300032053 | Sediment | MFKVQKKSCSTCIYRKDSPLDLKLLEAQVADKYGGFKGHRVCHHSKDVCCRGFWNRHKDKFQLGQIAQRMGWVRYVEVDTLKEKK |
Ga0315287_102605714 | 3300032397 | Sediment | VTKGFKVQKKMCSTCIYRPDSTLDLKVLEAQVADKYGGFKGHRICHHSDGACCQGFWNRHKDEFAAGQIAQRLGLVVFVEEDV |
Ga0348332_131704952 | 3300032515 | Plant Litter | RRPCSTCIYRADSTLDLAALEDAVRDEHVGFKGHRICHHSDDACCRGFWNAHKDEFAAGQIAQRLNFVEFVDDDNLSSKGVNPC |
⦗Top⦘ |