Basic Information | |
---|---|
Family ID | F098974 |
Family Type | Metagenome / Metatranscriptome |
Number of Sequences | 103 |
Average Sequence Length | 47 residues |
Representative Sequence | MADLVFIVSRTEPKHYLYLKHEFADESRDVVLDRRAGERRRSQ |
Number of Associated Samples | 84 |
Number of Associated Scaffolds | 103 |
Quality Assessment | |
---|---|
Transcriptomic Evidence | Yes |
Most common taxonomic group | Unclassified |
% of genes with valid RBS motifs | 100.00 % |
% of genes near scaffold ends (potentially truncated) | 30.10 % |
% of genes from short scaffolds (< 2000 bps) | 25.24 % |
Associated GOLD sequencing projects | 78 |
AlphaFold2 3D model prediction | Yes |
3D model pTM-score | 0.21 |
Hidden Markov Model |
---|
Powered by Skylign |
Most Common Taxonomy | |
---|---|
Group | Unclassified (70.874 % of family members) |
NCBI Taxonomy ID | N/A |
Taxonomy | N/A |
Most Common Ecosystem | |
---|---|
GOLD Ecosystem | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil (22.330 % of family members) |
Environment Ontology (ENVO) | Unclassified (35.922 % of family members) |
Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Soil (non-saline) (49.515 % of family members) |
⦗Top⦘ |
⦗Top⦘ |
Predicted Topology & Secondary Structure | |||||
---|---|---|---|---|---|
Classification: | Globular | Signal Peptide: | No | Secondary Structure distribution: | α-helix: 0.00% β-sheet: 8.45% Coil/Unstructured: 91.55% | Feature Viewer |
|
|||||
Powered by Feature Viewer |
Structure Viewer | |
---|---|
| |
Per-residue confidence (pLDDT): 0-50 51-70 71-90 91-100 | pTM-score: 0.21 |
Powered by PDBe Molstar |
⦗Top⦘ |
Pfam ID | Name | % Frequency in 103 Family Scaffolds |
---|---|---|
PF00583 | Acetyltransf_1 | 6.80 |
PF02954 | HTH_8 | 2.91 |
PF00027 | cNMP_binding | 1.94 |
PF02518 | HATPase_c | 1.94 |
PF01207 | Dus | 1.94 |
PF09585 | Lin0512_fam | 0.97 |
PF01527 | HTH_Tnp_1 | 0.97 |
PF00709 | Adenylsucc_synt | 0.97 |
PF04366 | Ysc84 | 0.97 |
PF01553 | Acyltransferase | 0.97 |
PF00210 | Ferritin | 0.97 |
PF16450 | Prot_ATP_ID_OB | 0.97 |
PF00574 | CLP_protease | 0.97 |
PF02321 | OEP | 0.97 |
PF04932 | Wzy_C | 0.97 |
PF03992 | ABM | 0.97 |
PF04389 | Peptidase_M28 | 0.97 |
PF00682 | HMGL-like | 0.97 |
PF04392 | ABC_sub_bind | 0.97 |
PF00072 | Response_reg | 0.97 |
COG ID | Name | Functional Category | % Frequency in 103 Family Scaffolds |
---|---|---|---|
COG0042 | tRNA-dihydrouridine synthase | Translation, ribosomal structure and biogenesis [J] | 1.94 |
COG0616 | Periplasmic serine protease, ClpP class | Posttranslational modification, protein turnover, chaperones [O] | 1.94 |
COG0740 | ATP-dependent protease ClpP, protease subunit | Posttranslational modification, protein turnover, chaperones [O] | 1.94 |
COG1538 | Outer membrane protein TolC | Cell wall/membrane/envelope biogenesis [M] | 1.94 |
COG0104 | Adenylosuccinate synthase | Nucleotide transport and metabolism [F] | 0.97 |
COG1030 | Membrane-bound serine protease NfeD, ClpP class | Posttranslational modification, protein turnover, chaperones [O] | 0.97 |
COG2930 | Lipid-binding SYLF domain, Ysc84/FYVE family | Lipid transport and metabolism [I] | 0.97 |
COG2984 | ABC-type uncharacterized transport system, periplasmic component | General function prediction only [R] | 0.97 |
COG3307 | O-antigen ligase | Cell wall/membrane/envelope biogenesis [M] | 0.97 |
⦗Top⦘ |
Name | Rank | Taxonomy | Distribution |
Unclassified | root | N/A | 70.87 % |
All Organisms | root | All Organisms | 29.13 % |
Visualization |
---|
Powered by ApexCharts |
Scaffold | Taxonomy | Length | IMG/M Link |
---|---|---|---|
3300004281|Ga0066397_10022150 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 956 | Open in IMG/M |
3300005295|Ga0065707_10299034 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 1008 | Open in IMG/M |
3300005468|Ga0070707_100014049 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 7502 | Open in IMG/M |
3300005574|Ga0066694_10624452 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 502 | Open in IMG/M |
3300006852|Ga0075433_10124800 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 2287 | Open in IMG/M |
3300006854|Ga0075425_101256108 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 840 | Open in IMG/M |
3300009094|Ga0111539_10519772 | All Organisms → cellular organisms → Bacteria | 1387 | Open in IMG/M |
3300010043|Ga0126380_10064039 | Not Available | 2053 | Open in IMG/M |
3300010329|Ga0134111_10032107 | All Organisms → cellular organisms → Bacteria | 1836 | Open in IMG/M |
3300010336|Ga0134071_10514393 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 619 | Open in IMG/M |
3300010366|Ga0126379_11460498 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 790 | Open in IMG/M |
3300012205|Ga0137362_10046527 | All Organisms → cellular organisms → Bacteria | 3516 | Open in IMG/M |
3300012207|Ga0137381_10779286 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 830 | Open in IMG/M |
3300012207|Ga0137381_11513820 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 562 | Open in IMG/M |
3300012210|Ga0137378_10778984 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 868 | Open in IMG/M |
3300012396|Ga0134057_1262169 | All Organisms → cellular organisms → Bacteria | 512 | Open in IMG/M |
3300012930|Ga0137407_10586099 | Not Available | 1046 | Open in IMG/M |
3300015241|Ga0137418_10015209 | All Organisms → cellular organisms → Bacteria | 7109 | Open in IMG/M |
3300015245|Ga0137409_10019949 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 6615 | Open in IMG/M |
3300025922|Ga0207646_11241692 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 652 | Open in IMG/M |
3300025938|Ga0207704_10734378 | All Organisms → cellular organisms → Bacteria | 820 | Open in IMG/M |
3300025972|Ga0207668_10226084 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 1506 | Open in IMG/M |
3300026118|Ga0207675_101312861 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 744 | Open in IMG/M |
3300026334|Ga0209377_1345425 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 502 | Open in IMG/M |
3300026529|Ga0209806_1075655 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 1501 | Open in IMG/M |
3300026538|Ga0209056_10272779 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 1188 | Open in IMG/M |
3300026547|Ga0209156_10079545 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 1664 | Open in IMG/M |
3300027909|Ga0209382_10228506 | All Organisms → cellular organisms → Bacteria | 2121 | Open in IMG/M |
3300031720|Ga0307469_10785352 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 872 | Open in IMG/M |
3300031740|Ga0307468_100410010 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 1038 | Open in IMG/M |
3300031740|Ga0307468_101098734 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 708 | Open in IMG/M |
3300031954|Ga0306926_11699473 | Not Available | 720 | Open in IMG/M |
3300032180|Ga0307471_102178169 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 698 | Open in IMG/M |
Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).
⦗Top⦘ |
Habitat | Taxonomy | Distribution |
Soil | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil | 22.33% |
Vadose Zone Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil | 15.53% |
Populus Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere | 13.59% |
Grasslands Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil | 12.62% |
Hardwood Forest Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil | 6.80% |
Grasslands Soil | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil | 4.85% |
Corn, Switchgrass And Miscanthus Rhizosphere | Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere | 4.85% |
Tropical Forest Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil | 3.88% |
Tropical Forest Soil | Environmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil | 2.91% |
Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil | 1.94% |
Miscanthus Rhizosphere | Host-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere | 1.94% |
Terrestrial Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil | 0.97% |
Switchgrass Rhizosphere | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Switchgrass Rhizosphere | 0.97% |
Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil | 0.97% |
Switchgrass Rhizosphere | Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere | 0.97% |
Groundwater Sand | Environmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand | 0.97% |
Sandy Soil | Environmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil | 0.97% |
Switchgrass Rhizosphere | Host-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere | 0.97% |
Switchgrass Rhizosphere | Host-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere | 0.97% |
Miscanthus Rhizosphere | Host-Associated → Plants → Rhizosphere → Soil → Unclassified → Miscanthus Rhizosphere | 0.97% |
Visualization |
---|
Powered by ApexCharts |
Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).
Taxon OID | Sample Name | Habitat Type | IMG/M Link |
---|---|---|---|
2140918013 | Soil microbial communities from Great Prairies - Iowa soil (MSU Assemblies) | Environmental | Open in IMG/M |
2228664021 | Soil microbial communities from Great Prairies - Iowa, Continuous Corn soil | Environmental | Open in IMG/M |
3300000363 | Soil microbial communities from Great Prairies - Iowa, Continuous Corn soil | Environmental | Open in IMG/M |
3300000789 | Soil microbial communities from Great Prairies - Iowa, Native Prairie soil | Environmental | Open in IMG/M |
3300002908 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cm | Environmental | Open in IMG/M |
3300004281 | Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 30 MoBio | Environmental | Open in IMG/M |
3300004633 | Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 1 MoBio | Environmental | Open in IMG/M |
3300005174 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129 | Environmental | Open in IMG/M |
3300005295 | Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL3 | Environmental | Open in IMG/M |
3300005343 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3L metaG | Environmental | Open in IMG/M |
3300005444 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-1 metaG | Environmental | Open in IMG/M |
3300005446 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135 | Environmental | Open in IMG/M |
3300005467 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG | Environmental | Open in IMG/M |
3300005468 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG | Environmental | Open in IMG/M |
3300005574 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_143 | Environmental | Open in IMG/M |
3300005718 | Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M2-2 | Host-Associated | Open in IMG/M |
3300006032 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145 | Environmental | Open in IMG/M |
3300006797 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108 | Environmental | Open in IMG/M |
3300006845 | Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5 | Host-Associated | Open in IMG/M |
3300006847 | Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD5 | Host-Associated | Open in IMG/M |
3300006852 | Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD2 | Host-Associated | Open in IMG/M |
3300006854 | Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4 | Host-Associated | Open in IMG/M |
3300006871 | Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD3 | Host-Associated | Open in IMG/M |
3300007076 | Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD4 | Host-Associated | Open in IMG/M |
3300007265 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 | Environmental | Open in IMG/M |
3300009012 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159 | Environmental | Open in IMG/M |
3300009094 | Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (version 2) | Host-Associated | Open in IMG/M |
3300009137 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158 | Environmental | Open in IMG/M |
3300009147 | Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2) | Host-Associated | Open in IMG/M |
3300009162 | Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2 | Host-Associated | Open in IMG/M |
3300009808 | Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N2_40_50 | Environmental | Open in IMG/M |
3300010043 | Tropical forest soil microbial communities from Panama - MetaG Plot_26 | Environmental | Open in IMG/M |
3300010102 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_40_5_4_1 metaT (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
3300010132 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_40_5_16_1 metaT (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
3300010304 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015 | Environmental | Open in IMG/M |
3300010329 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_11112015 | Environmental | Open in IMG/M |
3300010336 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09082015 | Environmental | Open in IMG/M |
3300010359 | Tropical forest soil microbial communities from Panama - MetaG Plot_15 | Environmental | Open in IMG/M |
3300010362 | Tropical forest soil microbial communities from Panama - MetaG Plot_22 | Environmental | Open in IMG/M |
3300010366 | Tropical forest soil microbial communities from Panama - MetaG Plot_24 | Environmental | Open in IMG/M |
3300010401 | Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1 | Environmental | Open in IMG/M |
3300012205 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaG | Environmental | Open in IMG/M |
3300012207 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaG | Environmental | Open in IMG/M |
3300012210 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaG | Environmental | Open in IMG/M |
3300012211 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaG | Environmental | Open in IMG/M |
3300012396 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_0_2 metaT (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
3300012582 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaG | Environmental | Open in IMG/M |
3300012685 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaG | Environmental | Open in IMG/M |
3300012922 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaG | Environmental | Open in IMG/M |
3300012930 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly) | Environmental | Open in IMG/M |
3300012944 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly) | Environmental | Open in IMG/M |
3300012975 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_11112015 | Environmental | Open in IMG/M |
3300012976 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_0_1 metaG | Environmental | Open in IMG/M |
3300015241 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly) | Environmental | Open in IMG/M |
3300015245 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly) | Environmental | Open in IMG/M |
3300016357 | Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.000 | Environmental | Open in IMG/M |
3300017654 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_2_24_1 metaG | Environmental | Open in IMG/M |
3300017659 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaG | Environmental | Open in IMG/M |
3300025922 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes) | Environmental | Open in IMG/M |
3300025937 | Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M3-3 metaG (SPAdes) | Host-Associated | Open in IMG/M |
3300025938 | Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M1-2 (SPAdes) | Host-Associated | Open in IMG/M |
3300025972 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-3 metaG (SPAdes) | Host-Associated | Open in IMG/M |
3300026118 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S4-2 (SPAdes) | Host-Associated | Open in IMG/M |
3300026328 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129 (SPAdes) | Environmental | Open in IMG/M |
3300026331 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137 (SPAdes) | Environmental | Open in IMG/M |
3300026332 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138 (SPAdes) | Environmental | Open in IMG/M |
3300026333 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 (SPAdes) | Environmental | Open in IMG/M |
3300026334 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141 (SPAdes) | Environmental | Open in IMG/M |
3300026342 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146 (SPAdes) | Environmental | Open in IMG/M |
3300026523 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157 (SPAdes) | Environmental | Open in IMG/M |
3300026529 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152 (SPAdes) | Environmental | Open in IMG/M |
3300026538 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes) | Environmental | Open in IMG/M |
3300026542 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148 (SPAdes) | Environmental | Open in IMG/M |
3300026547 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124 (SPAdes) | Environmental | Open in IMG/M |
3300027874 | Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 1 MoBio (SPAdes) | Environmental | Open in IMG/M |
3300027909 | Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5 (SPAdes) | Host-Associated | Open in IMG/M |
3300031248 (restricted) | Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH5_T0_E5 | Environmental | Open in IMG/M |
3300031720 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515 | Environmental | Open in IMG/M |
3300031740 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05 | Environmental | Open in IMG/M |
3300031820 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515 | Environmental | Open in IMG/M |
3300031954 | Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.178 (v2) | Environmental | Open in IMG/M |
3300032060 | Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.084b2f18 | Environmental | Open in IMG/M |
3300032180 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515 | Environmental | Open in IMG/M |
3300032205 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05 | Environmental | Open in IMG/M |
Geographical Distribution | |
---|---|
Zoom: | Powered by OpenStreetMap |
⦗Top⦘ |
Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).
Protein ID | Sample Taxon ID | Habitat | Sequence |
Iowa-Corn-GraphCirc_01500660 | 2140918013 | Soil | MTDLLFIVSRAEPRQYLYLKHVFGDDSRDVVLDRRIGERRRTLRPPPVERRRV |
ICCgaii200_04618041 | 2228664021 | Soil | MTDLLFIVSRAEPRQYLYLKHVFGDDSRDVVLDRR |
ICChiseqgaiiFebDRAFT_143359761 | 3300000363 | Soil | MTDLLFIVSRAEPRQYLYLKHVFGDDSRDVVLDRRIGERRRTLRPPPVERR |
JGI1027J11758_123929231 | 3300000789 | Soil | VALVIVVSRTELKRYLYLKHLYADEGMDVVLDRRRG |
JGI25382J43887_100856763 | 3300002908 | Grasslands Soil | MSDLVFVVSRTEPKQYFYLKHVFADESRDVVLDRRLGERRRGSLSPPRAE |
JGI25382J43887_105029641 | 3300002908 | Grasslands Soil | MADLLFIVSRTEPKQYLYLKHVYADESRDVVLDRRMGERRRSLRPQ |
Ga0066397_100221501 | 3300004281 | Tropical Forest Soil | VPDLVFIVSRSEPKQYMYLKHFWADEGRDVILDRRTGERRQSLRPPPVERRHVERRRQ |
Ga0066395_109692681 | 3300004633 | Tropical Forest Soil | MADLVFILSRTELKQYLYLKHAWTDERRDVEVLLDRRTGERRRSPR |
Ga0066680_100351114 | 3300005174 | Soil | MGDFLFIVSRTKPIRYLRLKRAFADQTEDVVLDRRTGERRQSLRPPP |
Ga0066680_107959642 | 3300005174 | Soil | MGDFLFIVSRTEPKRYLRLKQAFADQTEDVVLDRRTGERRQSLRPAA |
Ga0065707_102990341 | 3300005295 | Switchgrass Rhizosphere | MADLVFIVSRTEAKQYFYLKHEFADESRDVVLDRRLGERRRSLRPPPIERRHID |
Ga0070687_1008696692 | 3300005343 | Switchgrass Rhizosphere | MADLVFIVSRSEPKQYLYLKHQFADESRDVVLDRRTGERRRSPMTQPRIERRH |
Ga0070694_1005516471 | 3300005444 | Corn, Switchgrass And Miscanthus Rhizosphere | MADLVFIVSRTEPKHYLYLKHEFADESRDVVLDRRAGERRRSQ |
Ga0066686_109414371 | 3300005446 | Soil | VADLVFIVSRTEPQQYLYLKHVFADESRDVVLDRRMGERRRSVRP |
Ga0070706_1005437451 | 3300005467 | Corn, Switchgrass And Miscanthus Rhizosphere | MPDLVFIVSRTEPKRYLYLKHEFADESRDVVLDRRLGERRRSLRPPQ |
Ga0070707_1000140495 | 3300005468 | Corn, Switchgrass And Miscanthus Rhizosphere | MEDLLFIVSRTEPKRYLYLKHVYADESRDVVLDRRMGERRRSLRPQQLERRHIDRRIAKSRGNSSAR* |
Ga0066694_106244521 | 3300005574 | Soil | MADLLFIVSRTEPKRYMYLRHVYADESRDVILDRRGGERRQSWRQPPVERRHVERRH |
Ga0068866_112475821 | 3300005718 | Miscanthus Rhizosphere | MADLVFIVSRSEPKQYLYLKHQFADESRDVVLDRRTGERRRSPMTQPR |
Ga0066696_108075671 | 3300006032 | Soil | MGDFLFIVSRTKPQRYLRLKHAFADQTEDVVLDRR |
Ga0066659_114360413 | 3300006797 | Soil | VADLVFIVSRTEPKQYLYLKHVFADESRDVVLDRRTSERRRSLR |
Ga0075421_1011275802 | 3300006845 | Populus Rhizosphere | MPDLVFIVSRTEPKHYLYLKHEFANESSDVVLDRRAGERRRSQ |
Ga0075431_1003407595 | 3300006847 | Populus Rhizosphere | MADLVFIVSRTEAKQYFYLKHEFADESRDVVLDRRLGERR |
Ga0075433_101248001 | 3300006852 | Populus Rhizosphere | MADLVFIVSRTEPKQYFYLKHEFADESRDVVLDRRLGERR |
Ga0075425_1012561082 | 3300006854 | Populus Rhizosphere | MADLVFIVSRTEPKHYLYLKHEFADESSDVVLDRR |
Ga0075425_1029554251 | 3300006854 | Populus Rhizosphere | MAELVFIVSRTEPKQYFYLKHEFADESRDVVLDRRLGER |
Ga0075434_1011894661 | 3300006871 | Populus Rhizosphere | MADLVFIVSRAEPKHYLYLKHEFADESSDVVLDRRAGDRRRSQRPLPT |
Ga0075434_1018306983 | 3300006871 | Populus Rhizosphere | MADLLFIVSRTESRQYLYLKQVFADESRDVVLDRRMGERRRS |
Ga0075435_1008640931 | 3300007076 | Populus Rhizosphere | VVDWLFIVSSTELERYLYLKHEYADEAREVIFDRR |
Ga0075435_1010256373 | 3300007076 | Populus Rhizosphere | MADLVFIVSRTEPKHYLYLKHEFADESSDVVLDRRAGERRRS |
Ga0099794_106909792 | 3300007265 | Vadose Zone Soil | MADLVFIVSRSEPKHYLYLKHEFADERSDVVLDRRS |
Ga0066710_1003029285 | 3300009012 | Grasslands Soil | VADLVFIVSRTEPKQYLYLKHVFADESRDVVLDRRTSERRRSLRAPTIE |
Ga0066710_1047798562 | 3300009012 | Grasslands Soil | VADLLFIVSRSEPKRYMYLKHEYADEGKEVILDRRGG |
Ga0111539_105197721 | 3300009094 | Populus Rhizosphere | MADLVFIVSRTEPKHYLYLKHEFADESRDVVLDRRAGERRRSQGPVPSERRHMQRR |
Ga0066709_1045423481 | 3300009137 | Grasslands Soil | VADLLFIVSRSEPKRYMYLKHEYADEGKEVILDRRG |
Ga0114129_112447461 | 3300009147 | Populus Rhizosphere | MADLVFIVSRTEPKHYLYLKHEFADESSDVVLDRRAG |
Ga0114129_129292382 | 3300009147 | Populus Rhizosphere | MADLVFIVSRAEPKHYLYLKHEFADESSDVVLDRRAGDRRRSQRPLPTERR |
Ga0075423_106209312 | 3300009162 | Populus Rhizosphere | MADMVFIVSRTDPKQYQYLKHEFADESTDVVLDRRAGERRR |
Ga0105071_10091854 | 3300009808 | Groundwater Sand | MVADLLFIVSRTEPKRYMYLKYVYADEGRDVILDRRTGERRRGRGQP |
Ga0126380_100640391 | 3300010043 | Tropical Forest Soil | LIFIVSRTSPRTYSYLKHVFADETRHVVLDRRAGERRRNQSWRLAERRHVER |
Ga0127453_10392062 | 3300010102 | Grasslands Soil | VADLLFIVSRNEPKQYLYLKHVYADASRDVVLDRRGGERRQGRRPPPLAER |
Ga0127455_10534661 | 3300010132 | Grasslands Soil | VADLLFIVSRNEPKQYLYLKHVYADASRDVVLDRRGGERRQ |
Ga0134088_102639951 | 3300010304 | Grasslands Soil | MMADLLFIVSRTEPNRYMYLKYVYADESRDVILDR |
Ga0134088_105336872 | 3300010304 | Grasslands Soil | MGDFLFIVSRTKPQRYLRLKHAFADQTEDVVLDRRTGERRQSLRPPPA |
Ga0134111_100321071 | 3300010329 | Grasslands Soil | MMADLLFIVSRTEPNRYMYLKYVYADESRDVILDRRQGERRRGQGQPPTERRHG |
Ga0134071_105143931 | 3300010336 | Grasslands Soil | VEAIMTELVFIVSRTEPKQYFYLKHEFADESRDVVLDRRMGERRRGLRPPPVERRHID |
Ga0134071_107650071 | 3300010336 | Grasslands Soil | VADLLFIVSRTEPKRYMYLRHVYADESRDVILDRRGGERRQSW |
Ga0126376_111584261 | 3300010359 | Tropical Forest Soil | VADLVFIVSRGEPKQYMYLKHFWADEGRDVILDRRMGERRQ |
Ga0126377_103982701 | 3300010362 | Tropical Forest Soil | VPDLVFIVSRSEPKQYMYLKHFWADEGRDVILDRRTGERRQSLRPPPV |
Ga0126379_114604982 | 3300010366 | Tropical Forest Soil | VGDVVFIVSRTEPKQYLYLKHYWADEKRDVILDRRTGERRRSLRPPPIERRRMERR |
Ga0134121_109431453 | 3300010401 | Terrestrial Soil | MAELVFIVWRTEPKQYFYLKHEFADESRDVVLDRRLGERRRSLRP |
Ga0137362_100465271 | 3300012205 | Vadose Zone Soil | MADLVFIVSRTEPKQYLYLKHVFADESRDVVLDRRTSVERRR |
Ga0137362_115132452 | 3300012205 | Vadose Zone Soil | MGDFLFIVSRTEPKRYLRLKQAFADQTEDVVLDRRTGERRQSLRPAAVER |
Ga0137381_107792862 | 3300012207 | Vadose Zone Soil | MADLVFIVSRTEPKHYLYLKHEFADERSDVILDRRVGERCRSQRPLPIERRHMQWRHRDVTWE |
Ga0137381_110528042 | 3300012207 | Vadose Zone Soil | MMADLLFIVSRTEPKRYMYLKYVYADERRDVILDRRQG |
Ga0137381_115138203 | 3300012207 | Vadose Zone Soil | MADLVFIVSRTEPKQYFHLKHEFADESRDVVLDRRLSERRRSLRPPPVERR |
Ga0137378_107789842 | 3300012210 | Vadose Zone Soil | MADLLFIVSRTEPKQYLYLKHVFADESRDVVFDRRIGGERRRSLSPLRVERRHIEP |
Ga0137377_107717142 | 3300012211 | Vadose Zone Soil | MADLVFIVSRTEPKQYFYLKHEFADESRDVVLDRRLGER |
Ga0134057_12621692 | 3300012396 | Grasslands Soil | VADLLFIVSRNEPKQYLYLKHVYADASRDVVLDRRGGERRQGRRPPPLAERRYGVERRRRDI |
Ga0137358_102360313 | 3300012582 | Vadose Zone Soil | MADLVFVVSRTEPKQYFYLKHVYADESRDVVLDRRLGERRRAWRPP |
Ga0137397_111102022 | 3300012685 | Vadose Zone Soil | MVADLLFIMARSEAKRYMDFKHVYADEGRDVILDRREGERR |
Ga0137394_103223922 | 3300012922 | Vadose Zone Soil | MADLVFIVSRTEPKHYLYLKHEFADESSDVVLDRRAGERRRSQRPLPTE |
Ga0137394_112762581 | 3300012922 | Vadose Zone Soil | MMADLLFIVSRTEPKRYMYLKYVYADERRDVILDRRQGERR |
Ga0137407_105860993 | 3300012930 | Vadose Zone Soil | MADLVFIVSRTEPKHYLYLKHEFADESSDVVLDRRAGERRRSQRPLPT |
Ga0137410_103548011 | 3300012944 | Vadose Zone Soil | VADFLFIVSRNEPKQYLYLKHVYADESREVVLDRRGGER |
Ga0134110_100694371 | 3300012975 | Grasslands Soil | MGDFLFIVSRTKPQRYLRLKHAFADQTEDVVLDRRTGERRQ |
Ga0134110_103273241 | 3300012975 | Grasslands Soil | MVADLVFIVSRTEPKQYLYLKHVFADETRDVVLDRRTVDRRRTLRPPPIERRH |
Ga0134076_106035601 | 3300012976 | Grasslands Soil | MSDLLFIVSRTEPKRYMYLKYVYADERRDVILDRRQ |
Ga0137418_100152097 | 3300015241 | Vadose Zone Soil | MPDLLFIVSRTEPKQYFYLKHVYADEGRDVVLDRRLGERRRSQRPPPAERRHVERRH |
Ga0137409_100199491 | 3300015245 | Vadose Zone Soil | MADLVFIVSRTEPKHYLYLKHEFADERSDVVLDRRAGERRRSQRPLPTERRHMQR |
Ga0182032_118893511 | 3300016357 | Soil | VGDVVFIVSRTEPKQYLYLKHYWADEKRDVILDRRTGERRRSLRPPPIER |
Ga0134069_13516221 | 3300017654 | Grasslands Soil | MADLVFIVSRTEPKHYLYLKHEFADGSSDVVLDRRASERR |
Ga0134083_102691632 | 3300017659 | Grasslands Soil | VASLLFIVSREAPGRYGYLKHVFAGESGDVIVDRRAGERRRREG |
Ga0207646_104698622 | 3300025922 | Corn, Switchgrass And Miscanthus Rhizosphere | MGDFLFIVSRTKPIRYLRLKRAFADQTEDVVLDRRTGERRQS |
Ga0207646_112416921 | 3300025922 | Corn, Switchgrass And Miscanthus Rhizosphere | MEDLLFIVSRTEPKRYLYLKHVYADESRDVVLDRRMGERRRSLRPQQLERRHIDRRIAKS |
Ga0207669_111275672 | 3300025937 | Miscanthus Rhizosphere | MADLVFIVSRTEPKHYLYLKHEFADESRDVVLDRRAGE |
Ga0207704_107343781 | 3300025938 | Miscanthus Rhizosphere | MADLVFIVSRSEPKQYLYLKHQFADESRDVVLDRRTGERRRSPMPSLITSRRERPRSP |
Ga0207668_102260841 | 3300025972 | Switchgrass Rhizosphere | MADLVFIVSRNEPKHYLYLKHEFADESRDVVLDRRAGERRRSQRPQPTERRHM |
Ga0207675_1013128611 | 3300026118 | Switchgrass Rhizosphere | MADLVFIVSRTEPKHYLYLKHEFADESRDVVLDRRLGERRRSLRPPLIERRHIDRRHRDD |
Ga0209802_10372895 | 3300026328 | Soil | MGDFLFIVSRTKPIRYLRLKRAFADQTEDVVLDRRTGERRQSLRPPPVERRHVDRRRR |
Ga0209802_10430714 | 3300026328 | Soil | MADLVFVVSRTEPKQYFYLKHVFADESRDVVLDRRLGERRRGSL |
Ga0209267_11497662 | 3300026331 | Soil | MPDLVFIVSRTEPKRYLYLKHEFADESRDVVLDRRLGERRRSLRPPQLER |
Ga0209803_12771571 | 3300026332 | Soil | VASLLFIVSREAPGRYGYLKHVFAGESGDVIVDRRAGERRRR |
Ga0209158_10957241 | 3300026333 | Soil | MADLVFVVSRTEPKQYFYLKHVFADESRDVVLDRRRGE |
Ga0209377_13454251 | 3300026334 | Soil | VADLLFIVSRTEPKRYMYLRHVYADESRDVILDRRGGERRQSWRQPPVERRHVERRHRDI |
Ga0209057_10476741 | 3300026342 | Soil | VADLLFIVSRSEPKRYMYLKHEYADEGKEVILDRRGGDRRRSQKPPPVE |
Ga0209808_13030991 | 3300026523 | Soil | VADLVFVVSRTEPQQYLYLKHVFADESRDVVLDRRMGERRRSLSPS |
Ga0209806_10093609 | 3300026529 | Soil | MADLVFVVSRTEPKQYFYLKHVFADESRDVVLDRRRGERRRGSFTPPRA |
Ga0209806_10756551 | 3300026529 | Soil | VADLVFIVSRTEPKQYLYLKHVFADESRDVVLDRRTSERRRSLRAPTIERRHID |
Ga0209056_102727792 | 3300026538 | Soil | MADLVFIVSRTAPKQYFYLKHVYADEGRDVVLDRRGSERRRTQRPPPAERRHVERR |
Ga0209805_12040631 | 3300026542 | Soil | MADLVFVVSRTEPKQYFYLKHVFADESRDVVLDRRRGERRRGSFTPP |
Ga0209156_100795453 | 3300026547 | Soil | MADLVFIVSRTAPKQYFYLKHVFADDSRDVVLDRRVGERRRSSRPPPSERRHVER |
Ga0209465_106398301 | 3300027874 | Tropical Forest Soil | VADLIFIVPRTELKWYGYLKQIYADESRDVVLDRRTGERRRSLSPPPVME |
Ga0209382_102285061 | 3300027909 | Populus Rhizosphere | MADLVFIVSRTEAKQYFYLKHEFADESRDVVLDRRLGERRRSL |
(restricted) Ga0255312_10697661 | 3300031248 | Sandy Soil | MAALLFIVSRTEPKQYLYLKHAFADESRDVVLDRRTGERRRSLRPPPIE |
Ga0307469_105127012 | 3300031720 | Hardwood Forest Soil | MADLLFVVSRTEPKRYMYLKYVYADESRDVILDRRQGERRRG |
Ga0307469_107853521 | 3300031720 | Hardwood Forest Soil | MADLLFIVSRTEPKQYLYLKHVFADESRDVVLDRRIGERRLSLRSPQVERRHIDRRRRD |
Ga0307468_1004100102 | 3300031740 | Hardwood Forest Soil | MADLVFIVSRNEPKRYLYLKHECADESSDVVLDRRAGERRRIQRPLPTERRHMQR |
Ga0307468_1010987341 | 3300031740 | Hardwood Forest Soil | MADLVFIVSRTEPKQYFYLKHEFADESRDVVLDRRLGERRRSLRPPLIERRHIDRRH |
Ga0307473_111803492 | 3300031820 | Hardwood Forest Soil | VALVIVVSRTELKRYMYLKHLYADEGMDVVLDRRRGERRQRV |
Ga0306926_116994731 | 3300031954 | Soil | MADLFFVVSRTEQKQYTHLKHVYSNATEDVVLDRRTGER |
Ga0318505_105284951 | 3300032060 | Soil | VGDVVFIVSRTEPKQYLYLKHYWADEKRDVILDRRTGERRRSLRSPPIER |
Ga0307471_1021781691 | 3300032180 | Hardwood Forest Soil | VADLLFIVSRTEPKQYLYLKHVFADESRDVVLDRRMSERRRGLRPPPIERRHIDR |
Ga0307472_1014693471 | 3300032205 | Hardwood Forest Soil | MADLVFIVSRTEPKHYLYLKHEFADESSDVVLDRRAGER |
⦗Top⦘ |