| Basic Information | |
|---|---|
| Family ID | F104837 |
| Family Type | Metagenome / Metatranscriptome |
| Number of Sequences | 100 |
| Average Sequence Length | 45 residues |
| Representative Sequence | HLRDVAVAARPSLAGQIDDVFSLEAIVTELAPVVDETLAGIAGEG |
| Number of Associated Samples | 89 |
| Number of Associated Scaffolds | 100 |
| Quality Assessment | |
|---|---|
| Transcriptomic Evidence | Yes |
| Most common taxonomic group | Unclassified |
| % of genes with valid RBS motifs | 0.00 % |
| % of genes near scaffold ends (potentially truncated) | 0.00 % |
| % of genes from short scaffolds (< 2000 bps) | 0.00 % |
| Associated GOLD sequencing projects | 86 |
| AlphaFold2 3D model prediction | Yes |
| 3D model pTM-score | 0.55 |
| Hidden Markov Model |
|---|
| Powered by Skylign |
| Most Common Taxonomy | |
|---|---|
| Group | Unclassified (100.000 % of family members) |
| NCBI Taxonomy ID | N/A |
| Taxonomy | N/A |
| Most Common Ecosystem | |
|---|---|
| GOLD Ecosystem | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil (16.000 % of family members) |
| Environment Ontology (ENVO) | Unclassified (28.000 % of family members) |
| Earth Microbiome Project Ontology (EMPO) | Host-associated → Plant → Plant rhizosphere (36.000 % of family members) |
| ⦗Top⦘ |
| ⦗Top⦘ |
| Predicted Topology & Secondary Structure | |||||
|---|---|---|---|---|---|
| Classification: | Globular | Signal Peptide: | No | Secondary Structure distribution: | α-helix: 53.42% β-sheet: 0.00% Coil/Unstructured: 46.58% | Feature Viewer |
|
|
|||||
| Powered by Feature Viewer | |||||
| Structure Viewer | |
|---|---|
|
| |
| Per-residue confidence (pLDDT): 0-50 51-70 71-90 91-100 | pTM-score: 0.55 |
| Powered by PDBe Molstar | |
| ⦗Top⦘ |
| Pfam ID | Name | % Frequency in 100 Family Scaffolds |
|---|---|---|
| PF01264 | Chorismate_synt | 14.00 |
| PF00535 | Glycos_transf_2 | 7.00 |
| PF08448 | PAS_4 | 7.00 |
| PF05685 | Uma2 | 5.00 |
| PF00296 | Bac_luciferase | 5.00 |
| PF01613 | Flavin_Reduct | 5.00 |
| PF00496 | SBP_bac_5 | 3.00 |
| PF00254 | FKBP_C | 3.00 |
| PF13279 | 4HBT_2 | 2.00 |
| PF07355 | GRDB | 2.00 |
| PF02515 | CoA_transf_3 | 1.00 |
| PF13185 | GAF_2 | 1.00 |
| PF00126 | HTH_1 | 1.00 |
| PF01425 | Amidase | 1.00 |
| PF13231 | PMT_2 | 1.00 |
| PF00079 | Serpin | 1.00 |
| PF09720 | Unstab_antitox | 1.00 |
| PF00067 | p450 | 1.00 |
| PF01850 | PIN | 1.00 |
| PF09084 | NMT1 | 1.00 |
| PF00903 | Glyoxalase | 1.00 |
| PF04075 | F420H2_quin_red | 1.00 |
| PF01055 | Glyco_hydro_31 | 1.00 |
| PF00206 | Lyase_1 | 1.00 |
| COG ID | Name | Functional Category | % Frequency in 100 Family Scaffolds |
|---|---|---|---|
| COG0082 | Chorismate synthase | Amino acid transport and metabolism [E] | 14.00 |
| COG1853 | FMN reductase RutF, DIM6/NTAB family | Energy production and conversion [C] | 5.00 |
| COG2141 | Flavin-dependent oxidoreductase, luciferase family (includes alkanesulfonate monooxygenase SsuD and methylene tetrahydromethanopterin reductase) | Coenzyme transport and metabolism [H] | 5.00 |
| COG4636 | Endonuclease, Uma2 family (restriction endonuclease fold) | General function prediction only [R] | 5.00 |
| COG0154 | Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit or related amidase | Translation, ribosomal structure and biogenesis [J] | 1.00 |
| COG0715 | ABC-type nitrate/sulfonate/bicarbonate transport system, periplasmic component | Inorganic ion transport and metabolism [P] | 1.00 |
| COG1501 | Alpha-glucosidase/xylosidase, GH31 family | Carbohydrate transport and metabolism [G] | 1.00 |
| COG1804 | Crotonobetainyl-CoA:carnitine CoA-transferase CaiB and related acyl-CoA transferases | Lipid transport and metabolism [I] | 1.00 |
| COG2124 | Cytochrome P450 | Defense mechanisms [V] | 1.00 |
| COG4521 | ABC-type taurine transport system, periplasmic component | Inorganic ion transport and metabolism [P] | 1.00 |
| COG4826 | Serine protease inhibitor | Posttranslational modification, protein turnover, chaperones [O] | 1.00 |
| ⦗Top⦘ |
| Name | Rank | Taxonomy | Distribution |
| Unclassified | root | N/A | 100.00 % |
| Visualization |
|---|
| Powered by ApexCharts |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|
Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).
| ⦗Top⦘ |
| Habitat | Taxonomy | Distribution |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil | 16.00% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil | 11.00% |
| Vadose Zone Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil | 8.00% |
| Populus Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere | 8.00% |
| Hardwood Forest Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil | 6.00% |
| Tropical Forest Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil | 5.00% |
| Tropical Forest Soil | Environmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil | 4.00% |
| Groundwater Sand | Environmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand | 4.00% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil | 3.00% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Agricultural → Soil | 3.00% |
| Corn, Switchgrass And Miscanthus Rhizosphere | Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere | 3.00% |
| Miscanthus Rhizosphere | Host-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere | 3.00% |
| Corn Rhizosphere | Host-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere | 3.00% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil | 2.00% |
| Soil | Environmental → Terrestrial → Soil → Wetlands → Unclassified → Soil | 2.00% |
| Switchgrass Rhizosphere | Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere | 2.00% |
| Soil | Environmental → Aquatic → Sediment → Unclassified → Unclassified → Soil | 1.00% |
| Groundwater Sediment | Environmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment | 1.00% |
| Terrestrial Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil | 1.00% |
| Grasslands Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil | 1.00% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil | 1.00% |
| Grasslands Soil | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil | 1.00% |
| Corn Rhizosphere | Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere | 1.00% |
| Sandy Soil | Environmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil | 1.00% |
| Peat Soil | Environmental → Terrestrial → Peat → Unclassified → Unclassified → Peat Soil | 1.00% |
| Microbial Mat On Rocks | Environmental → Terrestrial → Cave → Unclassified → Unclassified → Microbial Mat On Rocks | 1.00% |
| Sediment | Environmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment | 1.00% |
| Corn Rhizosphere | Host-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere | 1.00% |
| Switchgrass Rhizosphere | Host-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere | 1.00% |
| Corn Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere | 1.00% |
| Corn Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere | 1.00% |
| Miscanthus Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere | 1.00% |
| Switchgrass Rhizosphere | Host-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere | 1.00% |
| Visualization |
|---|
| Powered by ApexCharts |
Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).
| Taxon OID | Sample Name | Habitat Type | IMG/M Link |
|---|---|---|---|
| 3300000955 | Soil microbial communities from Great Prairies - Iowa, Native Prairie soil | Environmental | Open in IMG/M |
| 3300004479 | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAs | Environmental | Open in IMG/M |
| 3300005174 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129 | Environmental | Open in IMG/M |
| 3300005180 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 | Environmental | Open in IMG/M |
| 3300005181 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127 | Environmental | Open in IMG/M |
| 3300005186 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125 | Environmental | Open in IMG/M |
| 3300005330 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3H metaG | Environmental | Open in IMG/M |
| 3300005332 | Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly) | Environmental | Open in IMG/M |
| 3300005336 | Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaG | Environmental | Open in IMG/M |
| 3300005445 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaG | Environmental | Open in IMG/M |
| 3300005457 | Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-3 metaG | Host-Associated | Open in IMG/M |
| 3300005468 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG | Environmental | Open in IMG/M |
| 3300005558 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147 | Environmental | Open in IMG/M |
| 3300005576 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157 | Environmental | Open in IMG/M |
| 3300005615 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-10-3 metaG | Environmental | Open in IMG/M |
| 3300005713 | Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 (version 2) | Environmental | Open in IMG/M |
| 3300005841 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S6-2 | Host-Associated | Open in IMG/M |
| 3300006058 | Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 | Host-Associated | Open in IMG/M |
| 3300006796 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 | Environmental | Open in IMG/M |
| 3300006800 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109 | Environmental | Open in IMG/M |
| 3300006845 | Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5 | Host-Associated | Open in IMG/M |
| 3300006853 | Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD4 | Host-Associated | Open in IMG/M |
| 3300006871 | Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD3 | Host-Associated | Open in IMG/M |
| 3300006880 | Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD3 | Host-Associated | Open in IMG/M |
| 3300006881 | Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M1-2 | Host-Associated | Open in IMG/M |
| 3300009090 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG | Environmental | Open in IMG/M |
| 3300009147 | Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2) | Host-Associated | Open in IMG/M |
| 3300009174 | Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-4 metaG | Host-Associated | Open in IMG/M |
| 3300009802 | Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_50_60 | Environmental | Open in IMG/M |
| 3300009816 | Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_0_10 | Environmental | Open in IMG/M |
| 3300010047 | Tropical forest soil microbial communities from Panama - MetaG Plot_30 | Environmental | Open in IMG/M |
| 3300010360 | Tropical forest soil microbial communities from Panama - MetaG Plot_6 | Environmental | Open in IMG/M |
| 3300010362 | Tropical forest soil microbial communities from Panama - MetaG Plot_22 | Environmental | Open in IMG/M |
| 3300010366 | Tropical forest soil microbial communities from Panama - MetaG Plot_24 | Environmental | Open in IMG/M |
| 3300010401 | Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1 | Environmental | Open in IMG/M |
| 3300012096 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaG | Environmental | Open in IMG/M |
| 3300012199 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaG | Environmental | Open in IMG/M |
| 3300012203 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaG | Environmental | Open in IMG/M |
| 3300012514 | Unplanted soil (control) microbial communities from North Carolina - M.Soil.1.old.130510 | Environmental | Open in IMG/M |
| 3300012582 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaG | Environmental | Open in IMG/M |
| 3300012922 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaG | Environmental | Open in IMG/M |
| 3300012925 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly) | Environmental | Open in IMG/M |
| 3300012927 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly) | Environmental | Open in IMG/M |
| 3300013100 | Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - C6-5 metaG | Host-Associated | Open in IMG/M |
| 3300013308 | Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M3-5 metaG | Host-Associated | Open in IMG/M |
| 3300014883 | Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT760_16_10D | Environmental | Open in IMG/M |
| 3300015358 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_5_24_1 metaG | Environmental | Open in IMG/M |
| 3300016445 | Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.108 | Environmental | Open in IMG/M |
| 3300018074 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_200_b2 | Environmental | Open in IMG/M |
| 3300018422 | Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 124 T | Environmental | Open in IMG/M |
| 3300018433 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116 | Environmental | Open in IMG/M |
| 3300019360 | White microbial mat communities from a lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - GBC170108-1 metaG | Environmental | Open in IMG/M |
| 3300020004 | Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1a2 | Environmental | Open in IMG/M |
| 3300021560 | Tropical forest soil microbial communities from Panama - MetaG Plot_4 | Environmental | Open in IMG/M |
| 3300025899 | Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M2-2 (SPAdes) | Host-Associated | Open in IMG/M |
| 3300025918 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3L metaG (SPAdes) | Environmental | Open in IMG/M |
| 3300025923 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S5-3 metaG (SPAdes) | Host-Associated | Open in IMG/M |
| 3300025933 | Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-3 metaG (SPAdes) | Host-Associated | Open in IMG/M |
| 3300025938 | Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M1-2 (SPAdes) | Host-Associated | Open in IMG/M |
| 3300026116 | Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C7-2 (SPAdes) | Host-Associated | Open in IMG/M |
| 3300026332 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138 (SPAdes) | Environmental | Open in IMG/M |
| 3300026532 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 (SPAdes) | Environmental | Open in IMG/M |
| 3300026540 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 (SPAdes) | Environmental | Open in IMG/M |
| 3300026542 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148 (SPAdes) | Environmental | Open in IMG/M |
| 3300027490 | Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_0_10 (SPAdes) | Environmental | Open in IMG/M |
| 3300027654 | Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 MoBio (SPAdes) | Environmental | Open in IMG/M |
| 3300027954 | Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_50_60 (SPAdes) | Environmental | Open in IMG/M |
| 3300028809 | Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_PalmiticAcid_Day48 | Environmental | Open in IMG/M |
| 3300031150 (restricted) | Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH4_T0_E4 | Environmental | Open in IMG/M |
| 3300031573 | Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.AN111 | Environmental | Open in IMG/M |
| 3300031680 | Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.089b5f22 | Environmental | Open in IMG/M |
| 3300031720 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515 | Environmental | Open in IMG/M |
| 3300031768 | Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.084b2f22 | Environmental | Open in IMG/M |
| 3300031770 | Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.178b2f17 | Environmental | Open in IMG/M |
| 3300031771 | Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.169b2f19 | Environmental | Open in IMG/M |
| 3300031792 | Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.053b4f23 | Environmental | Open in IMG/M |
| 3300031797 | Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.169b2f23 | Environmental | Open in IMG/M |
| 3300031799 | Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.066b5f21 | Environmental | Open in IMG/M |
| 3300031831 | Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.066b5f20 | Environmental | Open in IMG/M |
| 3300031859 | Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.052b4f25 | Environmental | Open in IMG/M |
| 3300032000 | Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C0D3 | Environmental | Open in IMG/M |
| 3300032052 | Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.084b2f19 | Environmental | Open in IMG/M |
| 3300032076 | Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.111 (v2) | Environmental | Open in IMG/M |
| 3300032180 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515 | Environmental | Open in IMG/M |
| 3300032205 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05 | Environmental | Open in IMG/M |
| 3300033004 | Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.4 | Environmental | Open in IMG/M |
| 3300033814 | Sediment microbial communities from East River floodplain, Colorado, United States - 55_j17 | Environmental | Open in IMG/M |
| 3300034090 | Peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF00N | Environmental | Open in IMG/M |
| 3300034666 | Metatranscriptome of lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C8R2 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| Geographical Distribution | |
|---|---|
| Zoom: | Powered by OpenStreetMap |
| ⦗Top⦘ |
Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).
| Protein ID | Sample Taxon ID | Habitat | Sequence |
| JGI1027J12803_1009486501 | 3300000955 | Soil | AVKERRHLKDVAVAARPSLAGRIDDVFSFEAVVTELAPVLDETLAGIPADP* |
| JGI1027J12803_1072747643 | 3300000955 | Soil | ARPSLAGEIDTVFSLETVVAELAPVLEETLSGVGGENP* |
| Ga0062595_1017465182 | 3300004479 | Soil | VRDRRNLKEVARAARPSLAGEIDTVFSLETVVAELAPVLEETLSGVGGENP* |
| Ga0066680_104700281 | 3300005174 | Soil | VKERRHLKDVAIAARPSLAGQIDDVFSLDAIVGELAPVLEETLAGIAEG* |
| Ga0066685_107484632 | 3300005180 | Soil | ERRHLKDVALAARPSLAGQIDAVFSLEAIVAELAPVLEETLEGVAG* |
| Ga0066678_105244151 | 3300005181 | Soil | KDVALAARPSLAGQIDAVFSLEAIVAELAPVLEETLEGVAG* |
| Ga0066676_107096032 | 3300005186 | Soil | AAVRERRHLKDVALAARPALAGQIDDVFSLEAIVTELAPVLDETLAGIPSAG* |
| Ga0070690_1002812192 | 3300005330 | Switchgrass Rhizosphere | VKERRPLRDVALAARPELAGAIHGVFSLDAVVAELAPVLDEVLRDVDA* |
| Ga0066388_1077801761 | 3300005332 | Tropical Forest Soil | RQLRDVALAARPALAGAIDGVFSLEAVVRELAPVLDEVLAEIDTDR* |
| Ga0070680_1005542582 | 3300005336 | Corn Rhizosphere | LKEVALAARPSLAAHIEEVFSLDALVKELAPVLDETLTGVPGDR* |
| Ga0070708_1001796823 | 3300005445 | Corn, Switchgrass And Miscanthus Rhizosphere | HLRDVALAARPELAGQLDEVFSLEAVVRELSPVLDETLAGIPGDG* |
| Ga0070662_1000683381 | 3300005457 | Corn Rhizosphere | QLRDVALAAKPELAGAIDNVFSLEAVVTELAPVLSEVLAEVND* |
| Ga0070662_1013196552 | 3300005457 | Corn Rhizosphere | VTVKEGRHLKDVAVAARPSLAGRIDGVFSFEAVVAELVPVLDETLAGLPADA* |
| Ga0070707_1018860731 | 3300005468 | Corn, Switchgrass And Miscanthus Rhizosphere | RHLRDVALAARPELAGQLDEVFSLEAIVRDLTPVLDETLAGIPSEG* |
| Ga0066698_107336851 | 3300005558 | Soil | AARPSLAGQIDDVFSLDAIVGELAPVLEETLAGIAEG* |
| Ga0066708_101453312 | 3300005576 | Soil | PALAGQIDDVFSLEAIVIALAPVLDETLAGIPSDG* |
| Ga0070702_1008719981 | 3300005615 | Corn, Switchgrass And Miscanthus Rhizosphere | EHRHLRDIAGAARPALARQLDDVFSFEVVVAELAPVLDEVLAPVAGDAAEPP* |
| Ga0066905_1003195282 | 3300005713 | Tropical Forest Soil | LAVKERRQLKDVALAAKPALAGRIDDVFSLDALVAELAPVLEETLAGIPTE* |
| Ga0066905_1012863012 | 3300005713 | Tropical Forest Soil | RNLKEVARAARPSLAGEIDTVFSLEAIVAELAPVLEETLSGVGGENP* |
| Ga0068863_1014050641 | 3300005841 | Switchgrass Rhizosphere | LAARPGLAGRLDDVFSLEAIVRELTPVLDETLAGITADG* |
| Ga0075432_100355812 | 3300006058 | Populus Rhizosphere | SSLAVKDHRHLKGVALAARPSLAAHIEDVFSMDALVKELAPVLDETLAGVPTEA* |
| Ga0066665_109019781 | 3300006796 | Soil | VKERRHLKDVAIAARPSLAGQIDDVFSLDAIVGELAPVLEEALAGIADG* |
| Ga0066665_109498411 | 3300006796 | Soil | DVAIAARPSLAGQIDDVFSLDAIVGELAPVLEETLAGIAEG* |
| Ga0066665_110754481 | 3300006796 | Soil | IKERRHLRDVALAARPELAGQLDQVFALDAIVVEAAPVLEETLAGIAREG* |
| Ga0066660_103219811 | 3300006800 | Soil | VAVNERRHLRDVALAARPSLAARIEEVFSLEAIVTDLASVLDETLQSCA* |
| Ga0075421_1013247192 | 3300006845 | Populus Rhizosphere | RHLRAVALAERPSLGAEIDSVFSLEAVVAELAPVLDETLSGVGGENP* |
| Ga0075421_1015989822 | 3300006845 | Populus Rhizosphere | DVALAARPDLAGQLADVFSLEAIVAELAPVLDETLAGIP* |
| Ga0075420_1013344861 | 3300006853 | Populus Rhizosphere | RDVALAAKPELAGAIDDVFSLEAVVTELAPVLSEVLAEVND* |
| Ga0075434_1022944241 | 3300006871 | Populus Rhizosphere | HLRDVALAARPELAGQLDQVFAPDAIVVEAAPVLEETLAGIAGEG* |
| Ga0075429_1015398342 | 3300006880 | Populus Rhizosphere | LAARPALAAQIADVFSLEAIVADLAPVLDETLAEVPAER* |
| Ga0068865_1009346912 | 3300006881 | Miscanthus Rhizosphere | VKERRHLKDVAMAERPSLAGRIDGVFSFEAVVAELVPVLDETLAGLPADA* |
| Ga0099827_107074541 | 3300009090 | Vadose Zone Soil | RPELADRLDEVFSLEAIVRELTPVLDETLAAIPADG* |
| Ga0114129_114208932 | 3300009147 | Populus Rhizosphere | LRDVALAAKPELAGAIDDVFSLEAVVTELAPVLSEVLAEVND* |
| Ga0114129_120962722 | 3300009147 | Populus Rhizosphere | ARPALAAQIADVFSLEAIVADLAPVLDETLAEVPAER* |
| Ga0105241_125007242 | 3300009174 | Corn Rhizosphere | EGRHLKDVAVAARPSLAGRIDGVFSFEAVVAELVPVLDETLAGLPADA* |
| Ga0105073_10467551 | 3300009802 | Groundwater Sand | HLRDVAVAARPSLAGQIDDVFSLEAIVTELAPVVDETLAGIAGEG* |
| Ga0105076_10634721 | 3300009816 | Groundwater Sand | AVREHRHLRDVAVAARPSLAGQIDDVFSLEAIVTELAPVVDETLAGIAGEG* |
| Ga0126382_106046741 | 3300010047 | Tropical Forest Soil | DWSAAAVKERRHLRDIAVAARPALAGEIDDVFSLEAIVTELAAVLEETLTGIAEA* |
| Ga0126372_104081061 | 3300010360 | Tropical Forest Soil | TEAIKERRHLKDVAVAARPALAGQIDDVFSFDAIVAELAPVFEEALAGVPTE* |
| Ga0126377_101275735 | 3300010362 | Tropical Forest Soil | VKELRHLRDVAVAAQPALAGRIDGVFSLEAVVTELAPVLDETLTEVPLSGDQP* |
| Ga0126379_101124821 | 3300010366 | Tropical Forest Soil | ARPSLGAEIDSVFSLEAVVAELAPVLEETLSGVGGENP* |
| Ga0134121_111975403 | 3300010401 | Terrestrial Soil | AARPSLAGRIDDVFSFDAVVTELAPVLDETLAGIPADP* |
| Ga0137389_104406091 | 3300012096 | Vadose Zone Soil | KDVAIAARPSLAGQIDDVFSLEAIVTELAPVLDETLAAIPGA* |
| Ga0137383_103223133 | 3300012199 | Vadose Zone Soil | AARPSLAGQIDDVFSLDAVVAELVPVLDETLEGVPG* |
| Ga0137399_100204816 | 3300012203 | Vadose Zone Soil | LAGGIDTVFSLEAIVAELASLLDETLSGVGGENP* |
| Ga0157330_10763602 | 3300012514 | Soil | LKEVARAARPALAGEIDTVFSLETVVAELAPVLEETLSGVGGENP* |
| Ga0137358_103048391 | 3300012582 | Vadose Zone Soil | AQPSLAGHIDDVFSLDAIATELAPILGETLAGIPSDASRL* |
| Ga0137394_103323982 | 3300012922 | Vadose Zone Soil | VSLAARPELAGQLDQVFAPDAIVVEAAPVLEETLAGIAGEG* |
| Ga0137419_104372301 | 3300012925 | Vadose Zone Soil | VKERRQLRDVALAARPSLAGQIDDVFSLDAVVAELVPVLDETLEGVPG* |
| Ga0137416_120413282 | 3300012927 | Vadose Zone Soil | VAVREHRHLRDIAVAARPSLAGEIDDVFSLDAIVTELAPIVDETLTGIAGEG* |
| Ga0157373_111199301 | 3300013100 | Corn Rhizosphere | RDVALAARPALTGAIDGVFSLESVVTELAPVLDEVLAEIA* |
| Ga0157375_114201851 | 3300013308 | Miscanthus Rhizosphere | VAVAARPSLAGRIDDVFSFEAVVTELAPVLDETLAGIPADP* |
| Ga0180086_11746251 | 3300014883 | Soil | RRNLRDVALAARPELASQLDEVFSLEAIVRELAPVLEETLARISGDG* |
| Ga0134089_103765382 | 3300015358 | Grasslands Soil | STVAVREHRHLRDVAIAAQPSLAGQVDVVFSLEAIVTDLAPVLDETLVGVAGEG* |
| Ga0182038_110140311 | 3300016445 | Soil | VAARPALAGQIDDVFSFDAIVAELAPVFEEALAGVPTE |
| Ga0184640_102385161 | 3300018074 | Groundwater Sediment | RHLRDVAIAARTSLAGQIDDVFSLEAVVTELAPVLDEVLAEVAGDA |
| Ga0190265_126770331 | 3300018422 | Soil | RDVALAARPSLAGQIHDVFSLEAIVGELAPVLDEVLAEVV |
| Ga0066667_104984251 | 3300018433 | Grasslands Soil | RHLRDVALAARPSLAARIDDVFSLEAIVTDLASVLDETLQSCA |
| Ga0187894_103930641 | 3300019360 | Microbial Mat On Rocks | VAVAERRHLREVALAARPELAGLLDEVFSLEAIVGELAPVLDETLAGLAGDA |
| Ga0193755_11398372 | 3300020004 | Soil | VGVADTAGARPSLARRIDHVFSLEAIVTELAPVLDEVLAEVAG |
| Ga0126371_104692241 | 3300021560 | Tropical Forest Soil | AARPSLASEIAGVFSLETIVTELAPVLEETLAGIPAG |
| Ga0207642_105873391 | 3300025899 | Miscanthus Rhizosphere | VALAARPELAGAIHGVFSLDAVVAELAPVLDEVLRDVDA |
| Ga0207662_113807221 | 3300025918 | Switchgrass Rhizosphere | KERRPLRDVALAARPELAGAIHGVFSLDAVVAELAPVLDEVLRDVDA |
| Ga0207681_100156545 | 3300025923 | Switchgrass Rhizosphere | VDDWSSVAVKERRPLRDVALAARPELAGAIHGVFSLDAVVAELAPVLDEVLRDVDA |
| Ga0207706_116170582 | 3300025933 | Corn Rhizosphere | SSVAVKERRPLRDVALAARPELAGAIHGVFSLDAVVAELAPVLDEVLRDVDA |
| Ga0207704_108708031 | 3300025938 | Miscanthus Rhizosphere | VKERRHLKDVAMAERPSLAGRIDGVFSFEAVVAELVPVLDETLAGLPADA |
| Ga0207674_112127673 | 3300026116 | Corn Rhizosphere | RDIARAARPALARQLDDVFSFEVVVAELAPVLDEVLAPVEGDTAEPP |
| Ga0209803_13362051 | 3300026332 | Soil | AARPSLAGQIDAVFSLEAIVAELAPVLEETLEGVAG |
| Ga0209160_11432841 | 3300026532 | Soil | VNERRHLRDVALAARPSLASRIDDVFSLEAIVTDLASVLDETLQSCA |
| Ga0209376_11976381 | 3300026540 | Soil | ERRHLKDVAIAARPSLAGQIDDVFSLDGIVGELAPVLEETLAGIAEG |
| Ga0209805_11259811 | 3300026542 | Soil | DDWSAVAVNERRHLRDVALAARPSLAARIDDVFSLEAIVTDLASVLDETLQSCA |
| Ga0209899_10439061 | 3300027490 | Groundwater Sand | PSLAGQIDDVFSLEAVVAELASVLDETLAGIAGEG |
| Ga0209799_10892001 | 3300027654 | Tropical Forest Soil | LKDVALAAKPALAGRIDDVFSLDALVAELAPVLEETLAGIPTE |
| Ga0209859_10324101 | 3300027954 | Groundwater Sand | VVKERRHLKEVALAARPTLAGRIDDVFSLESVVVELAPVLDETLAGVAG |
| Ga0247824_102292211 | 3300028809 | Soil | AVREGRQLRDVALAAKPELAGAIDDVFSLEAVVTELAPVLSEVLAEVND |
| (restricted) Ga0255311_10818411 | 3300031150 | Sandy Soil | VATTARPSLAGQIDDVFSLEAIVTELAPVLDETLAAIPGD |
| Ga0310915_107933362 | 3300031573 | Soil | ERLHLKDVATAARPSLASEIGGVFSLETIVTELGPVLEETLAGIPASA |
| Ga0318574_101407001 | 3300031680 | Soil | ARVVKERLHLKDVATAARPSLASEIGGVFSLETIVTELGPVLEETLAGIPAGA |
| Ga0307469_109850591 | 3300031720 | Hardwood Forest Soil | LRDVALAARPALAGTIHDVFSLDAVVAEMAPVLDEVLRDVHA |
| Ga0307469_112316302 | 3300031720 | Hardwood Forest Soil | RRHLKDVAMAARPSRAGRIDGVFSFEAVVAELVPVLDETLAGLPADA |
| Ga0307469_112804611 | 3300031720 | Hardwood Forest Soil | HLKDVAVAARPSLAGHIDDVFSFEAVIAELAPVLDETLEGIPAGA |
| Ga0318509_106428001 | 3300031768 | Soil | TAARPSLASEIAGVFSLETIVTELAPVLEETLAGIPAGA |
| Ga0318521_102702931 | 3300031770 | Soil | VKERLHLKDVATAARPSQASEIAGVFSLETIVTELGPVLEETLAGIPAGA |
| Ga0318546_104935131 | 3300031771 | Soil | VVKERLHLKDVATAARPSQASEIAGVFSLETIVTELGPVLEETLAGIPAGA |
| Ga0318529_102983251 | 3300031792 | Soil | DWSTEAIKERRHLKDVAVAARPALAGQIDDVFSFDAIVAELAPVFEEALAGVPTE |
| Ga0318550_100633671 | 3300031797 | Soil | WSTEAIKERRHLKDVAVAARPALAGQIDDVFSFDAIVAELAPVFEEALAGVPTE |
| Ga0318565_101696292 | 3300031799 | Soil | LLDDAGPSLASEIAGVFSLETIVTELAPVLEETLAGIPAGA |
| Ga0318564_101697292 | 3300031831 | Soil | HLKDVATAARPSLASEIAGVFSLETIVTELAPVLEETLAGIPAGA |
| Ga0318527_103116851 | 3300031859 | Soil | AIKERRHLKDVAVAARPALAGQIDDVFSFDAIVAELAPVFEEALAGVPTE |
| Ga0310903_100510581 | 3300032000 | Soil | DRRHLKDVALAARPSLAAHIEDVFSLDALVKELAPVLDETLAGVPTEA |
| Ga0318506_102780291 | 3300032052 | Soil | VDDWSTEAIKERRHLKDVAVAARPALAGQIDDVFSFDAIVAELAPVFEEALAGVPTE |
| Ga0306924_121154602 | 3300032076 | Soil | VAVAARPALAGQIDDVFSFDAIVAELAPVFEEALAGVPTE |
| Ga0307471_1005797881 | 3300032180 | Hardwood Forest Soil | DWSGVAVRDRRHLKLVALAARPALAGEIDAVFSLEAIVAELAPLLDETLSGVGGENP |
| Ga0307471_1006335111 | 3300032180 | Hardwood Forest Soil | CRHLRAVALAERPSLGAEIDSVFSLEAVVAELAPVLDETLSGVGGENP |
| Ga0307472_1002174771 | 3300032205 | Hardwood Forest Soil | RSARPSLAGEIDTVFSLETVVAELAPVLEETLSGVGGESP |
| Ga0335084_113254822 | 3300033004 | Soil | RQLRDVALAARPELAGTIDDVFSLAAIVTELAPVLDEVLAEIDA |
| Ga0335084_115612421 | 3300033004 | Soil | GARPALAGQIDAVFSPETMVTELAPVLDETLAVLSEGA |
| Ga0364930_0146106_3_143 | 3300033814 | Sediment | RHLRDVVIAARPALAGQIDDVFSLEAVVTELAPVLDEVLAEVAADG |
| Ga0326723_0332406_1_144 | 3300034090 | Peat Soil | RRHLKDVAVAARPSLTGQIDEVFSLEAIVAELAPVLDETLAGIPADA |
| Ga0314788_044523_705_818 | 3300034666 | Soil | LAAKPELAGAIDDVFSLEAVVTELAPVLSEVLAEVND |
| ⦗Top⦘ |