| Basic Information | |
|---|---|
| Family ID | F103796 |
| Family Type | Metagenome |
| Number of Sequences | 101 |
| Average Sequence Length | 55 residues |
| Representative Sequence | AGSEPWQYMVCYEFDSEESLQAFVRSDTLRAMTRDYNARFGGAGDRARLAYRQIYP |
| Number of Associated Samples | 93 |
| Number of Associated Scaffolds | 101 |
| Quality Assessment | |
|---|---|
| Transcriptomic Evidence | No |
| Most common taxonomic group | Unclassified |
| % of genes with valid RBS motifs | 0.00 % |
| % of genes near scaffold ends (potentially truncated) | 0.00 % |
| % of genes from short scaffolds (< 2000 bps) | 0.00 % |
| Associated GOLD sequencing projects | 89 |
| AlphaFold2 3D model prediction | Yes |
| 3D model pTM-score | 0.52 |
| Hidden Markov Model |
|---|
| Powered by Skylign |
| Most Common Taxonomy | |
|---|---|
| Group | Unclassified (100.000 % of family members) |
| NCBI Taxonomy ID | N/A |
| Taxonomy | N/A |
| Most Common Ecosystem | |
|---|---|
| GOLD Ecosystem | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil (10.891 % of family members) |
| Environment Ontology (ENVO) | Unclassified (33.663 % of family members) |
| Earth Microbiome Project Ontology (EMPO) | Host-associated → Plant → Plant rhizosphere (39.604 % of family members) |
| ⦗Top⦘ |
| ⦗Top⦘ |
| Predicted Topology & Secondary Structure | |||||
|---|---|---|---|---|---|
| Classification: | Globular | Signal Peptide: | No | Secondary Structure distribution: | α-helix: 26.19% β-sheet: 0.00% Coil/Unstructured: 73.81% | Feature Viewer |
|
|
|||||
| Powered by Feature Viewer | |||||
| Structure Viewer | |
|---|---|
|
| |
| Per-residue confidence (pLDDT): 0-50 51-70 71-90 91-100 | pTM-score: 0.52 |
| Powered by PDBe Molstar | |
| ⦗Top⦘ |
| Pfam ID | Name | % Frequency in 101 Family Scaffolds |
|---|---|---|
| PF00072 | Response_reg | 19.80 |
| PF00664 | ABC_membrane | 15.84 |
| PF13633 | Obsolete Pfam Family | 8.91 |
| PF12697 | Abhydrolase_6 | 7.92 |
| PF04199 | Cyclase | 3.96 |
| PF00296 | Bac_luciferase | 2.97 |
| PF13544 | Obsolete Pfam Family | 2.97 |
| PF08334 | T2SSG | 2.97 |
| PF07963 | N_methyl | 1.98 |
| PF02515 | CoA_transf_3 | 1.98 |
| PF12146 | Hydrolase_4 | 1.98 |
| PF01145 | Band_7 | 1.98 |
| PF01522 | Polysacc_deac_1 | 0.99 |
| PF00005 | ABC_tran | 0.99 |
| PF00656 | Peptidase_C14 | 0.99 |
| PF04794 | YdjC | 0.99 |
| PF00004 | AAA | 0.99 |
| PF02779 | Transket_pyr | 0.99 |
| PF00529 | CusB_dom_1 | 0.99 |
| PF00441 | Acyl-CoA_dh_1 | 0.99 |
| PF00561 | Abhydrolase_1 | 0.99 |
| COG ID | Name | Functional Category | % Frequency in 101 Family Scaffolds |
|---|---|---|---|
| COG1878 | Kynurenine formamidase | Amino acid transport and metabolism [E] | 3.96 |
| COG2141 | Flavin-dependent oxidoreductase, luciferase family (includes alkanesulfonate monooxygenase SsuD and methylene tetrahydromethanopterin reductase) | Coenzyme transport and metabolism [H] | 2.97 |
| COG1804 | Crotonobetainyl-CoA:carnitine CoA-transferase CaiB and related acyl-CoA transferases | Lipid transport and metabolism [I] | 1.98 |
| COG0726 | Peptidoglycan/xylan/chitin deacetylase, PgdA/NodB/CDA1 family | Cell wall/membrane/envelope biogenesis [M] | 0.99 |
| COG1960 | Acyl-CoA dehydrogenase related to the alkylation response protein AidB | Lipid transport and metabolism [I] | 0.99 |
| COG3394 | Chitooligosaccharide deacetylase ChbG, YdjC/CelG family | Carbohydrate transport and metabolism [G] | 0.99 |
| COG4249 | Uncharacterized conserved protein, contains caspase domain | General function prediction only [R] | 0.99 |
| ⦗Top⦘ |
| Name | Rank | Taxonomy | Distribution |
| Unclassified | root | N/A | 100.00 % |
| Visualization |
|---|
| Powered by ApexCharts |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|
Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).
| ⦗Top⦘ |
| Habitat | Taxonomy | Distribution |
| Vadose Zone Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil | 10.89% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil | 7.92% |
| Tropical Forest Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil | 6.93% |
| Corn, Switchgrass And Miscanthus Rhizosphere | Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere | 6.93% |
| Hardwood Forest Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil | 5.94% |
| Miscanthus Rhizosphere | Host-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere | 5.94% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil | 4.95% |
| Switchgrass Rhizosphere | Host-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere | 3.96% |
| Agricultural Soil | Environmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil | 2.97% |
| Tropical Forest Soil | Environmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil | 2.97% |
| Forest Soil | Environmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil | 2.97% |
| Sediment | Environmental → Aquatic → Freshwater → Lake → Sediment → Sediment | 1.98% |
| Natural And Restored Wetlands | Environmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands | 1.98% |
| Groundwater Sediment | Environmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment | 1.98% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Agricultural → Soil | 1.98% |
| Switchgrass Rhizosphere | Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere | 1.98% |
| Switchgrass Rhizosphere | Host-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere | 1.98% |
| Populus Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere | 1.98% |
| Switchgrass Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere | 1.98% |
| Corn Rhizosphere | Host-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere | 1.98% |
| Groundwater Sediment | Environmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment | 0.99% |
| Soil | Environmental → Aquatic → Sediment → Unclassified → Unclassified → Soil | 0.99% |
| Groundwater Sediment | Environmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment | 0.99% |
| Terrestrial Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil | 0.99% |
| Grasslands Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil | 0.99% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil | 0.99% |
| Sugarcane Root And Bulk Soil | Environmental → Terrestrial → Soil → Unclassified → Agricultural Land → Sugarcane Root And Bulk Soil | 0.99% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil | 0.99% |
| Grass Soil | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Grass Soil | 0.99% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil | 0.99% |
| Soil | Environmental → Terrestrial → Soil → Wetlands → Unclassified → Soil | 0.99% |
| Natural And Restored Wetlands | Environmental → Terrestrial → Soil → Wetlands → Unclassified → Natural And Restored Wetlands | 0.99% |
| Rice Paddy Soil | Environmental → Terrestrial → Soil → Wetlands → Unclassified → Rice Paddy Soil | 0.99% |
| Tropical Peatland | Environmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland | 0.99% |
| Groundwater Sand | Environmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand | 0.99% |
| Sandy Soil | Environmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil | 0.99% |
| Peat Soil | Environmental → Terrestrial → Peat → Unclassified → Unclassified → Peat Soil | 0.99% |
| Microbial Mat On Rocks | Environmental → Terrestrial → Cave → Unclassified → Unclassified → Microbial Mat On Rocks | 0.99% |
| Corn Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere | 0.99% |
| Rhizosphere Soil | Host-Associated → Plants → Rhizosphere → Soil → Unclassified → Rhizosphere Soil | 0.99% |
| Visualization |
|---|
| Powered by ApexCharts |
Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).
| Taxon OID | Sample Name | Habitat Type | IMG/M Link |
|---|---|---|---|
| 2189573000 | Grass soil microbial communities from Rothamsted Park, UK - July 2010 direct MP BIO 1O1 lysis 0-21cm (T0 for microcosms) | Environmental | Open in IMG/M |
| 3300001545 | Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 | Environmental | Open in IMG/M |
| 3300003319 | Sugarcane bulk soil Sample L2 | Environmental | Open in IMG/M |
| 3300004020 | Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_TuleC_D2 | Environmental | Open in IMG/M |
| 3300004156 | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 1 | Environmental | Open in IMG/M |
| 3300005330 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3H metaG | Environmental | Open in IMG/M |
| 3300005334 | Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M5-2 | Host-Associated | Open in IMG/M |
| 3300005353 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S5-3 metaG | Host-Associated | Open in IMG/M |
| 3300005444 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-1 metaG | Environmental | Open in IMG/M |
| 3300005445 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaG | Environmental | Open in IMG/M |
| 3300005457 | Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-3 metaG | Host-Associated | Open in IMG/M |
| 3300005459 | Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M3-2 | Host-Associated | Open in IMG/M |
| 3300005518 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaG | Environmental | Open in IMG/M |
| 3300005547 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-10-3 metaG | Environmental | Open in IMG/M |
| 3300005617 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S2-2 | Host-Associated | Open in IMG/M |
| 3300005713 | Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 (version 2) | Environmental | Open in IMG/M |
| 3300005764 | Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2) | Environmental | Open in IMG/M |
| 3300005840 | Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M6-2 | Host-Associated | Open in IMG/M |
| 3300005874 | Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_20C_0N_404 | Environmental | Open in IMG/M |
| 3300006173 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaG | Environmental | Open in IMG/M |
| 3300006804 | Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200 | Environmental | Open in IMG/M |
| 3300006914 | Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD5 | Host-Associated | Open in IMG/M |
| 3300009553 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-4 metaG | Host-Associated | Open in IMG/M |
| 3300009821 | Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_20_30 | Environmental | Open in IMG/M |
| 3300010047 | Tropical forest soil microbial communities from Panama - MetaG Plot_30 | Environmental | Open in IMG/M |
| 3300010048 | Tropical forest soil microbial communities from Panama - MetaG Plot_11 | Environmental | Open in IMG/M |
| 3300010366 | Tropical forest soil microbial communities from Panama - MetaG Plot_24 | Environmental | Open in IMG/M |
| 3300010376 | Tropical forest soil microbial communities from Panama - MetaG Plot_28 | Environmental | Open in IMG/M |
| 3300010399 | Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-3 | Environmental | Open in IMG/M |
| 3300012204 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_100_16 metaG | Environmental | Open in IMG/M |
| 3300012207 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaG | Environmental | Open in IMG/M |
| 3300012353 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_80_16 metaG | Environmental | Open in IMG/M |
| 3300012685 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaG | Environmental | Open in IMG/M |
| 3300012922 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaG | Environmental | Open in IMG/M |
| 3300012944 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly) | Environmental | Open in IMG/M |
| 3300012948 | Tropical forest soil microbial communities from Panama - MetaG Plot_14 | Environmental | Open in IMG/M |
| 3300012971 | Tropical forest soil microbial communities from Panama - MetaG Plot_1 | Environmental | Open in IMG/M |
| 3300012988 | Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_242_MG | Environmental | Open in IMG/M |
| 3300014882 | Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT231B'_16_10D | Environmental | Open in IMG/M |
| 3300015358 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_5_24_1 metaG | Environmental | Open in IMG/M |
| 3300017939 | Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_BV02_MP12_10_MG | Environmental | Open in IMG/M |
| 3300018052 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b2 | Environmental | Open in IMG/M |
| 3300018075 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b1 | Environmental | Open in IMG/M |
| 3300018422 | Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 124 T | Environmental | Open in IMG/M |
| 3300019487 | White microbial mat communities from a basaltic lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - MA170107-4 metaG | Environmental | Open in IMG/M |
| 3300020583 | Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-M | Environmental | Open in IMG/M |
| 3300021080 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_coex redo | Environmental | Open in IMG/M |
| 3300021086 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_1_08_16fungal (Illumina Assembly) | Environmental | Open in IMG/M |
| 3300021088 | Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-M | Environmental | Open in IMG/M |
| 3300022534 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_30_b1 | Environmental | Open in IMG/M |
| 3300025560 | Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqB_D2 (SPAdes) | Environmental | Open in IMG/M |
| 3300025914 | Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C2-4 metaG (SPAdes) | Host-Associated | Open in IMG/M |
| 3300025915 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaG (SPAdes) | Environmental | Open in IMG/M |
| 3300025922 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes) | Environmental | Open in IMG/M |
| 3300025923 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S5-3 metaG (SPAdes) | Host-Associated | Open in IMG/M |
| 3300025931 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S7-3 metaG (SPAdes) | Host-Associated | Open in IMG/M |
| 3300025933 | Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-3 metaG (SPAdes) | Host-Associated | Open in IMG/M |
| 3300025938 | Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M1-2 (SPAdes) | Host-Associated | Open in IMG/M |
| 3300026047 | Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqA_D1_rd (SPAdes) | Environmental | Open in IMG/M |
| 3300026089 | Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M3-2 (SPAdes) | Host-Associated | Open in IMG/M |
| 3300026469 | Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-17-B | Environmental | Open in IMG/M |
| 3300026528 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156 (SPAdes) | Environmental | Open in IMG/M |
| 3300026557 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal | Environmental | Open in IMG/M |
| 3300027383 | Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM1_M2 (SPAdes) | Environmental | Open in IMG/M |
| 3300027681 | Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM3_M2 (SPAdes) | Environmental | Open in IMG/M |
| 3300027725 | Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200 (SPAdes) | Environmental | Open in IMG/M |
| 3300027775 | Agricultural soil microbial communities from Georgia to study Nitrogen management - GA Control (SPAdes) | Environmental | Open in IMG/M |
| 3300027846 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes) | Environmental | Open in IMG/M |
| 3300027862 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes) | Environmental | Open in IMG/M |
| 3300027903 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes) | Environmental | Open in IMG/M |
| 3300027907 | Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (SPAdes) | Host-Associated | Open in IMG/M |
| 3300027995 (restricted) | Sediment microbial communities from Lake Towuti, South Sulawesi, Indonesia - Sediment_Towuti_2014_1_MG | Environmental | Open in IMG/M |
| 3300028380 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S5-2 (SPAdes) | Host-Associated | Open in IMG/M |
| 3300028715 | Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_203 | Environmental | Open in IMG/M |
| 3300028716 | Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_198 | Environmental | Open in IMG/M |
| 3300028787 | Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_381 | Environmental | Open in IMG/M |
| 3300028792 | Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 19_S | Environmental | Open in IMG/M |
| 3300028796 | Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_141 | Environmental | Open in IMG/M |
| 3300028803 | Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_120 | Environmental | Open in IMG/M |
| 3300031197 (restricted) | Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH1_T0_E1 | Environmental | Open in IMG/M |
| 3300031720 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515 | Environmental | Open in IMG/M |
| 3300031740 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05 | Environmental | Open in IMG/M |
| 3300031765 | Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.082b2f22 | Environmental | Open in IMG/M |
| 3300031820 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515 | Environmental | Open in IMG/M |
| 3300032008 | Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.066b5f18 | Environmental | Open in IMG/M |
| 3300032143 | Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G13_0 | Environmental | Open in IMG/M |
| 3300032205 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05 | Environmental | Open in IMG/M |
| 3300032211 | Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C8D1 | Environmental | Open in IMG/M |
| 3300032770 | Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.5 | Environmental | Open in IMG/M |
| 3300033407 | Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT140D175 | Environmental | Open in IMG/M |
| 3300033551 | Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day5 | Environmental | Open in IMG/M |
| 3300034090 | Peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF00N | Environmental | Open in IMG/M |
| 3300034820 | Populus rhizosphere microbial communities from soil in West Virginia, United States - WV94_WV_N_2 | Host-Associated | Open in IMG/M |
| Geographical Distribution | |
|---|---|
| Zoom: | Powered by OpenStreetMap |
| ⦗Top⦘ |
Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).
| Protein ID | Sample Taxon ID | Habitat | Sequence |
| N55_00320720 | 2189573000 | Grass Soil | MVVYEFDSEESLRDFAASDTLKAMTEDYEARFGGAGDRARFTYRQVFP |
| JGI12630J15595_100989162 | 3300001545 | Forest Soil | YVPVPVDVGAHPGSEPWQYMVVYEFDSEEALRAFAASDTLKAMTQDYEARFGGAGDRVRFTYRQVFP* |
| soilL2_102274711 | 3300003319 | Sugarcane Root And Bulk Soil | YMVCYEFDSEASLQAFVASDTLQAMTRDYDSRFRGDRARFAYRQIHP* |
| Ga0055440_101730031 | 3300004020 | Natural And Restored Wetlands | ESPHAGSEPWQYMVCYEFDSEESLQAFVDRTLRAMTTDYNARFGGAGDRARLAYRQIYP* |
| Ga0062589_1007991122 | 3300004156 | Soil | GSEPWQYMVCYEFDSEASLEAFVRSDTLRAMTGDYNARFGGAGDRTRLAYRQIYP* |
| Ga0070690_1007211441 | 3300005330 | Switchgrass Rhizosphere | MVCYEFDSEASLQAFVQSDTLRAMTKDYDTRFANTSTRARFAYRQIYP* |
| Ga0070690_1007378962 | 3300005330 | Switchgrass Rhizosphere | EPWQYMVCYEFDSEAALQAFVRSDTLRAMTKDYDARFRGERARFAYQQIFP* |
| Ga0068869_1000172306 | 3300005334 | Miscanthus Rhizosphere | EFDSEASLQAFVRSETLQDMTRDYNARFGGAGDRARLAYRQIYP* |
| Ga0070669_1002568602 | 3300005353 | Switchgrass Rhizosphere | SEPWQYMVCYEFDSEASLQAFVRSETLQDMTRDYNARFGGAGDRARLAYRQIYP* |
| Ga0070694_1000239511 | 3300005444 | Corn, Switchgrass And Miscanthus Rhizosphere | HAGSEPWQYMVCYEFDSEASLRAFVQSDTLRAMTRDYNARFGGSGDRARLAYRQIYP* |
| Ga0070708_1010157543 | 3300005445 | Corn, Switchgrass And Miscanthus Rhizosphere | TPHAGSEPWQYMVCYEFDSEESLRAFVESDTLRAMTRDYNARFGGDRARLAYRQIYP* |
| Ga0070662_1009943571 | 3300005457 | Corn Rhizosphere | EPWQYMVCYEFDSEASLQAFVRSDTLRAMTKDYDSRFRGERARFAYQQIFP* |
| Ga0068867_1001704911 | 3300005459 | Miscanthus Rhizosphere | VCYEFDSEASLQAFVHSDTLRAMTKDYDTRFANTSTRARFAYRQIYP* |
| Ga0068867_1002445863 | 3300005459 | Miscanthus Rhizosphere | CYEFDSEASLRAFVQSDTLRAMTRDYNARFKGDRARLAYRQIYP* |
| Ga0070699_1003579243 | 3300005518 | Corn, Switchgrass And Miscanthus Rhizosphere | MRRYAALPLETPHAGSEPWQYMVCYEFDSEESLRAFVESDTLRAMTRDYNERFGGSGARARLAYRQIYP* |
| Ga0070693_1003680181 | 3300005547 | Corn, Switchgrass And Miscanthus Rhizosphere | SEPWQYMVCYEFDSEESLRAFVESDTLRAMTRDYNARFGGSGDRARLAYRQIYP* |
| Ga0068859_1028255951 | 3300005617 | Switchgrass Rhizosphere | LETPHAGSEPWQYMVCYEFDSEASLEAFVRSDTLRAMTGDYNARFGGAGDRTRLAYRQIYP* |
| Ga0066905_1001858013 | 3300005713 | Tropical Forest Soil | VPIALDVGAHPGSEPWQYMVVYEFDSEEALRNFTASETLVAMTRDYEARFGGAGHRARFTYRQVFP* |
| Ga0066905_1005161331 | 3300005713 | Tropical Forest Soil | VPIALDVGAHPGSEPWQYMVVYEFDSEEALRNFTASETLAAMTRDYEARFGGAGHRVRFTYRQVFP* |
| Ga0066903_1005920593 | 3300005764 | Tropical Forest Soil | MVVYEFDSEEALRDFAASDTLKAMTQDYEARFGGAGDRVRFTYRQIFP* |
| Ga0068870_111103352 | 3300005840 | Miscanthus Rhizosphere | EPWQYMVCYEFDSEASLQAFVRSDTLRAMTKDYDARFRGERARFAYQQIFP* |
| Ga0075288_10649942 | 3300005874 | Rice Paddy Soil | YAALELDSAHAGAEPWQYMVCYEFDSEASLQAFVRSDTLRAMTQDYDSRFAGARARFAYRQIFP* |
| Ga0070716_1001262861 | 3300006173 | Corn, Switchgrass And Miscanthus Rhizosphere | YMVCYEFDSEESLRAFVDSDTLRAMTRDYDARFGGARARLAYRQIYP* |
| Ga0079221_112888042 | 3300006804 | Agricultural Soil | YAALPLETPHAGSEPWQYMVCYEFDSEESLRAFVESDTLRAMTRDYNARFGGDRARLAYRQIYP* |
| Ga0075436_1011680491 | 3300006914 | Populus Rhizosphere | GTEPWQYMVCYEFDSEASLRAFVQSDTLRAMTRDYNARFKGDRARLAYRQIYP* |
| Ga0105249_103630553 | 3300009553 | Switchgrass Rhizosphere | VDVGVHPGSEPWQYMVVYEFDSEESLRDFAASDTLKAMTEDYEARFGGAGDRARFTYRQVFP* |
| Ga0105249_117434251 | 3300009553 | Switchgrass Rhizosphere | GREPWQYMVCYEFDSEAALQAFVRSDTLRAMTKDYDSRFRGERARFAYQQIFP* |
| Ga0105064_10594232 | 3300009821 | Groundwater Sand | WQYMVCYEFDSEESLRAFVNSDTLRAMTKDYNARFGGAGERARLAYRQIYP* |
| Ga0126382_109979672 | 3300010047 | Tropical Forest Soil | WQYMVCYEFDSEDSMQAFVRSDTLRAMTKDYDSRFRGDRARFAYRQIFP* |
| Ga0126373_125381432 | 3300010048 | Tropical Forest Soil | ALDVGAHPGSEPWQYMVVYEFDSEQALRNFTASETLAAMTRDYEARFGGAGNRARFTYRQVFP* |
| Ga0126379_133703201 | 3300010366 | Tropical Forest Soil | EPWQYMVVYEFDSEEALRNFTASETLVAMTRDYEARFGGAGHRARFTYRQVFP* |
| Ga0126381_1034001751 | 3300010376 | Tropical Forest Soil | SEPWQYMVVYEFDSEQALRNFTASETLAAMTRDYEARFGGAGHRARFTYRQVFP* |
| Ga0134127_118034471 | 3300010399 | Terrestrial Soil | TVCYEFDSEASLQAFVQSDTLKAMTRDYDSRFGGDRARFAYRQIYP* |
| Ga0137374_108452532 | 3300012204 | Vadose Zone Soil | SLDVGVHAGSEPWQYMVVYEFDSEESLREFAASDTLKAMTQDYEARFGGAGDRARLTYRQVFP* |
| Ga0137381_116558322 | 3300012207 | Vadose Zone Soil | VCYEFDSEESLQAFVRSDMLRAMTRDYNARFGGAGDRARLAYRQIYP* |
| Ga0137367_106378681 | 3300012353 | Vadose Zone Soil | GSEPWQYMVVYEFDSEESLREFAASETLKAMTQDYEARFGGAGDRARLAYRQVFP* |
| Ga0137397_100298081 | 3300012685 | Vadose Zone Soil | EPWQYMVCYEFDSEASLQAFVRSDTLRAMTKDYDSRFRGDHARFAYRQIYP* |
| Ga0137394_101444613 | 3300012922 | Vadose Zone Soil | AGSEPWQYMVCYEFDSEASLQAFVRSDTLRAMTKDYDSRFRGDRARFAYRQIYP* |
| Ga0137410_100633521 | 3300012944 | Vadose Zone Soil | EPWQYMVCYEFDSEASLQAFVRSDTLQAMTRDYNARFGGAGDRARLAYRQIYP* |
| Ga0126375_105868261 | 3300012948 | Tropical Forest Soil | GSEPWQYMVVYEFDSEEALRDFAASDTLKAMTEDYEARFGGAGDRVHFTYRQIFP* |
| Ga0126375_106378601 | 3300012948 | Tropical Forest Soil | ALDVGAHPGSEPWQYMVVYEFDSEEALRNFTASDTLRAMTRDYEARFGDAGHRVRFTYRQVFP* |
| Ga0126369_125725432 | 3300012971 | Tropical Forest Soil | ALDVGAHPGSEPWQYMVVYEFDSEEALRNFTASETLVAMTRDYEARFGGAGHRARFTYRQVFP* |
| Ga0164306_108276131 | 3300012988 | Soil | ALPIETPHGGTEPWQYMVCYEFDSEASLRAFVQSDTLRAMTRDYNARFKGDRARLAYRQIYP* |
| Ga0180069_10321452 | 3300014882 | Soil | MRRYAPIPLESPHAGSEPWQYMVCYEFDSEESLQAFVRSDTLRAMTRDYNTRFGGAGDRARLAYRQIYP* |
| Ga0134089_100862392 | 3300015358 | Grasslands Soil | YEFESEAALHAFVHSDTLKAMTRDYNARFAGAGERARFTYRQIFP* |
| Ga0187775_100524371 | 3300017939 | Tropical Peatland | EPWQYMVCYEFDSEESLRAFVASDTLRAMTRDYNARFGGAGERARLAYRQIYP |
| Ga0184638_10989041 | 3300018052 | Groundwater Sediment | MVCYEFDSEESLQAFVRSDTLRAMTRDYNTRFGGAGDRARLAYRQIYP |
| Ga0184632_104696061 | 3300018075 | Groundwater Sediment | SPHAGSEPWQYMVCYEFDSEESLQAFVRSDTLRAMTRDYNTRFGGAGDRARLAYRQIYP |
| Ga0190265_114291882 | 3300018422 | Soil | CYEFDSEASLQAFVRSDTLRAMTRDYNARFGGAGERARLAYRQIYP |
| Ga0187893_102024891 | 3300019487 | Microbial Mat On Rocks | AGSEPWQYMVCYEFDSEESLQAFVRSDTLRAMTRDYNARFGGAGDRARLAYRQIYP |
| Ga0210401_111359831 | 3300020583 | Soil | AIPLDSPHAGSGPWQYMVCYEFDSEESLRAFVVSDTLRAMTKDYDSRFGGGKRARLAYRQIYP |
| Ga0210382_105686102 | 3300021080 | Groundwater Sediment | GGRLESAHAGSEPWQYMVCYEFDSEESLQAFVRSDTLRAMTQDYNARFGGAGDRARLAYRQIYP |
| Ga0179596_101303853 | 3300021086 | Vadose Zone Soil | AATRPFPLETPHAGSEPWQYMVCYEFDSEESLRAFVESDTLRAMTRDYNARFGGSGARARLAYRQIYP |
| Ga0210404_100378784 | 3300021088 | Soil | SEPWQYMVCYEFDSEESLRAFVVSDTLRAMTKDYDSRFGGGGERARLAYRQIYP |
| Ga0224452_10327081 | 3300022534 | Groundwater Sediment | EFDSEESLQAFVRSDTLRAMTRDYNTRFGGAGDRARLAYRQIYP |
| Ga0210108_10745162 | 3300025560 | Natural And Restored Wetlands | PWQYMVVYEFDSEEALRAFTASDTLKAMTRDYEARFGGAGDRVRFTYRQVFP |
| Ga0207671_109857621 | 3300025914 | Corn Rhizosphere | IETPHGGTEPWQYMVCYEFDSEASLRAFVQSDTLRAMTRDYNARFKGDRARLAYRQIYP |
| Ga0207693_101735191 | 3300025915 | Corn, Switchgrass And Miscanthus Rhizosphere | WQYMVCYEFDSEESLRAFVESDTLRAMTRDYNARFGGDRARLAYRQIYP |
| Ga0207646_107500081 | 3300025922 | Corn, Switchgrass And Miscanthus Rhizosphere | TPHAGSEPWQYMVCYEFDSEESLRAFVESDTLRAMTRDYNARFGGDRARLAYRQIYP |
| Ga0207681_103161511 | 3300025923 | Switchgrass Rhizosphere | YEFDSEAALQAFVRSETLQDMTRDYNARFGGAGDRARLAYRQIYP |
| Ga0207681_117446592 | 3300025923 | Switchgrass Rhizosphere | LPLESAHAGREPWQYMVCYEFDSEASLQAFVRSDTLRAMKKDYDSRFRGERARFAYQQIF |
| Ga0207644_105263211 | 3300025931 | Switchgrass Rhizosphere | AALPIETPHGGTEPWQYMVCYEFDSEASLRAFVQSDTLRAMTRDYNARFKGDRARLAYRQIYP |
| Ga0207706_101611773 | 3300025933 | Corn Rhizosphere | MALDTPHAGSEPWQYMVCYEFDSEASLRAFVQSDTLRAMTRDYNARFGGSGDRARLAYRQIYP |
| Ga0207704_103205083 | 3300025938 | Miscanthus Rhizosphere | MVCYEFDSEASLQAFVQSDTLRAMTKDYDTRFANTSTRARFAYRQIYP |
| Ga0208658_10228342 | 3300026047 | Natural And Restored Wetlands | ESDHAGAEPWQYMVCYEFDSEASLQAFVRSDTLRAMTRDYNARFGGAGDRARLAYRQIYP |
| Ga0207648_100142699 | 3300026089 | Miscanthus Rhizosphere | CYEFDSEASLRAFVQSDTLRAMTRDYNARFKGDRARLAYRQIYP |
| Ga0257169_10094642 | 3300026469 | Soil | PWQYMVCYEFDSEESLQAFVRSDTLRAMTRDYNTRFGGAGDRARLAYRQIYP |
| Ga0209378_10306043 | 3300026528 | Soil | MVIYEFESEAALHAFVHSDTLKAMTRDYNTRFAGAGERARFTYRQIFP |
| Ga0179587_107438451 | 3300026557 | Vadose Zone Soil | QYMVCYEFDSEESLRAFVESDTLRAMTRDYNARFGGERARLAYRQIYP |
| Ga0209213_11062621 | 3300027383 | Forest Soil | HAGSEPWQYMVCYEFDSEASLQAFVRSDTLRAMTRDYNARFGGAGDRARLAYRQIYP |
| Ga0208991_11488121 | 3300027681 | Forest Soil | MVCYEFDSEESLRAFVESDTLRAMTRDYNARFGGERARLAYR |
| Ga0209178_13826781 | 3300027725 | Agricultural Soil | PLETPHAGSEPWQYMVCYEFDSEESLRAFVESDTLRAMTRDYNARFGGDRARLAYRQIYP |
| Ga0209177_100252763 | 3300027775 | Agricultural Soil | GGSEPWQYMVCYEFDSEESLRAFVQSDTLRAMTRDYNARFTGDRARFAYRQIYP |
| Ga0209180_105559391 | 3300027846 | Vadose Zone Soil | CYEFDSEESLRAFVESDTLRAMTRDYNARFGGSGARARLAYRQIYP |
| Ga0209701_101408781 | 3300027862 | Vadose Zone Soil | YAPISLESPHAGSEPWQYMVCYEFDSEESLQAFVRSDTLRAMTRDYNTRFGGAGDRARLAYRQIYP |
| Ga0209488_102505493 | 3300027903 | Vadose Zone Soil | RRYAALPLETPHAGSEPWQYMVCYEFDSEESLRAFVESDTLRAMTRDYNARFGGSGARARLAYRQIYP |
| Ga0207428_101915141 | 3300027907 | Populus Rhizosphere | HAGREPWQYMVCYEFDSEAALQAFVRSDTLRAMTKDYDSRFRGERARFAYQQIFP |
| (restricted) Ga0233418_101187201 | 3300027995 | Sediment | YAALSLDTPHAGSEPWQYMVCYEFDSEESLEAFIASDTLRAMTRDYNARFGGAGERARLAYRQIYP |
| Ga0268265_101318201 | 3300028380 | Switchgrass Rhizosphere | ETPHGGTEPWQYMVCYEFDSEASLRAFVQSDTLRAMTRDYNARFKGDRARLAYRQIYP |
| Ga0307313_101194712 | 3300028715 | Soil | ALPLESAHAGSEPWQYMVCYEFDSEESLQAFVRSDTLRAMTQDYNARFGGAGDRARLAYRQIYP |
| Ga0307311_101614961 | 3300028716 | Soil | VCYEFDSEESLQAFVRSDTLRAMTQDYNARFGGAGDRARLAYRQIYP |
| Ga0307323_101763982 | 3300028787 | Soil | PWQYMVCYEFDSEESLQAFVRSDTLRAMTQDYNARFGGAGDRARLAYRQIYP |
| Ga0307504_101498082 | 3300028792 | Soil | AHAGSEPWQYMVCYEFDSEASLQAFVGSDTLQAMTRDYNARFGGAGDRARLAYRQIYP |
| Ga0307287_101238811 | 3300028796 | Soil | YMVCYEFDSEESLQAFVRSDTLRAMTQDYNARFGGAGDRARLAYRQIYP |
| Ga0307281_100268991 | 3300028803 | Soil | PIPLESPHAGSEPWQYMVCYEFDSEESLQAFVRSDTLRAMTRDYNTRFGGAGDRARLAYRQIYP |
| (restricted) Ga0255310_101699512 | 3300031197 | Sandy Soil | GSEPWQYMVCYEFDSEESLRAFVTSDTLRAMTKDYNARFGGGSDRARLAYRQIHP |
| Ga0307469_102824461 | 3300031720 | Hardwood Forest Soil | VPVPVDVGVHPGSEPWQYMVVYEFDSEESLRDFAASNTLRAMTEDYEARFGGAGDRARFTYRQVFP |
| Ga0307468_1003669593 | 3300031740 | Hardwood Forest Soil | VGVHPGAEPWQYMVVYEFDSEESLRDFAASDTLRAMTVDYEARFGGAGDRARLTYRQVFP |
| Ga0307468_1008475722 | 3300031740 | Hardwood Forest Soil | EPWQYMVCYEFDSEASLQAFVRSDTLRAMTKDYDSRFRGERARFAYRQIYP |
| Ga0307468_1024517351 | 3300031740 | Hardwood Forest Soil | REPWQYMVCYEFDSEAALQAFVRSDTLRAMTKDYDARFRGERARLAYRQIFP |
| Ga0318554_102285221 | 3300031765 | Soil | QYMVCYEFDSEASLRAFAASDTLRAMTEDYERRFGGAGERVRLAYRQIYP |
| Ga0307473_104213092 | 3300031820 | Hardwood Forest Soil | WQYMVCYEFDSEASLQAFVSSDTLRAMTKDYNSRFRGERARFAYRQIYP |
| Ga0318562_105463201 | 3300032008 | Soil | PHAGAEPWQYMVCYEFDSEASLRAFAASDTLRAMTEDYERRFGGAGERVRLAYRQIYP |
| Ga0315292_108509392 | 3300032143 | Sediment | VALDVGVHQGSEPWQYMVVYEFDSEEALRAFAASDTLKAMTQDYEARFGGAGDRVRLTYRQVFP |
| Ga0307472_1015306461 | 3300032205 | Hardwood Forest Soil | AALPLETPHAGSEPWQYMVCYEFDSEESLRAFVESDTLRAMTRDYNARFGGDRARLAYRQIYP |
| Ga0310896_107084002 | 3300032211 | Soil | ALPIETPHGGTEPWQYMVCYEFDSEASLRAFVQSDTLRAMTRDYNARFKGDRARLAYRQIYP |
| Ga0335085_103700262 | 3300032770 | Soil | SEPWQYMVCYAFDSEAALQAFVRSDTLRAMTKDYDSRFRGERARFAYRQIFP |
| Ga0214472_117559101 | 3300033407 | Soil | IPLESPHAGSEPWQYMVCYEFDSEESLQAFVRSDTLRAMTRDYNTRFGGAGDRARLAYRQVYP |
| Ga0247830_113303531 | 3300033551 | Soil | MRRYAALPLESAHAGSEPWQYMVCYEFDSEASLQAFVRSDTLRAMTRDYNARFGGAGDRARLAYRQIYP |
| Ga0326723_0437433_415_594 | 3300034090 | Peat Soil | MRRYVPVPLDVGAHPGSEPWQYMVVYEFDSEEALRAFTASDTLKAMTRDYEARFGGAGDR |
| Ga0373959_0093183_16_156 | 3300034820 | Rhizosphere Soil | MVCYEFDSEASLRAFVQSDTLRAMTRDYNARFKGDRARLAYRQIYP |
| ⦗Top⦘ |