| Basic Information | |
|---|---|
| Family ID | F091688 |
| Family Type | Metagenome |
| Number of Sequences | 107 |
| Average Sequence Length | 53 residues |
| Representative Sequence | FGFDASKALPVENPGRVCAMLATAKDPMYFSGKDIYGPGFHAEHALVRFD |
| Number of Associated Samples | 96 |
| Number of Associated Scaffolds | 107 |
| Quality Assessment | |
|---|---|
| Transcriptomic Evidence | No |
| Most common taxonomic group | Unclassified |
| % of genes with valid RBS motifs | 0.00 % |
| % of genes near scaffold ends (potentially truncated) | 0.00 % |
| % of genes from short scaffolds (< 2000 bps) | 0.00 % |
| Associated GOLD sequencing projects | 92 |
| AlphaFold2 3D model prediction | Yes |
| 3D model pTM-score | 0.27 |
| Hidden Markov Model |
|---|
| Powered by Skylign |
| Most Common Taxonomy | |
|---|---|
| Group | Unclassified (100.000 % of family members) |
| NCBI Taxonomy ID | N/A |
| Taxonomy | N/A |
| Most Common Ecosystem | |
|---|---|
| GOLD Ecosystem | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil (19.626 % of family members) |
| Environment Ontology (ENVO) | Unclassified (30.841 % of family members) |
| Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Soil (non-saline) (57.009 % of family members) |
| ⦗Top⦘ |
| ⦗Top⦘ |
| Predicted Topology & Secondary Structure | |||||
|---|---|---|---|---|---|
| Classification: | Globular | Signal Peptide: | No | Secondary Structure distribution: | α-helix: 23.08% β-sheet: 0.00% Coil/Unstructured: 76.92% | Feature Viewer |
|
|
|||||
| Powered by Feature Viewer | |||||
| Structure Viewer | |
|---|---|
|
| |
| Per-residue confidence (pLDDT): 0-50 51-70 71-90 91-100 | pTM-score: 0.27 |
| Powered by PDBe Molstar | |
| ⦗Top⦘ |
| Pfam ID | Name | % Frequency in 107 Family Scaffolds |
|---|---|---|
| PF00296 | Bac_luciferase | 31.78 |
| PF01040 | UbiA | 11.21 |
| PF00378 | ECH_1 | 5.61 |
| PF00578 | AhpC-TSA | 2.80 |
| PF06742 | DUF1214 | 1.87 |
| PF04542 | Sigma70_r2 | 1.87 |
| PF01346 | FKBP_N | 1.87 |
| PF02777 | Sod_Fe_C | 1.87 |
| PF07883 | Cupin_2 | 1.87 |
| PF01797 | Y1_Tnp | 0.93 |
| PF13193 | AMP-binding_C | 0.93 |
| PF04545 | Sigma70_r4 | 0.93 |
| PF07690 | MFS_1 | 0.93 |
| PF12697 | Abhydrolase_6 | 0.93 |
| PF13450 | NAD_binding_8 | 0.93 |
| PF13469 | Sulfotransfer_3 | 0.93 |
| PF00975 | Thioesterase | 0.93 |
| PF04493 | Endonuclease_5 | 0.93 |
| PF01593 | Amino_oxidase | 0.93 |
| PF05721 | PhyH | 0.93 |
| PF00903 | Glyoxalase | 0.93 |
| PF12680 | SnoaL_2 | 0.93 |
| PF00795 | CN_hydrolase | 0.93 |
| PF00106 | adh_short | 0.93 |
| COG ID | Name | Functional Category | % Frequency in 107 Family Scaffolds |
|---|---|---|---|
| COG2141 | Flavin-dependent oxidoreductase, luciferase family (includes alkanesulfonate monooxygenase SsuD and methylene tetrahydromethanopterin reductase) | Coenzyme transport and metabolism [H] | 31.78 |
| COG0545 | FKBP-type peptidyl-prolyl cis-trans isomerase | Posttranslational modification, protein turnover, chaperones [O] | 1.87 |
| COG0568 | DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) | Transcription [K] | 1.87 |
| COG0605 | Superoxide dismutase | Inorganic ion transport and metabolism [P] | 1.87 |
| COG1191 | DNA-directed RNA polymerase specialized sigma subunit | Transcription [K] | 1.87 |
| COG1595 | DNA-directed RNA polymerase specialized sigma subunit, sigma24 family | Transcription [K] | 1.87 |
| COG4941 | Predicted RNA polymerase sigma factor, contains C-terminal TPR domain | Transcription [K] | 1.87 |
| COG5361 | Uncharacterized conserved protein | Mobilome: prophages, transposons [X] | 1.87 |
| COG5402 | Uncharacterized protein, contains DUF1214 domain | Function unknown [S] | 1.87 |
| COG1515 | Deoxyinosine 3'-endonuclease (endonuclease V) | Replication, recombination and repair [L] | 0.93 |
| COG1943 | REP element-mobilizing transposase RayT | Mobilome: prophages, transposons [X] | 0.93 |
| COG5285 | Ectoine hydroxylase-related dioxygenase, phytanoyl-CoA dioxygenase (PhyH) family | Secondary metabolites biosynthesis, transport and catabolism [Q] | 0.93 |
| ⦗Top⦘ |
| Name | Rank | Taxonomy | Distribution |
| Unclassified | root | N/A | 100.00 % |
| Visualization |
|---|
| Powered by ApexCharts |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|
Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).
| ⦗Top⦘ |
| Habitat | Taxonomy | Distribution |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil | 19.63% |
| Grasslands Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil | 8.41% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil | 6.54% |
| Soil | Environmental → Terrestrial → Soil → Wetlands → Unclassified → Soil | 6.54% |
| Tropical Forest Soil | Environmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil | 6.54% |
| Tropical Forest Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil | 5.61% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil | 3.74% |
| Vadose Zone Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil | 3.74% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil | 3.74% |
| Sediment | Environmental → Aquatic → Freshwater → Lake → Sediment → Sediment | 2.80% |
| Agricultural Soil | Environmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil | 2.80% |
| Grasslands Soil | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil | 2.80% |
| Arabidopsis Rhizosphere | Host-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere | 2.80% |
| Populus Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere | 2.80% |
| Freshwater Sediment | Environmental → Aquatic → Freshwater → Lake → Sediment → Freshwater Sediment | 1.87% |
| Natural And Restored Wetlands | Environmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands | 1.87% |
| Hardwood Forest Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil | 1.87% |
| Tropical Peatland | Environmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland | 1.87% |
| Corn, Switchgrass And Miscanthus Rhizosphere | Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere | 1.87% |
| Corn Rhizosphere | Host-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere | 1.87% |
| Freshwater Sediment | Environmental → Aquatic → Freshwater → Lentic → Sediment → Freshwater Sediment | 0.93% |
| Freshwater Lake | Environmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater Lake | 0.93% |
| Hot Spring | Environmental → Aquatic → Thermal Springs → Hot (42-90C) → Unclassified → Hot Spring | 0.93% |
| Soil | Environmental → Aquatic → Sediment → Unclassified → Unclassified → Soil | 0.93% |
| Groundwater Sediment | Environmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment | 0.93% |
| Terrestrial Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil | 0.93% |
| Natural And Restored Wetlands | Environmental → Terrestrial → Soil → Wetlands → Unclassified → Natural And Restored Wetlands | 0.93% |
| Soil | Environmental → Terrestrial → Soil → Loam → Grasslands → Soil | 0.93% |
| Microbial Mat On Rocks | Environmental → Terrestrial → Cave → Unclassified → Unclassified → Microbial Mat On Rocks | 0.93% |
| Miscanthus Rhizosphere | Host-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere | 0.93% |
| Switchgrass Rhizosphere | Host-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere | 0.93% |
| Visualization |
|---|
| Powered by ApexCharts |
Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).
| Taxon OID | Sample Name | Habitat Type | IMG/M Link |
|---|---|---|---|
| 2088090009 | Freshwater sediment microbial communities from Lake Washington, Seattle, for methane and nitrogen Cycles - SIP 13C-methane anaerobic+nitrate | Environmental | Open in IMG/M |
| 3300000559 | Amended soil microbial communities from Kansas Great Prairies, USA - control no BrdU total DNA F1.4 TC clc assemly | Environmental | Open in IMG/M |
| 3300004025 | Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqB_D1 | Environmental | Open in IMG/M |
| 3300004052 | Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - RushOxbow_ThreeSqC_D2 | Environmental | Open in IMG/M |
| 3300004633 | Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 1 MoBio | Environmental | Open in IMG/M |
| 3300005167 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121 | Environmental | Open in IMG/M |
| 3300005180 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 | Environmental | Open in IMG/M |
| 3300005332 | Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly) | Environmental | Open in IMG/M |
| 3300005334 | Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M5-2 | Host-Associated | Open in IMG/M |
| 3300005364 | Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-3 metaG | Host-Associated | Open in IMG/M |
| 3300005445 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaG | Environmental | Open in IMG/M |
| 3300005552 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150 | Environmental | Open in IMG/M |
| 3300005558 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147 | Environmental | Open in IMG/M |
| 3300005598 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 | Environmental | Open in IMG/M |
| 3300005713 | Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 (version 2) | Environmental | Open in IMG/M |
| 3300005764 | Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2) | Environmental | Open in IMG/M |
| 3300006034 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105 | Environmental | Open in IMG/M |
| 3300006796 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 | Environmental | Open in IMG/M |
| 3300006797 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108 | Environmental | Open in IMG/M |
| 3300006804 | Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200 | Environmental | Open in IMG/M |
| 3300006852 | Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD2 | Host-Associated | Open in IMG/M |
| 3300006854 | Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4 | Host-Associated | Open in IMG/M |
| 3300006894 | Agricultural soil microbial communities from Utah to study Nitrogen management - NC Control | Environmental | Open in IMG/M |
| 3300006954 | Agricultural soil microbial communities from Georgia to study Nitrogen management - GA Control | Environmental | Open in IMG/M |
| 3300007258 | Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3 | Environmental | Open in IMG/M |
| 3300009137 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158 | Environmental | Open in IMG/M |
| 3300009162 | Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2 | Host-Associated | Open in IMG/M |
| 3300009792 | Tropical forest soil microbial communities from Panama - MetaG Plot_12 | Environmental | Open in IMG/M |
| 3300010047 | Tropical forest soil microbial communities from Panama - MetaG Plot_30 | Environmental | Open in IMG/M |
| 3300010313 | Hot spring microbial communities from South Africa to study Microbial Dark Matter (Phase II) - Sagole hot spring metaG | Environmental | Open in IMG/M |
| 3300010325 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_0_1 metaG | Environmental | Open in IMG/M |
| 3300010333 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_2_24_1 metaG | Environmental | Open in IMG/M |
| 3300010335 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09082015 | Environmental | Open in IMG/M |
| 3300010360 | Tropical forest soil microbial communities from Panama - MetaG Plot_6 | Environmental | Open in IMG/M |
| 3300010366 | Tropical forest soil microbial communities from Panama - MetaG Plot_24 | Environmental | Open in IMG/M |
| 3300010391 | Freshwater sediment microbial communities from Lake Superior, USA - Station SU-17. Combined Assembly of Gp0155404, Gp0155335, Gp0155336, Gp0155336, Gp0155403, Gp0155406 | Environmental | Open in IMG/M |
| 3300010398 | Tropical forest soil microbial communities from Panama - MetaG Plot_35 | Environmental | Open in IMG/M |
| 3300010400 | Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2 | Environmental | Open in IMG/M |
| 3300012582 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaG | Environmental | Open in IMG/M |
| 3300012903 | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S134-311R-1 | Environmental | Open in IMG/M |
| 3300012971 | Tropical forest soil microbial communities from Panama - MetaG Plot_1 | Environmental | Open in IMG/M |
| 3300012972 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_24_1 metaG | Environmental | Open in IMG/M |
| 3300013127 (restricted) | Sediment microbial communities from Lake Kivu, Rwanda - Sediment site 48cm | Environmental | Open in IMG/M |
| 3300014157 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_2_24_1 metaG | Environmental | Open in IMG/M |
| 3300014321 | Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_TuleB_D1 | Environmental | Open in IMG/M |
| 3300015053 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (PacBio error correction) | Environmental | Open in IMG/M |
| 3300015241 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly) | Environmental | Open in IMG/M |
| 3300015259 | Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT730_16_10D | Environmental | Open in IMG/M |
| 3300015359 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09182015 | Environmental | Open in IMG/M |
| 3300015373 | Combined assembly of cpr5 rhizosphere | Host-Associated | Open in IMG/M |
| 3300015374 | Col-0 rhizosphere combined assembly | Host-Associated | Open in IMG/M |
| 3300016371 | Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.172 | Environmental | Open in IMG/M |
| 3300017656 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_11112015 | Environmental | Open in IMG/M |
| 3300017657 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09212015 | Environmental | Open in IMG/M |
| 3300017947 | Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0815_BV2_4_20_MG | Environmental | Open in IMG/M |
| 3300017959 | Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_Q2_SP10_10_MG | Environmental | Open in IMG/M |
| 3300018059 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_65_coex | Environmental | Open in IMG/M |
| 3300018482 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118 | Environmental | Open in IMG/M |
| 3300019360 | White microbial mat communities from a lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - GBC170108-1 metaG | Environmental | Open in IMG/M |
| 3300019362 | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S104-311B-1 (version 2) | Environmental | Open in IMG/M |
| 3300020084 | Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015032 Kigoma Deep Cast 1200m | Environmental | Open in IMG/M |
| 3300025939 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaG (SPAdes) | Environmental | Open in IMG/M |
| 3300026067 | Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-3 metaG (SPAdes) | Host-Associated | Open in IMG/M |
| 3300026313 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes) | Environmental | Open in IMG/M |
| 3300026314 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_143 (SPAdes) | Environmental | Open in IMG/M |
| 3300026328 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129 (SPAdes) | Environmental | Open in IMG/M |
| 3300026331 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137 (SPAdes) | Environmental | Open in IMG/M |
| 3300026332 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138 (SPAdes) | Environmental | Open in IMG/M |
| 3300026343 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144 (SPAdes) | Environmental | Open in IMG/M |
| 3300026354 | Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-04-B | Environmental | Open in IMG/M |
| 3300026528 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156 (SPAdes) | Environmental | Open in IMG/M |
| 3300026536 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147 (SPAdes) | Environmental | Open in IMG/M |
| 3300026537 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135 (SPAdes) | Environmental | Open in IMG/M |
| 3300026538 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes) | Environmental | Open in IMG/M |
| 3300026548 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 (SPAdes) | Environmental | Open in IMG/M |
| 3300030619 | Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT150D86 (Novaseq) | Environmental | Open in IMG/M |
| 3300031668 | Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.168b4f23 | Environmental | Open in IMG/M |
| 3300031680 | Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.089b5f22 | Environmental | Open in IMG/M |
| 3300031716 | Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - YN3 | Environmental | Open in IMG/M |
| 3300031751 | Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.108b1f24 | Environmental | Open in IMG/M |
| 3300031765 | Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.082b2f22 | Environmental | Open in IMG/M |
| 3300031799 | Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.066b5f21 | Environmental | Open in IMG/M |
| 3300031879 | Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.172 (v2) | Environmental | Open in IMG/M |
| 3300031941 | Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.OX080 | Environmental | Open in IMG/M |
| 3300031954 | Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.178 (v2) | Environmental | Open in IMG/M |
| 3300032177 | Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G05_0 | Environmental | Open in IMG/M |
| 3300032180 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515 | Environmental | Open in IMG/M |
| 3300032261 | Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170 (v2) | Environmental | Open in IMG/M |
| 3300032782 | Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.1 | Environmental | Open in IMG/M |
| 3300032828 | Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.4 | Environmental | Open in IMG/M |
| 3300032829 | Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_1.3 | Environmental | Open in IMG/M |
| 3300033004 | Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.4 | Environmental | Open in IMG/M |
| 3300033233 | Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - C3_bottom | Environmental | Open in IMG/M |
| 3300033419 | Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_soil_day5_noCT | Environmental | Open in IMG/M |
| 3300033434 | Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_soil_day10_CT_b | Environmental | Open in IMG/M |
| 3300033486 | Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_N3_C1_D5_A | Environmental | Open in IMG/M |
| Geographical Distribution | |
|---|---|
| Zoom: | Powered by OpenStreetMap |
| ⦗Top⦘ |
Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).
| Protein ID | Sample Taxon ID | Habitat | Sequence |
| LWAnN_07626760 | 2088090009 | Freshwater Sediment | ALELAEFGFDAAKGLPVENPGRVCAMLATAKHPLLFSGKDLYGPTFHAEHANVSFE |
| F14TC_1025677782 | 3300000559 | Soil | FGFDASKGLPTDNPGRVCAMLATADDPMFFSGRDVHGPTFYSEHALTRFA* |
| Ga0055433_101745261 | 3300004025 | Natural And Restored Wetlands | VENPGRVCAMLATSKNPMHFSGRDLRGPELYLEHSLLRFDG* |
| Ga0055490_101895081 | 3300004052 | Natural And Restored Wetlands | PGFVGTERMAAELGEFGFDASKALPVENPGRVCAMLATATDPMVFSGRDLRGPEVYREHAVLRFEP* |
| Ga0066395_103666581 | 3300004633 | Tropical Forest Soil | MPGFVGTERMAQELGEFGFDAAKALPVENPGRVCAMLATAKDPMHFSGKDIYGPAFHAEHALVRFETEGNR* |
| Ga0066672_106907572 | 3300005167 | Soil | SKALPVENPGRVCAMLATAKDPMHFSGKDIYGPDFHAEHALVRFD* |
| Ga0066685_106537041 | 3300005180 | Soil | ELGEFGFDASKALPVENPGRVCAMLATAKDPMHFSGTDLYGPAFYAEHALVRFEA* |
| Ga0066388_1003254563 | 3300005332 | Tropical Forest Soil | GFVATERMAAELGPFGFDASKGLPVENPGRVCAMLATAKDPMFFTGRDVHGPTFHAEHAVTRFDG* |
| Ga0066388_1072052742 | 3300005332 | Tropical Forest Soil | FVATERMAAELAAFGFDASKGLPTENPGRVCAMLATADDPMLFSGRDIHGPTFHAEHATTRFA* |
| Ga0068869_1010026081 | 3300005334 | Miscanthus Rhizosphere | GLMPGFVGTERMAIELKEFGFDAARGLPVENPGRVCAMLATAKDPLYFSGKDIFGPGFHAEHSQVRFD* |
| Ga0070673_1012828582 | 3300005364 | Switchgrass Rhizosphere | MPGFVGTERMAIELGEFGFDASKALPVENPGRVCAMLATAKDPMHFSGRDLRGPELYREHSLLRFDD* |
| Ga0070708_1018941131 | 3300005445 | Corn, Switchgrass And Miscanthus Rhizosphere | MPGFVGTERMAQELGEFGFDATKALPVENPGRVCAMLATARNPMYFSGKDIYGPAFHAEHALVRFDAEEGDR* |
| Ga0066701_100889593 | 3300005552 | Soil | EFGFDASKALPVENPGRVCAMLATAKDPMHFSGKDIYGPDFHAEHALVRFD* |
| Ga0066701_101674781 | 3300005552 | Soil | LPVENPGRVCAMLATAKDPMYFSGKDIYGPGFHAEHALVRFD* |
| Ga0066701_107207752 | 3300005552 | Soil | KALPVENPGRVCAMLATAKDPMHFSGTDLYGPAFYAEHALVRFEA* |
| Ga0066698_103073721 | 3300005558 | Soil | VQNPGRVCAMLATAKDPMYFSGKDVYGPAFHAEHSLISLDE* |
| Ga0066706_107245902 | 3300005598 | Soil | GFDASKALPVENPGRVCAMLATAKDPMYFSGKDIYGPGFHAEHALVHFD* |
| Ga0066905_1015099231 | 3300005713 | Tropical Forest Soil | FVGTERMAAELGEFGFDASKALPVENPGRVCAMLATATDPMVFSGRDLDGPTFYAEHAGVTFAR* |
| Ga0066903_1021428361 | 3300005764 | Tropical Forest Soil | PVENPGRVCAMLATAADPMYFSGKDVYGPAFHAEHANVRFEV* |
| Ga0066903_1033953972 | 3300005764 | Tropical Forest Soil | PGRVCAMLATAENPLYFSGKDVYGPAFHAEHAIVRFD* |
| Ga0066903_1043815311 | 3300005764 | Tropical Forest Soil | LPPDNPGRVCAMLATADDPMWFSGRDVHGPTFHTEHAITRFA* |
| Ga0066656_106638592 | 3300006034 | Soil | VENPGRVCAMLATAKDPMYFSGKDVYGPGFHAEHALVRFD* |
| Ga0066665_115209321 | 3300006796 | Soil | RMAIELGEFGFDASKALPVENPGRVCAMLATAKDPMHFSGTDLYGPAFYAEHALVRFEA* |
| Ga0066659_105615172 | 3300006797 | Soil | LPVENPGRVCAMLATAKDPMYFSGKDIYGPGFHADHALVRFD* |
| Ga0079221_114226251 | 3300006804 | Agricultural Soil | GRVCGMLATAKDPMFFSGKDIFGPGFHAEHALVRFDP* |
| Ga0075433_109509241 | 3300006852 | Populus Rhizosphere | EFGFDASKALPVENPGRVCAMLATAADPMHFSGRDLRGPELYREHELLRFEP* |
| Ga0075425_1009015741 | 3300006854 | Populus Rhizosphere | EFGFDASKALPVENPGRVCAMLATAENPLYFSGKDVYGPAFHAEHAIVRFD* |
| Ga0079215_102292452 | 3300006894 | Agricultural Soil | REYGFDPSKGLPVENPGRVCAMLATARDPMFFSGRDLRGPEFFAEHEQVRFSE* |
| Ga0079219_113041321 | 3300006954 | Agricultural Soil | FGFDAAKALPVENPGRVCAMLATATDPMHFSGKDIYGPAFHAEHALVRFETGGNR* |
| Ga0099793_106550911 | 3300007258 | Vadose Zone Soil | GFDASKALPVENPGRVCAMLATAKDPMYFSGKDVYGPGFHTEHALVRFD* |
| Ga0066709_1018397901 | 3300009137 | Grasslands Soil | NPGRVCAMLATAKDPMHFSGTDLYGPAFYAEHALVRFEA* |
| Ga0075423_109122622 | 3300009162 | Populus Rhizosphere | VGTERMAAELGEFGFDASKALPVENPGRVCAMLATAENPLYFSGKDVYGPAFHAEHAIVRFD* |
| Ga0126374_115535151 | 3300009792 | Tropical Forest Soil | SKALPVENPGRVCAMLATAKDPMFFTGKDIYGPAFHAEHMLVTP* |
| Ga0126382_106412772 | 3300010047 | Tropical Forest Soil | GFVGTERMAAELGEFGFDASKALPVENPGRVCAMLATATDPMVFSGRDLDGPTFYAEHAGVTFAGQVAPGSSR* |
| Ga0116211_11134391 | 3300010313 | Hot Spring | RIAAELAEFGFDASRGLPPENPGRVCAMLATATDPMFFSGRDVHGPTFHAEHALTRFEG* |
| Ga0134064_104219322 | 3300010325 | Grasslands Soil | GFDASKALPVENPGRVCAMLATSTNPMRFSGRDLRGPEMYLDHAALKFD* |
| Ga0134080_100729883 | 3300010333 | Grasslands Soil | GFDASKALPVENPGRVCAMLATAKDPMYFSGKDVYGPGFHAEHALVRFD* |
| Ga0134063_102931402 | 3300010335 | Grasslands Soil | FVGTERMAAELAEFGFDASKALPVENPGRVCAMLATAKDPMHFSGKDIYGPDFHAEHALVRFD* |
| Ga0134063_106500752 | 3300010335 | Grasslands Soil | FVGTERMAAELAEFGFDASKALPVENPGRVCAMLATAKDPMYFSGKDIYGPGFHAEQALVRFD* |
| Ga0126372_106752821 | 3300010360 | Tropical Forest Soil | IELAEFGFDASKGLPVENPGRVCAMLATAAEPLYFSGKDIYGPAFHEEHRLVRFD* |
| Ga0126379_105902442 | 3300010366 | Tropical Forest Soil | FVATERMAAELGPFGFDASKGLPVENPGRVCGMLATADDPMFFSGRDVHGPTFHAEHALTRFAG* |
| Ga0136847_112113502 | 3300010391 | Freshwater Sediment | VPTTVEPVEHPGRVCAMLATASAPLWFSGRDVHGPTFYAEHAGVRFAPA* |
| Ga0136847_125900952 | 3300010391 | Freshwater Sediment | GRVCAMLATAKDPMFFSGKDVYGPGFHAEHALVRFE* |
| Ga0126383_127214461 | 3300010398 | Tropical Forest Soil | KGLPTDNPGRVCAMLATADDPMLFSGHDVYGPRFYAEHAGTRFAP* |
| Ga0134122_123202812 | 3300010400 | Terrestrial Soil | ENPGRVCAMLATARDPMVFTGKDIHGPTFYAEHEGVIFN* |
| Ga0137358_100609871 | 3300012582 | Vadose Zone Soil | MAAELAEFGFDASKALPVENPGRVCAMLATAKDPMYFSGKDIYGPGFHAEHALVRFD* |
| Ga0157289_103596152 | 3300012903 | Soil | MDSTREALRRLVVGTLMPGFVGTERMAIELGQFGFDASKALPVENPGRVCAMLATSTNPMHFSGRDLRGPELYLEHSLLRFDD* |
| Ga0126369_121144372 | 3300012971 | Tropical Forest Soil | ENPGRVCAMLATAKDPMHFSGRDLRGPELHLEHSLLRFDG* |
| Ga0134077_101121031 | 3300012972 | Grasslands Soil | PVENPGRVCAMLATAKDPMHFSGKDIYGPDFHAEHALVRFD* |
| (restricted) Ga0172365_100971253 | 3300013127 | Sediment | VCAMLATAKDPMVFSGRDIFGPTFHDDHANVRFT* |
| Ga0134078_100590473 | 3300014157 | Grasslands Soil | SKALPVENPGRVCAMLATAKDPMYFSGKDVYGPGFHAEHALVRFE* |
| Ga0075353_11270061 | 3300014321 | Natural And Restored Wetlands | GFDPAKGLSVENPGRVCAMLASARDPMVFTGRDLRGPEFYAEHEGVQFDPPGDG* |
| Ga0137405_14204916 | 3300015053 | Vadose Zone Soil | MAAELGEFGFDASKALPVENPGRVCAMLATAEDPMHFSGRDIHGPSFHAEHALVRFEG* |
| Ga0137418_105966752 | 3300015241 | Vadose Zone Soil | MAAELGEFGFDASKALPVENPGRVCAMLATAKDPMFFSGKDIYGPGFHAEHALVRFD* |
| Ga0180085_11360141 | 3300015259 | Soil | GFVGTERMAIELGEFGFDASKALPVENPGRVCAMLATSANPMHFSGRDLRGPELYLEHSLLRFDG* |
| Ga0134085_103625782 | 3300015359 | Grasslands Soil | LPVENPGRVCAMLATAKDPMYFSGKDIYGPGFHAEHALVHFD* |
| Ga0132257_1019884262 | 3300015373 | Arabidopsis Rhizosphere | GTERMAAELGEFGFDASKALPVENPGRVCAMLATAKDPMYFSGKDVYGPAFHAEHANVQFEV* |
| Ga0132255_1057131801 | 3300015374 | Arabidopsis Rhizosphere | KGLPTDNPGRVCAMLATADDPMFFSGRDVHGPTFHAEHALTRFG* |
| Ga0132255_1059654082 | 3300015374 | Arabidopsis Rhizosphere | NPGRVCAMLATASDPMYFSGKDIFGPGFHAEHSLVRFDR* |
| Ga0182034_115079381 | 3300016371 | Soil | ELGPFGFDASKGLLVENPGRVCAMLATAEDPMFFSGRDVHGPTFHAEHALTRFAG |
| Ga0134112_100443541 | 3300017656 | Grasslands Soil | AELAEFGFDASKALPVENPGRVCAMIATAKDPMYFSGKDIYGPGFHAEHTLVRFD |
| Ga0134074_10210911 | 3300017657 | Grasslands Soil | FGFDASKALPVENPGRVCAMLATAKDPMYFSGKDIYGPGFHAEHALVRFD |
| Ga0187785_107537172 | 3300017947 | Tropical Peatland | GEFGFDASKALPVENPGRVCAMLATSSDPMFFSGRDLRGPELYAEHARLRFADDA |
| Ga0187779_106494792 | 3300017959 | Tropical Peatland | SKGLPVENPGRVCAMLATAADPLYFSGKDIYGPAFHEDHRLVRFD |
| Ga0184615_102447891 | 3300018059 | Groundwater Sediment | MPGFVGTERMAIELKDFGFDASKGLPVENPGRVCAMLATAKDPLYFSGKDIYGPGFYAEHEQVRFE |
| Ga0066669_110628072 | 3300018482 | Grasslands Soil | GLAAVRALPVEHPGRVCAMLATAKDPMHFSGRDVHGPSFHAEHALVHFE |
| Ga0187894_104658672 | 3300019360 | Microbial Mat On Rocks | GLMPGFVGTERMAIELKEFGFDASKGLPVENPGRVCAMLATAKDPLYFSGTDLYGPSFHAEHELVRFE |
| Ga0173479_102567271 | 3300019362 | Soil | MAIELGQFGFDASKALSVENPGRVCAMLATSTNPMHFSGRDLRGPELYLEHSLLRFDD |
| Ga0194110_105139441 | 3300020084 | Freshwater Lake | VDNPGRVCAMLATATDPMVFSGRDIFGPTFHEDHANVRFS |
| Ga0207665_108429641 | 3300025939 | Corn, Switchgrass And Miscanthus Rhizosphere | FGFDASKALPVENPGRVCAMLATATDPMHFSGKDIYGPAFHAEHALVRFEA |
| Ga0207678_116607062 | 3300026067 | Corn Rhizosphere | MPGFVGTERMAIELADFGFDASRGLPVENPGRVCAMLATATDPLLFSGKDIYGPLFHAEHAAVRFDAS |
| Ga0207678_119918302 | 3300026067 | Corn Rhizosphere | TERMAIELGDFGFDASKALPVDNPGRVCAMLATAKDPLFFSGRDIYGPDLYAEMARLRFE |
| Ga0209761_11362041 | 3300026313 | Grasslands Soil | SKALPVENPGRVCAMLATAKDPMYFSGKDVYGPGFHAEHALVRFE |
| Ga0209268_10040651 | 3300026314 | Soil | TERMAAELAEFGFDASKALPVENPGRVCAMLATAKDPMYFSGKDIYGPGFHAEHALVHFD |
| Ga0209802_10134778 | 3300026328 | Soil | GLMPGFVGTERMAAELAEFGFDASKALPVENPGRVCAMLATAKDPMYFSGKDIYGPGFHAEHALVRFD |
| Ga0209267_12985101 | 3300026331 | Soil | PVENPGRVCAMLATAKDPMHFSGKDIYGPDFHAEHALVRFD |
| Ga0209803_10199081 | 3300026332 | Soil | ELAEFGFDASKALPVENPGRVCAMLATARDPMYFSGKDVYGPGFHAEHALVRFD |
| Ga0209159_11536172 | 3300026343 | Soil | ERMAAELAEFGFDASKALPVENPGRVCAMLATAKDPMYFSGKDVYGPGFHAEHALVRFE |
| Ga0257180_10459821 | 3300026354 | Soil | ELAEFGFDASKALPVENPGRVCAMLATAKDPMYFSGKDVYGPGFHAEHALVRFD |
| Ga0209378_11698971 | 3300026528 | Soil | ALPVENPGRVCAMLATAKDPMHFSGKDIYGPDFHAEHALVRFD |
| Ga0209058_11962461 | 3300026536 | Soil | ENPGRVCAMLATAKDPLYFSGKDVYGPGFHAEHALVRFE |
| Ga0209058_12208962 | 3300026536 | Soil | ASKALPVENPGRVCAMLATAKDPMHFSGKDVYGPDFHAEHALVRFD |
| Ga0209157_13296442 | 3300026537 | Soil | MAAELAEFGFDASKALPVENPGRVCAMLATAKDPMYFSGKDIYGPGFHAEHALVRFD |
| Ga0209056_102235753 | 3300026538 | Soil | MPGFVGTERMAIELGEFGFDASKALPVENPGRVCAMLATAKDPMHFSGTDLYGPAFYAEHALVRFEA |
| Ga0209161_105732451 | 3300026548 | Soil | KALPVENPGRVCAMLATARDPMYFSGKDVYGPGFHAEHALVRFD |
| Ga0268386_102031971 | 3300030619 | Soil | LGEFGFDASKALPVEHPGRVCAMLATARDPLFFSGRDLRGPEVYAQAERLRFDG |
| Ga0318542_104806011 | 3300031668 | Soil | AELGPFGFDASKGLPVENPGRVCAMLATAEDPMFFSGRDVHGPTFHAEHALTRFAG |
| Ga0318574_107154361 | 3300031680 | Soil | FGFDASKALPVENPGRVCAMLATATDPMFFSGRDVFGPAFFEEHRQVRFD |
| Ga0310813_118691321 | 3300031716 | Soil | LAEFGFDASKALPVENPGRVCAMLATAKDPMYFSGKDVYGPAFHAEHANVQFEV |
| Ga0318494_106931401 | 3300031751 | Soil | AELGPFGFDASKGLPVENPGRVCAMLATADDPMFFSGRDVHGPTFHAEHALTRFAG |
| Ga0318554_101233463 | 3300031765 | Soil | IGLMPGFVGTERMAAELGEFGFDATKALPVENPGRVCAMLATATDPMFFSGRDVFGPAFFEEHRQVRFD |
| Ga0318565_102017332 | 3300031799 | Soil | KGLPVENPGRVCAMLATAEDPMFFSGRDVHGPTFHAEHALTRFAG |
| Ga0306919_110309972 | 3300031879 | Soil | FVATERMAAELGPFGFDASKGLPVENPGRVCAMLATAEDPMFFSGRDVHGPTFHAEHALTRFAG |
| Ga0310912_113061221 | 3300031941 | Soil | VATERMAAELGPFGFDASKGLPVENPGRVCAMLATAEDPMFFSGRDVHGPTFHAEHALTRFAG |
| Ga0306926_116584492 | 3300031954 | Soil | PGRVCAMLATAEDPMFFSGRDVHGPTFHAEHALTRFAG |
| Ga0315276_123746571 | 3300032177 | Sediment | FDAAKGLPPENPGRVCAMLATADDPMWFSGRDVHGPTFHAEHAITRFD |
| Ga0307471_1010178771 | 3300032180 | Hardwood Forest Soil | ELGEFGFDASKALPVENPGRVCAMLATAKDPMYFSGKDVYGPAFHAEHANVQFEV |
| Ga0307471_1031370091 | 3300032180 | Hardwood Forest Soil | GFVATERMAAELAEFGFDASKGLPVENPGRVCAMLATAKDPLHFSGKDIFGPGFYAEHSLVRFET |
| Ga0306920_1033933641 | 3300032261 | Soil | GDYGFDATKALPVENPGRVCAMLATAKDPLHFSGRDILGPAFYAEHSQVRFD |
| Ga0335082_104235893 | 3300032782 | Soil | EMADFGFDASKGLPVENPGRVCAMLATARDPLYFSGRDIYGPTFYSEHELVDFD |
| Ga0335080_118777652 | 3300032828 | Soil | GDFGFDASKGLPVENPGRVCAMLATARDPLYFSGRDIYGPTFYSEHELVDFD |
| Ga0335070_111649581 | 3300032829 | Soil | VGTERMAIELGEFGFDASKALPVENPGRVCAMLATAKDPMFFTGRDVYGPTFFDEHRLVSFE |
| Ga0335084_112877613 | 3300033004 | Soil | SKGLPVENPGRVCAMLATARDPLYFSGRDIYGPTFYSEHELVDFD |
| Ga0334722_107740631 | 3300033233 | Sediment | ASKALAVENPGRVCAMLATATDPLWFSGRDVHGPTFHAEHALVRFDPA |
| Ga0316601_1017060652 | 3300033419 | Soil | LMPGFVGTERMAIELGEYGFDASKALPVENPGRVCAMLATAKDPMYFSGRDLRGPELYLEHSLLRFED |
| Ga0316613_108251571 | 3300033434 | Soil | RTAVELREYGFDPAKGLPVENPGRVCAMLATARDPMFFTGRDLRGPEFHAEHEQVQFAPPGDG |
| Ga0316624_101088561 | 3300033486 | Soil | AIELGDFGFDASRGLPVENPGRVCAMLATAHDPLVFSGRDIYGPTFYEDHAQTRFDWS |
| ⦗Top⦘ |