| Basic Information | |
|---|---|
| Family ID | F074845 |
| Family Type | Metagenome / Metatranscriptome |
| Number of Sequences | 119 |
| Average Sequence Length | 75 residues |
| Representative Sequence | MAVAEAGSQPALRRRRLRLDLTTPTLLGLPLAWLGVFFVVPIAIVAAYSFDVYSLFPGPHGFTLRAWHDFLH |
| Number of Associated Samples | 112 |
| Number of Associated Scaffolds | 119 |
| Quality Assessment | |
|---|---|
| Transcriptomic Evidence | Yes |
| Most common taxonomic group | Unclassified |
| % of genes with valid RBS motifs | 53.85 % |
| % of genes near scaffold ends (potentially truncated) | 9.24 % |
| % of genes from short scaffolds (< 2000 bps) | 10.08 % |
| Associated GOLD sequencing projects | 109 |
| AlphaFold2 3D model prediction | Yes |
| 3D model pTM-score | 0.42 |
| Hidden Markov Model |
|---|
| Powered by Skylign |
| Most Common Taxonomy | |
|---|---|
| Group | Unclassified (89.076 % of family members) |
| NCBI Taxonomy ID | N/A |
| Taxonomy | N/A |
| Most Common Ecosystem | |
|---|---|
| GOLD Ecosystem | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil (15.126 % of family members) |
| Environment Ontology (ENVO) | Unclassified (28.571 % of family members) |
| Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Soil (non-saline) (51.261 % of family members) |
| ⦗Top⦘ |
| ⦗Top⦘ |
| Predicted Topology & Secondary Structure | |||||
|---|---|---|---|---|---|
| Classification: | Transmembrane (alpha-helical) | Signal Peptide: | No | Secondary Structure distribution: | α-helix: 35.00% β-sheet: 0.00% Coil/Unstructured: 65.00% | Feature Viewer |
|
|
|||||
| Powered by Feature Viewer | |||||
| Structure Viewer | |
|---|---|
|
| |
| Per-residue confidence (pLDDT): 0-50 51-70 71-90 91-100 | pTM-score: 0.42 |
| Powered by PDBe Molstar | |
| ⦗Top⦘ |
| Pfam ID | Name | % Frequency in 119 Family Scaffolds |
|---|---|---|
| PF13416 | SBP_bac_8 | 50.42 |
| PF07883 | Cupin_2 | 2.52 |
| PF09992 | NAGPA | 0.84 |
| PF13560 | HTH_31 | 0.84 |
| PF01797 | Y1_Tnp | 0.84 |
| PF11799 | IMS_C | 0.84 |
| PF05922 | Inhibitor_I9 | 0.84 |
| PF01019 | G_glu_transpept | 0.84 |
| PF03167 | UDG | 0.84 |
| COG ID | Name | Functional Category | % Frequency in 119 Family Scaffolds |
|---|---|---|---|
| COG0405 | Gamma-glutamyltranspeptidase | Amino acid transport and metabolism [E] | 0.84 |
| COG0692 | Uracil-DNA glycosylase | Replication, recombination and repair [L] | 0.84 |
| COG1404 | Serine protease, subtilisin family | Posttranslational modification, protein turnover, chaperones [O] | 0.84 |
| COG1573 | Uracil-DNA glycosylase | Replication, recombination and repair [L] | 0.84 |
| COG1943 | REP element-mobilizing transposase RayT | Mobilome: prophages, transposons [X] | 0.84 |
| COG3663 | G:T/U-mismatch repair DNA glycosylase | Replication, recombination and repair [L] | 0.84 |
| ⦗Top⦘ |
| Name | Rank | Taxonomy | Distribution |
| Unclassified | root | N/A | 89.08 % |
| All Organisms | root | All Organisms | 10.92 % |
| Visualization |
|---|
| Powered by ApexCharts |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| 3300006058|Ga0075432_10559468 | All Organisms → cellular organisms → Bacteria | 518 | Open in IMG/M |
| 3300006577|Ga0074050_11222038 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria | 601 | Open in IMG/M |
| 3300006579|Ga0074054_11178446 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria | 540 | Open in IMG/M |
| 3300009088|Ga0099830_10873270 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium | 743 | Open in IMG/M |
| 3300010047|Ga0126382_11750995 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium | 582 | Open in IMG/M |
| 3300010321|Ga0134067_10497062 | All Organisms → cellular organisms → Bacteria | 505 | Open in IMG/M |
| 3300010322|Ga0134084_10152369 | All Organisms → cellular organisms → Bacteria | 778 | Open in IMG/M |
| 3300010375|Ga0105239_10096086 | All Organisms → cellular organisms → Bacteria | 3274 | Open in IMG/M |
| 3300010400|Ga0134122_13190042 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium | 514 | Open in IMG/M |
| 3300012351|Ga0137386_11294379 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium | 506 | Open in IMG/M |
| 3300012402|Ga0134059_1400079 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium | 515 | Open in IMG/M |
| 3300012907|Ga0157283_10365451 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium | 525 | Open in IMG/M |
| 3300012976|Ga0134076_10495861 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria | 557 | Open in IMG/M |
Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).
| ⦗Top⦘ |
| Habitat | Taxonomy | Distribution |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil | 15.13% |
| Vadose Zone Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil | 14.29% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil | 10.08% |
| Grasslands Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil | 8.40% |
| Corn, Switchgrass And Miscanthus Rhizosphere | Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere | 7.56% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil | 5.88% |
| Grasslands Soil | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil | 5.04% |
| Populus Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere | 5.04% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil | 3.36% |
| Corn Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere | 3.36% |
| Permafrost | Environmental → Terrestrial → Soil → Unclassified → Permafrost → Permafrost | 2.52% |
| Terrestrial Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil | 1.68% |
| Tropical Forest Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil | 1.68% |
| Hardwood Forest Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil | 1.68% |
| Miscanthus Rhizosphere | Host-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere | 1.68% |
| Switchgrass Rhizosphere | Host-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere | 1.68% |
| Sediment | Environmental → Aquatic → Freshwater → Lake → Sediment → Sediment | 0.84% |
| Watersheds | Environmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds | 0.84% |
| Surface Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil | 0.84% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil | 0.84% |
| Agricultural Soil | Environmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil | 0.84% |
| Grass Soil | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Grass Soil | 0.84% |
| Peatland | Environmental → Terrestrial → Soil → Wetlands → Unclassified → Peatland | 0.84% |
| Tropical Forest Soil | Environmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil | 0.84% |
| Switchgrass Rhizosphere | Host-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere | 0.84% |
| Populus Endosphere | Host-Associated → Plants → Roots → Bulk Soil → Unclassified → Populus Endosphere | 0.84% |
| Switchgrass Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere | 0.84% |
| Miscanthus Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere | 0.84% |
| Avena Fatua Rhizosphere | Host-Associated → Plants → Rhizosphere → Soil → Unclassified → Avena Fatua Rhizosphere | 0.84% |
| Visualization |
|---|
| Powered by ApexCharts |
Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).
| Taxon OID | Sample Name | Habitat Type | IMG/M Link |
|---|---|---|---|
| 2032320005 | Soil microbial communities from sample at FACE Site 5 Oak Ridge CO2- | Environmental | Open in IMG/M |
| 2170459007 | Grass soil microbial communities from Rothamsted Park, UK - March 2009 indirect in plug lysis (for fosmid construction) 10-21cm | Environmental | Open in IMG/M |
| 3300001537 | Permafrost active layer microbial communities from McGill Arctic Research Station, Canada - (A20-65 cm-11A)- 1 week illumina | Environmental | Open in IMG/M |
| 3300004643 | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 3 | Environmental | Open in IMG/M |
| 3300005180 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 | Environmental | Open in IMG/M |
| 3300005334 | Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M5-2 | Host-Associated | Open in IMG/M |
| 3300005347 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-3 metaG | Host-Associated | Open in IMG/M |
| 3300005436 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaG | Environmental | Open in IMG/M |
| 3300005439 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaG | Environmental | Open in IMG/M |
| 3300005440 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-3 metaG | Environmental | Open in IMG/M |
| 3300005445 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaG | Environmental | Open in IMG/M |
| 3300005467 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG | Environmental | Open in IMG/M |
| 3300005468 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG | Environmental | Open in IMG/M |
| 3300005526 | Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire. - Coalmine Soil_Cen17_06102014_R1 | Environmental | Open in IMG/M |
| 3300005545 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-2 metaG | Environmental | Open in IMG/M |
| 3300005555 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141 | Environmental | Open in IMG/M |
| 3300005561 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148 | Environmental | Open in IMG/M |
| 3300005568 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152 | Environmental | Open in IMG/M |
| 3300005569 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154 | Environmental | Open in IMG/M |
| 3300005618 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S7-2 | Host-Associated | Open in IMG/M |
| 3300005713 | Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 (version 2) | Environmental | Open in IMG/M |
| 3300006038 | Populus root and rhizosphere microbial communities from Tennessee, USA - Endosphere MetaG P. deltoides DD176-5 | Host-Associated | Open in IMG/M |
| 3300006049 | Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD1 | Host-Associated | Open in IMG/M |
| 3300006050 | Freshwater sediment microbial communities from Pennsylvania, USA - Little Laurel Run_MetaG_LLR_2014 | Environmental | Open in IMG/M |
| 3300006058 | Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 | Host-Associated | Open in IMG/M |
| 3300006577 | Soil and rhizosphere microbial communities from Centre INRS-Institut Armand-Frappier, Laval, Canada - Soil microcosm metaTmtHPA (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300006579 | Soil and rhizosphere microbial communities from Centre INRS-Institut Armand-Frappier, Laval, Canada - Soil microcosm metaTmtLAB (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300006605 | Soil and rhizosphere microbial communities from Centre INRS-Institut Armand-Frappier, Laval, Canada - Soil microcosm metaTmtHAB (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300006794 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107 | Environmental | Open in IMG/M |
| 3300006797 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108 | Environmental | Open in IMG/M |
| 3300006806 | Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100 | Environmental | Open in IMG/M |
| 3300006852 | Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD2 | Host-Associated | Open in IMG/M |
| 3300006854 | Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4 | Host-Associated | Open in IMG/M |
| 3300009088 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG | Environmental | Open in IMG/M |
| 3300009090 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG | Environmental | Open in IMG/M |
| 3300009101 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-4 metaG | Host-Associated | Open in IMG/M |
| 3300009137 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158 | Environmental | Open in IMG/M |
| 3300010047 | Tropical forest soil microbial communities from Panama - MetaG Plot_30 | Environmental | Open in IMG/M |
| 3300010301 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09082015 | Environmental | Open in IMG/M |
| 3300010321 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09212015 | Environmental | Open in IMG/M |
| 3300010322 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_2_24_1 metaG | Environmental | Open in IMG/M |
| 3300010337 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09082015 | Environmental | Open in IMG/M |
| 3300010373 | Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-4 | Environmental | Open in IMG/M |
| 3300010375 | Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C4-4 metaG | Host-Associated | Open in IMG/M |
| 3300010400 | Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2 | Environmental | Open in IMG/M |
| 3300011269 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaG | Environmental | Open in IMG/M |
| 3300011271 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaG | Environmental | Open in IMG/M |
| 3300012096 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaG | Environmental | Open in IMG/M |
| 3300012198 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaG | Environmental | Open in IMG/M |
| 3300012201 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaG | Environmental | Open in IMG/M |
| 3300012209 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaG | Environmental | Open in IMG/M |
| 3300012210 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaG | Environmental | Open in IMG/M |
| 3300012211 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaG | Environmental | Open in IMG/M |
| 3300012285 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaG | Environmental | Open in IMG/M |
| 3300012351 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaG | Environmental | Open in IMG/M |
| 3300012357 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaG | Environmental | Open in IMG/M |
| 3300012402 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_8_2 metaT (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300012469 | Combined assembly of Soil carbon rhizosphere | Host-Associated | Open in IMG/M |
| 3300012903 | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S134-311R-1 | Environmental | Open in IMG/M |
| 3300012907 | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S044-104R-1 | Environmental | Open in IMG/M |
| 3300012909 | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S149-409B-1 | Environmental | Open in IMG/M |
| 3300012915 | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S103-311B-2 | Environmental | Open in IMG/M |
| 3300012917 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaG | Environmental | Open in IMG/M |
| 3300012948 | Tropical forest soil microbial communities from Panama - MetaG Plot_14 | Environmental | Open in IMG/M |
| 3300012960 | Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_231_MG | Environmental | Open in IMG/M |
| 3300012972 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_24_1 metaG | Environmental | Open in IMG/M |
| 3300012976 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_0_1 metaG | Environmental | Open in IMG/M |
| 3300012977 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_5_24_1 metaG | Environmental | Open in IMG/M |
| 3300013772 | Permafrost microbial communities from Nunavut, Canada - A10_80_0.25M | Environmental | Open in IMG/M |
| 3300014823 | Permafrost microbial communities from Nunavut, Canada - A3_80cm_0M | Environmental | Open in IMG/M |
| 3300015077 | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S178-409R-2 (version 2) | Environmental | Open in IMG/M |
| 3300015357 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_5_0_1 metaG | Environmental | Open in IMG/M |
| 3300015358 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_5_24_1 metaG | Environmental | Open in IMG/M |
| 3300018433 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116 | Environmental | Open in IMG/M |
| 3300018468 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111 | Environmental | Open in IMG/M |
| 3300018482 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118 | Environmental | Open in IMG/M |
| 3300019361 | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S133-311R-2 (version 2) | Environmental | Open in IMG/M |
| 3300019867 | Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3m1 | Environmental | Open in IMG/M |
| 3300020062 | Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2a1 | Environmental | Open in IMG/M |
| 3300024245 | Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK18 | Environmental | Open in IMG/M |
| 3300025911 | Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-4 metaG (SPAdes) | Host-Associated | Open in IMG/M |
| 3300025913 | Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-4 metaG (SPAdes) | Host-Associated | Open in IMG/M |
| 3300025916 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaG (SPAdes) | Environmental | Open in IMG/M |
| 3300025927 | Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-4 metaG (SPAdes) | Host-Associated | Open in IMG/M |
| 3300025938 | Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M1-2 (SPAdes) | Host-Associated | Open in IMG/M |
| 3300025939 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaG (SPAdes) | Environmental | Open in IMG/M |
| 3300025972 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-3 metaG (SPAdes) | Host-Associated | Open in IMG/M |
| 3300026313 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes) | Environmental | Open in IMG/M |
| 3300026326 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127 (SPAdes) | Environmental | Open in IMG/M |
| 3300026330 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133 (SPAdes) | Environmental | Open in IMG/M |
| 3300026331 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137 (SPAdes) | Environmental | Open in IMG/M |
| 3300026538 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes) | Environmental | Open in IMG/M |
| 3300026550 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145 (SPAdes) | Environmental | Open in IMG/M |
| 3300027907 | Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (SPAdes) | Host-Associated | Open in IMG/M |
| 3300028043 (restricted) | Sediment microbial communities from Lake Towuti, South Sulawesi, Indonesia - Sediment_Towuti_2014_0.5_MG | Environmental | Open in IMG/M |
| 3300028138 | Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK25 | Environmental | Open in IMG/M |
| 3300028536 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly) | Environmental | Open in IMG/M |
| 3300028717 | Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_158 | Environmental | Open in IMG/M |
| 3300028718 | Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_194 | Environmental | Open in IMG/M |
| 3300028787 | Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_381 | Environmental | Open in IMG/M |
| 3300028811 | Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_149 | Environmental | Open in IMG/M |
| 3300028828 | Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_202 | Environmental | Open in IMG/M |
| 3300028872 | Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_204 | Environmental | Open in IMG/M |
| 3300031114 | Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_182 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300031573 | Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.AN111 | Environmental | Open in IMG/M |
| 3300031740 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05 | Environmental | Open in IMG/M |
| 3300031946 | Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.HF172 | Environmental | Open in IMG/M |
| 3300032009 | Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.066b5f19 | Environmental | Open in IMG/M |
| 3300032094 | Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.166b4f25 | Environmental | Open in IMG/M |
| 3300032205 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05 | Environmental | Open in IMG/M |
| 3300033290 | Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.178b2f15 | Environmental | Open in IMG/M |
| 3300033805 | Tropical peat soil microbial communities from peatlands in Loreto, Peru - MAQ_50_10 | Environmental | Open in IMG/M |
| Geographical Distribution | |
|---|---|
| Zoom: | Powered by OpenStreetMap |
| ⦗Top⦘ |
Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).
| Protein ID | Sample Taxon ID | Habitat | Sequence |
| FACEORA_3897750 | 2032320005 | Soil | MAVAEAGSQPALRRRRLRLDLTTPTLLGLPLAWLGVFFVVPIAIVAAYSFNVYSLFPGPHGFTLRAWHDFLHSSVYLAL |
| L02_07089750 | 2170459007 | Grass Soil | VQTEARLRRRGLPRLDLTTPGLLGLPLSWLLVFFVVPIAIVAAYSFDVYSLFPGPHGFTLSGWRGFVHDAV |
| A2065W1_111696231 | 3300001537 | Permafrost | MTVEGEAGPPPVHLPRRRRVRLDLTTPTLLGLPVAWLVVFFLVPIGIVAAYSFDVYSLFPGKHGFTLTAWREFVHSSVYLALFW |
| Ga0062591_1020760472 | 3300004643 | Soil | VTDLVSPAPRRRRIDLTTPSLLGLPLAWLGVFFLAPIAIVLLYSFNVYSLYPGEQGFTLKAWHDFFHSS |
| Ga0066685_105481782 | 3300005180 | Soil | MAARRERVRLDLTTPTLLGLPVAWLVVFFIVPIAIVAAYSFDVYSLFPGKHGFTLAAWREFVH |
| Ga0068869_1020158291 | 3300005334 | Miscanthus Rhizosphere | MAITEAREPPAQRRRRRLRLDLSAPSLLGLPLAWLAAFFLVPIGIVAAYSFNLYSLDPGPHHF |
| Ga0070668_1001766931 | 3300005347 | Switchgrass Rhizosphere | MAVAEAGSQPALRRRRLRLDLTTPTLLGLPLAWLGVFFVVPIAIVAAYSFDVYSLFPGPHGFTLRAWHDFL |
| Ga0070713_1000479211 | 3300005436 | Corn, Switchgrass And Miscanthus Rhizosphere | MAIADAGSSRERVPRARRLRLDLTTPTLLGLPLAWLAVFFLVPIAIVAAYSFDVYSLNSGPHGFTLTAWHD |
| Ga0070711_1005860952 | 3300005439 | Corn, Switchgrass And Miscanthus Rhizosphere | MAIADAGSSRERVPRAGRLRLDLTTPTLLGLPLAWLAVFFLVPIAIVAAYSFDVYSLNSGPHGFTLT |
| Ga0070705_1001830941 | 3300005440 | Corn, Switchgrass And Miscanthus Rhizosphere | MAVEGEAGPLPARLPRRRRLRLDLTTPTLLGLPVAWLVVFFLVPIAIVAAYSFDVYSLFPGKHGFTLA |
| Ga0070708_1016088952 | 3300005445 | Corn, Switchgrass And Miscanthus Rhizosphere | VATEARAGSPPRAGLRGRLRLDLTTPSLLGLPLAWLAVFFLAPIGIVAGYSFDAFSLNPGPHALTLRAWHDFLHSATYLRLFWSSVKLSLIVSAIV |
| Ga0070706_1016231791 | 3300005467 | Corn, Switchgrass And Miscanthus Rhizosphere | MAVAEAGSQPALRRRRLRVDLTTPTLLGLPLAWLGVFFLVPITIVAAYSFNVYSLFPGPHGFTLRAWHDFLHSSVYL |
| Ga0070707_1009519661 | 3300005468 | Corn, Switchgrass And Miscanthus Rhizosphere | MAVEAEAGGAPAPLRRRRLRVDLSTPGLLGLPIGWLVVFFIAPIGIVAAYSFDVLSLLPGGHAFTLTAWHDFLHSSV |
| Ga0073909_101778681 | 3300005526 | Surface Soil | MALSEVRDPPARLGRRRWLRLDLTTPTLLGLPLAWLVVFFLAPIAIVAAYSFDVYSLNPGPHGFTLTAWHDFLHSSVYLRL |
| Ga0070695_1006838712 | 3300005545 | Corn, Switchgrass And Miscanthus Rhizosphere | MAIAARASPSARRRRLRLDLTTPTLLGLPLAWLAVFFLAPIAIVAAYSFNVYSFESGPHGFTLDAW |
| Ga0066692_102361262 | 3300005555 | Soil | MALTEVRDPPARLGRRRWLRLDLTTPTLLGLPLAWVAVFFLVPIAIVAAYSFDVYSLNPGPHGFTLQAWHAFLHSSVYLG |
| Ga0066699_105585442 | 3300005561 | Soil | VTLVDAGSSPELVPRPRRLRLDLTTPALLGLPLAWLAVFFLVPIAIVAAYSFDVYSLNSGPHGFTLTAWHDFL |
| Ga0066703_108108932 | 3300005568 | Soil | VTRHRRFSSRLDLTTPGLLGLPIGWLLVFFLAPIAIVAAYSFDVYSLNPGPHGFTTTAWHDFLHSSVYL |
| Ga0066705_108383631 | 3300005569 | Soil | MALVDARSSPEDVPRPRRLRLDLTTPTLLSLPLAWLAVFFLVPIAIVAAYSFDVYSLNPGPHGFTLTAWHDFLHSS |
| Ga0068864_1018726911 | 3300005618 | Switchgrass Rhizosphere | MALLHASSSAGRLGRPRRRRLDLTTPALLGLPLAWLAAFFLVPIAIVAAYSFDVYSLDPGPHGFTLTAWHDFLHSSVYLKLFW |
| Ga0066905_1020541452 | 3300005713 | Tropical Forest Soil | VAAEAPAEVSPPRVRRRRLRVDLTTPGLLGLPLAWLAIFFLVPIGIVGAYSVDALYLPLGLGPHPVTLKAWH |
| Ga0075365_112013802 | 3300006038 | Populus Endosphere | MAIAEAREPPAPRRRRLRLDLTTPSLLGLPLVWLAVFFLVPIAIVGAYSFDVYSLDPGPHGFTLTAWHDF |
| Ga0075417_104531461 | 3300006049 | Populus Rhizosphere | MAIAEAREPPAPRRRRLRLDLTTPSLLGLPLVWLAVFFLVPIAIVGAYSFDVYSLDPGPHGFTLTAWHD |
| Ga0075028_1003262841 | 3300006050 | Watersheds | LARPRPDLTTPSLLGLPLAWLAVFFVLPILIVAAYSFDIYSLNPGPHGFTLGAWRAFVHDSVYLRLFWK |
| Ga0075432_105594681 | 3300006058 | Populus Rhizosphere | MAISEAREPPARTSRRRLRLDLSTPSLLGLPLAWLVVFFLVPIGIVAAYSFNVYSLDPGPHHFTLTAWHDFLH |
| Ga0074050_112220382 | 3300006577 | Soil | MAVEAGASPQPRRRRRPHVDLTTPSLLGLPLGWLAVFFLAPIVIVGAYSLGIFLLEESGRHPATLTAWHDFLHS |
| Ga0074054_111784461 | 3300006579 | Soil | MAVEAEAVVHPPRRRRRLRVDLTTPSLLGLPLAWLVVFFLVPIAIVGAYSLGVLSLDEPGRHPFTLTAWHDFLHS |
| Ga0074057_112922731 | 3300006605 | Soil | MAVEARAEEPPARQRVRRRSRPNLTTPSLLGLPLAWLAVFFVVPIAIVAAYSFDVYTLFPGPHGFTLA |
| Ga0066658_107716092 | 3300006794 | Soil | MTLVDAGSSPEHVPRPRRLRLDLTTPALLGLPLAWLAVFFLVPIAIVAAYSFDVYSLNSGPHGFTLMAWHD |
| Ga0066659_106116761 | 3300006797 | Soil | MALEGEAGTPPVRLPRRGRARPDVTTPTLLGLPVAWLVVFFVVPIAIVAAYSFDVYSLFPGKHGF |
| Ga0079220_113312452 | 3300006806 | Agricultural Soil | MSIAEAREPPAPRRRRRRLDLTTPTLLGPPVAWLAVFFLAPIAIVAAYSLNVYSLDPGPHSLTLSAWHD |
| Ga0075433_112760382 | 3300006852 | Populus Rhizosphere | MAIAEAREPPAPRRRRPRLDLTTPTLLGLPLAWLVVFFLAPIAIVAAYSLDVYSLDPGPHSLTLSAWHDFVHSSIYLK |
| Ga0075425_1008257082 | 3300006854 | Populus Rhizosphere | MAISEAREPPARTSRRRLRLDLSTPSLLGLPLAWLVVFFLVPIGIVAAYSFNVYSLDPGPHHFTLTAWHDFLHGSVY |
| Ga0075425_1030727801 | 3300006854 | Populus Rhizosphere | VAVEAPAAPPPSGIRRRRLRFDLTTPGLLGLPLGWLVAFFLLPIAIVGAYSFDVYSINPGPHGFTLT |
| Ga0099830_106574212 | 3300009088 | Vadose Zone Soil | MAAEAKADVPPAQRQRRLRPDLTTPGLLGLPLAWLVVFFVVPIAIVAGYSFDVYSLFPGKHGFTLGAWRGFMHDPVYLRLFW |
| Ga0099830_108732702 | 3300009088 | Vadose Zone Soil | MAVEGEAGTPPVRLARRRRVRLDLTTPTLLGLPVAWLVVFFLVPIAIVAAYSFDVYSLFPGKHGFTLAAWRAFAHSHS* |
| Ga0099827_110315772 | 3300009090 | Vadose Zone Soil | MAVEGEAGTPPVRLPRRGRVRLDLTTPTLLGLPVAWLVVFFLVPIAIVVAYSFDVYSLFPGKHGFTLAAWREFVHSSVYLALFWKS |
| Ga0105247_103086821 | 3300009101 | Switchgrass Rhizosphere | VGRGQGVLSVPEAQARRRRLRLDPTTPALLGLPLAWLVVFFLVPIAIVGLYSVGRLSLDPGPHAITLEAWRAFLHSPIYLKLFWKS |
| Ga0066709_1043888312 | 3300009137 | Grasslands Soil | VATDAPAAVSPPRAGGRRRLRLDPTTPSLLGLPIGWLIVFFVVPIAIVAAYSFDVYSLIPGPHRFTLTAWHDFLH |
| Ga0126382_117509951 | 3300010047 | Tropical Forest Soil | MPFAETGDAPPHPRRRRFRPDLTTPTLLGLPVAWLGVFFLVPIGIVAAYSVNVYSLDPGPHEFTTEAWRSFLHGS |
| Ga0134070_102223592 | 3300010301 | Grasslands Soil | MAIADAGSSPEHIPRRRRLRVDLTTPTLLGLPLAWLAVFFLVPIGIVAAYSFDVYSLDPGPHSFTLT |
| Ga0134067_104970622 | 3300010321 | Grasslands Soil | MAIADAGSSSQRAFRPRRLRLDLTTPTLLGLPLAWLGVFFLVPIAIVAAYSFDVYSLDPGPHGFTLTAWHDFLHSSVDLALFWKSCQAVSVKPCGPGASE* |
| Ga0134084_101523691 | 3300010322 | Grasslands Soil | MAVEGEARVPAVSLSSRRRRRLDAAAPALLGLPIAWLVAFFLVPIAIVALYSFDVYSLFPGTHGFTLAAWK |
| Ga0134062_105122102 | 3300010337 | Grasslands Soil | VATTEAGAAAPLRRRRHLRLDLSTPALLGLPLGWLAVFFLVPIGIVAAYSFNVYSIDPGPHSFTLKAWH |
| Ga0134128_128340952 | 3300010373 | Terrestrial Soil | MAVAETPVPADTRRRRRRLDLTTPALLGLPLGWLVVFVLARIAIVGAYSFDVYSLDPGPHGFTLTAWREFLHGGI* |
| Ga0105239_100960861 | 3300010375 | Corn Rhizosphere | MAIAEAREPPAPRRRRLRLDLTTPSLLGLPLAWLAVFFLVPIAIVSAYSFNVYSLDPGPHGFTLTAWHDFIHS |
| Ga0105239_108011492 | 3300010375 | Corn Rhizosphere | MAVAEAGSQPALRRRRLRVDLTTPTLLGLPLAWLGVFFVVPIAIVAAYSFDVYSLFPGPHGFTLRAWHDFLHSSVYLALFW |
| Ga0134122_131900421 | 3300010400 | Terrestrial Soil | MAITEAREPPAQRRRRRLRLDLSTPTLLGLPLAWLAAFFLVPIGIVAAYSFNLYPLDPGAHHFTLAGWHDFLHGS |
| Ga0137392_112419891 | 3300011269 | Vadose Zone Soil | VGRSEGLLTSAAGDTPAPSRRRRLLRLDLTTPGLLGLPLAWLAVFFFVPIAIVAAYSFDVYSLNPGPHGFTLGGWRRFVHDPVYLKLFWKS |
| Ga0137393_102440381 | 3300011271 | Vadose Zone Soil | MAVEGEAGTPPVRLPRRRRVRLDLTTPTLLGPPVAWLVVFFLVPIAIVAAYSFDLYSLFPGKHGFTLAAWRAFAHSHS* |
| Ga0137389_104062141 | 3300012096 | Vadose Zone Soil | MAVEGEAGTPPVRLQRRRRVRLDLTTPTLLGLPVAWLIVFFLVPIAIVAAYSFDVYSLFPGKHGFTLAAWRAFAHSHSSQPPASRC* |
| Ga0137364_107591481 | 3300012198 | Vadose Zone Soil | MAVDAKAATVRSRRKRRRLDLTTPGLLGLPLAWLAVFFVAPIAIVAAYSVDALSLYPGAHPLTLQAWHDFLHSAVYLKLFWKRGKMWWPASPLSVLLAF |
| Ga0137365_100891361 | 3300012201 | Vadose Zone Soil | MAIEGEAGAPPVRLPRRRRVRLDLTTPTLLGLPVAWLVVFFLVPIAIVAAYSFDVYSLFPGKHGFTLAAWHEFVHSSVYLALF |
| Ga0137379_102864161 | 3300012209 | Vadose Zone Soil | MATAVRASPAPSRRRRLRLALTTPTLLGLPLAWLAVFFLVPIAIVAAYSFDVYSIDPGPHGFTLEAWHHFLH |
| Ga0137378_102129211 | 3300012210 | Vadose Zone Soil | MATAVRASPAPSRRRRLRLALTTPTLLGLPLAWLAVFFLVPIAIVAAYSFDVYSLNPGPHGFTLTAWHDFLHSSVYLRLFWK |
| Ga0137377_100466561 | 3300012211 | Vadose Zone Soil | MALTEVRDPPARLGRRRWLRLDLTTPTLLGLPLAWVAVFFLVPIAIVAAYSFDVYSLNPGPHGFTLT |
| Ga0137377_106728791 | 3300012211 | Vadose Zone Soil | MALSEVRDSPPARLGRRRWLRLDLTTPTLLGLPLAWVAVFFLVPIAIVAAYSFDVYSLNPGPHGFTLTAWHDFLHS |
| Ga0137370_102059981 | 3300012285 | Vadose Zone Soil | MAVEGEAGVPAVSASGRRRTRLDLTAPALLGFPVVWLVAFFLVPIAIVAAYSFDVYSLFPGKHGFTVAAWRAFLHSS |
| Ga0137386_112943792 | 3300012351 | Vadose Zone Soil | MAVEGEAGTPPVRLPRRGRARLDLTTPTLLGLPVAWLVVFFLVPIAIVAAYSFDVYSLFPGKHGFTLAAWREFVHSSVYL |
| Ga0137384_106230552 | 3300012357 | Vadose Zone Soil | MVGAKTGGSAAPHRRRVRLDLTTPGLLSLPLAWLIVFFVVPIGIVAAYSVDALSLYPGAHAVTLSAWHDFLHSS |
| Ga0134059_14000791 | 3300012402 | Grasslands Soil | MAVEGEAGTPAVGLSSRRRWRRDLTTPTLLGLPVAWLVVFFVVPIAIVAAYSFDVYSLFPGKHGFTFAAWNSVAHDSIYL |
| Ga0150984_1099567012 | 3300012469 | Avena Fatua Rhizosphere | MGVETRADTPPAPQRRRRPRLDLTTPTLLGPPLAWLAVFFVVPIAIVGAYSVNALSFDPGPHAVTLSAWHSFL |
| Ga0157289_102485241 | 3300012903 | Soil | VGRGQGVLSVPEAQARRRRLRLDFTTPGLLGLPLAWLAVFFVVPIAIVTCYSLNVYSIDPGPHSLTLSAWHDFLHSSIYLKLFWK |
| Ga0157283_103654512 | 3300012907 | Soil | MAIAEAREPPAPRRRRRRLDLSTPTLLGLPLAWLAVFFVVPIALVTCYSLNVYSIDPGPHSLTLSAWHDFLHSSI |
| Ga0157290_104257282 | 3300012909 | Soil | MAIAEAREPPAPRRRRLRLDFTTPSLLGLPLAWLAVFFLVPIAIVSAYSFDVYSLDPGPHGFTLTAWHDFIH |
| Ga0157302_102354222 | 3300012915 | Soil | MAIAETRAPAEPRRRRLRLDLSTPALLGLPLGWLVLFFLAPIAIVAAYSFDVYSLDPGPHGFTVAAW |
| Ga0137395_105576022 | 3300012917 | Vadose Zone Soil | MAVRRGRVRLDLTTPTLLGLPVAWLVVFFIVPIAIVAAYSFDVYSLFPGKHGFTLAAWRAFAHSHS* |
| Ga0126375_104008882 | 3300012948 | Tropical Forest Soil | MAIAEAREPPAPRRRRLRLDLTTPTLLGLPLAWLAVFFVAPIAIVTCYSLNVYSLDPGPHSLTLSAWHDFLHSSIYLKLFW |
| Ga0164301_107918791 | 3300012960 | Soil | MALSEVHDPPARLGRRRWLRLDLTTPTLLGLPLAWLVVFFLAPIAIVAAYSFDVYSLNPGPHGFTLTAWHDFLHSSVYL |
| Ga0134077_103846542 | 3300012972 | Grasslands Soil | MAVEAETEVPPERRRPRRHRRPDLTTPGLLGLPLAWLVVFFVVPIAIVAAYSFDALSLYPDAHPLTLRAWHDFLHSSIYLR |
| Ga0134076_104958611 | 3300012976 | Grasslands Soil | MALAEAGDSPVRLRRRRLRLDLTTPTLLGLPLAWLAVFFVVPIAIVAAYSFDVYSLYPGTHGFTLAAWHDFVHSSVYLKLFWK |
| Ga0134087_104378763 | 3300012977 | Grasslands Soil | MALAEAGDSPVPLRRRRLRLDLTTPTLLGLPLAWLAVFFVVPIAIVAAYSFDVYSLFPGKHGFTLAAW |
| Ga0120158_100467784 | 3300013772 | Permafrost | MARRRRRLDLATPSLLGLPLAWLAVFFVLPIAIVAAYSFDVYSLNPGPHGFTLGAWRSFVHDAV |
| Ga0120170_11137641 | 3300014823 | Permafrost | MTVEGEAGPPPVHLPRRRRVRLDLTTPTLLGLPVAWLVVFFIVPIAIVAAYSFDVYSLFPGKHGFTLAAWREFVHSSVY |
| Ga0173483_1000104613 | 3300015077 | Soil | MAIAEAREPPAPRRRRLRLDFTTPSLLGLPLAWLAVFFLVPIAIVSAYSFDVYSLDPGPHGFTLTAWHDF |
| Ga0173483_100469593 | 3300015077 | Soil | VTDLVSPAPRRRRIDLTTPSLLGLPLAWLGVFFLAPIAIVLLYSFNVYSLYPGEQGFTLKAWHDFFHSSLYLKLFWK |
| Ga0134072_101935502 | 3300015357 | Grasslands Soil | MALAEAGDSPVRLRRRRLRLDLTTPTLLGLPLAWLAVFFVVPIAIVAAYSFDVYSLNPGPHGFTLAAWRAFAHSSVY |
| Ga0134089_102074651 | 3300015358 | Grasslands Soil | MAVEGEAGPPPVRLPRRRRLRLDLTTPTLLGLPVAWLVVFFVVPIAIVAAYSFDVYSLFPGKHGFTLAAW |
| Ga0066667_113496941 | 3300018433 | Grasslands Soil | VALADARTSPEPVPRRRRLRLDLTTPSLLGLPLAWLGVFFLVPIAIVAAYSFDVYSLDPGPHGFTLTAWHDFLH |
| Ga0066662_114036182 | 3300018468 | Grasslands Soil | MAIEGEAGVPAVSLSRRRRMRLGLVSPALLGLPVAWLVVFFLVPIAIVAAYSFDVYSLFPGKHGFTLDAWRAFLHSSIYLGKSE |
| Ga0066669_120047051 | 3300018482 | Grasslands Soil | MAVEGEAGVPAAGLSGRRRRRFDLAAPALLGLPVAWLVVFFVVPIMIVAAYSFDVYSLFPGKHGFTLAA |
| Ga0173482_103250512 | 3300019361 | Soil | MAVPRRRPRLDLTTPTLLGLPVAWLLVFFIVPVAIVALYSFDVYSLFPGKHGFTLSAWHEFLHSSVYLK |
| Ga0193704_10135093 | 3300019867 | Soil | MAVEGEAGPPPVRLPRPRRLRLDLTTPTLLGLPVAWLVVFFLVPIAIVAAYSFDVYSLFPGKHGFTLAAWR |
| Ga0193724_10132961 | 3300020062 | Soil | VGRGQGVLSVPEAQARRRRLRLDLTTPTLLGLPLAWLTVFFLVPIAIVGLYSVGRLSLDPGPHAVTLAAWHDFLHSPIYLK |
| Ga0247677_10059203 | 3300024245 | Soil | LRPSRRRRLRLDLTTPGLLAPPLAWLGVFFVVPIAIVGAYSFDVYSLDPGPHGFTTDAWHAFLHSSVYL |
| Ga0207654_100996741 | 3300025911 | Corn Rhizosphere | MAIAEAREPPAPRRRRRRLDLTTPTLLGLPLAWLVVFFLAPIAIVAAYSLDVYSLDPGPHSLTLSAWHDFVHSSIYL |
| Ga0207695_113604391 | 3300025913 | Corn Rhizosphere | MAITEAREPPAQRRRRRLRLDLSTPTLLGLPLAWLAAFFLVPIGIVAAYSFNLYSLDPGAHHFTLA |
| Ga0207663_104675622 | 3300025916 | Corn, Switchgrass And Miscanthus Rhizosphere | MALDTDAGRPPAPVRRRRGRPDLTTPGLLGLPLSWLAVFFVVPIAIVAAYSVDLYSLFPGPHGFTLAGWRSFFHDPVYLRL |
| Ga0207687_107429221 | 3300025927 | Miscanthus Rhizosphere | MAITEAREPPAQRRRRRLRLDLSTPTLLGLPLAWLAAFFLVPIGIVAAYSFNLYSLDPGAHHFTLAG |
| Ga0207704_105969651 | 3300025938 | Miscanthus Rhizosphere | MAITEAREPPAQRRRRRLRLDLSAPSLLGLPLAWLAAFFLVPIGIVAAYSFNLYSLDPGPHHFTLAGWHDFLHGSVY |
| Ga0207665_110794342 | 3300025939 | Corn, Switchgrass And Miscanthus Rhizosphere | VAVTEAPVSPPRARARRRPRFDLTTPSLLGLPLAWLGVFFLAPVAIVAAYSFDVYSLDPGPHGFTVDAWRAFLHS |
| Ga0207668_101444351 | 3300025972 | Switchgrass Rhizosphere | MAVAEAGSQPALRRRRLRLDLTTPTLLGLPLAWLGVFFVVPIAIVAAYSFDVYSLFPGPHGFTLRAWHDFLH |
| Ga0209761_11874661 | 3300026313 | Grasslands Soil | MALEGEAGTPPVRLPRRGRARPDVTTPTLLGLPVAWLVVFFVVPIAIVAAYSFDVYSLFPGKHGFTLAAWREFVHSS |
| Ga0209761_12674431 | 3300026313 | Grasslands Soil | MAGEGEAGRVPVRLPRRRRVRLDLTTPTLLGLPVAWLVVFFIVPIAIVAAYSFDVYSLFPGKHGFTLAAWREFVHS |
| Ga0209801_10975652 | 3300026326 | Soil | MAVEGEAGVPAVSASGRRRTRLDLTAPALLGFPVVWLVAFFLVPIAIVAAYSFDIYSLFPGKHGFTLAAWRA |
| Ga0209473_10380931 | 3300026330 | Soil | MPRRRSLLRVDLATPALLGLPLAWLGVFFVVPIAIVAAYSFDVYSLEPGPHGFTLAAWRAFAHSSIYLALFWK |
| Ga0209267_12485872 | 3300026331 | Soil | MSIVARGSPAAPRRRRLRLALTTPALLGLPLAWLAVFFLVPIAIVAAYSFDVYSIDPGPHGFTLEAWRHFL |
| Ga0209056_102346641 | 3300026538 | Soil | MAVEGEAGPPPARLPRRRRLRLDLTTPTLLGLPVAWLVVFFLVPIAIVAAYSFDVYSLFPGKHGFT |
| Ga0209474_106438861 | 3300026550 | Soil | MSVVARASPAAPRRRRLRLALTTPALLGLPLAWLAVFFLVPIAIVAAYSFDVYSLDPGPHGFTLEAWRHFAHSSIYLSLF |
| Ga0207428_108121922 | 3300027907 | Populus Rhizosphere | MAISEAREPPARTSRRRLRLDLSTPSLLGLPLAWLVVFFLVPIGIVAAYSFNVYSLDPGPHHFTLTAWHDFLHGSV |
| (restricted) Ga0233417_101218662 | 3300028043 | Sediment | VATDSPSAVSPAPARGRRRLRLDLTTPALLGLPLGWLAVFFLVPIGIVAAYSVDALHLPLFPGPHPITLQAWHDF |
| Ga0247684_10119601 | 3300028138 | Soil | MAIAEAREPPAPRRWRRRLDLTTPTLLGLPLAWLVVFFLAPIAIVAAYSLDVYSLDPGPHSLTLSAWHDFVHSSI |
| Ga0137415_111130331 | 3300028536 | Vadose Zone Soil | MAVESRAEDLPAQRRGRRRQRPDLTKPSLLGLPLAWLAVFFVVPIAIVAAYSFDVFSFGTGSHAFTLQAWHDFLSNGVYLRLFW |
| Ga0307298_100019991 | 3300028717 | Soil | MAITEAREPPARTSRRRLRLDLSTPSLLGLPLAWLAAFFLVPIGIVAAYSFNLYSLDPGPHHFTLAGWHDFLHGSVYLSLFWK |
| Ga0307298_100021341 | 3300028717 | Soil | MAVEGEAGPPPVRLPRPRRLRLDLTTPTLLGLPVAWLVVFFLVPIAIVAAYSFDVYSLFPGKHGFTLAA |
| Ga0307307_102492571 | 3300028718 | Soil | MALVEAGDSQARLPGRRRLRLDLTTPTLLGLPLAWLIVFFVVPIAIVAAYSVDVYSLNPGPHAFTLAAWHDFLHSSVYLKLFWKS |
| Ga0307323_103305632 | 3300028787 | Soil | MAITEAREPPAQRRRRRLRLDLSTPSLLGLPLAWLAAFFLVPIGIVAAYSFNLYSLDPGPHHFTLAGWH |
| Ga0307292_102680801 | 3300028811 | Soil | MAIAAGTSPARRRRRFRLDLTTPTLLGLPLAWLAVFFVVPIAIVAAYSFDVYSIDPGPHGFTLTAWHDFLHSSIYLKLF |
| Ga0307312_104877481 | 3300028828 | Soil | VGRSEGLLTSVAGDTPAPSRRRRLLRLDLTTPGLLGLPLAWLVVFFVVPIAIVAAYSFDVYSLFPGPHGFTFAGWRGFFHQAIYLR |
| Ga0307314_101596612 | 3300028872 | Soil | MAITEAREPPARTSRRRLRLDLSTPSLLGLPLAWLAAFFLVPIGIVAAYSFNLYSLDPGPHHFTLAGWHDFLHGSVYL |
| Ga0308187_100810041 | 3300031114 | Soil | MAVEGEAGPPPVRLPRPRRLRLDLTTPTLLGLPVAWLVVFFLVPIAIVAAYSFDVYSLFPGKHGFT |
| Ga0310915_109017472 | 3300031573 | Soil | MAVDSGARLPARRAPLRTRARLGLVTPGLLGLPVAWLVIFFIVPIAIVAAYSFDLYSLFPGPHGFTFGAWRSFVHDRVYLRLFWKS |
| Ga0307468_1013692961 | 3300031740 | Hardwood Forest Soil | MAVAEAGSQPALRRRRLRVDLTTPTLLGLPLAWLGVFFVVPIAIVAAYSFDVYSLYPGPHGFTLSAWHSFVHDPVYLRLFWK |
| Ga0310910_112721492 | 3300031946 | Soil | MAITEAGETPARSPRRRRLRFDLTTPTLLGPPVAWLVVFFVVPIAIVAAYSFNVYSLYPGQQGFTL |
| Ga0318563_100874631 | 3300032009 | Soil | MAVDSGARLPARRAPLRTRARLGLVTPGLLGLPVAWLVIFFIVPIAIVAAYSFDLYSLFPGPHGFTFGAWRSFVHDRVYLRLF |
| Ga0318540_102226932 | 3300032094 | Soil | MAVDSGARLPARRAPLRTRARLGLVTPGLLGLPVAWLVIFFIVPIAIVAAYSFDLYSLFPGPHGFTFGAWRSFVHDRVYLRLFWK |
| Ga0307472_1005910071 | 3300032205 | Hardwood Forest Soil | VTTRRERVRLDLTTPTLLGLPVAWLVVFFIVPIAIVAAYSFDVYSLFPGKHGFTLAAWREFVH |
| Ga0318519_106146861 | 3300033290 | Soil | MAVDSGARLPARRAPLRTRARLGLVTPGLLGLPVAWLVIFFIVPIAIVAAYSFDLYSLFPGPHGVTF |
| Ga0314864_0203769_282_518 | 3300033805 | Peatland | VPEDLMAVDSTARLPAKRLPLRTRARLGMVTPGLLGLPVAWLVVFFVVPIAIVAAYSFDVYSLNAGPHGFTVSAWQSFV |
| ⦗Top⦘ |