| Basic Information | |
|---|---|
| Family ID | F103728 |
| Family Type | Metagenome / Metatranscriptome |
| Number of Sequences | 101 |
| Average Sequence Length | 73 residues |
| Representative Sequence | MSESTEKAAEWWSDPDRDVPGTQWLQVPGAVENMNRRATGDPEMDWITHSAGLLAKFTKPIKALSVGCGFG |
| Number of Associated Samples | 92 |
| Number of Associated Scaffolds | 101 |
| Quality Assessment | |
|---|---|
| Transcriptomic Evidence | Yes |
| Most common taxonomic group | Unclassified |
| % of genes with valid RBS motifs | 0.00 % |
| % of genes near scaffold ends (potentially truncated) | 0.00 % |
| % of genes from short scaffolds (< 2000 bps) | 0.00 % |
| Associated GOLD sequencing projects | 90 |
| AlphaFold2 3D model prediction | Yes |
| 3D model pTM-score | 0.45 |
| Hidden Markov Model |
|---|
| Powered by Skylign |
| Most Common Taxonomy | |
|---|---|
| Group | Unclassified (100.000 % of family members) |
| NCBI Taxonomy ID | N/A |
| Taxonomy | N/A |
| Most Common Ecosystem | |
|---|---|
| GOLD Ecosystem | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil (15.842 % of family members) |
| Environment Ontology (ENVO) | Unclassified (36.634 % of family members) |
| Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Soil (non-saline) (40.594 % of family members) |
| ⦗Top⦘ |
| ⦗Top⦘ |
| Predicted Topology & Secondary Structure | |||||
|---|---|---|---|---|---|
| Classification: | Globular | Signal Peptide: | No | Secondary Structure distribution: | α-helix: 38.38% β-sheet: 0.00% Coil/Unstructured: 61.62% | Feature Viewer |
|
|
|||||
| Powered by Feature Viewer | |||||
| Structure Viewer | |
|---|---|
|
| |
| Per-residue confidence (pLDDT): 0-50 51-70 71-90 91-100 | pTM-score: 0.45 |
| Powered by PDBe Molstar | |
| ⦗Top⦘ |
| Pfam ID | Name | % Frequency in 101 Family Scaffolds |
|---|---|---|
| PF13469 | Sulfotransfer_3 | 22.77 |
| PF13489 | Methyltransf_23 | 3.96 |
| PF00685 | Sulfotransfer_1 | 3.96 |
| PF07995 | GSDH | 1.98 |
| PF13692 | Glyco_trans_1_4 | 0.99 |
| PF05050 | Methyltransf_21 | 0.99 |
| PF12708 | Pectate_lyase_3 | 0.99 |
| PF14581 | SseB_C | 0.99 |
| COG ID | Name | Functional Category | % Frequency in 101 Family Scaffolds |
|---|---|---|---|
| COG2133 | Glucose/arabinose dehydrogenase, beta-propeller fold | Carbohydrate transport and metabolism [G] | 1.98 |
| ⦗Top⦘ |
| Name | Rank | Taxonomy | Distribution |
| Unclassified | root | N/A | 100.00 % |
| Visualization |
|---|
| Powered by ApexCharts |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|
| ⦗Top⦘ |
| Habitat | Taxonomy | Distribution |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil | 15.84% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil | 14.85% |
| Vadose Zone Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil | 8.91% |
| Corn, Switchgrass And Miscanthus Rhizosphere | Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere | 4.95% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Agricultural → Soil | 3.96% |
| Arabidopsis Rhizosphere | Host-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere | 3.96% |
| Tropical Forest Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil | 2.97% |
| Grasslands Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil | 2.97% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil | 2.97% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil | 2.97% |
| Hardwood Forest Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil | 2.97% |
| Tropical Forest Soil | Environmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil | 2.97% |
| Groundwater Sediment | Environmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment | 1.98% |
| Soil | Environmental → Terrestrial → Soil → Loam → Grasslands → Soil | 1.98% |
| Switchgrass Rhizosphere | Host-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere | 1.98% |
| Populus Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere | 1.98% |
| Miscanthus Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere | 1.98% |
| Switchgrass Rhizosphere | Host-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere | 1.98% |
| Groundwater Sediment | Environmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment | 0.99% |
| Groundwater Sediment | Environmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment | 0.99% |
| Terrestrial Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil | 0.99% |
| Serpentine Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Serpentine Soil | 0.99% |
| Grass Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Grass Soil | 0.99% |
| Sugarcane Root And Bulk Soil | Environmental → Terrestrial → Soil → Unclassified → Agricultural Land → Sugarcane Root And Bulk Soil | 0.99% |
| Grass Soil | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Grass Soil | 0.99% |
| Corn, Switchgrass And Miscanthus Rhizosphere | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Corn, Switchgrass And Miscanthus Rhizosphere | 0.99% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil | 0.99% |
| Corn Rhizosphere | Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere | 0.99% |
| Switchgrass Rhizosphere | Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere | 0.99% |
| Miscanthus Rhizosphere | Host-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere | 0.99% |
| Switchgrass Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere | 0.99% |
| Switchgrass Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere | 0.99% |
| Miscanthus Rhizosphere | Host-Associated → Plants → Rhizosphere → Soil → Unclassified → Miscanthus Rhizosphere | 0.99% |
| Corn Rhizosphere | Host-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere | 0.99% |
| Arabidopsis Rhizosphere | Host-Associated → Plants → Rhizosphere → Soil → Unclassified → Arabidopsis Rhizosphere | 0.99% |
| Agave | Host-Associated → Plants → Phylloplane → Unclassified → Unclassified → Agave | 0.99% |
| Visualization |
|---|
| Powered by ApexCharts |
| Taxon OID | Sample Name | Habitat Type | IMG/M Link |
|---|---|---|---|
| 2189573000 | Grass soil microbial communities from Rothamsted Park, UK - July 2010 direct MP BIO 1O1 lysis 0-21cm (T0 for microcosms) | Environmental | Open in IMG/M |
| 2189573004 | Grass soil microbial communities from Rothamsted Park, UK - FG2 (Nitrogen) | Environmental | Open in IMG/M |
| 3300000559 | Amended soil microbial communities from Kansas Great Prairies, USA - control no BrdU total DNA F1.4 TC clc assemly | Environmental | Open in IMG/M |
| 3300000789 | Soil microbial communities from Great Prairies - Iowa, Native Prairie soil | Environmental | Open in IMG/M |
| 3300000890 | Soil microbial communities from Great Prairies - Iowa, Continuous Corn soil | Environmental | Open in IMG/M |
| 3300000955 | Soil microbial communities from Great Prairies - Iowa, Native Prairie soil | Environmental | Open in IMG/M |
| 3300000956 | Soil microbial communities from Great Prairies - Kansas, Native Prairie soil | Environmental | Open in IMG/M |
| 3300003321 | Sugarcane bulk soil Sample H1 | Environmental | Open in IMG/M |
| 3300004479 | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAs | Environmental | Open in IMG/M |
| 3300004480 | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 4 | Environmental | Open in IMG/M |
| 3300004643 | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 3 | Environmental | Open in IMG/M |
| 3300005172 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132 | Environmental | Open in IMG/M |
| 3300005175 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122 | Environmental | Open in IMG/M |
| 3300005187 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124 | Environmental | Open in IMG/M |
| 3300005332 | Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly) | Environmental | Open in IMG/M |
| 3300005335 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-3 metaG | Host-Associated | Open in IMG/M |
| 3300005343 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3L metaG | Environmental | Open in IMG/M |
| 3300005356 | Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M3-3 metaG | Host-Associated | Open in IMG/M |
| 3300005437 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-2 metaG | Environmental | Open in IMG/M |
| 3300005439 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaG | Environmental | Open in IMG/M |
| 3300005447 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138 | Environmental | Open in IMG/M |
| 3300005530 | Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-3B metaG | Environmental | Open in IMG/M |
| 3300005555 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141 | Environmental | Open in IMG/M |
| 3300005562 | Agave microbial communities from Guanajuato, Mexico - As.Ma.e | Host-Associated | Open in IMG/M |
| 3300005575 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151 | Environmental | Open in IMG/M |
| 3300005617 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S2-2 | Host-Associated | Open in IMG/M |
| 3300005618 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S7-2 | Host-Associated | Open in IMG/M |
| 3300005764 | Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2) | Environmental | Open in IMG/M |
| 3300006173 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaG | Environmental | Open in IMG/M |
| 3300006796 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 | Environmental | Open in IMG/M |
| 3300006844 | Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD2 | Host-Associated | Open in IMG/M |
| 3300006854 | Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4 | Host-Associated | Open in IMG/M |
| 3300009098 | Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-4 metaG | Host-Associated | Open in IMG/M |
| 3300010040 | Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot55 | Environmental | Open in IMG/M |
| 3300010360 | Tropical forest soil microbial communities from Panama - MetaG Plot_6 | Environmental | Open in IMG/M |
| 3300010371 | Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-1 | Environmental | Open in IMG/M |
| 3300011119 | Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M6-4 metaG | Host-Associated | Open in IMG/M |
| 3300012198 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaG | Environmental | Open in IMG/M |
| 3300012203 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaG | Environmental | Open in IMG/M |
| 3300012211 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaG | Environmental | Open in IMG/M |
| 3300012356 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaG | Environmental | Open in IMG/M |
| 3300012361 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaG | Environmental | Open in IMG/M |
| 3300012898 | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S194-509B-1 | Environmental | Open in IMG/M |
| 3300012924 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Illumina Assembly) | Environmental | Open in IMG/M |
| 3300012930 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly) | Environmental | Open in IMG/M |
| 3300012948 | Tropical forest soil microbial communities from Panama - MetaG Plot_14 | Environmental | Open in IMG/M |
| 3300012951 | Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_226_MG | Environmental | Open in IMG/M |
| 3300012961 | Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_202_MG | Environmental | Open in IMG/M |
| 3300012976 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_0_1 metaG | Environmental | Open in IMG/M |
| 3300012989 | Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_237_MG | Environmental | Open in IMG/M |
| 3300014157 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_2_24_1 metaG | Environmental | Open in IMG/M |
| 3300014968 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S2-5 metaG | Host-Associated | Open in IMG/M |
| 3300015372 | Soil combined assembly | Host-Associated | Open in IMG/M |
| 3300015373 | Combined assembly of cpr5 rhizosphere | Host-Associated | Open in IMG/M |
| 3300015374 | Col-0 rhizosphere combined assembly | Host-Associated | Open in IMG/M |
| 3300016294 | Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.178 | Environmental | Open in IMG/M |
| 3300017654 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_2_24_1 metaG | Environmental | Open in IMG/M |
| 3300018051 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_5_b1 | Environmental | Open in IMG/M |
| 3300019361 | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S133-311R-2 (version 2) | Environmental | Open in IMG/M |
| 3300019867 | Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3m1 | Environmental | Open in IMG/M |
| 3300019881 | Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3c2 | Environmental | Open in IMG/M |
| 3300019998 | Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H3m1 | Environmental | Open in IMG/M |
| 3300020006 | Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1m2 | Environmental | Open in IMG/M |
| 3300020018 | Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2s2 | Environmental | Open in IMG/M |
| 3300020062 | Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2a1 | Environmental | Open in IMG/M |
| 3300021078 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_5_coex redo | Environmental | Open in IMG/M |
| 3300021339 | Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3c1 | Environmental | Open in IMG/M |
| 3300021411 | Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H3c2 | Environmental | Open in IMG/M |
| 3300021413 | Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L1c1 | Environmental | Open in IMG/M |
| 3300021418 | Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L3s2 | Environmental | Open in IMG/M |
| 3300021510 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_5_coex | Environmental | Open in IMG/M |
| 3300022534 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_30_b1 | Environmental | Open in IMG/M |
| 3300024288 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_08_16fungal | Environmental | Open in IMG/M |
| 3300025903 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-3 metaG (SPAdes) | Host-Associated | Open in IMG/M |
| 3300025905 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-1 metaG (SPAdes) | Environmental | Open in IMG/M |
| 3300025919 | Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C3-3 metaG (SPAdes) | Host-Associated | Open in IMG/M |
| 3300025930 | Switchgrass rhizosphere bulk soil microbial communities from Kellogg Biological Station, Michigan, USA, with PhiX - S5 (SPAdes) | Environmental | Open in IMG/M |
| 3300025938 | Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M1-2 (SPAdes) | Host-Associated | Open in IMG/M |
| 3300025941 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-4 metaG (SPAdes) | Host-Associated | Open in IMG/M |
| 3300026330 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133 (SPAdes) | Environmental | Open in IMG/M |
| 3300026333 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 (SPAdes) | Environmental | Open in IMG/M |
| 3300026342 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146 (SPAdes) | Environmental | Open in IMG/M |
| 3300026508 | Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-01-A | Environmental | Open in IMG/M |
| 3300026550 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145 (SPAdes) | Environmental | Open in IMG/M |
| 3300030917 | Forest soil microbial communities from France for metatranscriptomics studies - Site 11 - Champenoux / Amance forest - FB5 Emin (Eukaryote Community Metatranscriptome) | Environmental | Open in IMG/M |
| 3300031538 | Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T20D1 | Environmental | Open in IMG/M |
| 3300031715 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_05 | Environmental | Open in IMG/M |
| 3300031854 | Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C48D1 | Environmental | Open in IMG/M |
| 3300031938 | Soil microbial communities from UC Gill Tract Community Farm, Albany, California, United States - DLSLS.C.R1 | Environmental | Open in IMG/M |
| 3300031941 | Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.OX080 | Environmental | Open in IMG/M |
| 3300032000 | Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C0D3 | Environmental | Open in IMG/M |
| 3300032180 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515 | Environmental | Open in IMG/M |
| Geographical Distribution | |
|---|---|
| Zoom: | Powered by OpenStreetMap |
| ⦗Top⦘ |
| Protein ID | Sample Taxon ID | Habitat | Sequence |
| N55_09436730 | 2189573000 | Grass Soil | MSESSKRAAEWWSDPDRVVPGTQWLQVPGTSENMNRRATGDPEMNWITHSAALLAKFPKPIKALSLGC |
| FG2_07901860 | 2189573004 | Grass Soil | MSETTEKAAEGWSAPDRDVPGTQWLQIPGAVENMNRRATGDPEMDWITHSAGLLAKFANPIKALSLGCGFGVIERVLRRCDYCQL |
| F14TC_1013448561 | 3300000559 | Soil | MSETTEKAAEWWSDPDRDVPGTQWLQIPGAVKNMNRRATGDPEMDWITHSAGLLAKFAKP |
| JGI1027J11758_124059872 | 3300000789 | Soil | MVEGTKSDAAKKAGEWWSDPERQVTGTQWVEVPGTFENLNRRATGDPAIDWITHSGSLLATFTKPVKALSLGCGFGVIERILRRRDYCQLI |
| JGI11643J12802_107207782 | 3300000890 | Soil | MSEAIRKAAEWWSDPESEAPGTQWVQVPGVKESINRRATGDPAIDWIDHSASLLASFTKPINLLSVGCGFGAIERLLRRRDYCQHI |
| JGI1027J12803_1016701444 | 3300000955 | Soil | MSKATKKAAEWWSDPQSEAPGTQWVQVPGVFESLNRRATGDPSIDWINHSASLLANFAKPIKALSAGCGFGGIERILRR |
| JGI1027J12803_1036274372 | 3300000955 | Soil | MSEPTKKAAEWWSDPDRDVPGTQWLQIPGASENMNRRATGDPEVNWITHSAGLLAQFPKP |
| JGI10216J12902_1030194062 | 3300000956 | Soil | MSEATQKAAEWWSDPESGDSETQWVRVPGVAENMNRRATGDPAIDWIHHSAGLLASFAKPIKALSLGCGFGIIERVLRRSDFCQIIH |
| soilH1_104008121 | 3300003321 | Sugarcane Root And Bulk Soil | MTEPAKKAAEWWSDPESEARETQWVRVPGVQENMNRRATGDPEMDWISHSGGLLVKFAKPVKALSLGCGFGVIERVIRRR |
| Ga0062595_1009097301 | 3300004479 | Soil | MSEAVKKAAEWWSDPERGVTGTQWLDVPGAIENMNLRATGDPKLDWISHSASLLASLSKPVKALSVGCGFGVI |
| Ga0062592_1006703781 | 3300004480 | Soil | MSESTDKAAEFWSDPGRDVPGTQWLQIPGAIQNMNRRATGDPDMDWITHSAALLAKFKKPIKVLSLGCGFGVIERVLRR |
| Ga0062591_1015970991 | 3300004643 | Soil | MSESTDKAAEFWSDPGRDVPGTQWLQIPGAIQNMNRRATGDPDMDW |
| Ga0066683_100129841 | 3300005172 | Soil | MTDTSVSEAAKRSARWWNDPQSEAPGTQWVEVPGVAENINRRATGDPEIDWIGHSAGLLLKSKRPIEALSIGCGFGRI |
| Ga0066673_104002711 | 3300005175 | Soil | MAEPINTKAIKKVAERWGDPQSEPPGTQWVGVPGVAENINRRATGDPKIDWINHSGSLLARFKKPIKALSLGCGFGAIERILR |
| Ga0066675_109663321 | 3300005187 | Soil | MIETSVSEATKRAAKWWSDPQSEVPGTQWVEVPGVAETINRRATGDPEIDWISHSAGLLAKSKRPIKALSIGCGFGGIERLLRRRDYCQLIH |
| Ga0066388_1010619551 | 3300005332 | Tropical Forest Soil | MSESTEKAAEWWSVPDQNPPGTQWLQIPGATENMNRRATGDPEMDWITHSAGLLSKFQKPIKALSLGCGFGVIERVLR |
| Ga0066388_1069745412 | 3300005332 | Tropical Forest Soil | MNESTSEKAAEWWSDPGREIPGTQWLQVPGAIQNMNSRATGDPEMDWITHSAGLLAKFAKPVKALSLGCGFGVIERVLRRSD |
| Ga0070666_113635932 | 3300005335 | Switchgrass Rhizosphere | MSESTDKAAEFWSDPGRDVPGTQWLQIPGAIQNMNRRATGDPDMDWITHSAALL |
| Ga0070687_1005983211 | 3300005343 | Switchgrass Rhizosphere | MSETTEKAAEWWSDPGRDVPGTQWLQIPGAIENMNRRATGDPEMDWITHSAGLLSKF |
| Ga0070674_1012411931 | 3300005356 | Miscanthus Rhizosphere | MSDSTEKAAEFWSDPGRDVPGTQWLQIPGAIENMNLRATGDPEMDWITHSAALLAKFKKP |
| Ga0070710_107134182 | 3300005437 | Corn, Switchgrass And Miscanthus Rhizosphere | MSESTEKAAEWWSDPDRDVPGTQWLQVPGAVENMNRRATGDPEMDWITHSAGLLAKFTKPIKALSVGCGFGIIERVLRRCDYCQLIHGVDVAEGAIEGARKAAQDEGL |
| Ga0070711_1000807061 | 3300005439 | Corn, Switchgrass And Miscanthus Rhizosphere | MSKTANKAAEWWSDPETEGPETHWVRVPGVVENMNRRATGDPAIDWISHSASLLARFAKPIKALSVGCGFGGIERALRRRN |
| Ga0070711_1017778771 | 3300005439 | Corn, Switchgrass And Miscanthus Rhizosphere | MSETTEKAAEWWSDPGRIAAGTQWLEIPGATENMNHRATGDAEMDWITHSAGLLAKFAKPIKALSL |
| Ga0066689_103579271 | 3300005447 | Soil | MSESTEKAAEWWSDPGRDVPGTQWLQIPGASENMNRRATGDPEMNWITHSAGLLAQFPKPVKVLSLGCGFGVIE |
| Ga0070679_1013154951 | 3300005530 | Corn Rhizosphere | MSESTDKAAEFWSDPGRDVPGTQWLQIPGAIQNMNRRATGDPDMDWITHSAALLAKFKKPIKVLSLGCGFGVIERVLRRADY |
| Ga0066692_106731222 | 3300005555 | Soil | MSEATKKAAEWWSDPQSEAPETQWVRVPGVVQNMNRRATGDMAIDWINHSATLLSRFAKPIKALSIGCGFGIIERVL |
| Ga0058697_103618581 | 3300005562 | Agave | MSEAIRKAADWWSDPQSEAPETQWVRVPGVVENMNRRATGDPAIDWINHSATLLTSLAKPIKALSIGCGFGVIERTLRRQDFCQLIHGVD |
| Ga0066702_100192621 | 3300005575 | Soil | MIETSVSEATKRAAKWWSDPQSEVPGTQWVEVPGVAENINRRATGDPEIDWISHSAGLLAKSKRPIKALSIGCG |
| Ga0068859_1024447892 | 3300005617 | Switchgrass Rhizosphere | MSESTDKAAEFWSDPGRDVPGTQWLQIPGAIQNMNRRATGDPDMDWITHS |
| Ga0068864_1020886491 | 3300005618 | Switchgrass Rhizosphere | MSESTERAAEWWSDPGRDVPGTQWPQTPGASENMNRRATGDPEMDWITHSAGLLAKFKKPIKALSLGCGFGVIERVLRR |
| Ga0066903_1067969781 | 3300005764 | Tropical Forest Soil | MSESTEKAAEWWSDPSHDVPGTQWLGVPGATENMNRRATGDPEMDWIAHSAALLSRFAKPIKALSLGCGFGVIE |
| Ga0070716_1017777441 | 3300006173 | Corn, Switchgrass And Miscanthus Rhizosphere | MSEPTKKAAEWWSDPDRDVPGTQWLQIPGASENMNRRATGDPEMNWITHSAGLLAKFAKPIKALSLGCGFGVIERV |
| Ga0066665_114995512 | 3300006796 | Soil | MSESTEKAAEWWSDPGRDVPGTQWLQIPGASENMNRRATGDPEMNWITHSAGLLAQFPKPVKVLSLGCGFGVI |
| Ga0075428_1004166252 | 3300006844 | Populus Rhizosphere | MSEASKKAAEWWSDPKSEAPETQWVRVPGVAENMNRRATGDPAINWINHSAGLLTSFAKPIKALSLSCGFGIIERVLRRSDFCQIIHGVDVAENAIESAR |
| Ga0075425_1007693661 | 3300006854 | Populus Rhizosphere | MSESTEKAAEWWSDPGRDVPGTQWLQVPGAIENMNLRATGDPEMDWITHSAGLLAKFAKPIKALSLGCGFGV |
| Ga0105245_104083172 | 3300009098 | Miscanthus Rhizosphere | MSKSTNKAAEFWSDPGRDVPGTQWLQIPGAIQNMNRRATGDPDMDWITHSAALL |
| Ga0126308_103890232 | 3300010040 | Serpentine Soil | MSESTKKAADFWSDPGRDVPGTQWLHIPGAVENMNLRATGDPEMDWIT |
| Ga0126372_114927101 | 3300010360 | Tropical Forest Soil | MNEPTKKAAEWWSDPDSEAPETQWVRVPGVAENMNRRATGDPEMDWITHSAGLLAKFEKPVKALSLGCGFGVIERVLRRCDYCQLIHGL |
| Ga0126372_121391071 | 3300010360 | Tropical Forest Soil | MSESTEKAAEWWSDPSHDVPGTQWLGVPGATENMNRRATGDPEMDWITHSAGL |
| Ga0134125_126721512 | 3300010371 | Terrestrial Soil | MSEASRKAAEWWSDPDRDVPGTQWLQIPGAVQNMNRRATGNPEMDWISHSAGLLAKFAKPIK |
| Ga0105246_107458612 | 3300011119 | Miscanthus Rhizosphere | MSESTERAAEWWSDPGRDVPGTQWPQTPGASENMNRRATGDPEMDWITHSAGLLAKFKKPIKALSLGCGFGVIERVLRRNDYC |
| Ga0137364_100857841 | 3300012198 | Vadose Zone Soil | MSEASKKAAEWWSDPKSEAPETQWVRVPGVAENMNRRATGDPAINWINHSAGLLSGFAKPIKALSLGCGFGIIERVLRRSDFC |
| Ga0137399_115405881 | 3300012203 | Vadose Zone Soil | MSEATRKAAEWWSDPQSEASETQWVRVPGVAENMNRRATGDPAINWINHSAGLLSGFAKPIKALSLGCGFGIIERVL |
| Ga0137377_117671002 | 3300012211 | Vadose Zone Soil | MSESTKKAGEWWSDPDRDIPGTQWLQIPGAVENMNRRATGDPEMDWI |
| Ga0137371_104851731 | 3300012356 | Vadose Zone Soil | MIESTEKAAEWWSDSDRDVPGTQWLLVPGASENMNRRATGDPEMNWITHSAGLLAQFPKPVKVL |
| Ga0137360_117286022 | 3300012361 | Vadose Zone Soil | MSEATKKAAEWWSDPQSEAPETQWVRIPGVVENMNCRATGDPAMDWINHSAGLLASFAKPVKALSVGC |
| Ga0157293_101571162 | 3300012898 | Soil | MSKSTDKAAEFWSDPGRDVPGTQWLQIPGAIQNMNRRATGDPDMDWITHSAALLAKFKKP |
| Ga0137413_100450403 | 3300012924 | Vadose Zone Soil | MSEAIRKAAEWWSDPESEAPGTQWVQVPGVKESINRRATGDPAIDWIDHSASFLASFTKPINVLSVGCGFGTIERLLRRRDNCQQVNRVDIAGAVIEATTKTAEAERLEGLT* |
| Ga0137407_105513541 | 3300012930 | Vadose Zone Soil | MSESTEKAAEWWSDPDRDVPGTQWLQIPGASENMNRRATGDPEMNWITHSAGLLAQFPKPVKVL |
| Ga0137407_123707451 | 3300012930 | Vadose Zone Soil | MSESTEKAAEWWSDPGRDVPGTQWLQIPGASENMNRRATGDPEMNWITHSAGLLAQFPKPVKVL |
| Ga0126375_112249402 | 3300012948 | Tropical Forest Soil | MSESTEKAAEWWSDPDRDVPGTQWLQVPGAIENMNLRATGDPEMDWITHSAGLLAKFAKPIKALSPGCGFGVIE |
| Ga0164300_108473702 | 3300012951 | Soil | MSEATKKAAEWWSDPKSEAPETQWVRVPGVVENMNRRATGDPAIDWINHSAGLLTGFARPIKALSVGCGFGVIERTLRRHDFCQLIHGVDVAENA |
| Ga0164302_106522072 | 3300012961 | Soil | MSEPTKKAAEWWSDPESEAPETQWVRVPGVVENMNRRATGDPEMDWITHSAGLLAKFAK |
| Ga0134076_104729571 | 3300012976 | Grasslands Soil | MSEATRKAANWWSDPQSEAPETQWVRVPGISENMNRRATGDPAIDWIHHSAGLLRSFAKPIKALSIGCGFGI |
| Ga0164305_110691271 | 3300012989 | Soil | MVLFDMSETTEKAAEWWSDPGRIAAGTQWLEIPGATENMNHRATGDPEMDWITHSA |
| Ga0134078_100992572 | 3300014157 | Grasslands Soil | MAEGTKSDATKKVAEWWSDSQREVPGTQWVEVPGALENMNRRATGDPGIDWINHSASVLAHFKKPIKALSLECGFGLIERVLRRGNFCQLVHGVDVAEGAMKALGKRPKQRGWMV* |
| Ga0157379_122021331 | 3300014968 | Switchgrass Rhizosphere | MSESTERAAEWWSDPGRDVPGTQWPQTPGASENMNRRATGDPEMDWI |
| Ga0132256_1020676731 | 3300015372 | Arabidopsis Rhizosphere | MSESTEKAAEWWSDPDRDVPGTQWLQVPGAIENMNRRATGEPEMNWITHS |
| Ga0132257_1028838001 | 3300015373 | Arabidopsis Rhizosphere | VNVTEKAAEWWSDPDRDVPGTQWLQIPGAVQNMNRRATGNPEMDWITHSAGLLAKFAKPIKALSLGCGFG |
| Ga0132255_1008783141 | 3300015374 | Arabidopsis Rhizosphere | MSESSKRAAEWWSDPDRDVPGTQWLQVPGTSENMNRRATGDPEMDWITHSAGLLAK |
| Ga0132255_1030306601 | 3300015374 | Arabidopsis Rhizosphere | MSKSTEKAAEWWSDPDREVPGTQWLLVPGASENMNRRATGDPEMDWITHSAALLAKFAKPIKALSLGC |
| Ga0132255_1041081712 | 3300015374 | Arabidopsis Rhizosphere | VNVTEKAAEWWSDPDRDVPGTQWLQIPGAVQNMNRRATGNPEMDWITHSAGLLAKFAKPIKALSLGCGFGVIER |
| Ga0182041_115599591 | 3300016294 | Soil | MSTLKSDVSKKVAVSWSDPQSEAPGTQWVQVPGVKESVNRRATGDPAIEWIDHSASLLAGFTKPINVLSVGCGFGAIERLLR |
| Ga0134069_12071061 | 3300017654 | Grasslands Soil | MSEATRKAANWWSDPQSEAPETQWVRVPGISENMNRRATGDPAIDWIHHSAGLLRSFAKPIKALSIGCGFGIIERTLRRRDFCQ |
| Ga0184620_102236561 | 3300018051 | Groundwater Sediment | MTKTLESDAAEKVAVWWSDPQSEAPGTQWVQVPGVKESINRRATGDPAIDWIDHSASLLASFTKPIKVLSVGCGFGAIERLL |
| Ga0173482_103616932 | 3300019361 | Soil | MSESTDKAAEFWSDPGRDVPGTQWLQIPGAIQNMNRRATGDPDMDWITH |
| Ga0193704_10420031 | 3300019867 | Soil | MSESTKKAAEFWSDPGRDVPGTQWLQIPGAVENMNLRATGDPEMDWITHSAALLARFKKPIKVLSLGCGFGVIERVLRRSDSCQLIH |
| Ga0193707_10913922 | 3300019881 | Soil | MSESTKKAAEFWSDPGRDVPGTQWLQIPGAVENMNLRATGDPEMDW |
| Ga0193710_10189201 | 3300019998 | Soil | MSESTKKAAEFWSDPGRDVPGTQWLQIPGAVENMNLRATGDPEMDWITHSAALLAKFKKPIKVLSLGCGFGVIERVLRRSDSCQLIHG |
| Ga0193735_10790522 | 3300020006 | Soil | MSEATKKAAEWWSDPQSEAPETQWVRVPGVVENMNYRATGDPAIDWINHSAGLLATFAKPVKALSA |
| Ga0193721_10616081 | 3300020018 | Soil | MSESTKKAAEFWSDPGRDVPGTQWLQIPGAVENMNLRATGDPEMDWITHSAAL |
| Ga0193721_11731841 | 3300020018 | Soil | MSLGAPLQNAVLSRMDEVSKKVAEWWGDPQSEAPGTQWVEVPGISENTKFRASGDPAIDWVNHSASLLSRFTRPIKALSLGCGFGVIERILRRRDYCQLIHG |
| Ga0193724_10011912 | 3300020062 | Soil | MSEPTKKAAEWWSDPESEAPETQWVRVPGVLENMNRRATGDPEMDWITHSAGLLAKFAKPVKALSV |
| Ga0210381_100263543 | 3300021078 | Groundwater Sediment | MSESTKKAAEFWSDPGRDVPGTQWLQIPGAVENMNLRATGDPEMDWITHSAALLAKFKKPIK |
| Ga0193706_12018961 | 3300021339 | Soil | MRLKPKPLSNEALFRMSEATKKAAEYWSSAQSQAFGNNWVGVPGVVENMNRRASGDPAINWINHSAALLSRFAKPIKALSIGCGLGIIERVLRRHDFCQLIHGVDVAENSIKSARQT |
| Ga0193709_11034512 | 3300021411 | Soil | MSESTKKAAEFWSDPGRDVPGTQWLQIPGAVENMNLRATGDPEMDWITHSAALLA |
| Ga0193750_10300211 | 3300021413 | Soil | MALLHMVEDTKIEAAKKAGEWWSDPEREIPGTQWLQIPGASENMNHRATGDPEMDWITHSASLLAKFAKPIKALSLGCGFGVIER |
| Ga0193695_11087801 | 3300021418 | Soil | VNVTEKAAEWWSDPERDVPGTQWLLVPGAVENMNRRATGDPAINWITHSAGLLAKFAKPIKALSLGCGFGIIERVL |
| Ga0222621_10379261 | 3300021510 | Groundwater Sediment | MSESTKKAAEFWSDPGRDVPGTQWLQIPGAVENMNLRATGDPEMDWITHSAALLAKFKKPIKVLSLGCGFGVIERVLRRSDSCQ |
| Ga0224452_11041762 | 3300022534 | Groundwater Sediment | MSESTKKAAEFWSDPGRDVPGTQWLQIPGAVENMNLRATGDPEMDWITHSAALLARF |
| Ga0179589_100407051 | 3300024288 | Vadose Zone Soil | MSEAIRKAAEWWSDPESEAPGTQWVQVPGVKESINRRATGDPAIDWIDHSASFLASFTKPINMLSVGCGFGAIERLLR |
| Ga0207680_110292232 | 3300025903 | Switchgrass Rhizosphere | MSESTDKAAEFWSDPGRDVPGTQWLQIPGAIQNMNRRATGDPDMDWITHSAALLAKFKKPIKVLSLGC |
| Ga0207685_104356062 | 3300025905 | Corn, Switchgrass And Miscanthus Rhizosphere | MSESTEKAAEWWSDPDRDVPGTQWLQVPGAVENMNRRATGDPEMDWITHSAGLLAKFTKPIKALSVGCGFG |
| Ga0207657_113200332 | 3300025919 | Corn Rhizosphere | MSESTDKAAEFWSDPGRDVPGTQWLQIPGAIQNMNRRATGDPDMDWIT |
| Ga0207701_104876622 | 3300025930 | Corn, Switchgrass And Miscanthus Rhizosphere | MSESTERAAEWWSDPGRDVPGTQWPQTPGASENMNRRATGDPEMDWITHSAGLLAKFKKPIKALSLGCGFGVIERVLRRNDYCQFIH |
| Ga0207704_114611032 | 3300025938 | Miscanthus Rhizosphere | MSETTEKAAEWWSDPGRDVPGTQWLQIPGAIENMNRRATGDPEMDWITHSAGLLSKFPKPVKVLSLGCGFGVIERALRR |
| Ga0207711_118047601 | 3300025941 | Switchgrass Rhizosphere | MSESTDKAAEFWSDPGRDVPGTQWLQIPGAIQNMNRRATGDPDMDWITHSAALLAKFKKPIKVLSLGCGFGVI |
| Ga0209473_10738142 | 3300026330 | Soil | MAEGTKSDATKKVAEWWSDSQREVPGTQCVEVPGALENMNRRATGDPGIDWINHSASVLAHFKKPIKALSLGC |
| Ga0209158_13385311 | 3300026333 | Soil | MIESTEKAAEWWSDPDRDVPGTQWLLVPGASENMNRRATGDPEMNWITHSAGLLAQFPKPVKVLSL |
| Ga0209057_10536202 | 3300026342 | Soil | MTDTSVSEAAKRSARWWNDPQSEAPGTQWVEVPGVAENINRRATGDPEIDWIGHSAGLLLKSKRPIEALSIGCGFGRIERLLRRSDYCQLIHGRGLAGFDV |
| Ga0257161_10638683 | 3300026508 | Soil | MSEATKKAAEYWSSAQSHAPGNNWLGVPGVVENMNRRATGDPAIDWINHSAALLSRFAKPIKALSIGCGFGIIERVLR |
| Ga0209474_100049641 | 3300026550 | Soil | VCYNDIIHMAEGTKSDATKKVAEWWSDSQREVPGTQCVEVPGALENMNRRATGDPGIDWINHSASLLAHFKKPIKALSLGCGFGVIERV |
| Ga0075382_108464541 | 3300030917 | Soil | MSETTEKAAEWWSNPDRDVPGTQWLQIPGAVENMNRRATGNPEMDWITHSAGLLAKFAKPIKALSLGCGFGVIERVLRRCDY |
| Ga0310888_106877361 | 3300031538 | Soil | MSESTERAAEWWSDPGRDVPGTQWPQTPGASENMNRRATGDPEMDWITHSAGLLAKFKKPIKALSLGCGFGVIERVLRRNDYCQL |
| Ga0307476_106530741 | 3300031715 | Hardwood Forest Soil | MNKAVKKAAEWWSDPQREVPGTQWLEIPGALQNMNRRATGDPAIDWINHSASLLANFKPPVKALSLGCGFGIIERVLRRQ |
| Ga0310904_105914891 | 3300031854 | Soil | MSESTERAAEWWSDPGRDVPGTQWPQTPGASENMNRRATGDPEMDWITHSAGL |
| Ga0308175_1019539052 | 3300031938 | Soil | MSESTKKAAEFWSDPDRDVPGTQWLHIPGAVENMNLRATGDPEMDWITHSAALLAKFKKPIKVLSLGCGFGVIERVLRSSDS |
| Ga0310912_106867141 | 3300031941 | Soil | MSEAIKKAAEWWSDPESEAPETQWVRVPGVEQNMNRRATGDPAIDWINHSASLLTSFAKPIKALSIGCGFGIIERRLRRNDFCQIIHGVDVAEN |
| Ga0310903_106248471 | 3300032000 | Soil | MSESTERAAEWWSDPGRDVPGTQWPQTPGASENMNRRATGDPEMDWITHSAGLLAKFKKPIKALSLGCGFGVIERVLRRNDYCQLIHG |
| Ga0307471_1010044041 | 3300032180 | Hardwood Forest Soil | MSEPTKKAAEWWSDPGSEAPETQWVRVPGVDENMNRRATGDPEMDWITHSAGLLAKFPKP |
| Ga0307471_1039661313 | 3300032180 | Hardwood Forest Soil | MSEVIRKAADWWSDPQSEAPETQWVRVPGVNENMNRRATGDPAIDWINHSAVLLSRFAKPIKALSVGCGFG |
| ⦗Top⦘ |