| Basic Information | |
|---|---|
| Family ID | F100615 |
| Family Type | Metagenome / Metatranscriptome |
| Number of Sequences | 102 |
| Average Sequence Length | 116 residues |
| Representative Sequence | MATKHSCYRCAVHEVQGPFLQQGVDGAPELESLDTTFADDGEEIFFATGYSAREIAQRHAREKGGQIYVNNISRKIRRRDLTAPVAYAVSKSPVYTLRAPDEWHDKVYRAYWPPL |
| Number of Associated Samples | 87 |
| Number of Associated Scaffolds | 102 |
| Quality Assessment | |
|---|---|
| Transcriptomic Evidence | Yes |
| Most common taxonomic group | Bacteria |
| % of genes with valid RBS motifs | 72.55 % |
| % of genes near scaffold ends (potentially truncated) | 30.39 % |
| % of genes from short scaffolds (< 2000 bps) | 67.65 % |
| Associated GOLD sequencing projects | 86 |
| AlphaFold2 3D model prediction | Yes |
| 3D model pTM-score | 0.58 |
| Hidden Markov Model |
|---|
| Powered by Skylign |
| Most Common Taxonomy | |
|---|---|
| Group | Bacteria (100.000 % of family members) |
| NCBI Taxonomy ID | 2 |
| Taxonomy | All Organisms → cellular organisms → Bacteria |
| Most Common Ecosystem | |
|---|---|
| GOLD Ecosystem | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil (8.823 % of family members) |
| Environment Ontology (ENVO) | Unclassified (20.588 % of family members) |
| Earth Microbiome Project Ontology (EMPO) | Host-associated → Plant → Plant rhizosphere (28.431 % of family members) |
| ⦗Top⦘ |
| ⦗Top⦘ |
| Predicted Topology & Secondary Structure | |||||
|---|---|---|---|---|---|
| Classification: | Globular | Signal Peptide: | No | Secondary Structure distribution: | α-helix: 11.19% β-sheet: 18.18% Coil/Unstructured: 70.63% | Feature Viewer |
|
|
|||||
| Powered by Feature Viewer | |||||
| Structure Viewer | |
|---|---|
|
| |
| Per-residue confidence (pLDDT): 0-50 51-70 71-90 91-100 | pTM-score: 0.58 |
| Powered by PDBe Molstar | |
| ⦗Top⦘ |
| Pfam ID | Name | % Frequency in 102 Family Scaffolds |
|---|---|---|
| PF00293 | NUDIX | 40.20 |
| PF04055 | Radical_SAM | 11.76 |
| PF00535 | Glycos_transf_2 | 2.94 |
| PF12724 | Flavodoxin_5 | 1.96 |
| PF02397 | Bac_transf | 0.98 |
| PF12697 | Abhydrolase_6 | 0.98 |
| PF13394 | Fer4_14 | 0.98 |
| PF10531 | SLBB | 0.98 |
| PF03460 | NIR_SIR_ferr | 0.98 |
| PF01979 | Amidohydro_1 | 0.98 |
| PF04185 | Phosphoesterase | 0.98 |
| PF01370 | Epimerase | 0.98 |
| PF13654 | AAA_32 | 0.98 |
| PF04932 | Wzy_C | 0.98 |
| PF10405 | BHD_3 | 0.98 |
| COG ID | Name | Functional Category | % Frequency in 102 Family Scaffolds |
|---|---|---|---|
| COG2148 | Sugar transferase involved in LPS biosynthesis (colanic, teichoic acid) | Cell wall/membrane/envelope biogenesis [M] | 0.98 |
| COG3307 | O-antigen ligase | Cell wall/membrane/envelope biogenesis [M] | 0.98 |
| COG3511 | Phospholipase C | Cell wall/membrane/envelope biogenesis [M] | 0.98 |
| ⦗Top⦘ |
| Name | Rank | Taxonomy | Distribution |
| All Organisms | root | All Organisms | 100.00 % |
| Unclassified | root | N/A | 0.00 % |
| Visualization |
|---|
| Powered by ApexCharts |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| 3300000364|INPhiseqgaiiFebDRAFT_101637485 | All Organisms → cellular organisms → Bacteria | 2709 | Open in IMG/M |
| 3300000364|INPhiseqgaiiFebDRAFT_105886408 | All Organisms → cellular organisms → Bacteria | 2522 | Open in IMG/M |
| 3300000789|JGI1027J11758_12820138 | All Organisms → cellular organisms → Bacteria | 2709 | Open in IMG/M |
| 3300000956|JGI10216J12902_106441223 | All Organisms → cellular organisms → Bacteria | 3196 | Open in IMG/M |
| 3300002231|KVRMV2_100014625 | All Organisms → cellular organisms → Bacteria | 4639 | Open in IMG/M |
| 3300004800|Ga0058861_11014121 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 787 | Open in IMG/M |
| 3300005172|Ga0066683_10370898 | All Organisms → cellular organisms → Bacteria | 887 | Open in IMG/M |
| 3300005180|Ga0066685_10340975 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 1040 | Open in IMG/M |
| 3300005330|Ga0070690_101421760 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 558 | Open in IMG/M |
| 3300005332|Ga0066388_103555546 | All Organisms → cellular organisms → Bacteria | 796 | Open in IMG/M |
| 3300005435|Ga0070714_100239815 | All Organisms → cellular organisms → Bacteria | 1673 | Open in IMG/M |
| 3300005445|Ga0070708_100065441 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 3260 | Open in IMG/M |
| 3300005467|Ga0070706_100407825 | All Organisms → cellular organisms → Bacteria | 1265 | Open in IMG/M |
| 3300005471|Ga0070698_101989682 | All Organisms → cellular organisms → Bacteria | 534 | Open in IMG/M |
| 3300005518|Ga0070699_100869554 | All Organisms → cellular organisms → Bacteria | 825 | Open in IMG/M |
| 3300005536|Ga0070697_100440439 | All Organisms → cellular organisms → Bacteria | 1134 | Open in IMG/M |
| 3300005549|Ga0070704_101763988 | All Organisms → cellular organisms → Bacteria | 572 | Open in IMG/M |
| 3300005558|Ga0066698_10466701 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 860 | Open in IMG/M |
| 3300005617|Ga0068859_102057188 | All Organisms → cellular organisms → Bacteria | 630 | Open in IMG/M |
| 3300005713|Ga0066905_100047842 | All Organisms → cellular organisms → Bacteria | 2593 | Open in IMG/M |
| 3300005764|Ga0066903_100221206 | All Organisms → cellular organisms → Bacteria | 2870 | Open in IMG/M |
| 3300006031|Ga0066651_10547439 | All Organisms → cellular organisms → Bacteria | 613 | Open in IMG/M |
| 3300006844|Ga0075428_100914965 | All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium | 930 | Open in IMG/M |
| 3300006845|Ga0075421_101917699 | All Organisms → cellular organisms → Bacteria | 634 | Open in IMG/M |
| 3300006847|Ga0075431_101064366 | All Organisms → cellular organisms → Bacteria | 774 | Open in IMG/M |
| 3300006847|Ga0075431_101560109 | All Organisms → cellular organisms → Bacteria | 618 | Open in IMG/M |
| 3300006876|Ga0079217_10258708 | All Organisms → cellular organisms → Bacteria | 935 | Open in IMG/M |
| 3300006904|Ga0075424_101426275 | All Organisms → cellular organisms → Bacteria | 735 | Open in IMG/M |
| 3300009012|Ga0066710_100014778 | All Organisms → cellular organisms → Bacteria | 8249 | Open in IMG/M |
| 3300009012|Ga0066710_100820450 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 1427 | Open in IMG/M |
| 3300009038|Ga0099829_11770419 | All Organisms → cellular organisms → Bacteria | 506 | Open in IMG/M |
| 3300009089|Ga0099828_10495673 | All Organisms → cellular organisms → Bacteria | 1101 | Open in IMG/M |
| 3300009090|Ga0099827_11250510 | All Organisms → cellular organisms → Bacteria | 646 | Open in IMG/M |
| 3300009137|Ga0066709_100004896 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 10847 | Open in IMG/M |
| 3300009137|Ga0066709_100078804 | All Organisms → cellular organisms → Bacteria | 3917 | Open in IMG/M |
| 3300009137|Ga0066709_102120310 | All Organisms → cellular organisms → Bacteria | 777 | Open in IMG/M |
| 3300009162|Ga0075423_10071043 | All Organisms → cellular organisms → Bacteria | 3614 | Open in IMG/M |
| 3300009444|Ga0114945_10036971 | All Organisms → cellular organisms → Bacteria | 2639 | Open in IMG/M |
| 3300009691|Ga0114944_1063564 | All Organisms → cellular organisms → Bacteria | 1369 | Open in IMG/M |
| 3300009691|Ga0114944_1494880 | All Organisms → cellular organisms → Bacteria | 520 | Open in IMG/M |
| 3300009807|Ga0105061_1093203 | All Organisms → cellular organisms → Bacteria | 518 | Open in IMG/M |
| 3300009873|Ga0131077_10008319 | All Organisms → cellular organisms → Bacteria | 21624 | Open in IMG/M |
| 3300009873|Ga0131077_10011265 | All Organisms → cellular organisms → Bacteria | 17509 | Open in IMG/M |
| 3300010029|Ga0105074_1059673 | All Organisms → cellular organisms → Bacteria | 683 | Open in IMG/M |
| 3300010043|Ga0126380_11026793 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 697 | Open in IMG/M |
| 3300010043|Ga0126380_12269553 | All Organisms → cellular organisms → Bacteria | 503 | Open in IMG/M |
| 3300010046|Ga0126384_10729234 | All Organisms → cellular organisms → Bacteria | 881 | Open in IMG/M |
| 3300010362|Ga0126377_12176910 | All Organisms → cellular organisms → Bacteria | 631 | Open in IMG/M |
| 3300010397|Ga0134124_10955018 | All Organisms → cellular organisms → Bacteria | 867 | Open in IMG/M |
| 3300010398|Ga0126383_13324467 | All Organisms → cellular organisms → Bacteria | 525 | Open in IMG/M |
| 3300010938|Ga0137716_10144108 | All Organisms → cellular organisms → Bacteria | 1613 | Open in IMG/M |
| 3300012204|Ga0137374_10250193 | All Organisms → cellular organisms → Bacteria | 1486 | Open in IMG/M |
| 3300012353|Ga0137367_10005197 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 10546 | Open in IMG/M |
| 3300012532|Ga0137373_10004742 | All Organisms → cellular organisms → Bacteria | 14847 | Open in IMG/M |
| 3300012929|Ga0137404_10001264 | All Organisms → cellular organisms → Bacteria | 16537 | Open in IMG/M |
| 3300012930|Ga0137407_10048485 | All Organisms → cellular organisms → Bacteria | 3434 | Open in IMG/M |
| 3300012931|Ga0153915_10175423 | All Organisms → cellular organisms → Bacteria | 2340 | Open in IMG/M |
| 3300013306|Ga0163162_12661768 | All Organisms → cellular organisms → Bacteria | 576 | Open in IMG/M |
| 3300015371|Ga0132258_10106442 | All Organisms → cellular organisms → Bacteria | 6626 | Open in IMG/M |
| 3300017659|Ga0134083_10543687 | All Organisms → cellular organisms → Bacteria | 525 | Open in IMG/M |
| 3300017997|Ga0184610_1074744 | All Organisms → cellular organisms → Bacteria | 1041 | Open in IMG/M |
| 3300018056|Ga0184623_10080767 | All Organisms → cellular organisms → Bacteria | 1498 | Open in IMG/M |
| 3300018063|Ga0184637_10038233 | All Organisms → cellular organisms → Bacteria | 2914 | Open in IMG/M |
| 3300018063|Ga0184637_10360739 | All Organisms → cellular organisms → Bacteria | 872 | Open in IMG/M |
| 3300018079|Ga0184627_10029962 | All Organisms → cellular organisms → Bacteria | 2739 | Open in IMG/M |
| 3300018079|Ga0184627_10484246 | All Organisms → cellular organisms → Bacteria | 640 | Open in IMG/M |
| 3300018433|Ga0066667_10410071 | All Organisms → cellular organisms → Bacteria | 1098 | Open in IMG/M |
| 3300018482|Ga0066669_10234004 | All Organisms → cellular organisms → Bacteria | 1438 | Open in IMG/M |
| 3300019249|Ga0184648_1037331 | All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium | 713 | Open in IMG/M |
| 3300019259|Ga0184646_1513719 | All Organisms → cellular organisms → Bacteria | 1144 | Open in IMG/M |
| 3300020063|Ga0180118_1240066 | All Organisms → cellular organisms → Bacteria | 661 | Open in IMG/M |
| 3300020170|Ga0179594_10024406 | All Organisms → cellular organisms → Bacteria | 1871 | Open in IMG/M |
| 3300020214|Ga0194132_10114828 | All Organisms → cellular organisms → Bacteria | 1683 | Open in IMG/M |
| 3300021081|Ga0210379_10406867 | All Organisms → cellular organisms → Bacteria | 601 | Open in IMG/M |
| 3300022563|Ga0212128_10694342 | All Organisms → cellular organisms → Bacteria | 611 | Open in IMG/M |
| 3300025922|Ga0207646_10038746 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 4291 | Open in IMG/M |
| 3300025930|Ga0207701_10111887 | All Organisms → cellular organisms → Bacteria | 2440 | Open in IMG/M |
| 3300027815|Ga0209726_10212860 | All Organisms → cellular organisms → Bacteria | 1008 | Open in IMG/M |
| 3300027819|Ga0209514_10174877 | All Organisms → cellular organisms → Bacteria | 1111 | Open in IMG/M |
| 3300031720|Ga0307469_10432039 | All Organisms → cellular organisms → Bacteria | 1134 | Open in IMG/M |
| 3300031740|Ga0307468_100175840 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 1414 | Open in IMG/M |
| 3300031747|Ga0318502_10980900 | All Organisms → cellular organisms → Bacteria | 515 | Open in IMG/M |
| 3300031820|Ga0307473_10790354 | All Organisms → cellular organisms → Bacteria | 676 | Open in IMG/M |
| 3300031834|Ga0315290_10176172 | All Organisms → cellular organisms → Bacteria | 1851 | Open in IMG/M |
| 3300031911|Ga0307412_10144841 | All Organisms → cellular organisms → Bacteria | 1745 | Open in IMG/M |
| 3300031938|Ga0308175_100003223 | All Organisms → cellular organisms → Bacteria | 11282 | Open in IMG/M |
| 3300031938|Ga0308175_100003862 | All Organisms → cellular organisms → Bacteria | 10430 | Open in IMG/M |
| 3300031938|Ga0308175_100094549 | All Organisms → cellular organisms → Bacteria | 2735 | Open in IMG/M |
| 3300031939|Ga0308174_10104121 | All Organisms → cellular organisms → Bacteria | 2041 | Open in IMG/M |
| 3300031949|Ga0214473_10090612 | All Organisms → cellular organisms → Bacteria | 3572 | Open in IMG/M |
| 3300031949|Ga0214473_10200389 | All Organisms → cellular organisms → Bacteria | 2311 | Open in IMG/M |
| 3300031949|Ga0214473_12128469 | All Organisms → cellular organisms → Bacteria | 543 | Open in IMG/M |
| 3300031997|Ga0315278_11642580 | All Organisms → cellular organisms → Bacteria | 613 | Open in IMG/M |
| 3300032261|Ga0306920_100967839 | All Organisms → cellular organisms → Bacteria | 1241 | Open in IMG/M |
| 3300032770|Ga0335085_10185747 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium | 2562 | Open in IMG/M |
| 3300032892|Ga0335081_10557216 | All Organisms → cellular organisms → Bacteria | 1426 | Open in IMG/M |
| 3300033004|Ga0335084_10123267 | All Organisms → cellular organisms → Bacteria | 2695 | Open in IMG/M |
| 3300033407|Ga0214472_11508244 | All Organisms → cellular organisms → Bacteria | 575 | Open in IMG/M |
| 3300033417|Ga0214471_10505746 | All Organisms → cellular organisms → Bacteria | 966 | Open in IMG/M |
| 3300033417|Ga0214471_10791185 | All Organisms → cellular organisms → Bacteria | 740 | Open in IMG/M |
| 3300033487|Ga0316630_10211240 | All Organisms → cellular organisms → Bacteria | 1433 | Open in IMG/M |
| 3300034661|Ga0314782_217270 | All Organisms → cellular organisms → Bacteria | 503 | Open in IMG/M |
| ⦗Top⦘ |
| Habitat | Taxonomy | Distribution |
| Vadose Zone Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil | 8.82% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil | 7.84% |
| Grasslands Soil | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil | 6.86% |
| Corn, Switchgrass And Miscanthus Rhizosphere | Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere | 6.86% |
| Groundwater Sediment | Environmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment | 5.88% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil | 5.88% |
| Populus Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere | 5.88% |
| Tropical Forest Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil | 4.90% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Agricultural → Soil | 4.90% |
| Groundwater Sediment | Environmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment | 3.92% |
| Thermal Springs | Environmental → Aquatic → Thermal Springs → Hot (42-90C) → Unclassified → Thermal Springs | 3.92% |
| Soil | Environmental → Terrestrial → Soil → Wetlands → Unclassified → Soil | 3.92% |
| Hardwood Forest Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil | 2.94% |
| Tropical Forest Soil | Environmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil | 2.94% |
| Sediment | Environmental → Aquatic → Freshwater → Lake → Sediment → Sediment | 1.96% |
| Groundwater | Environmental → Aquatic → Freshwater → Groundwater → Contaminated → Groundwater | 1.96% |
| Groundwater Sand | Environmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand | 1.96% |
| Wastewater | Engineered → Wastewater → Activated Sludge → Unclassified → Unclassified → Wastewater | 1.96% |
| Freshwater Lake | Environmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater Lake | 0.98% |
| Freshwater Wetlands | Environmental → Aquatic → Freshwater → Wetlands → Unclassified → Freshwater Wetlands | 0.98% |
| Marine Sediment | Environmental → Aquatic → Marine → Hydrothermal Vents → Sediment → Marine Sediment | 0.98% |
| Hot Spring Fe-Si Sediment | Environmental → Aquatic → Thermal Springs → Hot (42-90C) → Neutral → Hot Spring Fe-Si Sediment | 0.98% |
| Terrestrial Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil | 0.98% |
| Grasslands Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil | 0.98% |
| Agricultural Soil | Environmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil | 0.98% |
| Corn, Switchgrass And Miscanthus Rhizosphere | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Corn, Switchgrass And Miscanthus Rhizosphere | 0.98% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil | 0.98% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil | 0.98% |
| Switchgrass Rhizosphere | Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere | 0.98% |
| Agricultural Soil | Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Agricultural Soil | 0.98% |
| Host-Associated | Host-Associated → Human → Digestive System → Large Intestine → Fecal → Host-Associated | 0.98% |
| Arabidopsis Rhizosphere | Host-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere | 0.98% |
| Switchgrass Rhizosphere | Host-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere | 0.98% |
| Switchgrass Rhizosphere | Host-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere | 0.98% |
| Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Rhizosphere | 0.98% |
| Visualization |
|---|
| Powered by ApexCharts |
| Taxon OID | Sample Name | Habitat Type | IMG/M Link |
|---|---|---|---|
| 3300000364 | Soil microbial communities from Great Prairies - Iowa, Native Prairie soil | Environmental | Open in IMG/M |
| 3300000789 | Soil microbial communities from Great Prairies - Iowa, Native Prairie soil | Environmental | Open in IMG/M |
| 3300000956 | Soil microbial communities from Great Prairies - Kansas, Native Prairie soil | Environmental | Open in IMG/M |
| 3300002231 | Marine sediment microbial communities from Santorini caldera mats, Greece - red mat | Environmental | Open in IMG/M |
| 3300004800 | Switchgrass rhizosphere and bulk soil microbial communities from Kellogg Biological Station, Michigan, USA for expression studies - soil CB-1 (Metagenome Metatranscriptome) | Host-Associated | Open in IMG/M |
| 3300005172 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132 | Environmental | Open in IMG/M |
| 3300005180 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 | Environmental | Open in IMG/M |
| 3300005330 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3H metaG | Environmental | Open in IMG/M |
| 3300005332 | Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly) | Environmental | Open in IMG/M |
| 3300005435 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaG | Environmental | Open in IMG/M |
| 3300005445 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaG | Environmental | Open in IMG/M |
| 3300005467 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG | Environmental | Open in IMG/M |
| 3300005471 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaG | Environmental | Open in IMG/M |
| 3300005518 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaG | Environmental | Open in IMG/M |
| 3300005536 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaG | Environmental | Open in IMG/M |
| 3300005549 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-2 metaG | Environmental | Open in IMG/M |
| 3300005558 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147 | Environmental | Open in IMG/M |
| 3300005617 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S2-2 | Host-Associated | Open in IMG/M |
| 3300005713 | Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 (version 2) | Environmental | Open in IMG/M |
| 3300005764 | Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2) | Environmental | Open in IMG/M |
| 3300006031 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Angelo_100 | Environmental | Open in IMG/M |
| 3300006844 | Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD2 | Host-Associated | Open in IMG/M |
| 3300006845 | Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5 | Host-Associated | Open in IMG/M |
| 3300006847 | Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD5 | Host-Associated | Open in IMG/M |
| 3300006876 | Agricultural soil microbial communities from Utah to study Nitrogen management - NC AS200 | Environmental | Open in IMG/M |
| 3300006904 | Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3 | Host-Associated | Open in IMG/M |
| 3300009012 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159 | Environmental | Open in IMG/M |
| 3300009038 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG | Environmental | Open in IMG/M |
| 3300009089 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG | Environmental | Open in IMG/M |
| 3300009090 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG | Environmental | Open in IMG/M |
| 3300009137 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158 | Environmental | Open in IMG/M |
| 3300009162 | Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2 | Host-Associated | Open in IMG/M |
| 3300009444 | Hot spring microbial communities from Beatty, Nevada to study Microbial Dark Matter (Phase II) - OV2 TP3 | Environmental | Open in IMG/M |
| 3300009691 | Hot spring microbial communities from Beatty, Nevada to study Microbial Dark Matter (Phase II) - OV2 TP2 | Environmental | Open in IMG/M |
| 3300009807 | Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_0_10 | Environmental | Open in IMG/M |
| 3300009873 | Activated sludge microbial diversity in wastewater treatment plant from Taiwan - Wenshan plant | Engineered | Open in IMG/M |
| 3300010029 | Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_10_20 | Environmental | Open in IMG/M |
| 3300010043 | Tropical forest soil microbial communities from Panama - MetaG Plot_26 | Environmental | Open in IMG/M |
| 3300010046 | Tropical forest soil microbial communities from Panama - MetaG Plot_36 | Environmental | Open in IMG/M |
| 3300010362 | Tropical forest soil microbial communities from Panama - MetaG Plot_22 | Environmental | Open in IMG/M |
| 3300010397 | Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-4 | Environmental | Open in IMG/M |
| 3300010398 | Tropical forest soil microbial communities from Panama - MetaG Plot_35 | Environmental | Open in IMG/M |
| 3300010938 | Sediment microbial community from Chocolate Pots hot springs, Yellowstone National Park, Wyoming, USA. Combined Assembly of Gp0156111, Gp0156114, Gp0156117 | Environmental | Open in IMG/M |
| 3300012204 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_100_16 metaG | Environmental | Open in IMG/M |
| 3300012353 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_80_16 metaG | Environmental | Open in IMG/M |
| 3300012532 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_80_16 metaG | Environmental | Open in IMG/M |
| 3300012929 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly) | Environmental | Open in IMG/M |
| 3300012930 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly) | Environmental | Open in IMG/M |
| 3300012931 | Freshwater wetland microbial communities from Ohio, USA - Open water 3 Core 3 Depth 3 metaG | Environmental | Open in IMG/M |
| 3300013306 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S5-5 metaG | Host-Associated | Open in IMG/M |
| 3300015371 | Combined assembly of cpr5 and col0 rhizosphere and soil | Host-Associated | Open in IMG/M |
| 3300017659 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaG | Environmental | Open in IMG/M |
| 3300017997 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_coex | Environmental | Open in IMG/M |
| 3300018056 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_b1 | Environmental | Open in IMG/M |
| 3300018063 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b2 | Environmental | Open in IMG/M |
| 3300018079 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b1 | Environmental | Open in IMG/M |
| 3300018433 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116 | Environmental | Open in IMG/M |
| 3300018482 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118 | Environmental | Open in IMG/M |
| 3300019249 | Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300019259 | Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300020063 | Metatranscriptome of soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaT ERMLT730_16_1Ra (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300020170 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad1_1_08_16fungal (Illumina Assembly) | Environmental | Open in IMG/M |
| 3300020214 | Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015054 Kigoma Offshore 80m | Environmental | Open in IMG/M |
| 3300021081 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32_coex redo | Environmental | Open in IMG/M |
| 3300022563 | OV2_combined assembly | Environmental | Open in IMG/M |
| 3300025922 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes) | Environmental | Open in IMG/M |
| 3300025930 | Switchgrass rhizosphere bulk soil microbial communities from Kellogg Biological Station, Michigan, USA, with PhiX - S5 (SPAdes) | Environmental | Open in IMG/M |
| 3300027815 | Subsurface groundwater microbial communities from S. Glens Falls, New York, USA - GMW46 contaminated, 5.4 m (SPAdes) | Environmental | Open in IMG/M |
| 3300027819 | Subsurface groundwater microbial communities from S. Glens Falls, New York, USA - GMW37 contaminated, 5.8 m (SPAdes) | Environmental | Open in IMG/M |
| 3300031720 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515 | Environmental | Open in IMG/M |
| 3300031740 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05 | Environmental | Open in IMG/M |
| 3300031747 | Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.174b1f22 | Environmental | Open in IMG/M |
| 3300031820 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515 | Environmental | Open in IMG/M |
| 3300031834 | Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G12_0 | Environmental | Open in IMG/M |
| 3300031911 | Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - DK15-C-1 | Host-Associated | Open in IMG/M |
| 3300031938 | Soil microbial communities from UC Gill Tract Community Farm, Albany, California, United States - DLSLS.C.R1 | Environmental | Open in IMG/M |
| 3300031939 | Soil microbial communities from UC Gill Tract Community Farm, Albany, California, United States - DLSLS.P.R2 | Environmental | Open in IMG/M |
| 3300031949 | Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT98D197 | Environmental | Open in IMG/M |
| 3300031997 | Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G06_0 | Environmental | Open in IMG/M |
| 3300032261 | Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170 (v2) | Environmental | Open in IMG/M |
| 3300032770 | Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.5 | Environmental | Open in IMG/M |
| 3300032892 | Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.5 | Environmental | Open in IMG/M |
| 3300033004 | Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.4 | Environmental | Open in IMG/M |
| 3300033407 | Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT140D175 | Environmental | Open in IMG/M |
| 3300033417 | Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT142D155 | Environmental | Open in IMG/M |
| 3300033487 | Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_May_M1_C1_D6_A | Environmental | Open in IMG/M |
| 3300034661 | Metatranscriptome of lab incibated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60R3 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| Geographical Distribution | |
|---|---|
| Zoom: | Powered by OpenStreetMap |
| ⦗Top⦘ |
| Protein ID | Sample Taxon ID | Habitat | Sequence |
| INPhiseqgaiiFebDRAFT_1016374853 | 3300000364 | Soil | MSQTISKSHNCFRCVPHDIQGPFRQQGHEDAPALESLETTFAENGDEVFFASGYSAREIAQKHARETGGQIYINNVSRTIKRRELTVPVAYAVSKSPVYTLRGPDAWHVQVYRVYWPPLGR* |
| INPhiseqgaiiFebDRAFT_1058864082 | 3300000364 | Soil | MATKHSCYRCAVHEVQGPFLQQGVDGAPELESLDTTFADDGEEIFFATGYSAREIAQRHVREKGGQIYVNNISRKLRRRDLTAPVAYAVSKSPVYTLRAPDEWHDKVYRAYWPPL* |
| JGI1027J11758_128201383 | 3300000789 | Soil | MSQTISKSHNCFRCVPHDIQGPFRQQGHEDAPALESLETTFAENGDEVFFASGYSAREIAQKHARETGGQIYINNVSRTIKRRELTVPVAYAVSKSPVYTLRGADAWHVQVYRVYWPPLGR* |
| JGI10216J12902_1064412233 | 3300000956 | Soil | MSAGTQQTHRCYKCAVHDVQGPFRQQGLDTAPALDSLETTYAEGGDEIFFTSGYSAREIAQKRVREAGGRLYVNNISRSIKRRDLSYPVAYAVAKSPVFTLRAPDQWHDKVYRAYWPLL* |
| KVRMV2_1000146253 | 3300002231 | Marine Sediment | MGQATNQGHTCYRCVPHDVQGPFRQLGDANAPELESAETTFSEDGDEIYFASGYSAREIAKTHAEESGGTVYVNNISRKIKRHDLNVSVAYAVAKSPIYTLRPPDDFHSQVYRAYWPPL* |
| Ga0058861_110141211 | 3300004800 | Host-Associated | MATKHSCYRCAVHEVQGPFLQQGVDGAPELESLDTTFADVGEEIFFATGYSAREIAQRHAREKGGQIYVNNISRKIRRRDLTAPVAYAVSKSPVYTLRAPDE |
| Ga0066683_103708982 | 3300005172 | Soil | MATKHSCHRCAVHEVQGPFLQQGVDGAPELESLDTTFAEDGQEIFFATGYSAREIAQRHAREKGGQIYVNNISRKLRRRDLTAPVAYAVSKSPVYTLRAPDEWHDKVYRAYWPPL* |
| Ga0066685_103409752 | 3300005180 | Soil | MSAATQPHRCYRCRPHEVQGPFRQQGDDSAPALDSIETTFSEEGDEVFFTSGYSAREIAQKHARETGGRVYINNISRTIKRPDLTVSVAYGVAKGAVYTLRAPDSLHDKVYRAYWPPLE* |
| Ga0070690_1014217602 | 3300005330 | Switchgrass Rhizosphere | MSKHNCYRCAVHDVEGPFRQPGAEDAPELETLETTSSDGAEEPFFTSGYSAREIAQRHVREHGGQIYVNNISRKIKRKDLTVSVAYAVAKTP |
| Ga0066388_1035555462 | 3300005332 | Tropical Forest Soil | VHEVQGPFLQQGVDGAPELESLDTTFAEDGQEVFFATGYSAREIAQRHSREKGGQIYVNNISRKLKRRDLTAAVAYAVSKSPVYTLRAPDEWHDKVYRAYWPPL* |
| Ga0070714_1002398154 | 3300005435 | Agricultural Soil | MNPTVTPRHTCHRCQPHDVQGPFRQQGHEDAPALESIDTTYSEQGDEIFYASGYSAREVAQKHALESGGRVYVNNISRNIKRRDLSSTLAPVFYAVAKSAVYTLRPPDAVHDKVYRAYWPSL* |
| Ga0070708_1000654414 | 3300005445 | Corn, Switchgrass And Miscanthus Rhizosphere | MATKHTCYRCAVHELQGPFLQQGVDGAPELESLDTTFAEDGEEIFFATGYSAREIAQRHAREKGGQIYVNNISRKLKRRDLTAPVAYAVSKSPVYTLRAPDEWHDKVYRAYWPPL* |
| Ga0070706_1004078252 | 3300005467 | Corn, Switchgrass And Miscanthus Rhizosphere | MATKHSCYRCAVHEVQGPFLQQGVDGAPELETLDTTFADDGEEIFFATGYSAREIAQRHAREKGGQIYVNNISRKLKRRDLTAPVAYAVSKSPVYTLRAPDEWHDKVYRAYWPPL* |
| Ga0070698_1019896822 | 3300005471 | Corn, Switchgrass And Miscanthus Rhizosphere | MATKHTCYRCAVHELQGPFLQQGVDGAPELESLDTTFAEDGEEIFFATGYSAREIAQRHAREKGGQIYVNNISRKLKRRDLTAPVAYAVSKSPVYTLRAPDEWHDKV |
| Ga0070699_1008695542 | 3300005518 | Corn, Switchgrass And Miscanthus Rhizosphere | MATKHTCYRCAVHELQGPFLQQGVDGAPELESLDTTFAEDGEEIFFATGYSAREIAQRHAREKGGQIYVNNISRKLKRRDLTAPVAYAVSKSPVYTLRAPDEWHDKVYRAYWPVL* |
| Ga0070697_1004404391 | 3300005536 | Corn, Switchgrass And Miscanthus Rhizosphere | ELQGPFLQQGVDGAPELESLDTTFAEDGEEIFFATGYSAREIAQRHAREKGGQIYVNNISRKLKRRDLTAPVAYAVSKSPVYTLRAPDEWHDKVYRAYWPPL* |
| Ga0070704_1017639881 | 3300005549 | Corn, Switchgrass And Miscanthus Rhizosphere | MSAAAPHKCYRCRPHEVQGPFRQLGDENAPPLDSIETTFSDSGDEVFFTSGYSAREIAQKHARTTGGQVYVNNISRTIKRPDLTVSVAYGVAKSPVYTLRPPDGAHDK |
| Ga0066698_104667012 | 3300005558 | Soil | MSRHSCHRCVTHDVQDGFRQPGAEDGPELETLETTFADGGEEVFFASGYSAREIAQRHVREHGGQIYVNNISRKIKRRDLTVSVAYAVAKTAVYTLRGPDEHHTRVYRAYWPP |
| Ga0068859_1020571881 | 3300005617 | Switchgrass Rhizosphere | MSKHNCYRCAVHDVEGPFRQPGAEDAPELETLETTSSDGAEEPFFTSGYSAREIAQRHVREHGGQIYVNNISRKIKRKDLTVSVAYAVAKTPVYTLRAPDENHPKVYRAY |
| Ga0066905_1000478424 | 3300005713 | Tropical Forest Soil | MATKHSCYRCAVHEVQRPFLQQGVDGAPELESLDTTFAEDGEEIFFATGYSAREIAQRHAREKGAQIYVNNISRKLKRRDLTAPVAYAVSKSPVYTLRAPDEWHDKVYRAYWPPL* |
| Ga0066903_1002212063 | 3300005764 | Tropical Forest Soil | MASKHSCYRCAVHEIQGPFLQQGADGSPELESLDTTFAEDGEEIFFATGYSAREIAQRHAREKGGQIYVNNISRKLKRRDLTAPVAYAVSKSPVYTLRAPDEWHDKVYRAYWPPL* |
| Ga0066651_105474392 | 3300006031 | Soil | IPTERGASQHCGGEGAGRVVMATKHSCHRCAVHEVQGPFLQQGVDGAPELESLDTTFAEDGQEIFFATGYSAREIAQRHAREKGGQIYVNNISRKLRRRDLTAPVAYAVSKSPVYTLRAPDEWHDKVYRAYWPPL* |
| Ga0075428_1009149652 | 3300006844 | Populus Rhizosphere | MSAATGPGHKCFRCRPHEVQGPFRHQGDETAPALESIQTTFSEAGDEVFLTSGYSAREIAQKHARETGGSVYVNNISRSIKRPDLTVSVAYGVAKGHIYTLRPPDSFHSGVYRAYWPPL |
| Ga0075421_1019176992 | 3300006845 | Populus Rhizosphere | YIMSQTTSQRHNCFRCVPHDIQGPFRQQGHENAPALESLETTFAENDAEVFFASGYSAREIAQKHARENGGQIYINNVSRTIKRRELTVPVAYAVSKSPVYTLRGPDARHDQVYRAYWPPLER* |
| Ga0075431_1010643662 | 3300006847 | Populus Rhizosphere | MSKHACHRCVVHDVQGPFRPDGGDDAPELETLETTFADGGEEIFFTSGYSAREIAQRHVREHGGVIYVNNVSRKIKRRDLTVSVGYAVAKTAIYTLRAPDEHHAKVYRAYWPPL* |
| Ga0075431_1015601092 | 3300006847 | Populus Rhizosphere | MSAAAPHKCYRCRPHEVQGPFRQLGDENAPPLDSIETTFSDSGDEVFFTSGYSAREIAQKHARTTGGQVYVNNISRTIKRPDLTVSVAYGVAKSPVYTLRPPDGAHDKVYRAYWPPLE* |
| Ga0079217_102587082 | 3300006876 | Agricultural Soil | MNPAVTQRHTCHRCEPHDVQGPFRQQGHEDAPVLESIDTTYSEKGDDIFYASGYSAREVAQKRALESGGRVYVNNISRNIKRRDLSNTLAPVFYAVAKSAVYTLRAPDAVHDKVYRAFWPAL* |
| Ga0075424_1014262752 | 3300006904 | Populus Rhizosphere | MATKHSCYRCAVHEVQGPFLQQGVDGAPELESLDTTFADDGEEIFFATGYSAREIAQRHAREKGGQIYVNNISRKLRRRDLTAPVAYAVSKSPVYSLRAPDEWHDKVYRAYWPPL* |
| Ga0066710_1000147783 | 3300009012 | Grasslands Soil | MSRHSCHRCVTHDVQDGFRQPGAEDGPELETLETTFADGGEEVFFASGYSAREIAQRHVREHGGQIYVNNISRKIKRRDLTVSVAYAVAKTAVYTLRGPDEHHTRVYRAYWPPL |
| Ga0066710_1008204501 | 3300009012 | Grasslands Soil | HSCHRCVTHDVQGPFRQPGAEGAPELETLETTFSDGGEEVFFTSGYSAREIAQRHVREHGGQIYVNNISRKIKRRDLTVSVAYAVAKTPVYTLRGQDEHHDKVYRAYWPPL |
| Ga0099829_117704192 | 3300009038 | Vadose Zone Soil | GGKTMSQATSKRHTCFRCVPHEVQGPFRQQGYEDAPALESLETTFADNGDEIFFASGYSAREIAQKHAQEAGWQVYVNNISRNIKRRDLTVPVFYAVSKSPVYTLRVPDAWHDKVYRAYWPPLS* |
| Ga0099828_104956731 | 3300009089 | Vadose Zone Soil | TCFRCVPHEIQGPFRQQGYEDAPALESLETTFAENGDEIFFASGYSAREIAQKHAREAGGQIYVNNISRNIKRRDLTVPVFYAVSKSPVYTLRVPDVWHDKVYRAYWPPLS* |
| Ga0099827_112505102 | 3300009090 | Vadose Zone Soil | MSQATSKRHTCFRCVPHEVQGPFRQQGYEDAPALESLETTFADNGDEIFFASGYSAREIAQKHAQEAGWQVYVNNISRNIKRRDLTVPVFYAVSKSPVYTLRVPDTWHDKVYRAYWPPLS |
| Ga0066709_1000048964 | 3300009137 | Grasslands Soil | MATKHSCHRCAVHEVQGPFLQQGVDGATELESLDTTFAEDGEEIFFATGYSAREIAQRHAREKGGQIYVNNISRKLRRRDLTAPVAYAVSKSPVYTLRAPDEWHDKVYRAYWPPL* |
| Ga0066709_1000788043 | 3300009137 | Grasslands Soil | MSRHSCHRCVTHDVQDGFRQPGAEDGPELETLETTFADGGEEVFFASGYSAREIAQRHVREHGGQIYVNNISRKIKRRDLTVSVAYAVAKTAVYTLRGPDEHHTRVYRAYWPPL* |
| Ga0066709_1021203102 | 3300009137 | Grasslands Soil | QTISKSHNCFRCVPHDIQGPFRQQGHEDAPALESLETTFAENDAEVFFASGYSAREIAQKHARENGGQIYINNISRTIKRRELTVPVAYAVSKSPVYTLRGPDAWHDQVTRAYWPPLER* |
| Ga0075423_100710432 | 3300009162 | Populus Rhizosphere | VQGPFRQQGDENAPPLDSIETTFSDSGDEVFFTSGYSAREIAQKHARTTGGQVYVNNISRTIKRPDLTVSVAYGVAKGPVYTLRSPDAAHDKVYRAYWPPLE* |
| Ga0114945_100369713 | 3300009444 | Thermal Springs | VPHEVQEPVRQQEGEAGPELESIDSTFSDEGDEVFFTSGYSAREIAQKHARECGGQVYVNNISRRIRRHDLTVPVAYAVARGPVYTLRPPDGHHYRIYRSYWPPLKGDAS* |
| Ga0114944_10635643 | 3300009691 | Thermal Springs | MSPATSSKHNCFRCVPHEIQGPFRQQGHEDAPALESIETTFSDNRDELFFASGYSAREIAQKHAKDTGGQVYINNISRNIKRHELTVPVAYAVSKSPVYTLRSPDAWHDKVYRMYWPPLDK* |
| Ga0114944_14948801 | 3300009691 | Thermal Springs | VQEPFREQGSEDAPALESSETTFSENGDEVFFTSGYSAREIAQKRAGETGGQVYVNNVSRNIKRRDSTLPVAYAVSKSAVYTLRGPDEWHDKIYRAYWPLASRLGRSCRATHQDHGGCT |
| Ga0105061_10932031 | 3300009807 | Groundwater Sand | MTAGTGTGHRCFRCAAHDVQGPFRVQGQEADTPLLDSIETTFADDKSAIFFASGYSAREIAQKRAQETGLCVYVNNVSRNVRRKELTVPVAYAVAASPVYTLKGPDRWHDKVYRAYWPPL |
| Ga0131077_100083193 | 3300009873 | Wastewater | MNQTMTQRHTCYRCRPHEVQEPFRQQGNDDAPSLDSIETTASEESDEIFFASGYSAREIAQKHARESGGRVYVNNISRNIKRRDLSNTLAPVFYAVAASPVYTLRPPDALHDKVYRAYWPPLQG* |
| Ga0131077_100112653 | 3300009873 | Wastewater | MNQIMTQRHTCYRCRPHEIQEPFRQQGNDEAPPLDSIETTAAEDRSEIFFASGYSAREIAQKHARESGGQVYVNNISRAIKRRDLSNTLAPVFYAVAASPVYTLRPPDALHDKVYRAYWPPLQG* |
| Ga0105074_10596732 | 3300010029 | Groundwater Sand | RCFRCAAHDVQGPFRVQGQEADTPLLDSIETTFADDKSAIFFASGYSAREIAQKRAQETGLCVYVNNVSRNVRRKELTVPVAYAVAASPVYTLKGPDRWHDKVYRAYWPPL* |
| Ga0126380_110267932 | 3300010043 | Tropical Forest Soil | MATKHSCYRCAVHEVQRPFLQQGVDGAPELESLDTTFAEDGEEIFFATGYSAREIAQRHSREKGGQIYVNNISRKLKRRDLTAPVAYAVSKSPVYTLRAPDEWHD |
| Ga0126380_122695532 | 3300010043 | Tropical Forest Soil | VDGAPELESLDTTFAEDGQEIFFATGYSAREIAQRHSREKGGQIYVNNISRKLKRRDLTAAVAYSVSKSPVYTLRAPDEWHDKVYRAYWPPL* |
| Ga0126384_107292342 | 3300010046 | Tropical Forest Soil | MATKHSCYRCAVHEVQRPFLQQGVDGAPELESLDTTFAEDGEEIFFATGYSAREIAQRHAREKGAQIYVNNISRKLKRRDLTAAVAYAVSKSPVYTLRAPDEWHDKVYRAYWPPL* |
| Ga0126377_121769102 | 3300010362 | Tropical Forest Soil | MAIKHSCYRCTVHEVQGPFLQQGVDGAPELESLDTTFAENGEEIFFATGYSAREIAQRHTREKGGQIYVNNISRKLKRRDLTAAVAYAVSKSPVYTLRAPDEWHDKVYRAYWPPL* |
| Ga0134124_109550182 | 3300010397 | Terrestrial Soil | MSKHNCYRCAVHDVEGPFRQPGAEDAPELETLETTSSDGAEEPFFTSGYSAREIAQRHVREHGGQIYVNNISRKIKRKDLTVSVAYAVAKTPVYTLRAPDENHPKVYRAYWPPL* |
| Ga0126383_133244672 | 3300010398 | Tropical Forest Soil | MATKHSCYRCAVHEVQRPFLQQGVDGAPELESLDTTFAEDGGEIFFATGYSAREIAQRHAREKGAQIYVNNISRKLKRRDLTAPVAYAVSKSPVYTLRAPDEWHDKVYRAYWPPL* |
| Ga0137716_101441082 | 3300010938 | Hot Spring Fe-Si Sediment | MDQAKGSAHRCYKCRSHDVQGPFRFQGSGEGPELESLETTFSENGDDIFFTSGYSAREIAQRHQRENGGQIYVNSISRHIKRRELTVPVAYAVAKSPVYTLR |
| Ga0137374_102501933 | 3300012204 | Vadose Zone Soil | MATKHSCYRCAVHEVQGPFLQQGVDGVPELESLDTTFADDGEEIFFATGYSAREIAQRHAREKGGQIYVNNISRKLRRRDLTAPVAYAVSKSPVYTLRAPDEWHDKVYRAYWPPL* |
| Ga0137367_100051978 | 3300012353 | Vadose Zone Soil | MAAKHSCYRCAVHEVQGPFLQQGVDGVPELESLDTTFADDGEEIFFATGYSAREIAQRHAREKGGQIYVNNISRKLRRRDLTAPVAYAVSKSPVYTLRAPDEWHDKVYRAYWPPL* |
| Ga0137373_1000474212 | 3300012532 | Vadose Zone Soil | MATKHSCYRCAVHEVQGPFLQQGVDGVPELESLDTTFADDGEEIFFATGYSAREIAQRHAREKGGQIYVNNISRKLRRRDLTAPVAYAVSKCPVYTLRAPDEWHDKVYRAYWPPL* |
| Ga0137404_1000126413 | 3300012929 | Vadose Zone Soil | VQGPFRQQGHEDAPAWESIETTFAENGDEVFFASGYSAREIAQKHARETGGQVYVNTISRNIKRRELSVPVSYAASRSSVYTLRGPDEWHDKVYRAYWPPL* |
| Ga0137407_100484853 | 3300012930 | Vadose Zone Soil | MSQATRQRHPCFCCAPHEVQGPFRQQGHEDAPAWESIETTFAENGDEVFFASGYSAREIAQKHARETGGQVYVNTISRNIKRRELSVPVSYAASRSSVYTLRGPDEWHDKVYRAYWPPL* |
| Ga0153915_101754232 | 3300012931 | Freshwater Wetlands | VHENQGPYRQQGVDDAPELESLETTHSDSGEEIFFTSGYSAREIAQRHQREKGGQIYVNNVSRKIKRRDLTVSVAYAVSKSPVYTLKGPDERHDKVYRVYWPPL* |
| Ga0163162_126617681 | 3300013306 | Switchgrass Rhizosphere | MSQTTSQKHNCFRCVPHDIQGPFRQQGHEDAPALESLETTFAENDDEVFFASGYSAREIAQKHARENGGQIYVNNVSRAIKRRELTVAVAYAVSKSPVYTLRGPDAWHDNVYRTYWPPLER* |
| Ga0132258_1010644210 | 3300015371 | Arabidopsis Rhizosphere | MATKHSCYRCAVHEVQGPFLQQGVDGAPELESLDTTFADDGEEIFFATGYSAREIAQRHAREKGAQIYVNNISRKLRRRDLTAPVAYAVSKSPVYTLRAPDEWHDRVYRAYWPPL* |
| Ga0134083_105436871 | 3300017659 | Grasslands Soil | VETMSQAPPQHACYRCVAHERQEPFREQGQPEAPALESIETTFCDTGGEVFFTSGYSAREIAQKRARETGGRVYVNTVSRKIKRPELTQPVAYAVANGPVYTLRGPDEWHDKVYRVYWPP |
| Ga0184610_10747442 | 3300017997 | Groundwater Sediment | MSQATSKRHTCFRCVPHEVQGPFRQQGYEDAPALESLETTFADNGDEIFFASGYSAREIAQKHAREAGGQIYVNNISRNIKRRDLTVSVFYAVSKSPVYTLRVPDVWHDKVYRTYWPPLS |
| Ga0184623_100807672 | 3300018056 | Groundwater Sediment | MNQATKGRHNCYRCMVHEIQTPFRQQGDDKAPALESIETTFSDNGDEVFCTSGYSAREIAQKHAGETGGQIYVNNVSRRIKRPDLSYPVAYAVSKSPVYTLRGPDEWHEKVYRAYWPPL |
| Ga0184637_100382332 | 3300018063 | Groundwater Sediment | MSQATSKRHTCFRCVSHEVQGPFRQQGYEDAPALESLETTFADNGDEIFFASGYSAREIAQKHAREAGGQIYVNNISRNIKRRDLTVPVFYAVSKSPVYTLRVPDVWHDKVYRAYWPPLS |
| Ga0184637_103607392 | 3300018063 | Groundwater Sediment | RHACFRCVPHEHQGPFRNAQSPDAPPLESIETTFAEGGEGVFFTSGYSAREIAQKHARETGGRVYVNSVSRNIKRPDLSYPVAYAVAESPVYTLRGPDEWHDKVFRIYWPPLS |
| Ga0184627_100299623 | 3300018079 | Groundwater Sediment | MSQATSKRHTCFRCVPHEVQGPFRQQGYEDAPALESLETTFADNGDEIFFASGYSAREIAQKHAREAGGQIYVNNISRNIKRRDLTVPVFYAVSKSPVYTLRVPDTWHDKVYRAYWPPLS |
| Ga0184627_104842461 | 3300018079 | Groundwater Sediment | MAQTTGRHACFRCVPHEHQGPFRNAQSPDAPPLESIETTFAEGGEEVFFTSGYSAREIAQKHARETGGRVYVNSVSRNIKRPDLSYPVAYAVAESPVYTLRGPDEWHDKVFRIYWPPLS |
| Ga0066667_104100712 | 3300018433 | Grasslands Soil | MATKHSCHRCAVHEVQGPFLQQGVDGAPELESLDTTFAEDGQEIFFATGYSAREIAQRHAREKGGQIYVNNISRKLRRRDLTAPVAYAVSKSPVYTLRAPDEWHDKVYRAYWPPL |
| Ga0066669_102340041 | 3300018482 | Grasslands Soil | QGPFLQQGVDGATELESLDTTFAEDGEEIFFATGYSAREIAQRHAREKGGQIYVNNISRKLRRRDLTAPVAYAVSKSPVYTLRAPDEWHDKVYRAYWPPL |
| Ga0184648_10373312 | 3300019249 | Groundwater Sediment | MSQATSKRHTCFRCVPHEVQGPFRQQGYEDAPALESLETTFADNGDEIFFASGYSAREIAQKHAQEAGWQVYVNNISRNIKRRDLTVPVFYAVSKSPVYTLRVPDVWHDKVY |
| Ga0184646_15137193 | 3300019259 | Groundwater Sediment | QQGYEDAPALESLETTFADNGDEIFFASGYSAREIAQKHAREAGGQIYVNNISRNIKRRDLTVSVFYAVSKSPVYTLRVPDVWHDKVYRTYWPPLS |
| Ga0180118_12400662 | 3300020063 | Groundwater Sediment | MVQVTGRRHTCFRCTPHDVQGPFRQQGDQEAPELESIETTFSESGDEVFFASGYSAREIAQKHARETDGQVYVNNVSRQIKRRDLNVPVSYGVSKTPVYTVRGPDEWHDKVYRVYWPPL |
| Ga0179594_100244063 | 3300020170 | Vadose Zone Soil | MSQATRQRHPCFCCAPHEVQGPFLQQGHEDAPAWESIETTFAENGDEVFFASGYSAREIAQKHARETGGQVYVNTISRNIKRRELSVPVSYAASRSSIYTLRGPDEWHDKVYRAYWPPL |
| Ga0194132_101148282 | 3300020214 | Freshwater Lake | MSAATGQGHKCYRCAPHDVQGPFRHQGHEDAPELETIETTFSDKGGEVFFTSGYSAREIAQKHARETGGAVYVNNISRKIKRRDLTVSVAYGVADSAVYTLRPPDGAHDKVYRAYWPPLT |
| Ga0210379_104068671 | 3300021081 | Groundwater Sediment | AGTNARIQCTEGGGKTMSQATSKRHTCFRCVPHEVQGPFRQQGYEDAPALESLETTFADNGDEIFFASGYSAREIAQKHAQEAGWQVYVNNISRNIKRRDLTVPVFYAVSKSPVYTLRVPDTWHDKVYRAYWPPLS |
| Ga0212128_106943421 | 3300022563 | Thermal Springs | MSPVTSQRHTCYRCAVHDVQEPFREQGSEDAPALESGETTFSENGDEVFFTSGYSAREIAQKRAGETGGQVYVNNVSRNIKRRDSTLPVAYAVSKSAVYTLRGPDEWHDKIYRAYWPLASRLGRSCRATHQDHGGCT |
| Ga0207646_100387464 | 3300025922 | Corn, Switchgrass And Miscanthus Rhizosphere | MATKHTCYRCAVHELQGPFLQQGVDGAPELESLDTTFAEDGEEIFFATGYSAREIAQRHAREKGGQIYVNNISRKLKRRDLTAPVAYAVSKSPVYTLRAPDEWHDKVYRAYWPPL |
| Ga0207701_101118872 | 3300025930 | Corn, Switchgrass And Miscanthus Rhizosphere | MATKHSCYRCAVHEVQGPFLQQGVDGAPELESLDTTFADDGEEIFFATGYSAREIAQRHAREKGGQIYVNNISRKIRRRDLTAPVAYAVSKSPVYTLRAPDEWHDKVYRAYWPPL |
| Ga0209726_102128602 | 3300027815 | Groundwater | QATTKRHSCYRCVPHESQGPFRQQGHEKAPALESIETTFSDKGEELYFTSGYSAREIAQAHARETGGQIYVNNISRNIKRPNLTVPVAYAVSKSPVYTLKGTDEWHDKVYRAYWPPL |
| Ga0209514_101748771 | 3300027819 | Groundwater | MNQATTKRHSCYRCVPHESQGPFRQQGHEKAPALESIETTFSDKGEELYFTSGYSAREIAQAHARETGGQIYVNNISRNIKRPNLTVPVAYAVSKSPVYTLKGTDEWHDKVYRAYWPPL |
| Ga0307469_104320392 | 3300031720 | Hardwood Forest Soil | MATNHSCYRCAVHEVQGPFLQQGVDGAPELESLGTTFADDGEEIFFATGYSAREIAQRHAREKGGQIYVNNISRKLRRRDLTAPVAYAVSKSPVYTLRAPDEWHDKVYRAYWPPL |
| Ga0307468_1001758402 | 3300031740 | Hardwood Forest Soil | MATNHSCYRCAVHEVQGPFLQQGVDGAPELESLDTTFADDGEEIFFATGYSAREIAQRHAREKGGQIYVNNISRKLRRRDLTAPVAYAVSKSPVYTLRAPDEWHDKVYRAYWPPL |
| Ga0318502_109809002 | 3300031747 | Soil | MAAKHSCYRCTVHEVQGPFLQQGVDGAPELESLDTTFADEGAEIFFATGYSAREIAQRHAREKGGQIYVNNISRKLKRRDLTAPVAYAVSKSPVYTLRAPDESHDKVYRAYWPRL |
| Ga0307473_107903542 | 3300031820 | Hardwood Forest Soil | MATKHTCYRCAVHELQGPFLQQGVDGAPELESLDTTFAEDGEEIFFATGYSAREIAQRHAREKGGQIYVNNISRKLRRRDLTAPVAYAVSKSPVYTLRAPDEWHDKVYRAYWPPL |
| Ga0315290_101761721 | 3300031834 | Sediment | MTSTARHACFRCAVHEVQGPFRQQGVEDAPELESLETTQSDSGEDIFFTSGYSAREIAQRHQREKGGQIWVNNVSRKIKRRELTVSVAYAVSKSPVYTLKGPDEGHDKVYR |
| Ga0307412_101448411 | 3300031911 | Rhizosphere | MNSAVTPRHTCHRCQPHDVQGPFRQQGHEDAPALESIDTTYSEQGDEIFYASGYSAREVAQKHALESGGRVYVNNISRNIKRRDLSSTLAPVFYAVAKSAVYTLRAPDAVHDKVYRAYWPSL |
| Ga0308175_10000322311 | 3300031938 | Soil | MNQTATPRHTCHRCQPHDVQGPFRQQGHEDAPALESIDTTYSEKGDEIFYASGYSAREVAQKHALESGGRVYVNNISRNIKRRDLSSTLAPVFYAVAKSAVYTLRPPDAVHDKVYRAYWPSL |
| Ga0308175_1000038629 | 3300031938 | Soil | MNPTATPRHTCHRCQPHDVQGPFRQQGHEDAPALESIDTTYSEQGDEIFYASGYSAREVAQKHAAESGGRVYVNNISRNIKRRDLSSTLAPVFYAVAKSAVYTLRPPDAVHDKVYRAYWPSL |
| Ga0308175_1000945494 | 3300031938 | Soil | MNQTATPRHTCHRCQPHDVQGPFRQQGHEDAPALESIDTTYSENGDEIFYASGYSAREVAQKHALESGGRVYVNNISRNIKRRDLSSTLAPVFYAVAKSAVYTLRAPDPVHDKVYRAYWPSL |
| Ga0308174_101041214 | 3300031939 | Soil | MNPTATPRHTCHRCQPHDVQGPFRQQGHEDAPALESIDTTYSEQGDEIFYASGYSAREVAQKHAAESGGRVYVNNISRNIKRRDLSSTLAPVFYAVAKSAVYTLRPPDAV |
| Ga0214473_100906123 | 3300031949 | Soil | VSPATQPRHACFRCAPHNVQGPFREQGDEGAPELESIDTTFSDNGDEVFFASGYSAREIAQKHGRETGGAVYVNNVSRKIKRRDLTVAVAYAVAKSPVYTLRAPDQCHAGVYRAYWPPL |
| Ga0214473_102003892 | 3300031949 | Soil | MTTATKHKCYRCAVHDVQGPFRQQGVEDAPELESLETTRSDGGEEVFFTSGYSAREIAQRHQREKGGQIYVNNISRKIKRRELTVSVAYAVAKKPVYTLMGPDQWHDKVYRAYWPPL |
| Ga0214473_121284691 | 3300031949 | Soil | MTTATKHKCYRCVAHDVQGPFSQQGVENAPELESLETTRSDGSEEIFFTSGYSAREIAQRHQRENGGQVYVNNISRKIKRRELTVSVAYAVAKKPVYTLMGPDQWHDKVYRAYWPPL |
| Ga0315278_116425801 | 3300031997 | Sediment | MTSTARHACFRCAVHEVQGPFRQQGVEDAPELESLETTQSDSGEDIFFTSGYSAREIAQRHQREKGGQIWVNNVSRKIKRRELTVSVAYAVSKSPVYTLKGPDEGHDKVYRAYWPPL |
| Ga0306920_1009678392 | 3300032261 | Soil | MAAKHSCYRCAVHEVQGPFLQQGVDGAPELESLDTTFADEGAEIFFATGYSAREIAQRHAREKGGQIYVNNISRKLKRRDLTAPVAYAVSKSPVYTLRAPDESHDKVYRAYWPPL |
| Ga0335085_101857473 | 3300032770 | Soil | MDQAKVTRHSCYRCRPHDVQGPFRQSGVGEGPELDSIETTFADGGDEVFFTSGYSAREIAQTRARENGGQVYVNNVSRNVKRRDLSMPVAYAVAKSPVYTLRGPDEWHDQIYRAYWPTL |
| Ga0335081_105572163 | 3300032892 | Soil | PAHSSCEPAGARVGSMGHDMTQMTATRHTCYRCAPHDEQGPFRHQGAGDGPELESIDTTRADAGDEVFFTSGYSAREIAQKRARETGGQVYVNNISRNIKRRDLSVPVAYAVAKSPVYTLRAPDAAHAQVYRAYWPPL |
| Ga0335084_101232672 | 3300033004 | Soil | MTHDQPNKSTPHTCHRCRPHDVQGPFRLRGTGDGPELESIETTFAEAGDEAFFTSGYSAREIAQKRARETGGQVYVNNISRNIKRRELSVPVAYAVAKSPVYTLRAPDEWHDEVYRAYWPTL |
| Ga0214472_115082441 | 3300033407 | Soil | MNQAAKSRHTCYRCAPHGVQGPFRQQGHADASALESIETTYSESGDEIFFTSGYSAREIAQVHAKETGDQIYVNNVSRNIKRPDLSYPVAYGVSKGPVYTLRPPDEWHDKVYRAYWPPL |
| Ga0214471_105057461 | 3300033417 | Soil | RCAVHDVQGPFRQHGKGDGPELESLDTTHAENGAEIFFTSGYSAREIAQRHQREHGGQVYVNNVSRNVKRRDLSTPVAYAVSKSPLYTLKGPDEWHDAVYRAYWPPL |
| Ga0214471_107911852 | 3300033417 | Soil | MNQATKSRHTCYRCAPHEVQGPFRQQGHADAPALESIETTYSESGDEIFFTSGYSAREIAQVHAKETGDQIYVNNISRNIKRPDLSYPVAYGVSKGSVYTLRPADEWHDKVYRAYWPPL |
| Ga0316630_102112402 | 3300033487 | Soil | MKQTQATRHTCHRCQPHDVQGPFRPPGTGGEGPELESIDTTFADAGDEVFFTSGYSAREIAQKHARESGGQVYVNNISRNIKRRGMSVPVSYAVAKSPVYTLRGADAAHDQVYRAYWPAL |
| Ga0314782_217270_134_481 | 3300034661 | Soil | MATKHSCYRCAVHEVQEPFLQQGVDGAPELESLDTTFADGGEEIFFATGYSAREIAQRHAREKGAQIYVNNISRKLRRRDLTAPVAYAVSKSPVYTLRAPDEWHDKVYRAYWPPL |
| ⦗Top⦘ |