| Basic Information | |
|---|---|
| Family ID | F102946 |
| Family Type | Metagenome |
| Number of Sequences | 101 |
| Average Sequence Length | 80 residues |
| Representative Sequence | PVGLSNRPPWLDTVLPIAIAGFAILPAALYPFLGTVSIAVAAAIVVLFFAGMSLSLGLGPYANVKVALVTK |
| Number of Associated Samples | 82 |
| Number of Associated Scaffolds | 101 |
| Quality Assessment | |
|---|---|
| Transcriptomic Evidence | No |
| Most common taxonomic group | Unclassified |
| % of genes with valid RBS motifs | 0.00 % |
| % of genes near scaffold ends (potentially truncated) | 0.00 % |
| % of genes from short scaffolds (< 2000 bps) | 0.00 % |
| Associated GOLD sequencing projects | 76 |
| AlphaFold2 3D model prediction | Yes |
| 3D model pTM-score | 0.45 |
| Hidden Markov Model |
|---|
| Powered by Skylign |
| Most Common Taxonomy | |
|---|---|
| Group | Unclassified (100.000 % of family members) |
| NCBI Taxonomy ID | N/A |
| Taxonomy | N/A |
| Most Common Ecosystem | |
|---|---|
| GOLD Ecosystem | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil (33.663 % of family members) |
| Environment Ontology (ENVO) | Unclassified (57.426 % of family members) |
| Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Soil (non-saline) (69.307 % of family members) |
| ⦗Top⦘ |
| ⦗Top⦘ |
| Predicted Topology & Secondary Structure | |||||
|---|---|---|---|---|---|
| Classification: | Transmembrane (alpha-helical) | Signal Peptide: | No | Secondary Structure distribution: | α-helix: 51.52% β-sheet: 2.02% Coil/Unstructured: 46.46% | Feature Viewer |
|
|
|||||
| Powered by Feature Viewer | |||||
| Structure Viewer | |
|---|---|
|
| |
| Per-residue confidence (pLDDT): 0-50 51-70 71-90 91-100 | pTM-score: 0.45 |
| Powered by PDBe Molstar | |
| ⦗Top⦘ |
| Pfam ID | Name | % Frequency in 101 Family Scaffolds |
|---|---|---|
| PF00933 | Glyco_hydro_3 | 45.54 |
| PF13472 | Lipase_GDSL_2 | 12.87 |
| PF02776 | TPP_enzyme_N | 2.97 |
| PF13784 | Fic_N | 2.97 |
| PF13098 | Thioredoxin_2 | 1.98 |
| PF00180 | Iso_dh | 1.98 |
| PF02683 | DsbD | 0.99 |
| PF01642 | MM_CoA_mutase | 0.99 |
| PF13620 | CarboxypepD_reg | 0.99 |
| PF07291 | MauE | 0.99 |
| PF02452 | PemK_toxin | 0.99 |
| PF04952 | AstE_AspA | 0.99 |
| PF13556 | HTH_30 | 0.99 |
| COG ID | Name | Functional Category | % Frequency in 101 Family Scaffolds |
|---|---|---|---|
| COG1472 | Periplasmic beta-glucosidase and related glycosidases | Carbohydrate transport and metabolism [G] | 45.54 |
| COG1884 | Methylmalonyl-CoA mutase, N-terminal domain/subunit | Lipid transport and metabolism [I] | 0.99 |
| COG2337 | mRNA-degrading endonuclease MazF, toxin component of the MazEF toxin-antitoxin module | Defense mechanisms [V] | 0.99 |
| ⦗Top⦘ |
| Name | Rank | Taxonomy | Distribution |
| Unclassified | root | N/A | 100.00 % |
| Visualization |
|---|
| Powered by ApexCharts |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|
| ⦗Top⦘ |
| Habitat | Taxonomy | Distribution |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil | 33.66% |
| Vadose Zone Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil | 16.83% |
| Grasslands Soil | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil | 15.84% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil | 11.88% |
| Grasslands Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil | 7.92% |
| Corn, Switchgrass And Miscanthus Rhizosphere | Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere | 5.94% |
| Populus Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere | 5.94% |
| Hardwood Forest Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil | 0.99% |
| Soil | Environmental → Terrestrial → Soil → Loam → Grasslands → Soil | 0.99% |
| Visualization |
|---|
| Powered by ApexCharts |
| Taxon OID | Sample Name | Habitat Type | IMG/M Link |
|---|---|---|---|
| 3300002908 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cm | Environmental | Open in IMG/M |
| 3300005166 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123 | Environmental | Open in IMG/M |
| 3300005172 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132 | Environmental | Open in IMG/M |
| 3300005174 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129 | Environmental | Open in IMG/M |
| 3300005175 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122 | Environmental | Open in IMG/M |
| 3300005176 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128 | Environmental | Open in IMG/M |
| 3300005178 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137 | Environmental | Open in IMG/M |
| 3300005179 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133 | Environmental | Open in IMG/M |
| 3300005186 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125 | Environmental | Open in IMG/M |
| 3300005445 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaG | Environmental | Open in IMG/M |
| 3300005447 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138 | Environmental | Open in IMG/M |
| 3300005450 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131 | Environmental | Open in IMG/M |
| 3300005454 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_136 | Environmental | Open in IMG/M |
| 3300005468 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG | Environmental | Open in IMG/M |
| 3300005536 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaG | Environmental | Open in IMG/M |
| 3300005540 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146 | Environmental | Open in IMG/M |
| 3300005555 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141 | Environmental | Open in IMG/M |
| 3300005559 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149 | Environmental | Open in IMG/M |
| 3300005560 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_119 | Environmental | Open in IMG/M |
| 3300005566 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_142 | Environmental | Open in IMG/M |
| 3300005568 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152 | Environmental | Open in IMG/M |
| 3300005586 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 | Environmental | Open in IMG/M |
| 3300005598 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 | Environmental | Open in IMG/M |
| 3300006031 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Angelo_100 | Environmental | Open in IMG/M |
| 3300006034 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105 | Environmental | Open in IMG/M |
| 3300006049 | Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD1 | Host-Associated | Open in IMG/M |
| 3300006794 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107 | Environmental | Open in IMG/M |
| 3300006796 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 | Environmental | Open in IMG/M |
| 3300006852 | Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD2 | Host-Associated | Open in IMG/M |
| 3300006854 | Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4 | Host-Associated | Open in IMG/M |
| 3300006903 | Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD5 | Host-Associated | Open in IMG/M |
| 3300009012 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159 | Environmental | Open in IMG/M |
| 3300009137 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158 | Environmental | Open in IMG/M |
| 3300009162 | Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2 | Host-Associated | Open in IMG/M |
| 3300010329 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_11112015 | Environmental | Open in IMG/M |
| 3300010333 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_2_24_1 metaG | Environmental | Open in IMG/M |
| 3300011271 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaG | Environmental | Open in IMG/M |
| 3300012189 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaG | Environmental | Open in IMG/M |
| 3300012200 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaG | Environmental | Open in IMG/M |
| 3300012203 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaG | Environmental | Open in IMG/M |
| 3300012207 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaG | Environmental | Open in IMG/M |
| 3300012208 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaG | Environmental | Open in IMG/M |
| 3300012211 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaG | Environmental | Open in IMG/M |
| 3300012356 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaG | Environmental | Open in IMG/M |
| 3300012918 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaG | Environmental | Open in IMG/M |
| 3300012922 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaG | Environmental | Open in IMG/M |
| 3300012925 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly) | Environmental | Open in IMG/M |
| 3300012930 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly) | Environmental | Open in IMG/M |
| 3300012972 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_24_1 metaG | Environmental | Open in IMG/M |
| 3300012976 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_0_1 metaG | Environmental | Open in IMG/M |
| 3300015241 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly) | Environmental | Open in IMG/M |
| 3300015245 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly) | Environmental | Open in IMG/M |
| 3300015359 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09182015 | Environmental | Open in IMG/M |
| 3300017659 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaG | Environmental | Open in IMG/M |
| 3300018431 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104 | Environmental | Open in IMG/M |
| 3300018433 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116 | Environmental | Open in IMG/M |
| 3300018468 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111 | Environmental | Open in IMG/M |
| 3300018482 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118 | Environmental | Open in IMG/M |
| 3300021418 | Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L3s2 | Environmental | Open in IMG/M |
| 3300026305 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_142 (SPAdes) | Environmental | Open in IMG/M |
| 3300026313 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes) | Environmental | Open in IMG/M |
| 3300026314 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_143 (SPAdes) | Environmental | Open in IMG/M |
| 3300026317 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121 (SPAdes) | Environmental | Open in IMG/M |
| 3300026323 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130 (SPAdes) | Environmental | Open in IMG/M |
| 3300026328 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129 (SPAdes) | Environmental | Open in IMG/M |
| 3300026529 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152 (SPAdes) | Environmental | Open in IMG/M |
| 3300026536 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147 (SPAdes) | Environmental | Open in IMG/M |
| 3300026548 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 (SPAdes) | Environmental | Open in IMG/M |
| 3300026550 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145 (SPAdes) | Environmental | Open in IMG/M |
| 3300027873 | Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD1 (SPAdes) | Host-Associated | Open in IMG/M |
| 3300028536 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly) | Environmental | Open in IMG/M |
| 3300028716 | Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_198 | Environmental | Open in IMG/M |
| 3300028717 | Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_158 | Environmental | Open in IMG/M |
| 3300028719 | Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_182 | Environmental | Open in IMG/M |
| 3300028771 | Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_369 | Environmental | Open in IMG/M |
| 3300028791 | Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_144 | Environmental | Open in IMG/M |
| 3300028819 | Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_153 | Environmental | Open in IMG/M |
| 3300028828 | Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_202 | Environmental | Open in IMG/M |
| 3300028878 | Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_117 | Environmental | Open in IMG/M |
| 3300028884 | Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_195 | Environmental | Open in IMG/M |
| 3300028885 | Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_185 | Environmental | Open in IMG/M |
| 3300031820 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515 | Environmental | Open in IMG/M |
| Geographical Distribution | |
|---|---|
| Zoom: | Powered by OpenStreetMap |
| ⦗Top⦘ |
| Protein ID | Sample Taxon ID | Habitat | Sequence |
| JGI25382J43887_101466521 | 3300002908 | Grasslands Soil | FGRPVGLSNRRPWLDTVLPIAIAALAIVPAALYPFLGTVSIAVAAAILVLFFAGMSLSLGLGPYSNVKVALVTK* |
| Ga0066674_100618281 | 3300005166 | Soil | QRFGRPVGISNRPPWLDTVLPIAIAAFAVLPAALYPFLGNVSIAVAAAIIVLFFAGMSLSLGLGAYANVKVALGLK* |
| Ga0066683_100703433 | 3300005172 | Soil | GIALFTAGWFAAQRFGRPAGLNNERPWLDTVLPIAIATFAVLPAALYPFLGNVSLGIAAAIIVLFFAGMSLSLGLGPYANVKVALVTR* |
| Ga0066680_101734763 | 3300005174 | Soil | SSALGAAFGVVSFVAGWIVARRFGHPVGSSDARPWLDTVLPIAIAAFAIAPAAAYPFVGQISIAVTAAVFVLFFAGMSLSLGAGPYSNEKVALSLKAESRS* |
| Ga0066673_103732351 | 3300005175 | Soil | GISNRPPWLDTVLPIAIAAFAVLPAALYPFLGNVSIAVAAAIIVLFFAGMSLSLGLGAYANVKVALGLK* |
| Ga0066679_110289831 | 3300005176 | Soil | ALFTAGWFAAQRFGRPVGLTNNRPWLDTVLPVAIAAFAVLPAALYPFLGSISVAVGAAIIVLFFAGMSLSLGLGPYSNVKVALVAR* |
| Ga0066688_101833381 | 3300005178 | Soil | WFAAQRFGRPAGLNNERPWLDTVLPIAIAAFAVLPAALYPFLGNVSLGIAAAIIVLFFAGMSLSLGLGSYANVKVALSPR* |
| Ga0066684_101109031 | 3300005179 | Soil | SVLAGVAGLMAFGAGWLVARRFAAPIGLGNSRPWLDTALPIALAAFAIAPAALYPLIGEVSFAATAAVFVLFFAGVSLSLGSGPYANVKVALAR* |
| Ga0066676_105958252 | 3300005186 | Soil | PAGLNNERPWLDTVLPIAIATFAVLPAALYPFLGNVSLGIAAAIIVLFFAGISLSLGLGPYANVKVALVTR* |
| Ga0070708_1003914941 | 3300005445 | Corn, Switchgrass And Miscanthus Rhizosphere | AIAAFALLPAALYPFMGTVSIAVAAAIVVLFFAGMSLSLGLGPYSNVKVALGTKA* |
| Ga0070708_1007817052 | 3300005445 | Corn, Switchgrass And Miscanthus Rhizosphere | IAIAAFALLPAALYPFMGTVSIAVAAAIVVLFFAGMSLSLGLGPYSNVKVALVAR* |
| Ga0070708_1007916571 | 3300005445 | Corn, Switchgrass And Miscanthus Rhizosphere | IAAFALLPAALYPFMGTVSIAVAAAIVVLFFAGMSLSLGLGPYSNVKVALGAR* |
| Ga0066689_104087462 | 3300005447 | Soil | GRPVGTSDERPWLDTVLPIAIAAFAIAPAAAYPFLGQISIAVTAAVFVLFFAGMSLSLGAGPYANEKVALSLKLEPRS* |
| Ga0066682_100820511 | 3300005450 | Soil | WFAAQRFGRPAGLNNERPWLDTVLPIAIATFAVLPAALYPFLGNVSLGIAAAIIVLFFAGMSLSLGLGPYANVKVALVTR* |
| Ga0066687_108555421 | 3300005454 | Soil | WLDTVLPIAIAGFAILPAALFPFIGTVSIVVAAAIVVLFFAGMSLSLGFGPYSNVKVALSTKS* |
| Ga0070707_1006351824 | 3300005468 | Corn, Switchgrass And Miscanthus Rhizosphere | VGASDARPWLDTVLPIAIAAFAIAPAAAYPFIGQISIAVTAAVFVLFFAGMSLSLGAGPYSNEKVALSLKAESRS* |
| Ga0070707_1017351612 | 3300005468 | Corn, Switchgrass And Miscanthus Rhizosphere | FLGIALFAAGWFAAQRFGRPVGLTNKPPWLDTVLPVAIAAFAVLPAALYPFLGSISVAVGAAIIVLFFAGMSLSLGIGPYANVKVALVAR* |
| Ga0070697_1006338241 | 3300005536 | Corn, Switchgrass And Miscanthus Rhizosphere | WLDTVLPIAIAVFAILPAALYPFLGTLSIAVGAAIVVLFFAGMSLSLGLGPYANVKVALVTR* |
| Ga0066697_104808972 | 3300005540 | Soil | GLLLFTAGWFAAQRFGRPVGISNRPPWLDTVLPIAIAAFAVLPAALYPFLGNVSIAVAAAIIVLFFAGMSLSLGLGAYANVKVALGLK* |
| Ga0066692_107247751 | 3300005555 | Soil | VATVFLGFVAFTAGWFAAQRFGRPVGLSNRPPWLDTVLPIAIALFALLPAALYPLMGTVSIAVAAAIVVLFFAGMSVSLGLGPYANVKVALSTKS* |
| Ga0066700_101233691 | 3300005559 | Soil | FVTEGGALSDLAGIVFFAAGWVVARRFGRPVGLSGARPWLDTVLPIALAAFAVAPAALYPFLGQIAIAPTAAVFVLFFAGMSFSLGLGPYANVKVALAR* |
| Ga0066700_103004931 | 3300005559 | Soil | PAGLNNERPWLDTVLPMAIAAFAVLPAALYPFLGNVSLGIAAAIIVLFFAGMSLSLGLGSYANVKVALSLR* |
| Ga0066700_104135242 | 3300005559 | Soil | DTVLPMAIAAFAVLPAALYPFLGNVSLGIAAAIIVLIFAGMSLSLGLGSYANVKVALSLR |
| Ga0066700_105165511 | 3300005559 | Soil | VGSSDARPWLDTVLPIAIAAFAIAPAAAYPFVGQISIAVTAAVFVLFFAGMSLSLGAGPYSNEKVALSLKAESRS* |
| Ga0066670_106810641 | 3300005560 | Soil | VTEGGALSDLAGIVFFAAGWVVARRFGRPVGLSGARPWLDTVLPIALAAFAVAPAALYPFVGQIAIAPTAAVFVLFFAGMSFSLGLGPYANVKVALAR* |
| Ga0066693_103475771 | 3300005566 | Soil | FGRPVGISNRPPWLDTVLPIAIAAFAVLPAALYPFLGNVSIAVAAAIIVLFFAGMSLSLGLGAYANVKVALGLK* |
| Ga0066703_100663261 | 3300005568 | Soil | ALFTAGWLAAQRFGRPVGLTNKPPWLDTVLPVAIAAFALLPAALYPFIGSVPVAVGAAIIVLFFAGMSLSLGLGPYANVKVALARQ* |
| Ga0066691_102493722 | 3300005586 | Soil | DERPWLDTVLPIAIAAFAIAPAAAYPFLGQISIAVTAAVFVLFFAGMSLSLGAGPYANEKVALSLKLEPRS* |
| Ga0066706_108402541 | 3300005598 | Soil | GFVAFTAGWFAAQRFGRPVGLSNRPPWLDTVLPIAIALFAFLPAALYPFMGTVSIAVAAAIVVLFFAGMSLSLGLGPYANVKVALSTKS* |
| Ga0066651_108042401 | 3300006031 | Soil | TQAGSVAPVILGFVTFAAGWFVAQRFGRPVGLSNRPPWLDTVLPVAIAGFAILPAALYPFIGTVSIVVAAAIVVLFFAGMSLSLGLGPYSNVKVALSTKS* |
| Ga0066656_106936572 | 3300006034 | Soil | GIALFTAGWFAAQRFGRPAGLNNERPWLDTVLPMAIAAFAVLPAALYPFLGNVSLGIAAAIIVLIFAGMSLSLGLGSYANVKVALSPR* |
| Ga0075417_101210291 | 3300006049 | Populus Rhizosphere | VIETDGTLVTAALGAVLLAAGWFVARRFGRPVGDSKRAPWLDTVLPIAIAGFALLPAALFPFIGKVSIATAAAIVVLFFAGISLSLGFGPYKNVKVALVR* |
| Ga0066658_108072241 | 3300006794 | Soil | PWLVAVLPVAIAAFAVLPAALYPFIGSVSVAVGAAIIVLFFAGMSLSLGLGPYANVKVALARQ* |
| Ga0066665_101837211 | 3300006796 | Soil | GSVATVFLGFVAFTAGWFAAQRFGRPVGLSNRPPWLDTVLPVAIALFALLPAALFPLMGTVSIAVAAAIVVLFFAGMSLSLGLGPYANVKVALSTKS* |
| Ga0075433_101055303 | 3300006852 | Populus Rhizosphere | VLVGFVTFTAGWFVAKRFGRPIGLSNRAPWLDTVLPIAIAAFAVLPAALYPFMGTVSIAVAAAIVVLFFAGMSLSLGLGPYANVKVALGAS* |
| Ga0075425_1027094442 | 3300006854 | Populus Rhizosphere | LSNRPPWLDTVLPIAIAGFALLPAALYPFLGTVSIAVAAAIVVLFFAGMSLSLGLGPYSNVKVALVTK* |
| Ga0075426_115431202 | 3300006903 | Populus Rhizosphere | TVLPIAIAAFAVLPAALYPFMGTVSIAVAAAIVVLFFAGMSLSLGLGPYANVKVALGAS* |
| Ga0066710_1001106021 | 3300009012 | Grasslands Soil | GTAFGLVFFIAGWVVARRFGRPVGSSDARPWLDTVLPIAIAAFAIAPAAAYPFIGQISIAVTAAVFVLFFAGMSLSLGAGPYANEKVTLTLKAESRS |
| Ga0066710_1030473061 | 3300009012 | Grasslands Soil | TFTAGWFAAQRFGRPVGLSNRPPWLDTVLPIAIAAFALLPAALYPFLGTVSIAVAAAIVVLFFAGMSLSLGVGPFANVKVGLATK |
| Ga0066710_1031819572 | 3300009012 | Grasslands Soil | SVVTGFLGIALFTAGWFAAQRFGRPAGLNNERPWLDTVLPIAIAAFAVLPAALYPFLGTVSLGIAAAIIVLFFAGMSLSLGLGSYANVKVALVTR |
| Ga0066709_1002329652 | 3300009137 | Grasslands Soil | MALFTAGWLAAQRFGRPVGLTNKPPWLDTVLPVAIAAFALLPAALYPFIGSVSVAVGAAIIVLFFAGMSLSLGLGPYANVKVALARQ* |
| Ga0066709_1010867751 | 3300009137 | Grasslands Soil | RPWLDTVLPMAIAAFAVLPAALYPFLGNVSLGIAAAIIVLFFAGMSLSLGLGSYANVKVALSLR* |
| Ga0066709_1034810782 | 3300009137 | Grasslands Soil | IVFFAAGWVVARRFGRPVGLSGARPWLDTVLPIALAAFAVAPAALYPFLGQIAIAPTAAVFVLFFAGMSFSLGLGPYANVKVALAR* |
| Ga0075423_100525752 | 3300009162 | Populus Rhizosphere | LVGFVTFTAGWFVAKRFGRPIGLSNRAPWLDTVLPIAIAAFAVLPAALYPFMGTVSIAVAAAIVVLFFAGMSLSLGLGPYANVKVALGAS* |
| Ga0134111_103415831 | 3300010329 | Grasslands Soil | TVLPIAIALFALLPAALFPLMGTVSIAVAAAIVVLFFAGMSLSLGLGPYANVKVALSTKS |
| Ga0134080_100784381 | 3300010333 | Grasslands Soil | VLPMAIAAFAVLPAALYPFLGNVSLGIAAAIIVLIFAGMSLSLGLGPYANVKVALVTR* |
| Ga0134080_102051571 | 3300010333 | Grasslands Soil | NERPWLDTVLPIAIAAFAVLPAALYPFLGNVSLGIAAAIIVLFFAGMSLSLGLGSYANVKVALSLR* |
| Ga0137393_108637083 | 3300011271 | Vadose Zone Soil | AAHRFGQPIGLSNRPPWLDTVLPIAIAVFAILPAALYPFLGTLSIAVAAAIVVLFFAGMSLSLGLGPYANVKVALVTR* |
| Ga0137393_114591081 | 3300011271 | Vadose Zone Soil | PIGLSNRAPWLDTVLPIAIAAFAFLPAALYPFMGTVSIAVAAAIVVLFFAGMSLSLGLGPYSNVKVALGTKA* |
| Ga0137388_105490231 | 3300012189 | Vadose Zone Soil | GLVAFTAGWFAAQRFGQPIGLSNRPPWLDTVLPIAIAVFAILPAALYPFLGTLSIAVAAAIVVLFFAGMSLSLGLGPYSNVKVALVTR* |
| Ga0137382_104936592 | 3300012200 | Vadose Zone Soil | TAGWFAAQRFGRPVGLSNRPPWLDTVLPIAIALFALLPAALYPLMGTVSIAVAAAIVVLFFAGMSVSLGIGPYANVKVALSTKS* |
| Ga0137399_101432403 | 3300012203 | Vadose Zone Soil | GFLTFTAGWFAAQRFGRPVGLSNRRPWLDAVLPIAIAAFAILPAALFPFLGTVSIAIAAAIVVLFFAGMSLSLGLGPYSNVKVALGTGS* |
| Ga0137381_104078623 | 3300012207 | Vadose Zone Soil | VLPIAIALFALLPAALFPLMGTVSIAVAAAIVVLFFAGMSLSLGLGPYANVKVALVTR* |
| Ga0137376_108002951 | 3300012208 | Vadose Zone Soil | GLVMFTAGWFAAQRFGRPVGLSNRPPWLDTVLPIAIAAFAILPAALYPFLGTVSIAVAAAILVLFFAGMSLSLGVGPYSNVKVALATK* |
| Ga0137376_109329121 | 3300012208 | Vadose Zone Soil | RFGRPIGLSNRRPWLDTVLPIAIAAFAVLPAALFPFLGTVSIAVAAAIVVLFFAGMSLSLGLGPYSNVKVALGTGS* |
| Ga0137377_106853572 | 3300012211 | Vadose Zone Soil | FGRPAGLNNERPWLDTVLPIAIATFAVLPGALYPFLGNVSLGIAAAIIVLFFAGISLSLGLGPYANVKVALVTR* |
| Ga0137371_102244501 | 3300012356 | Vadose Zone Soil | VGLSNRPPWLDTVLPIAIALFALLPAALYPLMGTVSIAVAAAIVVLFFAGMSLSLGLGPYANVKVALVTR* |
| Ga0137396_102565521 | 3300012918 | Vadose Zone Soil | RRFGRPVGTSDERPWLDTVLPIAIAAFAVAPAAAYPFLGQISIAVTAAVFVLFFAGMSLSLGAGPYANEKVALSLKLEPRS* |
| Ga0137394_109760892 | 3300012922 | Vadose Zone Soil | SNRLPWVDTALPIAIAGFAILPATLYPFLGTVSIAVAAAIVVLFFAGVSLSLGVGPYSNVKVSLVTK* |
| Ga0137419_104225172 | 3300012925 | Vadose Zone Soil | LPIAIAAFAILPAALFPFLGTVSIAIAAAIVVLFFAGMSLSLGLGPYSNVKVALGTGS* |
| Ga0137407_101795593 | 3300012930 | Vadose Zone Soil | GRPVGLSNRRPWLDAVLPIAIAAFAILPAALFPFLGTVSIAIAAAIVVLFFAGVSLSLGLGPYSNVKVALGTRT* |
| Ga0134077_103013361 | 3300012972 | Grasslands Soil | PTAFLGLVMFTAGWFAAQRFGRPVGLSNRPPWLDTVLPIAIAGFAILPAALYPFLGTVSIAVAAAIVVLFFAGMSLSLGLGPYANVKVALVTK* |
| Ga0134076_106183751 | 3300012976 | Grasslands Soil | PVGLSNRPPWLDTVLPIAIAGFAILPAALYPFLGTVSIAVAAAIVVLFFAGMSLSLGLGPYANVKVALVTK* |
| Ga0134076_106185342 | 3300012976 | Grasslands Soil | LSNRRPWLDTALPIAIAAFAFLPAALYPFLGTVSIAVAAAIVVLFFAGMSLSLGLGPYANVKVALVTR* |
| Ga0137418_1000161216 | 3300015241 | Vadose Zone Soil | MLAHPNPLYRQSFLAVLPIAIAAFAILPAALFPFLGTVSIAIAAAIVVLFFAGMSLSLGLGPYSNVKVALGTGT* |
| Ga0137409_105803872 | 3300015245 | Vadose Zone Soil | GRPVGLSNRRPWLDAVLPIAIAAFAILPAALFPFLGTVSIAIAAAIVVLFFAGMSLSLGLGPYSNVKVALGTGT* |
| Ga0134085_104506032 | 3300015359 | Grasslands Soil | QEGAVGTVFLGLLLFTAGWFAAQRFGRPVGISNRPPWLDTVLPIAIAAFAVLPAALYPFLGNVSIAVAAAIIVLFFAGMSLSLGLGAYANVKVALGLK* |
| Ga0134083_105603612 | 3300017659 | Grasslands Soil | ATVFLGFVAFTAGWFAAQRFGRPVGLSNRPPWLDTVLPIAIALFALLPAALYPLMGTVSIAVAAAIVVLFFAGMSLSLGLGPYANVKVALSTKS |
| Ga0066655_106678393 | 3300018431 | Grasslands Soil | FVAGWIVARRFGHPVGSSDARPWLDTVLPIAIAAFAIAPAAAYPFVGQISIAVTAAVFVLFFAGMSLSLGAGPYSNEKVALSLKAESRS |
| Ga0066667_103883742 | 3300018433 | Grasslands Soil | SVGTGFLGIALFTAGWFAAQRFGRPAGLNNERPWLDTVLPMAIAAFAVLPAALYPFLGNVSLGIAAAIIVLFFAGMSLSLGLGSYANVKVALSLR |
| Ga0066667_109843272 | 3300018433 | Grasslands Soil | SVGTGFLGIALFTAGWFAAQRFGRPAGLNNERPWLDTVLPIAIATFAVLPAALYPFLGNVSLGIAAAIIVLFFAGMSLSLGLGSYANVKVALVTR |
| Ga0066667_116254022 | 3300018433 | Grasslands Soil | FWREGSVGTVFLGMALFAAGWLAAQRFGRPVGLTNKPPWLDTVLPVAIAAFAVLPAALYPFIGSVSVAVGAAIIVLFFAGMSLSLGLGPYANVKVALARQ |
| Ga0066662_104733412 | 3300018468 | Grasslands Soil | RPWLDTVLPMAIAAFAVLPAALYPFLGNVSLGIAAAIIVLFFAGMSLSLGLGSYANVKVALSLR |
| Ga0066669_101068921 | 3300018482 | Grasslands Soil | IVFFAAGWVVARRFGRPVGLSGARPWLDTVLPIALAAFAVAPAALYPFVGQIAIAPTAAVFVLFFAGMSFSLGLGPYANVKVALAR |
| Ga0066669_104053581 | 3300018482 | Grasslands Soil | FGRPVGLSNRPPWLDTVLPIAIAGFAFLPAALYPFIGTVSIVAAAAIVVLFFAGMSLSLGLGPYSNVKVALSTKS |
| Ga0066669_109754642 | 3300018482 | Grasslands Soil | IVFFAAGWVVARRFGRPVGLSGARPWLDTVLPIALAAFAVAPAALYPFLGQIAIAPTAAVFVLFFAGMSFSLGLGPYANVKVALAR |
| Ga0193695_11349301 | 3300021418 | Soil | EGSAPTAFLGLGMFTAGWFAAQRFGRPVGLSNRPPWLDTVLPIAIAGFAILPAALYPFLGTVSIAVAAAILVLFFAGMSLSLGLGPYSNVKVALVTR |
| Ga0209688_10644592 | 3300026305 | Soil | MMGRPGGIAIAIAAFAVLPAALYPFLGNVSIAVAAAIIVLFFAGMSLSLGLGAYANVKVALGLK |
| Ga0209761_13036271 | 3300026313 | Grasslands Soil | GLSNRLPWLDTVLPIAIAAFAILPAALYPFLGTVSIATAAAIVVLFFAGMSLSLGFGPYSNVKVALASR |
| Ga0209268_10645731 | 3300026314 | Soil | VTGFLGIALFTAGWFAAQRFGRPAGLNNERPWLDTVLPIAIATFAVLPAALYPFLGNVSLGIAAAIIVLFFAGMSLSLGLGPYANVKVALVTR |
| Ga0209154_12013702 | 3300026317 | Soil | DTVLPIAIAAFALLPAALYPFLGTVSIAVAAAIVVLFFAGMSVSLGVGPFANVKVGLATK |
| Ga0209472_10551622 | 3300026323 | Soil | NRPPWLDTVLPIAIAAFAVLPAALYPFLGNVSIAVAAAIIVLFFAGMSLSLGLGAYANVKVALGLK |
| Ga0209802_12038822 | 3300026328 | Soil | GWFVARRFGEPVGLSNRLPWLDTVLPIAIAAFAILPAALYPFLGTVSIATAAAIVVLFFAGMSLSLGFGPYSNVKVALASR |
| Ga0209806_13311651 | 3300026529 | Soil | ALFTAGWLAAQRFGRPVGLTNKPPWLDTVLPVAIAAFALLPAALYPFIGSVPVAVGAAIIVLFFAGMSLSLGLGPYANVKVALARQ |
| Ga0209058_11614721 | 3300026536 | Soil | PWLDTVLPMAIAAFAVLPAALYPFLGNVSLGIAAAIIVLIFAGMSLSLGLGSYANVKVALSPR |
| Ga0209161_104381111 | 3300026548 | Soil | ALFALLPAALFPLMGTVSIAVAAAIVVLFFAGMSLSLGLGPYANVKVALSTKS |
| Ga0209474_106926971 | 3300026550 | Soil | AWCAYVFLTEGGALSDLAGIVFFAAGWVVARRFGRPVGLSGARPWLDTVLPIALAAFAVAPAALYPFVGQIAIAPTAAVFVLFFAGMSFSLGLGPYANVKVALAR |
| Ga0209814_104192062 | 3300027873 | Populus Rhizosphere | MAVRVLVAKRFGRPIGLSNRAPWLDTVLPIAIAAFAVLPAALYPFMGTVSIAVAAAIVVLFFAGMSLSLGLGPYA |
| Ga0137415_111542511 | 3300028536 | Vadose Zone Soil | AQRFGRPVGLSNRPPWVDTVLPIAIAGFAILPAALYPFLGTVSIAVAAAIVVLFFAGMSLSLGLGPYANVKVALSTKS |
| Ga0307311_100869731 | 3300028716 | Soil | FSQEGSVPTVFLGFVTFSAGWFVAQRFGRPVGLSNRRPWLDTVLPIAIAGFAVLPAALYPFTGMVSIAVAAAIVVLFFAGMSLSLGLGPFANVKVALIGD |
| Ga0307298_101954252 | 3300028717 | Soil | SQEGSLPTVFLGFVAFTAGWFAAQRFGRPIGLSNRPPWLDTVLPIAIAAFAILPAALFPFLGTVSIAVAAAIVVLFFAGMSLSLGLGPYSNVKVALGTRG |
| Ga0307301_103309551 | 3300028719 | Soil | RRPWLDAVLPIAIAAFAILPAALFPFLGTVSIAVAAAIVVLFFAGMSLSLGLGPYSNVKVALSAGS |
| Ga0307320_100409541 | 3300028771 | Soil | SFTAGWFAAQRFGRPIGLSNRRPWLDAVLPIAIAAFAILPAALFPFLGTVSIAVAAAIVVLFFAGMSLSLGLGPYSNVKVALSTRS |
| Ga0307290_101363181 | 3300028791 | Soil | WFVAQRFGRPIGLSNRPPWLDTVLPIAIAGFAFLPAALYPFMGTVSIVAAAAIVVLFFAGMSLSLGLGPYSNVKVALSTRS |
| Ga0307296_101095662 | 3300028819 | Soil | TAGWFAAQRFGRPIGLSNRPPWLDTVLPIAIAAFAILPAALFPFLGTVSIAVAAAIVVLFFAGMSLSLGLGPYSNVKVALGTRG |
| Ga0307296_106251022 | 3300028819 | Soil | VFLGFVTFGAGWFVAQRFGRPVGLSNRRPWLDTVLPIAIVGFAVLPAALYPFTGTVSIAVAAAIVVLFFAGMSLSLGLGPFANVKVALIGE |
| Ga0307312_101482703 | 3300028828 | Soil | GLSNRPPWLDTVLPIAIAAFAILPAALFPFLGTVSIAVAAAIVVLFFAGMSLSLGLGPYSNVKVALGTRG |
| Ga0307278_100850092 | 3300028878 | Soil | FLGFATFTAGWFVAQRFGRPIGLSNRAPWLDTVLPIAIAGFAVLPAALYPFIGTVSIAVAAAIVVLFFAGMSLSLGLGPYSNVKVALSTRS |
| Ga0307308_104412561 | 3300028884 | Soil | GLSNRPPWLDTVLPIAIAGFAFLPAALYPFMGTVSIVAAAAIVVLFFAGMSLSLGLGPYSNVKVALSTKS |
| Ga0307304_105268012 | 3300028885 | Soil | SVPTVFLGFVTFSAGWFVAQRFGRPVGLSNRRPWLDTVLPIAIAGFAVLPAALYPFTGMVSIAVAAAIVVLFFAGMSLSLGLGPFANVKVALIGD |
| Ga0307473_104142022 | 3300031820 | Hardwood Forest Soil | TAGWFAAQRFGQPVGLSNRRPWLDTVLPIAIAVFAILPAALYPFLGTLSIAVAAAIVVLFFAGMSLSLGLGPYANVKVALVTR |
| ⦗Top⦘ |