| Basic Information | |
|---|---|
| Family ID | F094191 |
| Family Type | Metagenome |
| Number of Sequences | 106 |
| Average Sequence Length | 165 residues |
| Representative Sequence | MKRALAALLCVLVVLGATLYLFVVRPLLRPSDLTAVAENALATEDLLLLGGINVKQAVFLERWFLGTPRVPTVQAVSTPAVAERTLLEHLRAAGVDARHDVDYALYAVYPAAAEATRHAVVLLGRFNPTAINAYLTRDLHATPRAGAGPASYEVVRP |
| Number of Associated Samples | 99 |
| Number of Associated Scaffolds | 106 |
| Quality Assessment | |
|---|---|
| Transcriptomic Evidence | No |
| Most common taxonomic group | Unclassified |
| % of genes with valid RBS motifs | 0.00 % |
| % of genes near scaffold ends (potentially truncated) | 0.00 % |
| % of genes from short scaffolds (< 2000 bps) | 0.00 % |
| Associated GOLD sequencing projects | 94 |
| AlphaFold2 3D model prediction | No |
| Hidden Markov Model |
|---|
| Powered by Skylign |
| Most Common Taxonomy | |
|---|---|
| Group | Unclassified (100.000 % of family members) |
| NCBI Taxonomy ID | N/A |
| Taxonomy | N/A |
| Most Common Ecosystem | |
|---|---|
| GOLD Ecosystem | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil (17.924 % of family members) |
| Environment Ontology (ENVO) | Unclassified (26.415 % of family members) |
| Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Soil (non-saline) (42.453 % of family members) |
| ⦗Top⦘ |
| ⦗Top⦘ |
| Predicted Topology & Secondary Structure | |||||
|---|---|---|---|---|---|
| Classification: | Globular | Signal Peptide: | No | Secondary Structure distribution: | α-helix: 68.15% β-sheet: 0.00% Coil/Unstructured: 31.85% | Feature Viewer |
|
|
|||||
| Powered by Feature Viewer | |||||
| ⦗Top⦘ |
| Pfam ID | Name | % Frequency in 106 Family Scaffolds |
|---|---|---|
| PF03949 | Malic_M | 25.47 |
| PF05494 | MlaC | 5.66 |
| PF04909 | Amidohydro_2 | 4.72 |
| PF02518 | HATPase_c | 3.77 |
| PF12697 | Abhydrolase_6 | 1.89 |
| PF13291 | ACT_4 | 0.94 |
| PF01609 | DDE_Tnp_1 | 0.94 |
| PF02129 | Peptidase_S15 | 0.94 |
| COG ID | Name | Functional Category | % Frequency in 106 Family Scaffolds |
|---|---|---|---|
| COG0281 | Malic enzyme | Energy production and conversion [C] | 25.47 |
| COG0686 | Alanine dehydrogenase (includes sporulation protein SpoVN) | Amino acid transport and metabolism [E] | 25.47 |
| COG2854 | Periplasmic subunit MlaC of the ABC-type intermembrane phospholipid transporter Mla | Cell wall/membrane/envelope biogenesis [M] | 5.66 |
| COG3039 | Transposase and inactivated derivatives, IS5 family | Mobilome: prophages, transposons [X] | 0.94 |
| COG3293 | Transposase | Mobilome: prophages, transposons [X] | 0.94 |
| COG3385 | IS4 transposase InsG | Mobilome: prophages, transposons [X] | 0.94 |
| COG5421 | Transposase | Mobilome: prophages, transposons [X] | 0.94 |
| COG5433 | Predicted transposase YbfD/YdcC associated with H repeats | Mobilome: prophages, transposons [X] | 0.94 |
| COG5659 | SRSO17 transposase | Mobilome: prophages, transposons [X] | 0.94 |
| ⦗Top⦘ |
| Name | Rank | Taxonomy | Distribution |
| Unclassified | root | N/A | 100.00 % |
| Visualization |
|---|
| Powered by ApexCharts |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|
Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).
| ⦗Top⦘ |
| Habitat | Taxonomy | Distribution |
| Vadose Zone Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil | 17.92% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil | 16.98% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil | 10.38% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil | 9.43% |
| Groundwater Sediment | Environmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment | 7.55% |
| Hardwood Forest Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil | 7.55% |
| Corn, Switchgrass And Miscanthus Rhizosphere | Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere | 4.72% |
| Populus Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere | 4.72% |
| Watersheds | Environmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds | 3.77% |
| Grasslands Soil | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil | 2.83% |
| Tropical Forest Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil | 1.89% |
| Grasslands Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil | 1.89% |
| Agricultural Soil | Environmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil | 1.89% |
| Sandy Soil | Environmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil | 1.89% |
| Peat Soil | Environmental → Terrestrial → Peat → Unclassified → Unclassified → Peat Soil | 1.89% |
| Groundwater Sediment | Environmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment | 0.94% |
| Freshwater Sediment | Environmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment | 0.94% |
| Tropical Forest Soil | Environmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil | 0.94% |
| Forest Soil | Environmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil | 0.94% |
| Soil | Environmental → Terrestrial → Soil → Loam → Forest Soil → Soil | 0.94% |
| Visualization |
|---|
| Powered by ApexCharts |
Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).
| Taxon OID | Sample Name | Habitat Type | IMG/M Link |
|---|---|---|---|
| 3300002886 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_20cm | Environmental | Open in IMG/M |
| 3300005174 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129 | Environmental | Open in IMG/M |
| 3300005180 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 | Environmental | Open in IMG/M |
| 3300005332 | Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly) | Environmental | Open in IMG/M |
| 3300005451 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130 | Environmental | Open in IMG/M |
| 3300005471 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaG | Environmental | Open in IMG/M |
| 3300005545 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-2 metaG | Environmental | Open in IMG/M |
| 3300005557 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 | Environmental | Open in IMG/M |
| 3300005576 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157 | Environmental | Open in IMG/M |
| 3300006047 | Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2013 | Environmental | Open in IMG/M |
| 3300006175 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaG | Environmental | Open in IMG/M |
| 3300006791 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102 | Environmental | Open in IMG/M |
| 3300006854 | Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4 | Host-Associated | Open in IMG/M |
| 3300006871 | Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD3 | Host-Associated | Open in IMG/M |
| 3300006904 | Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3 | Host-Associated | Open in IMG/M |
| 3300006954 | Agricultural soil microbial communities from Georgia to study Nitrogen management - GA Control | Environmental | Open in IMG/M |
| 3300009012 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159 | Environmental | Open in IMG/M |
| 3300009038 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG | Environmental | Open in IMG/M |
| 3300009088 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG | Environmental | Open in IMG/M |
| 3300009090 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG | Environmental | Open in IMG/M |
| 3300009147 | Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2) | Host-Associated | Open in IMG/M |
| 3300010048 | Tropical forest soil microbial communities from Panama - MetaG Plot_11 | Environmental | Open in IMG/M |
| 3300010359 | Tropical forest soil microbial communities from Panama - MetaG Plot_15 | Environmental | Open in IMG/M |
| 3300012096 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaG | Environmental | Open in IMG/M |
| 3300012206 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaG | Environmental | Open in IMG/M |
| 3300012210 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaG | Environmental | Open in IMG/M |
| 3300012360 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_113_16 metaG | Environmental | Open in IMG/M |
| 3300012362 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaG | Environmental | Open in IMG/M |
| 3300012363 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaG | Environmental | Open in IMG/M |
| 3300012582 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaG | Environmental | Open in IMG/M |
| 3300012929 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly) | Environmental | Open in IMG/M |
| 3300015264 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly) | Environmental | Open in IMG/M |
| 3300015359 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09182015 | Environmental | Open in IMG/M |
| 3300017654 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_2_24_1 metaG | Environmental | Open in IMG/M |
| 3300017936 | Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_1 | Environmental | Open in IMG/M |
| 3300018028 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_30_coex | Environmental | Open in IMG/M |
| 3300018054 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_30_b1 | Environmental | Open in IMG/M |
| 3300018061 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_b1 | Environmental | Open in IMG/M |
| 3300018075 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b1 | Environmental | Open in IMG/M |
| 3300018076 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_coex | Environmental | Open in IMG/M |
| 3300018078 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_60_coex | Environmental | Open in IMG/M |
| 3300018433 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116 | Environmental | Open in IMG/M |
| 3300019886 | Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2c2 | Environmental | Open in IMG/M |
| 3300019889 | Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L2c2 | Environmental | Open in IMG/M |
| 3300020001 | Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1a2 | Environmental | Open in IMG/M |
| 3300020060 | Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2c2 | Environmental | Open in IMG/M |
| 3300020170 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad1_1_08_16fungal (Illumina Assembly) | Environmental | Open in IMG/M |
| 3300020579 | Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-M | Environmental | Open in IMG/M |
| 3300020580 | Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-M | Environmental | Open in IMG/M |
| 3300020583 | Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-M | Environmental | Open in IMG/M |
| 3300021081 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32_coex redo | Environmental | Open in IMG/M |
| 3300021088 | Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-M | Environmental | Open in IMG/M |
| 3300021170 | Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-M | Environmental | Open in IMG/M |
| 3300021178 | Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-M | Environmental | Open in IMG/M |
| 3300021344 | Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2a2 | Environmental | Open in IMG/M |
| 3300021404 | Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-O | Environmental | Open in IMG/M |
| 3300021420 | Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-M | Environmental | Open in IMG/M |
| 3300021559 | Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-M | Environmental | Open in IMG/M |
| 3300025910 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes) | Environmental | Open in IMG/M |
| 3300025916 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaG (SPAdes) | Environmental | Open in IMG/M |
| 3300026327 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132 (SPAdes) | Environmental | Open in IMG/M |
| 3300026371 | Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-01-B | Environmental | Open in IMG/M |
| 3300026376 | Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-02-B | Environmental | Open in IMG/M |
| 3300026480 | Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-07-B | Environmental | Open in IMG/M |
| 3300026494 | Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-07-A | Environmental | Open in IMG/M |
| 3300026497 | Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-08-B | Environmental | Open in IMG/M |
| 3300026498 | Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-49-A | Environmental | Open in IMG/M |
| 3300026499 | Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-06-B | Environmental | Open in IMG/M |
| 3300026507 | Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-12-B | Environmental | Open in IMG/M |
| 3300026514 | Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-13-B | Environmental | Open in IMG/M |
| 3300026540 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 (SPAdes) | Environmental | Open in IMG/M |
| 3300026542 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148 (SPAdes) | Environmental | Open in IMG/M |
| 3300026552 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109 (SPAdes) | Environmental | Open in IMG/M |
| 3300027651 | Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM3H0_M2 (SPAdes) | Environmental | Open in IMG/M |
| 3300027748 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149 (SPAdes) | Environmental | Open in IMG/M |
| 3300027846 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes) | Environmental | Open in IMG/M |
| 3300027862 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes) | Environmental | Open in IMG/M |
| 3300027875 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes) | Environmental | Open in IMG/M |
| 3300027894 | Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2012 (SPAdes) | Environmental | Open in IMG/M |
| 3300027903 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes) | Environmental | Open in IMG/M |
| 3300027910 | Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2014 (SPAdes) | Environmental | Open in IMG/M |
| 3300027915 | Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2013 (SPAdes) | Environmental | Open in IMG/M |
| 3300028792 | Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 19_S | Environmental | Open in IMG/M |
| 3300028796 | Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_141 | Environmental | Open in IMG/M |
| 3300028814 | Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_183 | Environmental | Open in IMG/M |
| 3300028881 | Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_116 | Environmental | Open in IMG/M |
| 3300028906 | Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5 (v2) | Environmental | Open in IMG/M |
| 3300031197 (restricted) | Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH1_T0_E1 | Environmental | Open in IMG/M |
| 3300031248 (restricted) | Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH5_T0_E5 | Environmental | Open in IMG/M |
| 3300031716 | Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - YN3 | Environmental | Open in IMG/M |
| 3300031720 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515 | Environmental | Open in IMG/M |
| 3300031740 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05 | Environmental | Open in IMG/M |
| 3300031820 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515 | Environmental | Open in IMG/M |
| 3300031962 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515 | Environmental | Open in IMG/M |
| 3300032174 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_05 | Environmental | Open in IMG/M |
| 3300032180 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515 | Environmental | Open in IMG/M |
| 3300032205 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05 | Environmental | Open in IMG/M |
| 3300033433 | Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF15MN | Environmental | Open in IMG/M |
| 3300033502 | Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF9FY SIP fraction | Environmental | Open in IMG/M |
| Geographical Distribution | |
|---|---|
| Zoom: | Powered by OpenStreetMap |
| ⦗Top⦘ |
Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).
| Protein ID | Sample Taxon ID | Habitat | Sequence |
| JGI25612J43240_10150231 | 3300002886 | Grasslands Soil | MKKVLAAVVAVLVVLGAGLYFFVARPLLRPSEVTAVAERALATDDLLLLAGINVKQAVFLEKWLLGSLRVTPVAATPPPAVADRSLLDHLRAAGVDPRHDVDHALYALYPADAAGSRHAAILIGRFNPAAVNAYLAR |
| Ga0066680_103645042 | 3300005174 | Soil | MIGMKRVVAAVLCALVALTIALYLLVVRPLLRPSDVTAVAESALVTEDLLLLGSINVKQAAFLEKWLLGAPQVTAVRGEPALPVADRTLFDHLRAAGVDARHDLDQVLYALYPTSAEAARHAVILVGRFNPAAINTYLTRELKATPR |
| Ga0066685_111216911 | 3300005180 | Soil | MIGMKRVVAAVLCALVALTIALYLLVVRPLLRPSDVTAVAESALVSEDLLLLGSINVKQAAFLEKWLLGAPQVTAVRGEPALPVADRTLFDHLRAAGVDARHDLDHVLYALYPTSAEAARHA |
| Ga0066388_1003496841 | 3300005332 | Tropical Forest Soil | MKRALVALLVVLVVLGATLYFFVVRPLLRPSEMTAVAENALATDDLLLLGGINVNQAVFLERWFFGAPAATATPASLPAAADRSLIEHLRAAGVDPRHDVDYVLYALYPAAEATRHAVVLVGRFNPNAVNGYLTRELAATPRAPAGPASYAVVRTDQSTCQPAATWIVTANSRWIVLADPATHAVLLPRLTSPPPESGDRLTWWRPLARADVAGLGITG |
| Ga0066681_104799931 | 3300005451 | Soil | MIGMKRVVAAVLCALVALTIALYLLVVRPLLRPSDVTAVAESALVSEDLLLLGSINVKHAAFLEKWLLGAPQVTAVRGEPALPVADRTLFDHLRAAGVDARHDLDHVLYALYPTSAEAARHAVILVGRFNPAAINTYLTRELKATPRAGAGPTSFEVARTDPRRRSSRSSSPWT |
| Ga0070698_1005039681 | 3300005471 | Corn, Switchgrass And Miscanthus Rhizosphere | MKKVLAAAIAVLVVLGAGLYFFVARPLLRPSEVTAVAERALAADDLLLLAGINVKQAVFLEKWLLGSPRVTPVAATPPPAVADRSLLDHLRAAGVDPRHDVDHALYALYPADASVSRHAAILVGRFNPAAVNTYLARELAATPRPGPGPASFDVIRIDPATCQPGASWVVTVTREWIVMA |
| Ga0070695_1005798661 | 3300005545 | Corn, Switchgrass And Miscanthus Rhizosphere | MKKALAALAAALVVLVAAVYFFVARPLLRPSEVTAVAEHALAADDLVLLAAINVKQAVFLERWFLGSPRATTVAGAPLPAGPDRSLLDHLRAAGVDPRHDVDHALYANYPADAGGARHAAILVGRFNPAAINAYLTRELGAVPRPGSSPASFDVTRVDPTTCRPGAAWVVTVSPEWIVLADPAS |
| Ga0066704_105893541 | 3300005557 | Soil | MKKVLAAVVAVLVVLGAGLYFFVARPLLRPSEVTAVAERALATDDLLLLAGINVKQAIFLEKWFLGSPRVTPVAATPAPAVVDRSLLDHLRAAGVDPRHDVDHALYALYPAGPPVSRHAAILVGRFNPAAINAYLTRELAATPRPGPGPASFDVTRIDPATCQPGASWVVTVAREWIVMADPISQPPLVARLAG |
| Ga0066708_101152821 | 3300005576 | Soil | MIGMKRVVAAVLCALVALTIALYLLVVRPLLRPSDVTAVAEGALVSEDLLLLGSINVKQAAFLEKWLLGAPQVTAVRGEPALPVADRTLFDHLRAAGVDARHD |
| Ga0075024_1008287561 | 3300006047 | Watersheds | RTIVDMKRALAALLGVLIVLGATLYLFVVRPLLRPSDVTAVAESALATEDLLLLGGINVKQAVFLERWFFGTPRISTVQAVPTPAVADRTLIDHLRVAGVDARHDVDYALYAVYPAAETTRHAVVLLGRFNPPAINAYLTRELQATPRAGAGPASYEVVRTNPTTCQPDATW |
| Ga0070712_1015126242 | 3300006175 | Corn, Switchgrass And Miscanthus Rhizosphere | MKKALAALAAALVVLVAAVYFFVARPLLRPSEVTAVAEHALAADDLVLLAAINVKQAVFLERWFLGSPRATTVAGAPLPAGPDRSLLDHLRAAGVDPRHDVDHALYANYPADAGGARHAAILVGRFNPAAINAYLTREL |
| Ga0066653_101085102 | 3300006791 | Soil | MIVMKRVVAAVLCALVALTIALYLLVVRPLLRPSDVTAVAEGALVSEDLLLLGSINVKQAAFLEKWLLGAPQVTAVRGEPALPVADRTLFDHLRAAGVDARHDLDHVLYAL |
| Ga0075425_1020424911 | 3300006854 | Populus Rhizosphere | MKKALAVLAAALVVLGGAVYFFVARPLLRPSEVTAVAERALAADDLVLLAAINVKQAVFLERWFLGSPRATTVAGAPLPAGPDRSLLDHLRAAGVDPRHDVDHALYANYPADAGGARHAAILVGRFNPAAINAYLTRELGAVPRPGSSPASFDVTRVDPTTCRP |
| Ga0075434_1007459232 | 3300006871 | Populus Rhizosphere | MIGMKKVVAAVLCALVALTIALYLFVVRPLLRPSDVTAVAEGALVTEDLLLLGGINVKQAAFLEKWLLGAPQVTAVRGEPALPVADRTLFDHLRAAGVDARHDVDHALYALYPASGEAARHAVILVGRFNPAAINAYLTRELKATPRAGSGP |
| Ga0075424_1008877571 | 3300006904 | Populus Rhizosphere | MIGMKKVVAAVLCALVALTIALYLFVVRPLLRPSDVTAVAEGALVTEDLLLLGGINVKQAAFLEKWLLGAPQVTAVRGEPALPVADRTLFDHLRAAGVDARHDVDHALYALYPASGEAARHAVILVGRFNPAAINAYLTRELKATPRAGSGPASFEVARTDPATCQPGATWVVTATAEWIVL |
| Ga0075424_1014926572 | 3300006904 | Populus Rhizosphere | MKKALAVLAAALVVLGGAVYFFVARPLLRPSEVTAVAERALAADDLVLLAAINVKQAVFLERWFLGSPRATTVAGAPLPAGPDRSLLDHIRAAGVDPRHDVDHALYANYPADAGGARHAAILVGRFNPAAINAYLTRELGAVPRPGSSPASFDVTRVDPTTCRP |
| Ga0079219_116228941 | 3300006954 | Agricultural Soil | MKKALAVLAAALVVLGVAVYFFVARPLLRPSEVTAVAERALAADDLVLLAAINVKQAVFLERWFLGSPRATTVAGAPLPAGPDRSLLDHLRAAGVDPRHDVDHALYANYPADAGGPRHAAILVGRFNPAAINAYLTRELGAVPRPGSSPASFDVTRVDP |
| Ga0079219_120152821 | 3300006954 | Agricultural Soil | RPLLRPSEVTAVMERVLATDDLLLMGAVNVKQAVFLEKWFLGVPRATPVVDPPAPAPSVADRTLLDHLRAAGVDPRHDVDGALYALYPTDGPAARHAAVLVGRFNPATVNAYLTRELGATPRPGPGPASYQVTRIDPATCQPGAAWLVTVSREWIVMADPVSHPMLLARVASPPAGSPERLAW |
| Ga0066710_1013735261 | 3300009012 | Grasslands Soil | MIGMKRVVAAVLCALVALTIALYLLVVRPLLRPSDVTAVAEGALVSEDLLLLGSINVKQAAFLEKWLLGAPQVTAVRGEPALPVADRTLFDHLRAAGVDARHDLDHVLYALYPTSAEAARHAVILVGRFNPAAINTYLTRELKATPRTGAGPTSFEVARTDPATCQPGATW |
| Ga0099829_102326452 | 3300009038 | Vadose Zone Soil | MKRALAALLCVLVVLGATLYLFVVRPLLRPSDLTAIAENALATEDLLLLGGINVKQAVFLERWFLGTPRVPTGQVVPTPAVADRTLFEHLRVAGVDARHDVDYALYAVYPAAAEATRHAVVLLGRFNPTAINAYLTRDLHATPRAGAGPASYEVVRPDSTTC |
| Ga0099829_103186942 | 3300009038 | Vadose Zone Soil | MKKVLAAVVAVLVVLGAGLYFFVARPLLRPSEVTAVAERALATDDLLLLAGINVKQAIFLEKWFLGSPRVTPVAATPPPAVVDRSLLDHLRAAGVDPRHDVDHALYALYPAEPPVSRHAAILVGRFNPAAINAYLTRELAATPRLGPGPASFDVARIDPATCQPGASWVVTVAREWIVMADPISQPTLVAR |
| Ga0099829_108972192 | 3300009038 | Vadose Zone Soil | MKRALAALLCVLVVLGATLYLFVVRPLLRPSDLTAVAENALATEDLLLLGGINVKQAVFLERWFLGTPRVPTGQAVSTPAVAERTLLEHLRAAGVDARHDVDYALYAVYPAAAEATRHAVVLLGRFNPTAINAYLTRDLHATPRAGAGPASYEVVRPDSTTC |
| Ga0099830_100974432 | 3300009088 | Vadose Zone Soil | MKKVLAVVVAVLVVLGAGLYFFVARPLLRPSEVTAVAERALATDDLLLLAGINVKQAIFLEKWFLGSPRVTPVAATPPPAVVDRSLLDHLRAAGVDPRHDVDHALYALYPAEPPVSRHAAILVGRFNPAAINAYLTRELAATPRPGPGPASFDVARIDP |
| Ga0099827_101273553 | 3300009090 | Vadose Zone Soil | MIGMKRVVAAVLCVLVALTIALYLLVVRPLLRPPDVTAVAEGALVTEDLLLLGSINVKQAAFLEKWLLGAPQVTAVRGEPALPVADRTLFDHLRAAGVDARHDLDHALYALYPASAEGARHAVILVGRFNPATINAYLTRELKATLRAGAGPASFEV |
| Ga0114129_106149091 | 3300009147 | Populus Rhizosphere | MKKALAVLAAALVVLGGAVYFFVARPLLRPSEVTAVAERALAADDLVLLAAINVKQAVFLERWFLGSPRATTVAGAPLPAGPDRSLLDHIRAAGVDPRHDVDHALYANYPADAGGARHAAILVGRFNPAAINAYLTRELGAVPRPGSSPASFDVTRVDPTTCRPGAAWVVTVSPEWIVLA |
| Ga0126373_112624352 | 3300010048 | Tropical Forest Soil | MKRALVALLVVLVVLGATLYFFVVRPLLRPSEMTAVAENALATDDLLLLGGIKVNQAVFLERWFFGAPAAAATPASLPAAADRSLIEHLRAAGVDPRHDVDYVLYALYPAAEATRHAVVLVGRFNPNAVNGYLTRELAATPRAPAGPASYAVVRT |
| Ga0126376_122611101 | 3300010359 | Tropical Forest Soil | EDVAVAVDDHGGTLPYPAEITSGKPRLSLTIEAMKRALVALLVVLVVLGATLYFFVVRPLLRPSEMTAVAENALATDDLLLLGGINVNQAVFLERWFFGAPAATATPVSLPAAADRSLIEHLRAAGVDPRHDVDYVLYALYPAAEATRHAVVLVGRFNPNAVNGYLTRELAATPRAPAGPASYAVVRTDPSTCQP |
| Ga0137389_101268593 | 3300012096 | Vadose Zone Soil | MKRALAALLCVLVVLGATLYLFVVRPLLRPSDLTAVAENALATEDLLLLGGINVKQAVFLERWFLGTPRVPTGQAVSTPAVAERTLLEHLRAAGVDARHDVDYALYAVYPAAAEATRHAVVLLGRFNPTAINAYLTRDLQAKPRAGAGSASYEVVRTDSTTCQPGAPWIVTVAPGWIVLADP |
| Ga0137380_107265082 | 3300012206 | Vadose Zone Soil | MKKVLAAVVAVLVVLGAGLYFFVARLLLRPSEVTAVAERALATDDLLLLAGINVKQAVFLEKWLLGSPRVTPVAATPAPAVVDRSLLDHLRAAGVDPRHDVDHALYALYPAGPPVSRHAAILVGRFNPAAINAYLTRELAATPRPGPGPASF |
| Ga0137378_113756471 | 3300012210 | Vadose Zone Soil | MIGMKRVVAAVLCALVALTIALYLFVVRPLLRSSDVTAVAEGALVTEDLLLLGSINVKQAAFLEKWLLGAPQGTAVRGEPALPVADRTLFDHLRAAGVDARHDLDHALYALYPTSAEAARHAVILVGRFNPAAINTYLTRELKATPRAGAGPASFEVARTDPSTCQPGATWVVTATAEWIVLADPSSHPALLARFA |
| Ga0137375_104772791 | 3300012360 | Vadose Zone Soil | MKRVLAALLCGLIVLGAALYLFVVRPLLRPSELTAIAESALATEDLLVLGGINVRQAVFLERWFQGTPRVPPAQTVSTPTAVADRTFLEHLRVAGVDARHDVDYALYAVYPAAAEATRHAVVLLGRFNPTALNAYLTRDLQAKPRAGAGPASYEVVRTDSTTCQPAAPWIVTVAPEWIVLADPASHTPLLSR |
| Ga0137361_107674381 | 3300012362 | Vadose Zone Soil | MKKVLAAVLAVLVMLGAGVYFFVARPLLRPSEVTAVAERALATDDLLLLAGINVKQAVFLEKWLLGSPRVTPVAATPPPAVADRALLDHLRAAGVDPRHDVDHVLYALYPADAAGSRHAAILVGRFNPAAVNAYLARELAATPRPGPGPASFDVIRIDPATCQPGASWVVTATREWVVMADPISQPTLIARLIGAPAATPE |
| Ga0137390_112169211 | 3300012363 | Vadose Zone Soil | MKRALAALLCVLVVLGATLYLFVVRPLLRPSDLTAVAENALATEDLLLLGGINVKQAVFLERWFLGTPRVPTVQAVSTPAVADRTLLEHLRVAGVDARHDVDYALYAVYPAAAEATRHAVVLLGRFNPTAINAYLTRDLHATPRAGAGSASYE |
| Ga0137358_101290722 | 3300012582 | Vadose Zone Soil | MKKVLAAAIALLVVLGAGLYFFVARPLLRPSEVTAVAERALATDDLLLLAGINVKQAVFLEKWLLGSPRVTPVAATPPPAVADRSLLDHLRAAGVDPRHDLDHALYALYPADAAGSRHAAILVGRFNPAAVNAYLARELAATPR |
| Ga0137404_106069161 | 3300012929 | Vadose Zone Soil | MKRALAALLCVLVVLGAALYLFVVRPLLRPSDLTAVAENALATEDLLLLGGINVKQAVFLERWFMGTPRVPTVPAVSAPAMADRTLLEHLRVAGVDARHDVDYALYAVYPAAAEATRHAVVLLGRFNPTAINAYLTRDLHATPRAGAEPASYEVVRTDSTTCQPGAPWIVTVAPGWIVLADPASHPTLLPRLASPPAENKEQLGWWR |
| Ga0137403_1000622012 | 3300015264 | Vadose Zone Soil | MKRALAALLCVLVVLGAALYLFVVRPLLRPSDLTAVAENALATEDLLLLGGINVKQAVFLERWFMGTPRVPTAPAVSAPAMADRTLLEHLRVAGVDARHDVDYALYAVYPAAAEATRHAVVLLGRFNPTAINAYLTRDLHATPRAGAEPASYEVVRTDSTTCQPGAPWIVTVAPGWIVLADPASHPTLLPRLASPPA* |
| Ga0134085_100933491 | 3300015359 | Grasslands Soil | MIGMKRVVAAVLCALVALTIALYLLVVRPLLRPSDVTAVAEGALVSEDLLLLGSINVKQAAFLEKWLLGAPQVTAVRGEPALPVADRTLFDHLRAAGVDARHDLDHVLYALYPTSAEAARHA |
| Ga0134069_12984801 | 3300017654 | Grasslands Soil | MIGMKRVVAAVLCALVALTIALYLLVVRPLLRPSDVTAVAESALVSEDLLLLGSINVKQAAFLEKWLLGAPQVTAVRGEPALPVADRTLFDHLRAAGVDARHDLDHVLYALYPTSAEAARHAVILVGRFNPAAINTYLT |
| Ga0187821_101528031 | 3300017936 | Freshwater Sediment | MRKALVGLVAALVVLVAALYFFVARPLLRPSEVTAVMERALATDDLLLMAAVNVKQAVFLEKWFLGAPRATPVADPPAPASSVADRTLLDHLRAAGVDPRHDVDGALYALYPTDGPAARHAAVLVGRFNPATVNAYLTRELGATPRPGPGPASYQVTRIDPATCQPGAA |
| Ga0184608_105292451 | 3300018028 | Groundwater Sediment | GSIHRNRTTNHFGLNSVSRTIVGMKRALAALLCVLVVLGVALYLFVVRPLLRPSDLTAVAENALATEDLLLLGGINVKQAVFLERWFLGTPRVPTVQAVSTPTAVADRTLLEHLRVAGVDARQDVDYALYAVYPAAAEATRHAVVLLGRFNPTAINAYLTRDLHATP |
| Ga0184621_102691891 | 3300018054 | Groundwater Sediment | MKRALAALLCVLVVVGAALYLFVVRPLLRPSDLTAVVENALATEDLLLLGGINVKQAVFLERWFLGTPRVPTVQAVATPAVADRSLLEHLRVAGVDARHDVDYALYAVYPAAAEATRHAVVLRGRFNPTAINTYLTRDLHATPRAGVGPASYEVVRTDSTTCQPGAPWIVTVASEWIVLADPASHI |
| Ga0184619_103746771 | 3300018061 | Groundwater Sediment | MKRALAALLCVLVVVGAALYLFVVRPLLRPSDLTAVAENAVATEDLLLLGGINVKQAVFLERWFLGTPRVPTVQAVSTPTAVADRSLFEHLRGAGVDARHDVDYALYAVYPAAAEATRHAVVLLGRFNPTAINAYLTRDLHATPRAGAGPASYEVVRTDSTTCQPGA |
| Ga0184632_100512761 | 3300018075 | Groundwater Sediment | LVVLGATLYLFVVRPLLRPSDVTAVAESALATEDLLLLGGINVKQAVFLERWFLGTPRGPTGHAVPTPAVVDRTLIEHLRVAGVDARHDVDYALYAVYPAAAEATRHALVLLGGFNPTAINAYLTRDLQATPRAGAGPASYPASYEVVRTDPTTCQPGAPSTTMAGPWRWLGPISPLPPGRSNG |
| Ga0184632_104681701 | 3300018075 | Groundwater Sediment | TAVAENALATEDLLLLGGINVKQAVFLERWFLGTPRVPTAQAVSTPAVADRTLLEHLRVAGVDARHDVDYALYAIYPAAAEATRHAVVLLGRFSPTAINAYLTRDLHATPRAGAGPASYEVVRPDSTTCQPGATWVVTVAPEWIVLADPASHTTLLPRLASPLPENKEQLGW |
| Ga0184609_101385631 | 3300018076 | Groundwater Sediment | MKRALAALLCVLVVLGASLYLFVVRPLLRPSDLTAVAENALATEDLLLLGGINVKQAVFLERWFLGTPRVPAGQAVPTPAVADRTLFEHLRVAGVDARHDVDHALYAVYPAAAEATRHAVVLLGRFNPTAINAYLTRDLHATPRAGAGPASYEVIRTDSTTCRPGATWIVTVAPEWIVLADPASHTALLPRLASP |
| Ga0184609_104330801 | 3300018076 | Groundwater Sediment | MKRALAALLCVLVVVGAALYLFVVRPLLRPSDLTAVVENALATEDLLLLGGINVKQAVFLERWFLGTPRVPTAQAASTPAVADRTLLEHLRVAGVDARHDVDYALYAVYPAAEATRHAVVLLGRFNPTAINAYLTRDLHATPRAGAGPASYEVVRTDSTTCQPGAPWIVTVAPEWIALADPASHP |
| Ga0184612_104005081 | 3300018078 | Groundwater Sediment | MKRALAALLCVLVVLGATLYLFVVRPLLRPSDLTAVAENALATEDLLLLGGINVKQAVFLERWFLGTPRVPTAQAVPPPAVADRTLFEHLRVAGVDARHDVDYALYAVYPAAAEATRHAVLLLGRFNPTAINAYLTRDLHATPRAGAGPASFEVLRTDSTT |
| Ga0066667_106480542 | 3300018433 | Grasslands Soil | MIGMKRVVAAVLCALVALTIALYLLVIRPLLRPSDVTAVAEGALVTEDLLLLGSVNVKQAAFLEKWLLGAPQVTAVRGEPALPVADRTLFDHLRAAGVDARHDLDQVLYALYPTSAEAARHAVILVGRFNPAAINTYLTRELKATPR |
| Ga0193727_10263372 | 3300019886 | Soil | MRKVLAALLCVLVVLGATLYLFVVRPLLRPSDLTAVAENALATEDLLLLGGINVKQAVFLERWFLGTPRVPTVQAVSTPAVADRTLLEHLRVAGVDARHDVDYALYAVYPAAAEATRHAVVLLGRFNPTAINAYLSRDLHATPRAGAGPASYEVVRTDSTTCQPGAPWIVTVAPGWIVLADPASHTTLLPRLASPPPENREELGWWRSLARADVAS |
| Ga0193743_10530021 | 3300019889 | Soil | MKRALAALLCVLVVLGAALYLFVVRPLLRPSDLTAVAENALATEDLLLLGGINVKQAVFLERWLLGTPRVPTVQAVSTPAVADRTLLEHLRVAGVDARHDVDYALYAVYPAAAEATRHAVVLLGRFNPTAINAYLTRDLHATPRAGAGPASYPASYEVIRTDSTTCQPGAPWIVTVAPEWIVLADPASHTTLLPRLASPPPENKEQLGWWRPLARAD |
| Ga0193731_10401582 | 3300020001 | Soil | MKKVLAAVLAVLVVLGAGLYFFVARPLLRPSEVTAVAERALATDDLLLLAGINVKQAVFLEKWLLGSPRVTPVAATPPPAVADRSLLDHLRAAGVDPRHDVDHALYGLYPADAAGSRHAAILVGRFNPAAVNAYLTRELAATPRPGPGPASFDVTRIDPATCQPGASWVV |
| Ga0193717_11824301 | 3300020060 | Soil | MKRALAALLGVLVVGGAALYLFVVRPLLRPADMTAAAESALATEDLLLLGGINVKQAVFLERWFLGTPPISTVPTATTSAVADRTLFDHLRAAGVSTRHDVDHALYAVYPAAAESTRHAVVLLGRFNPAAINAYLTRELQAAPRAGAGPASYEVVRTDPTTCR |
| Ga0179594_100718151 | 3300020170 | Vadose Zone Soil | MKRALAALLCVLVVLGAALYLFVVRPLLRPSDLTAVAENALATEDLLLLGGINVKQAVFLERWFMGTPRVPTVQAVSAPAMADRTLLEHLRVAGVDARHDVDYALYAVYPAAAEATRHAVVLLGRFNPTAINAYLTRDLHATPRAGAEPASYEVVRTDSTTCQPGAPWIVTVAPGWIVLADPASHPTLLPRLASPPAENKEQLGWWRALARADVASVGIMAPDRLETG |
| Ga0210407_106109472 | 3300020579 | Soil | MKRAIVALCCALVVLGAAVYFFVARPLLRPSEVTAVAERAVATDDLVLLGGVNVKQAAFLERWLLGRPPATTAAAESAPGAADRTLLDHLRAAGVDARHDVDYALYALYPAAGETTRHAVVLIGRFNPGAVNAYLTRELQATPHEGSGPASFEVTRTDPTTCRP |
| Ga0210403_108260382 | 3300020580 | Soil | MKRAIVALCCALVVLGAAVYFFVARPLLRPSEVTAVAERAVATDDLVLLGGLNVKQAAFLERWLLGRPPATTGVAARAPGAAERTLLDHLRAAGVDARHDVDYALYALYPAAGETTRHAVVLIGRFNPGAVNAYLTRELQATPHEGSGPASFEVTRIDPTTCRPSAAWLVTAAPEWIVLADPASHAILLPRFAGAS |
| Ga0210401_106875351 | 3300020583 | Soil | MKRAIVALCCALVVLGAAVYFFVARPLLRPSEVTAVAERAVATDDLVLLGGLNVKQAAFLERWLLGRPPATTGVAARAPGAAERTLLDHLRAAGVDARHDVDYALYALYPAAGETTRHAVVLIGRFNPGAVNAYLMRELQATPHEGSGPASFEVTR |
| Ga0210379_103887162 | 3300021081 | Groundwater Sediment | MKRALAALLCVLVMLGATLYLFVVRPLLRPSDLTAVAENALATEDLLLLGGINVKQAVFLERWFLGTPRVPAGQAMPTPAVADRTLFEHLRVAGVDARHDVDYALYAVYPAAAEATRHAVVLLGRFNPTALNAYLTRELRATPRAGVGPASYEVVR |
| Ga0210404_102322101 | 3300021088 | Soil | MKKALAVLAAALVVLGVAVYFFVARPLLRPSEVTAVAERALAADDLVLLAAINVKQAVFLERWFLGSPRATTVAGAPLPAGPDRSLLDHLRAAGVDPRHDVDHALYANYPADAGGARHAAILVGRFNPAAINAYLTRELGAVPRPGSGPASFDVTRVDPTTCRP |
| Ga0210400_102013731 | 3300021170 | Soil | MKKALAVLAAALVVLGVAVYFFVARPLLRPSEVTAVAERALAADDLVLLAAINVKQAVFLERWFLGSPRATTAAGAPLPAGPDRSLLDHLRAAGVDPRHDVDHALYANYPADAGGARHAAILVGRFNPAAINAYLTHELGAVPRPGSGPASFDVTRVDPTTCRPGAAWVVTVSPEWIVLADPA |
| Ga0210408_111985511 | 3300021178 | Soil | MKRAIVALCCALVVLGAAVYFFVARPLLRPSEVTAVAERAVATDDLVLLGGVNVKQAAFLERWLLGRPPATTAAAESAPGAADRTLLDHLRAAGVDARHDVDYALYALYPAAGETTRHAVVLIGRFNPGAVNAYLTRELQATPH |
| Ga0193719_103929151 | 3300021344 | Soil | MTKVLAAAIAVLVVLGAGLYFFVARPLLRPSEVTAVAERALATDDLLLLAGINVKQAVFLEKWLLGSPPAVADRSLLDHLRAAGVDPRHDVDHALYALYPADAAGSRHAAILVGRFNPAAVNAYLTRELAATPRPGPGPASFDVTRIDPATCQPGASWVV |
| Ga0210389_109115991 | 3300021404 | Soil | MKKALAVLAAALVVLGVAVYFFVARPLLRPSEVTAVAERALAADDLVLLAAINVKQAVFLKRWFLGSPRATTVAGAPLPAGPDRSLLDHLRAAGVDPRHDVDHALYANYPADAGGARHAAILVGRFNPAAINAYLTRELGAVPRPGSGP |
| Ga0210394_105952681 | 3300021420 | Soil | MKRAIVALCCALVVLGAAVYFFVARPLLRPSEVTAVAERAVATDDLVLLGGLNVKQAAFLERWLLGRPPATTGVAARAPGAAERTLLDHLRAAGVDARHDVDYALYALYPAAGETTRHAVVLIGRFNPGAVNAYLTRELQATPHERS |
| Ga0210409_111062561 | 3300021559 | Soil | MKRAIVALCCALVVLGAAVYFFVARPLLRPSEVTAVAERAVATDDLVLLGGLNVKQAAFLERWLLGRPPATTGVAARAPGAAERTLLDHLRAAGVDARHDVDYALYALYPAAGETTRHAVVLIGRFNPGAVNAYLTRELQATPHEGSGPASFEVTSTDPT |
| Ga0207684_114716251 | 3300025910 | Corn, Switchgrass And Miscanthus Rhizosphere | MKRALAALLGVLVVLGATLYLFVVRPLLRPSDVTAAAESALATEDLLLLGGINVKQAVFLERWFLGTPRVSTVQAVPTPAVADRTLVDHLRAAGVDARQDVDYALYAVYPAAAETTRHAVVLLGRFNPTAINAYLTRELRAPPRAGA |
| Ga0207663_107034541 | 3300025916 | Corn, Switchgrass And Miscanthus Rhizosphere | MKKALAVLAAALVVLGVAVYFFVARPLLRPSEVTAVAERALAADDLVLLAAINVKQAVFLERWFLGSPRATTVAGAPLPAGPDRSLLDHLRAAGVDPRHDVDHALYANYPADAGGARHAAILVGRFNPAAINAYLT |
| Ga0209266_11214032 | 3300026327 | Soil | MIGMKRVVAAVLCALVALTIALYLLVVRPLLRPSDVTAVAESALVSEDLLLLGSINVKQAAFLEKWLLGAPQVTAVRGEPALPVADRTLFDHLRAAGVDARHDLDHVLYALYPTSAEAARHAVILVGRFNPAAINTYLTRELKATPRTGAGPTSFE |
| Ga0257179_10496001 | 3300026371 | Soil | LVVLGAGLYFFVARPLLRPSEVTAVAERALATDDLLLLAGINVKQAIFLEKWFLGSPRVTPVAATPPPAVVDRSLLDHLRAAGVDPRHDVDHALYALYPAEPPVSRHAAILVGRFNPAAINAYLTRELAATPRPGPGPASFDVARIDPATCQPGASWVVTVAREWIVMADPISQPTLVAR |
| Ga0257167_10818291 | 3300026376 | Soil | MKRALAALLCVLVVLGAVLYLFVVRPLLRPSDLTAVAENALATEDLLLLGGINVKQAVFLERWFLGTPRVPTVQAVSTPAVADRTLLEHLRVAGVDARHDVDYALYAVYPAAAEATRHAVVLLGRFNPTAINAYLTRDLQAKPRAGAGSASY |
| Ga0257177_10051671 | 3300026480 | Soil | MKRALAALLCVLVVLGATLYLFVVRPLLWPSDLTAVAENALATEDLLLLGGINVKQAVFLERWFLGTPRVPTVQAVSTPAVAERTLLEHLRAAGVDARHDVDYALYAVYPAAAEATRHAVVLLGRFNPTAINAYLTRDLHATPRAGAGPASYEVVRTDSTTCQPGAPWIVTVAPEWIVLADPASHPTLLLRLASPPAENKDQLAWWRA |
| Ga0257159_10025341 | 3300026494 | Soil | MKKVLAAVVAVLVVLGAGLYFFVARPLLRPSEVTAVAERALATDDLLLLAGINVKQAIFLEKWFLGSPRVTPVAATPPPAVVDRSLLDHLRAAGVDPRHDVDHALYALYPAEPPVSRHAAILVGRFNPAAINAYLTRELAATPRPGPGPASFDVARIDPATCQPGASWVVTVAREWIVMADPISQPTLV |
| Ga0257164_10272201 | 3300026497 | Soil | MKKVLAAVVAVLVVLGAGLYFFVARPLLRPSEVTAVAERALATDDLLLLAGINVKQAVFLEKWLLGSPRVTPVAATPPPAVADRSLLDHLRAAGVDPRHDVDHALYALYPADAAGSRHAAILIGRFNPAAVNAYLARELAAAPRPGP |
| Ga0257156_10065041 | 3300026498 | Soil | MKKVLAAVVAVLVVLGAGLYFFVARPLLRPSEVTAVAERALATDDLLLLAGINVKQAVFLEKWLLGSPRVTPVAATPPPAVADRSLLDHLRAAGVDPRHDVDHALYALYPADAAGSRHAAILIGRFNPAAVNAYLARELAAA |
| Ga0257181_10606341 | 3300026499 | Soil | MKKVLAAAIALLVVLGAGLYFFVARPLLRPSEVTAVAERALATDDLLLLAGINVKQAIFLEKWFIGSPRVTPVAATPPPAVVDRSLLDHLRAAGVDPRHDVDHALYALYPAEPPVSRHAAILVGRFNPAAINAYLTRELAATPRPGPGPASFDVARIDPATCQP |
| Ga0257165_10047581 | 3300026507 | Soil | MKKVLAAVVAVLVVLGAGLYFFVARPLLRPSEVTAVAERALATDDLLLLAGINVKQAVFLEKWLLGSPRVTPVAATPPPAVADRSLLDHLRAAGVDPRHDVDHALYALYPADAAGSRHAAILIGRFNPAAVNAYLARELAAAPRPGPGPASFDVIRIDPATCQPGASWVVTVTRE |
| Ga0257168_10131112 | 3300026514 | Soil | MKKVLAAVVAVLVVLGAGLYFFVARPLLRPSEVTAVAERALATDDLLLLAGINVKQAVFLEKWLLGSPRVTPVAATPPPAVADRSLLDHLRAAGVDPRHDVDHALYALYPADAAGSRHAAILVGRFNPAAVNAYLARELAATPRPGPGPASFDVIRIDPATCQPGASWVVTV |
| Ga0209376_13593971 | 3300026540 | Soil | MIGMKRVVAAVLCALVALTIALYLLVVRPLLRPSDVTAVAESALVSEDLLLLGSINVKQAAFLEKWLLGAPQVTAVRGEPALPVADRTLFDHLRAAGVDARHDLDHVLYALYPTSAEAARHAVILVG |
| Ga0209805_10499172 | 3300026542 | Soil | MIGMKRVVAAVLCALVALTIALYLLVVRPLLRPSDVTAVAEGALVTEDLLLLGSINVKQAAFLEKWLLGAPQVTAVRGEPALPVADRTLFDHLRAAGVDARHDLDHVLYALYPTSAEAARHAVILVGRFNPAAINTYLTRELKATPRTGAGPTSFEVARTDPATCQPGATWVVTATAEWIVLADPSSHPALLARFA |
| Ga0209577_102449741 | 3300026552 | Soil | MIGMKRVVAAVLCALVALTIALYLLVVRPLLRPSDVTAVAESALVTEDLLLLGSINVKQAAFLEKWLLGAPQVTAVRGEPALPVADRTLFDHLRAAGVDARHDLDQ |
| Ga0209217_10255631 | 3300027651 | Forest Soil | MKKALAVLAAALVVLGAAVYFFVARPLLRPSEVTAVAEHALAADDLVLLAAINVKQAVFLERWFLGSPRATTVAGAPLPAGPDRSLLDHLRAAGVDPRHDVDHALYANYPAGAGGARHAAILVGRFNPAAINAYLTRELGAVPRPGSSPASFDVTRVDPTTCRP |
| Ga0209689_12046122 | 3300027748 | Soil | MIGMKRVVAAVLCALVALTIALYLLVVRPLLRPSDVTAVAESALVTEDLLLLGSINVKQAAFLEKWLLGAPQVTAVRGEPALPVADRTLFDHLRAAGVDARHDLDQVLYALYPTSAEAARHAVILVGRFN |
| Ga0209180_103804402 | 3300027846 | Vadose Zone Soil | MKRALAALLCVLVVLGATLYLFVVRPLLRPSDLTAIAENALATEDLLLLGGINVKQAVFLERWFLGTPRVPTGQVVPTPAVADRTLFEHLRVAGVDARHDVDYALYAVYPAAAEATRHAVVLLGRFNPTAINAYLTRDLHATPRAGAGPASYEVVRP |
| Ga0209701_102082172 | 3300027862 | Vadose Zone Soil | MKRVLAALLGVLVVLGATLYLFVVRPLLRPSDVTAVAESALATEDLLLLGGINVKQAVFLERWFLGTPRVPTVQAVPTPAVADRTLIEHLRVAGVDARHDVDYALYAVYPAAAEATRHAVVLLGRFNPMAINAYLTRELQATPRAGAGPASYEVVRTDPTTCQPGATWIVTVAPEWI |
| Ga0209283_109289051 | 3300027875 | Vadose Zone Soil | MIGMKRVVAAVLSALVALTIALYLFVVRPLLRPSDVTVVAESALATEDLLLLGGINVKQAAFLEKWFLGAPQVAAVRGEPAPPAADRTLFDHLRAAGVDARRDVDHALYALYPASGEAARHAVIL |
| Ga0209068_100059181 | 3300027894 | Watersheds | MKKVLAAVVAALVVLGAGLYFFVARPLLRPSEVTAVAERALATDDLLLLAAINVKQAVFLEKWFLGSPRATPVAATPPPSVADRSLLDHLRAAGVNPRHDVDHALYALYPAEAPVSRHAAILVGRFNPAAINAYLTRELAATPR |
| Ga0209488_104593161 | 3300027903 | Vadose Zone Soil | MKRALAALLCVLVVLGATLYLFVVRPLLRPSDLTAVAENALATEDLLLLGGINVKQAVFLERWFLGTPRVPTVQAVSTPAVAERTLLEHLRAAGVDARHDVDYALYAVYPAAAEATRHAVVLLGRFNPTAINAYLTRDLHATPRAGAGPASYEVVRP |
| Ga0209583_102751122 | 3300027910 | Watersheds | MKRAIVALCCALVVLGAAVYFFVARPLLRPSEVTAVAERAVATDDLVLLGGVNVKQAAFLERWLLGRPPATTGVAAPAPGAADRTLLDHLRAAGVDARHDVDYALYALYPAAGETTRHAVVLIGRFNPGAVNAYLT |
| Ga0209069_107361041 | 3300027915 | Watersheds | RTIVDMKRALAALLGVLIVLGATLYLFVVRPLLRPSDVTAVAESALATEDLLLLGGINVKQAVFLERWFFGTPRISTVQAVPTPAVADRTLIDHLRVAGVDARHDVDYALYAVYPAAETTRHAVVLLGRFNPPAINAYLTRELQATPRAGAGPASYEVVRTNPTTCQPDATWIVTVAPEWIVLADPASHTALL |
| Ga0307504_100309692 | 3300028792 | Soil | MKRALAALLCVLAVLGATLYLFVVRPLLRPSDMTAVAESALATEDLLLLGGINVKQAVFLERWFFGTPRISTVQAVPTPAVADRTLIDHLRVAGVDARHDVDYALYAVYPAAETTRHAVVLLGRFNPPALNAYLTRELRATPRAGTGPASY |
| Ga0307287_101488731 | 3300028796 | Soil | MKRALAALLCVLVVVGAALYLFVVRPLLRPSDLTAVAENALATEDLLLLGGINVKQAVFLERWFLGTPRVPTAQAASTPAVADRTLLEHLRVAGVDARHDVDYALYAVYPAAAEATRHAVVLLGRFNPTAINTYLTRDLHATPRAGAGPASYEVVRTDSTTCQPGAPWIVTVAPGWIVLADPASH |
| Ga0307302_106750721 | 3300028814 | Soil | MKRALAALLCVLVVLGATLYLFVVRPLLRPSDLTAVAENALATEDLLLLGGINVKQAVFLERWLLGTPRVPTVQAVATPAVADRSLLEHLRVAGVDARHDVDYALYAVYPAAAEATRHAVVLLGRFNPTAINAYLTRDLHATPRAG |
| Ga0307277_103261522 | 3300028881 | Soil | MTKVLAAAIAVLVVLGAGLYFFVAGPLLRPSEVTAVAERALATDDLLLLAGINVKQAVFLEKWLLGSPRATPVAATPPPAVADRSLLDHLRAAGVDPRHDVDHALYALYPADAAGSRHAAILVGRFNPAAVNAYLTRELAATPRPGPGPASFDVTRIDP |
| Ga0308309_115890511 | 3300028906 | Soil | PRTIGSMKRAIVALCCALVVLGAAVYFFVARPLLRPSEVTAVAERAVATDDLVLLGGLNVKQAAFLERWLLGRPPATTGVAARAPGAAERTLLDHLRAAGVDARHDVDYALYALYPAAGETTRHAVVLIGRFNPGAVNAYLTRELQATPHEGSGPASFEVTRTDPTTCRPSAAWLVTAAPEWIVL |
| (restricted) Ga0255310_102251941 | 3300031197 | Sandy Soil | LETGRQETDAGRGTNPFGLMTPSRTMIGMKRVVAAVLCALVALTIALYLFVVRPLLRPSDVTAVAESALATEDLLLLGGINVKQAAFLEKWFLGAPQVAAVRGEPAPPVADRTLFDHLRAAGVDARHDVDHALYALYPASGEAARHAVILVGRFNPAAINAYLTRELKATPRAGS |
| (restricted) Ga0255312_11195971 | 3300031248 | Sandy Soil | MIDMKRVVAAVLCALVALTIALYLFVVRPLLRPSDVTAVAESALATEDLLLLGGINVKQAAFLEKWFLGAPQVAAVRGEPAPPVADRTLFDHLRAAGVDARHDVDHALYALYPASGEAARHAVILVGRFNPAAINAYLARELK |
| Ga0310813_105406321 | 3300031716 | Soil | MKKVLVAVVAALVVLVAALYFFVARPLLRPSEVTAVMERALATDDLLLVAAVNVKQAVFLEKWLIGTPRATPVADTPAPAVADRTVLDHLRAAGVDPRHDVDGALYALYPADGPAARHAAVLVGRFNPAAVNAYLTRELAATPRPGPGPA |
| Ga0307469_113173152 | 3300031720 | Hardwood Forest Soil | MKRALAALLGVLVLLGAALYLFVVRPLLRPVELTAAAESALATEDLLLLGGINVKQAVFLERWFLGSPRVSTGTAPPPAVADRALLDHLRAAGVDPRHDVDQALYAVYPAAESTRHAVVLIGRFNPTAIDAYLRREL |
| Ga0307468_1010212831 | 3300031740 | Hardwood Forest Soil | MKKALAVLAAAVVVLGVAVYFFVARPLLRPSEVTAVAERALAADDLVLLAAINVKQAVFLERWFLGSPRATTVAGAPLPAGPDRSLLDHLRAAGVDPRHDVDHALYANYPADAGGTRHAAILVGRFNPAAINAYLTRELGAVPRPGSSPASFDVTRVDPTTCRPGAAWVVTVS |
| Ga0307473_100858982 | 3300031820 | Hardwood Forest Soil | MKKVLAAAIAVLVVLGAGLYFFVARPLLRPSEVSAVAERALAADDLLLLAGINVKQAVFLEKWLLGSPRVTPVAATPPPAVADRSLLDHLRAAGVDPRHDVDHALYALYPADASVSRHAAILVGRFNPAAVNTYLAR |
| Ga0307479_104193581 | 3300031962 | Hardwood Forest Soil | MKRAIVALCCALVVLGAAVYFFVARPLLRPSEVTAVAERAVATDDLVLLGGLNVKQAAFLERWLLGRPPATTGVAARAPGAAERTLLDHLRAAGVDARHDVDYALYALYPAAGETTRHAVVLIGRFNPGAVNAYLTRELQATPHEGSGPASFEVTRTDPTTCRPSAAWLVTAAPEWIVLADSASHAILLPRFAGASTESPEKLAWWRGLARADVASLGIPGLD |
| Ga0307470_117114381 | 3300032174 | Hardwood Forest Soil | MKKALAVLAAALVVLGGAVYFFVARPLLRPSEVTAVAEHALAADDLVLLAAINVKQAVFLERWFLGSPRATTVAGAPLPAGPDRSLLDHLRAAGVDPRHDVDHALYANYPADAGGARHAAILVGRFNPAAINAYLTRELGAVPRPGSSPASFDV |
| Ga0307471_1006664762 | 3300032180 | Hardwood Forest Soil | VKRALAALLGVLVVLGATLYLFVVRPLLRPSDVTAAAESALATEDLLLLGGINVKQAVFLERWFLGTPRVSTVQAVPTPAVADRTLVDHLRAAGVDARQDVDYALYAVYPAGAETTRHAVVLLGRFNPTAINAYLTRELRATPRAGAGPASYEVVRTDPTTCQPGATWVVTVAPAWIVLADPASHTALLPRLASPPSE |
| Ga0307471_1037952461 | 3300032180 | Hardwood Forest Soil | QQIASVPRSRSRTIGGMKKALAVLAAALVVLVAAVYFFVARPLLRPSEVTAVAEHALAADDLVLLAAINVKQAVFLERWFLGSPRATTVAGAPLPAGPDRSLLDHLRAAGVDPRHDVDHALYANYPADAGGARHAAILVGRFNPAAINAYLTRELGAVPRPGSSPASFDVTRVDPTTC |
| Ga0307472_1003258341 | 3300032205 | Hardwood Forest Soil | MKKALAVLAAALVVLGVAVYFFVARPLLRPSEVTAVAERALAADDLVLLAAINVKQAVFLERWFLGSPRATTVAGAPLPAGPDRSLLDHLRAAGVDPRHDVDHALYANYPADAGGARHAAILVGRFNPAAINAYLTRELGAVPRPGSSPASF |
| Ga0326726_113362711 | 3300033433 | Peat Soil | MKKVLVAVVATLMVLVAALYFFVARPLLRPSEVTAVMERVLATDDLLLMAAVNVKQAVFLEKWFFGAPRATPVADTPAPAVADRTFLDHLRAAGVDPRHDVDGALYALYPTDGAAARHATILVGRFNPATVNAYLTRELGATPRPGPGRASYQVTRTDPATCQPGATWLVTVSREWIVMADPVSHPMLLARVASPPV |
| Ga0326731_10060511 | 3300033502 | Peat Soil | MKKVLVAVVATLVVLVAALYFFVARPLLRPSEVTAVMERVLATDDLLLMAAVNVKQAVFLEKWFFGAPRATPVADTPAPAVADRTFLDHLRAAGVDPRHDVDGALYALYPTDGPAARHATVLVGRFNPATVKAYLTRELG |
| ⦗Top⦘ |