| Basic Information | |
|---|---|
| Family ID | F074687 |
| Family Type | Metagenome / Metatranscriptome |
| Number of Sequences | 119 |
| Average Sequence Length | 77 residues |
| Representative Sequence | RKRIGFWSSIVTEMARQTDELFGRTESVLITHKGLKMVTIPLSARRSLGLSLDRSADTEKIILKIMTKFDLASSD |
| Number of Associated Samples | 78 |
| Number of Associated Scaffolds | 119 |
| Quality Assessment | |
|---|---|
| Transcriptomic Evidence | Yes |
| Most common taxonomic group | Unclassified |
| % of genes with valid RBS motifs | 5.88 % |
| % of genes near scaffold ends (potentially truncated) | 88.24 % |
| % of genes from short scaffolds (< 2000 bps) | 79.83 % |
| Associated GOLD sequencing projects | 65 |
| AlphaFold2 3D model prediction | No |
| Hidden Markov Model |
|---|
| Powered by Skylign |
| Most Common Taxonomy | |
|---|---|
| Group | Unclassified (85.714 % of family members) |
| NCBI Taxonomy ID | N/A |
| Taxonomy | N/A |
| Most Common Ecosystem | |
|---|---|
| GOLD Ecosystem | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil (42.857 % of family members) |
| Environment Ontology (ENVO) | Unclassified (68.067 % of family members) |
| Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Soil (non-saline) (70.588 % of family members) |
| ⦗Top⦘ |
| ⦗Top⦘ |
| Predicted Topology & Secondary Structure | |||||
|---|---|---|---|---|---|
| Classification: | Globular | Signal Peptide: | No | Secondary Structure distribution: | α-helix: 56.00% β-sheet: 0.00% Coil/Unstructured: 44.00% | Feature Viewer |
|
|
|||||
| Powered by Feature Viewer | |||||
| ⦗Top⦘ |
| Pfam ID | Name | % Frequency in 119 Family Scaffolds |
|---|---|---|
| PF07690 | MFS_1 | 26.05 |
| PF02867 | Ribonuc_red_lgC | 4.20 |
| PF01638 | HxlR | 3.36 |
| PF12637 | TSCPD | 2.52 |
| PF00583 | Acetyltransf_1 | 1.68 |
| PF06745 | ATPase | 1.68 |
| PF14947 | HTH_45 | 0.84 |
| PF08471 | Ribonuc_red_2_N | 0.84 |
| PF13673 | Acetyltransf_10 | 0.84 |
| PF01596 | Methyltransf_3 | 0.84 |
| PF01209 | Ubie_methyltran | 0.84 |
| COG ID | Name | Functional Category | % Frequency in 119 Family Scaffolds |
|---|---|---|---|
| COG0209 | Ribonucleotide reductase alpha subunit | Nucleotide transport and metabolism [F] | 5.04 |
| COG1733 | DNA-binding transcriptional regulator, HxlR family | Transcription [K] | 3.36 |
| COG2226 | Ubiquinone/menaquinone biosynthesis C-methylase UbiE/MenG | Coenzyme transport and metabolism [H] | 0.84 |
| COG2227 | 2-polyprenyl-3-methyl-5-hydroxy-6-metoxy-1,4-benzoquinol methylase | Coenzyme transport and metabolism [H] | 0.84 |
| COG2518 | Protein-L-isoaspartate O-methyltransferase | Posttranslational modification, protein turnover, chaperones [O] | 0.84 |
| COG4122 | tRNA 5-hydroxyU34 O-methylase TrmR/YrrM | Translation, ribosomal structure and biogenesis [J] | 0.84 |
| COG4123 | tRNA1(Val) A37 N6-methylase TrmN6 | Translation, ribosomal structure and biogenesis [J] | 0.84 |
| ⦗Top⦘ |
| Name | Rank | Taxonomy | Distribution |
| Unclassified | root | N/A | 85.71 % |
| All Organisms | root | All Organisms | 14.29 % |
| Visualization |
|---|
| Powered by ApexCharts |
| ⦗Top⦘ |
| Habitat | Taxonomy | Distribution |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil | 42.86% |
| Vadose Zone Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil | 26.89% |
| Grasslands Soil | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil | 20.17% |
| Grasslands Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil | 5.04% |
| Hardwood Forest Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil | 2.52% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil | 0.84% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil | 0.84% |
| Forest Soil | Environmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil | 0.84% |
| Visualization |
|---|
| Powered by ApexCharts |
| Taxon OID | Sample Name | Habitat Type | IMG/M Link |
|---|---|---|---|
| 3300001661 | Mediterranean Blodgett CA OM1_O3 (Mediterranean Blodgett coassembly) | Environmental | Open in IMG/M |
| 3300002558 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cm | Environmental | Open in IMG/M |
| 3300002560 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cm | Environmental | Open in IMG/M |
| 3300002561 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm | Environmental | Open in IMG/M |
| 3300002908 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cm | Environmental | Open in IMG/M |
| 3300002911 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cm | Environmental | Open in IMG/M |
| 3300005166 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123 | Environmental | Open in IMG/M |
| 3300005167 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121 | Environmental | Open in IMG/M |
| 3300005172 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132 | Environmental | Open in IMG/M |
| 3300005174 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129 | Environmental | Open in IMG/M |
| 3300005176 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128 | Environmental | Open in IMG/M |
| 3300005178 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137 | Environmental | Open in IMG/M |
| 3300005186 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125 | Environmental | Open in IMG/M |
| 3300005447 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138 | Environmental | Open in IMG/M |
| 3300005450 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131 | Environmental | Open in IMG/M |
| 3300005552 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150 | Environmental | Open in IMG/M |
| 3300005555 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141 | Environmental | Open in IMG/M |
| 3300005557 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 | Environmental | Open in IMG/M |
| 3300005569 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154 | Environmental | Open in IMG/M |
| 3300005586 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 | Environmental | Open in IMG/M |
| 3300006791 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102 | Environmental | Open in IMG/M |
| 3300006796 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 | Environmental | Open in IMG/M |
| 3300006797 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108 | Environmental | Open in IMG/M |
| 3300006800 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109 | Environmental | Open in IMG/M |
| 3300007258 | Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3 | Environmental | Open in IMG/M |
| 3300009012 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159 | Environmental | Open in IMG/M |
| 3300009038 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG | Environmental | Open in IMG/M |
| 3300009137 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158 | Environmental | Open in IMG/M |
| 3300010304 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015 | Environmental | Open in IMG/M |
| 3300010323 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_2_24_1 metaG | Environmental | Open in IMG/M |
| 3300010333 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_2_24_1 metaG | Environmental | Open in IMG/M |
| 3300010336 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09082015 | Environmental | Open in IMG/M |
| 3300011269 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaG | Environmental | Open in IMG/M |
| 3300011270 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaG | Environmental | Open in IMG/M |
| 3300011271 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaG | Environmental | Open in IMG/M |
| 3300012189 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaG | Environmental | Open in IMG/M |
| 3300012198 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaG | Environmental | Open in IMG/M |
| 3300012199 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaG | Environmental | Open in IMG/M |
| 3300012206 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaG | Environmental | Open in IMG/M |
| 3300012208 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaG | Environmental | Open in IMG/M |
| 3300012209 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaG | Environmental | Open in IMG/M |
| 3300012285 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaG | Environmental | Open in IMG/M |
| 3300012349 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaG | Environmental | Open in IMG/M |
| 3300012350 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_60_16 metaG | Environmental | Open in IMG/M |
| 3300012351 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaG | Environmental | Open in IMG/M |
| 3300012357 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaG | Environmental | Open in IMG/M |
| 3300012362 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaG | Environmental | Open in IMG/M |
| 3300012363 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaG | Environmental | Open in IMG/M |
| 3300012386 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_2_24_1 metaT (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300015241 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly) | Environmental | Open in IMG/M |
| 3300017659 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaG | Environmental | Open in IMG/M |
| 3300018433 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116 | Environmental | Open in IMG/M |
| 3300018468 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111 | Environmental | Open in IMG/M |
| 3300021046 | Soil microbial communities from Shale Hills CZO, Pennsylvania, United States - 90cm depth | Environmental | Open in IMG/M |
| 3300021088 | Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-M | Environmental | Open in IMG/M |
| 3300026295 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_20cm (SPAdes) | Environmental | Open in IMG/M |
| 3300026298 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cm (SPAdes) | Environmental | Open in IMG/M |
| 3300026307 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123 (SPAdes) | Environmental | Open in IMG/M |
| 3300026310 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cm (SPAdes) | Environmental | Open in IMG/M |
| 3300026313 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes) | Environmental | Open in IMG/M |
| 3300026318 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128 (SPAdes) | Environmental | Open in IMG/M |
| 3300026331 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137 (SPAdes) | Environmental | Open in IMG/M |
| 3300026332 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138 (SPAdes) | Environmental | Open in IMG/M |
| 3300026334 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141 (SPAdes) | Environmental | Open in IMG/M |
| 3300026343 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144 (SPAdes) | Environmental | Open in IMG/M |
| 3300026524 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150 (SPAdes) | Environmental | Open in IMG/M |
| 3300026529 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152 (SPAdes) | Environmental | Open in IMG/M |
| 3300026530 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154 (SPAdes) | Environmental | Open in IMG/M |
| 3300026532 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 (SPAdes) | Environmental | Open in IMG/M |
| 3300026538 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes) | Environmental | Open in IMG/M |
| 3300026548 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 (SPAdes) | Environmental | Open in IMG/M |
| 3300026552 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109 (SPAdes) | Environmental | Open in IMG/M |
| 3300027671 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 (SPAdes) | Environmental | Open in IMG/M |
| 3300027748 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149 (SPAdes) | Environmental | Open in IMG/M |
| 3300027862 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes) | Environmental | Open in IMG/M |
| 3300027875 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes) | Environmental | Open in IMG/M |
| 3300031962 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515 | Environmental | Open in IMG/M |
| 3300032205 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05 | Environmental | Open in IMG/M |
| Geographical Distribution | |
|---|---|
| Zoom: | Powered by OpenStreetMap |
| ⦗Top⦘ |
| Protein ID | Sample Taxon ID | Habitat | Sequence |
| JGI12053J15887_105089023 | 3300001661 | Forest Soil | LRKKVGFWVSVVIGMARQTDQLFGATEYVCVAHKGLKMVTVPVSTRRSLGLSLDRSADPDKIIPKIIARFDLRTGV* |
| JGI25385J37094_100668652 | 3300002558 | Grasslands Soil | LSQSIDLRKRIGFWSSIVTEMARQTDELFGRTESVLITHKGLKMVTIPLSARRSLGLSLDRSADTEKIILKIMTKFDLGSSD* |
| JGI25385J37094_101383481 | 3300002558 | Grasslands Soil | TARTKNVLSQSIDLRKRIGFWSSIVTEMARQTDELFGRTESVLITHKGLKMVTIPLSARRSLGLSLDRSADTEKIILKIMTKFDLASSD* |
| JGI25383J37093_100041794 | 3300002560 | Grasslands Soil | SIVTEMARQTDELFGRTESVLITHKGLKMVTIPXSARRSLGLSLDRSAXTEKIILKIMTKFDLAPSD* |
| JGI25383J37093_101340041 | 3300002560 | Grasslands Soil | EMARQTDELFGRTESVLITHKGLKMVTIPLSARRSLGLSLDRSADTEKIILKVMTKFDLGSSD* |
| JGI25383J37093_101447561 | 3300002560 | Grasslands Soil | RTKNVLSQSIDLRKRIGFWSSIVTEMARQTDELFGRTESVLITHKGLKMVTIPLSARRSLGLSLDRPADTEKIIQKIMTKFDLLSRD* |
| JGI25384J37096_100702172 | 3300002561 | Grasslands Soil | NVLSQSIDLRKRIGFWSSIVTEMARQTDELFGRTESVLITHKGLKMVTIPLSARRSLGLSLDRSADTEKIILKIMTKFDLGSSD* |
| JGI25384J37096_100988912 | 3300002561 | Grasslands Soil | KKIGFWVSIITEMARQTDNLFGSTESICVTHKGLKMVTVPLSARRSLGLSLDRSADSEKIILKITAKFSLGPTV* |
| JGI25384J37096_101536981 | 3300002561 | Grasslands Soil | ETARTKNVLSQSIDLRKRIGFWSSIVTEMARQTDELFGRTESVLITHKGLKMVTIPLSARRSLGLSLDRPADTEKIIQKIMTKFDLLSRD* |
| JGI25382J43887_101537742 | 3300002908 | Grasslands Soil | TKNVLSQSIDLRKRIGFWSSIVTEMARQTDELFGSTEYVLITHKGLKMVTIPLSGRRSLGLSLDRSADTEKIILKIITKFGLGSSE* |
| JGI25382J43887_101664391 | 3300002908 | Grasslands Soil | NVLSQSIDLRKRIGFWSSIVTEMARQTDELFGRTESVLITHKGLKMVTIPLSGRRSLGLSLDRSADTEKIILKIMTKFDLAPTD* |
| JGI25382J43887_103818181 | 3300002908 | Grasslands Soil | LFGRTESVLITHKGLKMVTIPLSARRSLGLSLDRSADTEKIILKIMTKFDLASSD* |
| JGI25390J43892_100010831 | 3300002911 | Grasslands Soil | NVLSQSIDLRKRIGFWSSIVTEMARQTDELFGRTESVLITHKGLKMVTIPLSARRSLGLSLDRSADTEKIILKIMTKFDLASTD* |
| JGI25390J43892_100015031 | 3300002911 | Grasslands Soil | LSQSIDLRKRIGFWSSIVTEMARQTDELFGRTESVLITHKGLKMVTIPLSARRSLGLSLDRSADTEKIILKVMTKFDLGSSD* |
| JGI25390J43892_100165962 | 3300002911 | Grasslands Soil | LSRSIDLRKKIGFWVSIITEMARQTDELFGSTEYVLITHKGLKMVTIPLSARRSLGLSLDRAADTEKLILKIMTKFDLASSN* |
| Ga0066674_100723251 | 3300005166 | Soil | RQTDELFGRTESVLITHKGLKMVTIPLSARRSLGLSLDRSADTEKIILKIMTKFDLASTD |
| Ga0066674_103708981 | 3300005166 | Soil | WSSIVTEMARQTDELFGRTESVLITHKGLKMVTIPLSARRSLGLSLDRSADTEKIILKIITKFDLASTD* |
| Ga0066674_103978021 | 3300005166 | Soil | RQTDELFGRTESVLITHKGLKMVTIPLSARRSLGLSLDRSADTEKIILKIMTKFDLGSSD |
| Ga0066672_105862502 | 3300005167 | Soil | EVTGYAETPRTKNVLSQSIDLRKKIGFWSSIVTEMGRQTDELFGNTEYVLITHKGLKMVTIPLSARRSLGLSLDRLADTEKIILKIMTKFVLGSNDWTSSNQV* |
| Ga0066683_100727681 | 3300005172 | Soil | ETARTKNVLSQSIDLRKRIGFWSSIVTEMARQTDELFGRTESVLITHKGLKMVTIPLSARRSLGLSLDRSADTEKIILKVMTKFDLGSSD* |
| Ga0066680_100834792 | 3300005174 | Soil | YAETARTKNVLSQSIDLRKRIGFWSSIVTEMARQTDELFGRTESVLITHKGLKMVTIPLSGRRSLGLSLDRSADTEKIILKIMTKFDLASTD* |
| Ga0066680_103751382 | 3300005174 | Soil | SDPAIRVAAFIEGAEVTGYAETARTKNVLSQSIDLRKKIGFWVSIITEMARQTDELFGSTEYVLMTHKGLKMVTIPLSARRSLGLSLDRSADTEKIILKIMTKFDLVSSN* |
| Ga0066679_101479221 | 3300005176 | Soil | WSSIVTEMGRQTDELFGNTEYVLITHKGLKMVTIPLSARRSLGLSLDRSADTEKIILKIMTKFDLGSSDWTSSNQV* |
| Ga0066679_104639012 | 3300005176 | Soil | WSSIVTEMGRQTDELFGNTEYVLITHKGLKMVTIPLSARRSLGLSLDRLADTEKIILKIMTKFVLGSNDWTSSNQV* |
| Ga0066688_101150292 | 3300005178 | Soil | VLSQSIDLRKRIGFWSSIVTEMARQTDELFGRTESVLITHKGLKMVTIPLSGRRSLSLSLDRSADTEKIILKIMTKFDLASTD* |
| Ga0066676_102191133 | 3300005186 | Soil | SIDLRKRIGFWSSIVTEMARQTDELFGRTESVLITHKGLKMVTIPLSARRSLGLSLDRSADTEKIILKVMTKFDLGSSD* |
| Ga0066689_106289041 | 3300005447 | Soil | WSSIVTEMARQTDELFGRTESVLITHKGLKMVTIPLSARRSLGLSLDRSADTEKIILKIMTKFDLGSSD* |
| Ga0066689_107219242 | 3300005447 | Soil | GRQTDELFGNTEYVLITHKGLKMVTIPLSARRSLGLSLDRLADTEKIILKIMTKFVLGSNDWTSSNQV* |
| Ga0066682_100339073 | 3300005450 | Soil | WSSIVTEMARQTDELFGRTESVLITHKGLKMVTIPLSARRSLGLSLDRSADTEKIILKIMTKFDLASTD* |
| Ga0066682_102206091 | 3300005450 | Soil | ELFGRTESVLITHKGLKMVTIPLSARRSLGLSLDRSADTEKIILKIMTKFDLGSSD* |
| Ga0066701_101685101 | 3300005552 | Soil | ESDPSIRVAAFIEGAEVTGYAETARTKNVLSQSIDLRKRIGFWSSIVTEMARQTDELFGRTESVLITHKGLKMVTIPLSARRSLGLSLDRSADTEKIILKIMTKFDLGSSD* |
| Ga0066692_104320071 | 3300005555 | Soil | WSSIVTEMGRQTDELFGNTEYVLITHKGLKMVTIPLSARRSLGLSLDRSADTEKIILKIITKFDLASTD* |
| Ga0066704_108992222 | 3300005557 | Soil | GFWSSIVTEMGRQTDELFGNTEYVLVTHKGLKMVTIPLSARRSLGLSLDRLADTEKIILKIMTKFVLGSNDWTSSNQV* |
| Ga0066705_105130601 | 3300005569 | Soil | QSGDLQKKIGFWVSVVTEMARQMDQLFGTTESVCVTHKGLRMVTVPVSGRRSLGLSLDRSADPDKMILKIMTKFDLASSN* |
| Ga0066705_108958271 | 3300005569 | Soil | EMARQMDQLFGTTESVCVTHKGLRMVTVPVSGRRSLGLSLDRSADADKTILKIMTKFDLLSRD* |
| Ga0066691_100666642 | 3300005586 | Soil | VTGYAETPRTKNVLSQSIDLRKKIGFWSSIVTEMGRQTDELFGNTEYVLITHKGLKMVTIPLSARRSLGLSLDRSADTEKIILKIMTKFDLGSSDWTSSNQV* |
| Ga0066691_106552582 | 3300005586 | Soil | DLRKRIGFWSSIVTEMARQTDELFGRTESVLITHKGLKMVTIPLSARRSLGLSLDRSADTEKIILKIITKFDLASTD* |
| Ga0066653_107803422 | 3300006791 | Soil | RKRIGFWSSIVTEMARQTDELFGRTECVLITHKGLKMVTIPLSARRSLGLSLDRSADTEKIILKIITKFDLASTD* |
| Ga0066665_107062681 | 3300006796 | Soil | IGFWVSVVTEMARQMDQLFGTTESVCVTHKGLRMVTVPISGRRSLGLSLDRSADPDKTVLKIMTKFDLGSSN* |
| Ga0066659_106853612 | 3300006797 | Soil | ARQTDELFGRTESVLITHKGLKMVTIPLSARRSLGLSLDRSADTEKIILKIMTKFDLASTD* |
| Ga0066659_110663581 | 3300006797 | Soil | FWSSIVTEMARQTDELFGRTESVLITHKGLKMVTIPLSARRSLGLSLDRSADTEKIILKIIIKFDLASTD* |
| Ga0066659_111546931 | 3300006797 | Soil | ARQTDELFGRTESVLITHKGLKMVTIPLSARRSLGLSLDRSADTEKIILKIMTKFDLGSSD* |
| Ga0066660_101413052 | 3300006800 | Soil | FWSSIVTEMGRQTDELFGNTEYVLITHKGLKMVTIPLSARRSLGLSLDRLADTEKIILKIMTKFVLGSNDWTSSNQV* |
| Ga0099793_100168261 | 3300007258 | Vadose Zone Soil | GFWVSIITEMARQTDNLFGSTESICVTHKGLKMVTVPLSARRSLGLSLDRSADTEKIILKITAKFSLGPTV* |
| Ga0066710_1003969592 | 3300009012 | Grasslands Soil | FWSSIVTEMARQTDELFGRTESVLITHKGLKMVTIPLSARRSLGLSLDRSADTEKIILKIMTKFDLASTD |
| Ga0066710_1048287482 | 3300009012 | Grasslands Soil | FWSSIVTEMARQTDELFGRTESVLITHKGLKMVTIPLSARRSLGLSLDRSADTEKIILKIITKFDLASTD |
| Ga0099829_100518154 | 3300009038 | Vadose Zone Soil | RQTDQMFGSTEYVCFTHKGLKMVSVPLSTRRSLGLSLDRSADPDKIIPKIMAKFNLRTGV |
| Ga0066709_1043645271 | 3300009137 | Grasslands Soil | SIDLRKRIGFWSSIVTEMARQTDELFGRTESVLITHKGLKRVTIPLSARRSLGLSLDRSADTEKIILKVMTKFDLGSSD* |
| Ga0134088_100222203 | 3300010304 | Grasslands Soil | MARQTDELFGSTEYVLITHKGLKMVTIPLSARRSLGLSLDRAADTEKLILKIMTKFDLASSN* |
| Ga0134086_103881111 | 3300010323 | Grasslands Soil | IVTEMARQTDELFGRTESVLITHKGLKMVTIPLSARRSLGLSLDRSADTEKIILKIITKFDLASTD* |
| Ga0134080_100533461 | 3300010333 | Grasslands Soil | SIDLRKRIGFWSSIVTEMARQTDELFGRTESVLITHKGLKMVTIPLSARRSLGLSLVRSADTEKIILKIMTKFDLASTD* |
| Ga0134071_100710904 | 3300010336 | Grasslands Soil | FWSSIVTEMARQTDELFGRTESVLITHKGLKMVTIPLSARRSLGLSLDRSADTEKIILKIMTKFDLGSSD* |
| Ga0137392_107366011 | 3300011269 | Vadose Zone Soil | SDLRKQVGFWVSIVTQMARQIDQMFGSTEYVCFTHKGLEMVSVPLSARRSLGLSLDRSADPDKIIPKIMARFDFGPSV* |
| Ga0137391_106293151 | 3300011270 | Vadose Zone Soil | SFAEAARTKNAFGQSSDLRKQVGFWVSIVTQMARQTDQIFGSTEYVCFTHKGLKMVSVPLSARRSLGLSLDRSADPDKIIPKIMAKFDFGPSV* |
| Ga0137393_105573182 | 3300011271 | Vadose Zone Soil | IVTQMARQTDQIFGSTEYVCFTHKGLKMVSVPLSARRSLGLSLDRSADPDKIIPKIMAKFDFGPSV* |
| Ga0137388_118445961 | 3300012189 | Vadose Zone Soil | FWSSIVTEMARQTDELFGRTESVLITHKGLKMVTIPLSERRSLGLSLDRSADTEKIILKIMTKFDLASSN* |
| Ga0137364_100561451 | 3300012198 | Vadose Zone Soil | QTDELFGRTESVLITHKGLKMVTIPLSARRSLGLSLDRSADTEKIILKIMTKFDLASTD* |
| Ga0137364_112742951 | 3300012198 | Vadose Zone Soil | TDELFGRTESVLITHKGLKMVTIPLSARRSLGLSLDRSADTEKIILKIMTKFDLGSSD* |
| Ga0137383_106374541 | 3300012199 | Vadose Zone Soil | EVTGYAETVRTKNVLSQSIDLRKRIGFWSSIVTEMARQTDELFGSTEYVLITHKGLKMVTIPLSARRSLGLSLDRTADTEKIILKIMTKFDLASSN* |
| Ga0137383_110056291 | 3300012199 | Vadose Zone Soil | EVTGYAETVRTKNVLSQSIDLRKRIGFWSSIVTEMARQTDELFGRTESVLITHKGLKMVTIPLSARRSLGLSLDRSADTEKIILKIMTKFDLASTH* |
| Ga0137383_112611751 | 3300012199 | Vadose Zone Soil | FWSSIVTEMARQTDELFGRTESVLITHKGLKMVTIPLSARRSLGLSLDRSADTEKIILKIITKFDLASTD* |
| Ga0137380_102421942 | 3300012206 | Vadose Zone Soil | VLSQSSDLRKKVGFWASIVTEMDRQTDNLFGSTEYICITHKGLKMVTIPLSSRRSLGLSLDRSADPDNIILKILRKFDLTSSD* |
| Ga0137380_105495921 | 3300012206 | Vadose Zone Soil | KRIGFWSSIVTEMARQTDELFGRTESVLITHKGLKMVTIPLSARRSLGLSLDRSADTEKIILKIMTKFGLGSSD* |
| Ga0137380_107737801 | 3300012206 | Vadose Zone Soil | EAARTKNAFSQSSDLRKQVGFWSSIVTQMARQTDQIFGSTGYVCFTHKGLKMVSVPLSARRSLGLSLDRSADPDKIIPKIMAKFDFGPSV* |
| Ga0137376_108264531 | 3300012208 | Vadose Zone Soil | RKRIGFWSSIVTEMARQTDELFGRTESVLITHKGLKMVTIPLSARRSLGLSLDRSADTEKIILKIMTKFDLASSD* |
| Ga0137379_117557352 | 3300012209 | Vadose Zone Soil | DLRKRIGFWSSIVTEMARQTDELFGRTESVLITHKGLKMVTIPLSARRSLGLSLDRSADTEKIILKIMTKFDLGSTD* |
| Ga0137370_107900151 | 3300012285 | Vadose Zone Soil | AFIQGAEVTGYAETVRTKNVLSQSIDLRKRIGFWSSIVTEMARQTDELFGSTEYVLITHKGLKMVTIPLSARRSLGLSLDRSADTEKIILKIMTKFDLASSN* |
| Ga0137387_100149642 | 3300012349 | Vadose Zone Soil | MDRQTDNLFGSTEYICITHKGLKMVTIPISSRRSLGLSLDRSADPDNIILKILRKFDLTSSD* |
| Ga0137387_110485911 | 3300012349 | Vadose Zone Soil | SDAAIRVAAFIQGAEVTGYAETVRTKNVLSQSIDLRKRIGFWSSIVTEMARQTDELFGSTEYVLITHKGLKMVTIPLSARRSLGLSLDRTADTEKIILKIMTKFDLASSN* |
| Ga0137372_1000434517 | 3300012350 | Vadose Zone Soil | MARQTDELFGRTESVLITHKGLKMVTIPLSARRSLGLSLDRSADTEKIILKIMTKFDLGSSD* |
| Ga0137386_102123282 | 3300012351 | Vadose Zone Soil | TVRTKNVLSQSIDLRKRIGFWSSIVTEMARQTDELFGSTEYVLITHKGLKMVTIPLSARRSLGLSLDRTADTEKIILKIMTKFDLASSN* |
| Ga0137386_109106242 | 3300012351 | Vadose Zone Soil | DLRKRIGFWSSIVTEMARQTDELFGRTESVLITHKGLKMVTIPLSARRSLGLSLDRSADTEKVILKIMTKFGLGSSD* |
| Ga0137386_109482982 | 3300012351 | Vadose Zone Soil | FWSSIVTEMARQTDELFGRTESVLITHKGLKMVTIPLSARRSLGLSLDRSADTEKIILKIMTKFDLGSTD* |
| Ga0137384_108369291 | 3300012357 | Vadose Zone Soil | GAEVTGYAETARTKNVLSQSIDLRKRIGFWSSIVTEMARQTDELFGRTESVLITHKGLKMVTIPLSARRSLGLSLDRSADTEKIILKIITKFDLASTD* |
| Ga0137361_109842372 | 3300012362 | Vadose Zone Soil | KNVLSQSIDLRKRIGFWSSIVTEMARQTDELFGRTESVLITHKGLKMVTIPLSARRSLGLSLDRPADTEKIIQKIMTKFDLLSRD* |
| Ga0137361_113921021 | 3300012362 | Vadose Zone Soil | RQTDNLFGSTESICVTHKGLKMVTVPLSARRSLGLSLDRSADTEKIILKITAKFSLGPTV |
| Ga0137390_109815321 | 3300012363 | Vadose Zone Soil | IVTQMARQTDQMFGSTEYVCFTHKGLKMVSVPITSRRSLGLSLDRSADPDKIIMKIMAKFDLGPGV* |
| Ga0134046_10356121 | 3300012386 | Grasslands Soil | MARQTDELFGSTEYVLITHKGLKMVTIPLSARRSLGLSLDRTADTEKIILKIMTKFDLASSN* |
| Ga0137418_100393481 | 3300015241 | Vadose Zone Soil | VLSQSGDLRKKVGFWVSIVTEMARQTDQMFGDTEYVCVAHKGLKMVTVPVSAGRSLGLSLDRSADPDKIILKIMAKLDLRRSV* |
| Ga0134083_100866352 | 3300017659 | Grasslands Soil | ETARTKNVLSQSIDLRKRIGFWSSIVTEMARQTDELFGRTESVLITHKGLKMVTIPLSARRSLGLSLDRAADTEKLILKIMTKFDLASSN |
| Ga0066667_100108234 | 3300018433 | Grasslands Soil | VLRQSIDLRKRIGFWSSIVTEMARQTDELFGRTESVLITHKGLKMVTIPLSGRRSLSLSLDRSADTEKIILKIMTKFDLASTD |
| Ga0066662_119379962 | 3300018468 | Grasslands Soil | FWSSIVTEMARQTDELFGRTESVLITHKGLKMVTIPLSGRRSLGLSLDRSADTEKIILKIMTKFDLASTD |
| Ga0066662_122359051 | 3300018468 | Grasslands Soil | QSGDLQKKIGFWVSVVTEMARQMDQLFGTTESVCVTHKGLRMVTVPVSGRRSLGLSLDRSADPDKMILKIMTKFDLASSN |
| Ga0215015_101419963 | 3300021046 | Soil | MFGNTEYVCFTHKGLKMVSVPLSARKSLGLSLDRSADPDKIVLKIMTKFSLGPTV |
| Ga0210404_102565183 | 3300021088 | Soil | RQTDQLFGGTEYVCVAHKGLKMVTVPVSARRSLGLSLDRSADPDKIIPKIMAKFDLRTSV |
| Ga0209234_10029144 | 3300026295 | Grasslands Soil | MARQTDELFGRTESVLITHKGLKMVTIPLSGRRSLGLSLDRSADTEKIILKIMTKFDLAPTD |
| Ga0209236_11256572 | 3300026298 | Grasslands Soil | AIRVAAFIEGAEVTGYAETARTKNVLSQSIDLRKRIGFWSSIVTEMARQTDELFGRTESVLITHKGLKMVTIPLSGRRSLGLSLDRSADTEKIILKIMTKFDLASSD |
| Ga0209469_10037107 | 3300026307 | Soil | DLRKRIGFWSSIVTEMARQTDELFGRTESVLITHKGLKMVTIPLSARRSLGLSLDRSADTEKIILKIMTKFDLASTD |
| Ga0209239_10855302 | 3300026310 | Grasslands Soil | VLSQSIDLRKRIGFWSSIVTEMARQTDELFGRTESVLITHKGLKMVTIPLSGRRSLGLSLDRSADTEKIILKIMTKFDLAPTD |
| Ga0209761_10354053 | 3300026313 | Grasslands Soil | AFIEGAEVTGYAETARTKNVLSQSIDLRKRIGFWSSIVTEMARQTDELFGRTESVLITHKGLKMVTIPLSGRRSLGLSLDRSADTEKIILKIMTKFDLASSD |
| Ga0209471_11920881 | 3300026318 | Soil | TDELFGNTEYVLVTHKGLKMVTIPLSARRSLGLSLDRLADTEKIILKIMTKFVLGSNDWTSSNQV |
| Ga0209471_12277451 | 3300026318 | Soil | GFWSSIVTEMARQTDELFGRTESVLITHKGLKMVTISLSARRSLGLSLDRSADTEKIILKIMTKFDLGSSD |
| Ga0209267_11232471 | 3300026331 | Soil | VLSQSIDLRKRIGFWSSIVTEMARQTDELFGRTESVLITHKGLKMVTIPLSGRRSLSLSLDRSADTEKIILKIMTKFDLASTD |
| Ga0209267_11676222 | 3300026331 | Soil | AEVTGYAETARTKNVLSQSIDLRKRIGFWSSIVTQMARQTDELFGRTESVLITHKGLKMVTIPLSARRSLGLSLDRSADTEKIILKIIIKFDLASTD |
| Ga0209803_10391901 | 3300026332 | Soil | RKRIGFWSSIVTEMARQTDELFGRTESVLITHKGLKMVTIPLSGRRSLSLSLDRSADTEKIILKIMTKFDLASTD |
| Ga0209803_12177033 | 3300026332 | Soil | RQTDELFGRTESVLITHKGLKMVTIPLSARRSLGLSLDRSADTEKIILKIITKFDLASTD |
| Ga0209377_12582551 | 3300026334 | Soil | DPSIRVAAFIEGAEVTGYAETARTKNVLSQSIDLRKRIGFWSSIVTEMARQTDELFGRTESVLITHKGLKMVTIPLSARRSLGLSLDRSADTEKIILKIMTKFDLGSSD |
| Ga0209377_13170401 | 3300026334 | Soil | GFWSSIVTEMARQTDELFGRTESVLITHKGLKMVTIPLSARRSLGLSLDRSADTEKIILKIITKFDLASTD |
| Ga0209159_11675351 | 3300026343 | Soil | ARQTDELFGRTESVLITHKGLKMVTIPLSARRSLGLSLDRSADTEKIILKIMTKFDLGSS |
| Ga0209690_11841883 | 3300026524 | Soil | LFGRTESVLITHKGLKMVTIPLSGRRSLGLSLDRSADTEKIILKIMTKFDLASTD |
| Ga0209806_11595992 | 3300026529 | Soil | SIDLRKKIGFWSSIVTEMGRQTDELFGNTEYVLVTHKGLKMVTIPLSARRSLGLSLDRLADTEKIILKIMTKFVLGSNDWTSSNQV |
| Ga0209806_12114691 | 3300026529 | Soil | MGRQTDELFGNTEYVLITHKGLKMVTIPLSARRSLGLSLDRSADTEKIILKIMTKSDLG |
| Ga0209806_12156991 | 3300026529 | Soil | MGRQTDELFGNTEYVLITHKGLKMVTIPLSARRSLGLSLDRSADTEKIILKIMTKFDLGSSDWTSSNQV |
| Ga0209807_11608262 | 3300026530 | Soil | NVFGQSGDLQKKIGFWVSVVTEMARQMDQLFGTTESVCVTHKGLRMVTVPVSGRRSLGLSLDRSADPDKMILKIMTKFDLASSN |
| Ga0209160_10220081 | 3300026532 | Soil | SDPTIRVAAFIEGAEVTGYAETPRTKNVLSRSIDLRKKIGFWVSIITEMARQTDELFGSTEYVLITHKRLKMVTIPLSARRSLGLSLDRAADTEKLILKIMTKFDLASSN |
| Ga0209056_100532933 | 3300026538 | Soil | VLRQSIDLRKRIGFWSSIVTEMARQTDELFGRTESVLITHKGLKMVTIPLSGRRSLGLSLDRSADTEKIILKIMTKFDLAPTD |
| Ga0209161_101470902 | 3300026548 | Soil | VLSQSIDLRKRIGFWSSIVTEMARQTDELFGRTESVLITHKGLKMVTIPLSGRRSLGLSLDRSADTEKIILKIMTKFGLGSSD |
| Ga0209577_100149423 | 3300026552 | Soil | MARQMDQLFGTTESVCVTHKGLRMVTVPVSGRRSLGLSLDRSADPDKMILKIMTKFDLASSN |
| Ga0209588_11038051 | 3300027671 | Vadose Zone Soil | MARQTDQLFGGTEYVCVAHEGLKMVTVPVSARRSLGLSLDRSADPDKIIPMIKAKFDLRRSV |
| Ga0209689_10777193 | 3300027748 | Soil | MARQTDELFGRTESVLITHKGLKMVTIPLSARRSLGLSLDRSADTEKIILKIMTKFDLGSSD |
| Ga0209689_11019982 | 3300027748 | Soil | GNTEYVLITHKGLKMVTIPLSARRSLGLSLDRSADTEKIILKIMTKFDLAPTD |
| Ga0209689_13309502 | 3300027748 | Soil | GNTEYVLITHKGLKMVTIPLSARRSLGLSLDRLADTEKIILKIMTKFVLGSNDWTSSNQV |
| Ga0209689_13550552 | 3300027748 | Soil | MARQTDELFGRTESVLITHKGLKMVTIPLSARRSLGLSLDRSADTEKIILKIITKFDLASTD |
| Ga0209701_100682462 | 3300027862 | Vadose Zone Soil | QSIDLRKRIGFWSSIVTEMARQTDELFGRTESVLITHKGLKMVTIPLSARRSLGLSLDRSADTEKIILKIMTKFDLRSND |
| Ga0209283_100298531 | 3300027875 | Vadose Zone Soil | NVLSQSIDLRKRIGFWSSIVTEMARQTDELFGRTESVLITHKGLKMVTIPLSARRSLGLSLDRSADTEKIILKIMTKFDLRSND |
| Ga0209283_106783842 | 3300027875 | Vadose Zone Soil | FWSSIVTQMARQTDQIFGSTEYVCFTHKGLKMVSVPLSARRSLGLSLDRSADPDKIIPKIMAKFDFGPSV |
| Ga0307479_114986391 | 3300031962 | Hardwood Forest Soil | TDIARQTDNLFGNTESICVTHKGLRMVTVPLSARKSLGLSLDRAADPDKIILKIMTRFSLGSAF |
| Ga0307479_117247942 | 3300031962 | Hardwood Forest Soil | LSQSGDLRKKIGFWVSVVTDMARETDKLFGSTESICVTHKGLRMVTAPLSARRFLGLSLDRSADPDKILQKIMTSFRLGSTL |
| Ga0307472_1001863591 | 3300032205 | Hardwood Forest Soil | EVTGYAESLRTNKVLSQSGDLRKKIGFWVSVVTDMAGETDQLFGSTESICVTHKGLRMVTAPIAARRYLGLSLDRSADPDKILLKIMTNFRLASTA |
| ⦗Top⦘ |