| Basic Information | |
|---|---|
| Family ID | F095829 |
| Family Type | Metagenome / Metatranscriptome |
| Number of Sequences | 105 |
| Average Sequence Length | 72 residues |
| Representative Sequence | MRCKDCQKRSNERATRPRPARRRKVATCERCGAPILPLPDLTLGAGDGTLAQLARFGLAITGDPLLELKREGI |
| Number of Associated Samples | 88 |
| Number of Associated Scaffolds | 105 |
| Quality Assessment | |
|---|---|
| Transcriptomic Evidence | Yes |
| Most common taxonomic group | Bacteria |
| % of genes with valid RBS motifs | 66.02 % |
| % of genes near scaffold ends (potentially truncated) | 33.33 % |
| % of genes from short scaffolds (< 2000 bps) | 54.29 % |
| Associated GOLD sequencing projects | 79 |
| AlphaFold2 3D model prediction | Yes |
| 3D model pTM-score | 0.26 |
| Hidden Markov Model |
|---|
| Powered by Skylign |
| Most Common Taxonomy | |
|---|---|
| Group | Bacteria (56.190 % of family members) |
| NCBI Taxonomy ID | 2 |
| Taxonomy | All Organisms → cellular organisms → Bacteria |
| Most Common Ecosystem | |
|---|---|
| GOLD Ecosystem | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil (33.333 % of family members) |
| Environment Ontology (ENVO) | Unclassified (51.429 % of family members) |
| Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Soil (non-saline) (58.095 % of family members) |
| ⦗Top⦘ |
| ⦗Top⦘ |
| Predicted Topology & Secondary Structure | |||||
|---|---|---|---|---|---|
| Classification: | Globular | Signal Peptide: | No | Secondary Structure distribution: | α-helix: 17.82% β-sheet: 3.96% Coil/Unstructured: 78.22% | Feature Viewer |
|
|
|||||
| Powered by Feature Viewer | |||||
| Structure Viewer | |
|---|---|
|
| |
| Per-residue confidence (pLDDT): 0-50 51-70 71-90 91-100 | pTM-score: 0.26 |
| Powered by PDBe Molstar | |
| ⦗Top⦘ |
| Pfam ID | Name | % Frequency in 105 Family Scaffolds |
|---|---|---|
| PF13522 | GATase_6 | 40.00 |
| PF00072 | Response_reg | 12.38 |
| PF00990 | GGDEF | 11.43 |
| PF00310 | GATase_2 | 5.71 |
| PF10531 | SLBB | 4.76 |
| PF02350 | Epimerase_2 | 3.81 |
| PF13614 | AAA_31 | 2.86 |
| PF13807 | GNVR | 1.90 |
| PF13692 | Glyco_trans_1_4 | 1.90 |
| PF01656 | CbiA | 0.95 |
| PF14361 | RsbRD_N | 0.95 |
| PF02954 | HTH_8 | 0.95 |
| COG ID | Name | Functional Category | % Frequency in 105 Family Scaffolds |
|---|---|---|---|
| COG0381 | UDP-N-acetylglucosamine 2-epimerase | Cell wall/membrane/envelope biogenesis [M] | 3.81 |
| COG0707 | UDP-N-acetylglucosamine:LPS N-acetylglucosamine transferase | Cell wall/membrane/envelope biogenesis [M] | 3.81 |
| ⦗Top⦘ |
| Name | Rank | Taxonomy | Distribution |
| All Organisms | root | All Organisms | 56.19 % |
| Unclassified | root | N/A | 43.81 % |
| Visualization |
|---|
| Powered by ApexCharts |
| ⦗Top⦘ |
| Habitat | Taxonomy | Distribution |
| Vadose Zone Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil | 33.33% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil | 24.76% |
| Grasslands Soil | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil | 15.24% |
| Grasslands Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil | 11.43% |
| Hardwood Forest Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil | 2.86% |
| Corn, Switchgrass And Miscanthus Rhizosphere | Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere | 2.86% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil | 1.90% |
| Forest Soil | Environmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil | 1.90% |
| Populus Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere | 1.90% |
| Groundwater Sediment | Environmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment | 0.95% |
| Agricultural Soil | Environmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil | 0.95% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil | 0.95% |
| Soil | Environmental → Terrestrial → Soil → Loam → Grasslands → Soil | 0.95% |
| Visualization |
|---|
| Powered by ApexCharts |
| Taxon OID | Sample Name | Habitat Type | IMG/M Link |
|---|---|---|---|
| 3300001661 | Mediterranean Blodgett CA OM1_O3 (Mediterranean Blodgett coassembly) | Environmental | Open in IMG/M |
| 3300002557 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_20cm | Environmental | Open in IMG/M |
| 3300002558 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cm | Environmental | Open in IMG/M |
| 3300002560 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cm | Environmental | Open in IMG/M |
| 3300002562 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cm | Environmental | Open in IMG/M |
| 3300002908 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cm | Environmental | Open in IMG/M |
| 3300005166 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123 | Environmental | Open in IMG/M |
| 3300005167 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121 | Environmental | Open in IMG/M |
| 3300005172 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132 | Environmental | Open in IMG/M |
| 3300005174 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129 | Environmental | Open in IMG/M |
| 3300005175 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122 | Environmental | Open in IMG/M |
| 3300005180 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 | Environmental | Open in IMG/M |
| 3300005440 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-3 metaG | Environmental | Open in IMG/M |
| 3300005446 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135 | Environmental | Open in IMG/M |
| 3300005447 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138 | Environmental | Open in IMG/M |
| 3300005450 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131 | Environmental | Open in IMG/M |
| 3300005451 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130 | Environmental | Open in IMG/M |
| 3300005552 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150 | Environmental | Open in IMG/M |
| 3300005556 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156 | Environmental | Open in IMG/M |
| 3300005598 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 | Environmental | Open in IMG/M |
| 3300006032 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145 | Environmental | Open in IMG/M |
| 3300006175 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaG | Environmental | Open in IMG/M |
| 3300006794 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107 | Environmental | Open in IMG/M |
| 3300006804 | Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200 | Environmental | Open in IMG/M |
| 3300006852 | Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD2 | Host-Associated | Open in IMG/M |
| 3300006914 | Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD5 | Host-Associated | Open in IMG/M |
| 3300007258 | Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3 | Environmental | Open in IMG/M |
| 3300009012 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159 | Environmental | Open in IMG/M |
| 3300009090 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG | Environmental | Open in IMG/M |
| 3300009143 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 | Environmental | Open in IMG/M |
| 3300010301 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09082015 | Environmental | Open in IMG/M |
| 3300010304 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015 | Environmental | Open in IMG/M |
| 3300010333 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_2_24_1 metaG | Environmental | Open in IMG/M |
| 3300011269 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaG | Environmental | Open in IMG/M |
| 3300011270 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaG | Environmental | Open in IMG/M |
| 3300012202 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaG | Environmental | Open in IMG/M |
| 3300012203 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaG | Environmental | Open in IMG/M |
| 3300012208 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaG | Environmental | Open in IMG/M |
| 3300012211 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaG | Environmental | Open in IMG/M |
| 3300012285 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaG | Environmental | Open in IMG/M |
| 3300012349 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaG | Environmental | Open in IMG/M |
| 3300012362 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaG | Environmental | Open in IMG/M |
| 3300012380 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_2_0_2 metaT (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300012385 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_20cm_2_4_1 metaT (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300012388 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_20cm_2_24_2 metaT (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300012395 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_2_8_1 metaT (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300012403 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_2_8_2 metaT (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300012406 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_4_1 metaT (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300012410 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_16_2 metaT (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300012918 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaG | Environmental | Open in IMG/M |
| 3300012922 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaG | Environmental | Open in IMG/M |
| 3300012923 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaG | Environmental | Open in IMG/M |
| 3300012925 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly) | Environmental | Open in IMG/M |
| 3300012929 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly) | Environmental | Open in IMG/M |
| 3300012930 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly) | Environmental | Open in IMG/M |
| 3300014154 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09212015 | Environmental | Open in IMG/M |
| 3300014157 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_2_24_1 metaG | Environmental | Open in IMG/M |
| 3300015054 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (PacBio error correction) | Environmental | Open in IMG/M |
| 3300018431 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104 | Environmental | Open in IMG/M |
| 3300018433 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116 | Environmental | Open in IMG/M |
| 3300018468 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111 | Environmental | Open in IMG/M |
| 3300018482 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118 | Environmental | Open in IMG/M |
| 3300021080 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_coex redo | Environmental | Open in IMG/M |
| 3300021086 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_1_08_16fungal (Illumina Assembly) | Environmental | Open in IMG/M |
| 3300021151 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_06_16RNAfungal (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300021178 | Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-M | Environmental | Open in IMG/M |
| 3300024330 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (PacBio error correction) | Environmental | Open in IMG/M |
| 3300025910 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes) | Environmental | Open in IMG/M |
| 3300026277 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_2_20cm (SPAdes) | Environmental | Open in IMG/M |
| 3300026300 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_20cm (SPAdes) | Environmental | Open in IMG/M |
| 3300026301 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cm (SPAdes) | Environmental | Open in IMG/M |
| 3300026306 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102 (SPAdes) | Environmental | Open in IMG/M |
| 3300026310 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cm (SPAdes) | Environmental | Open in IMG/M |
| 3300026323 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130 (SPAdes) | Environmental | Open in IMG/M |
| 3300026325 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107 (SPAdes) | Environmental | Open in IMG/M |
| 3300026328 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129 (SPAdes) | Environmental | Open in IMG/M |
| 3300026342 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146 (SPAdes) | Environmental | Open in IMG/M |
| 3300026524 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150 (SPAdes) | Environmental | Open in IMG/M |
| 3300026537 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135 (SPAdes) | Environmental | Open in IMG/M |
| 3300026557 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal | Environmental | Open in IMG/M |
| 3300027643 | Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3 (SPAdes) | Environmental | Open in IMG/M |
| 3300027669 | Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM1_M1 (SPAdes) | Environmental | Open in IMG/M |
| 3300027875 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes) | Environmental | Open in IMG/M |
| 3300028536 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly) | Environmental | Open in IMG/M |
| 3300028819 | Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_153 | Environmental | Open in IMG/M |
| 3300028878 | Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_117 | Environmental | Open in IMG/M |
| 3300031720 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515 | Environmental | Open in IMG/M |
| 3300032180 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515 | Environmental | Open in IMG/M |
| Geographical Distribution | |
|---|---|
| Zoom: | Powered by OpenStreetMap |
| ⦗Top⦘ |
| Protein ID | Sample Taxon ID | Habitat | Sequence |
| JGI12053J15887_105642772 | 3300001661 | Forest Soil | MRCKDCMKRSTERTKRPRPTRRGKVANCERCGAPIIPLPDLTLGAGDGTLAQLTRFGLAITGDPLLELKREGI* |
| JGI25381J37097_10389471 | 3300002557 | Grasslands Soil | SMRCKDCKKRSLETSKRPRPARRGTVVRCERCGALILPLADVTLGAADGTLAQLSRFGLAITGDPLLELKREGS* |
| JGI25385J37094_100332483 | 3300002558 | Grasslands Soil | MRCKDCRTRANERVKRPRTSRRHPAAKCDRCGGPLLPLPDAALGAGDGTLAQLTRFGLAITGDPLLELKREGI* |
| JGI25383J37093_100364642 | 3300002560 | Grasslands Soil | MRCKDCKKRTTERPKRPLPTRRRPVATCARCGAPILPLPDLTVGAGDGMLAELARFGLAITGDPLLELKREGI* |
| JGI25382J37095_100220342 | 3300002562 | Grasslands Soil | MRCKDCRKRTNERIQRPRASRRRPAAKCDMCGAPILPLPDQALGASDGTLAQLRRFGLAITGDPLLELKREGI* |
| JGI25382J43887_100178692 | 3300002908 | Grasslands Soil | MRCKDCKKRSNETSKRPQLSRRAKVARCERCGALILPLPDMTLGAGDGTLAQLTRFGLAFTGDPLLELKREGR* |
| Ga0066674_100033744 | 3300005166 | Soil | MHCKDCKKRTTERPKRPLRTRRRPVATCERCGAPILPLPDLTVGAGDGTLAQLARFGLAITGDPLLELKREGI* |
| Ga0066674_100094881 | 3300005166 | Soil | PSMRCKACQKRSNETSKRPRLSRRAMVARCERCGALILPLPDVTLGAGDGTLAQLTRFGLAFTGDPLLELKREGG* |
| Ga0066672_100103202 | 3300005167 | Soil | MRCKDCKKRSLETSKRPRPARRGTVVRCERCGALILPLADVTLGAADGTLAQLSRFGLAITGDPLLELKREGS* |
| Ga0066683_107273781 | 3300005172 | Soil | RDLRPPSMRCKACQKRSNETSKRPRLSRRAKVARCERCGALILPLPDMMLGAGDGTLAQLTRFGLAFTGDPLLELKREGG* |
| Ga0066680_103905631 | 3300005174 | Soil | MRCKDCKKRSLETSKRPRPARRGTVVRCERCGALILPLADVTLGAADGTLAQLSRFGLAITGDPLLELKREG |
| Ga0066673_101542311 | 3300005175 | Soil | RPPSMRCKACQKRSNETSKRPRLSRRAKVARCERCGALILPLPDMMLGAGDGTLAQLTRFGLAFTGDPLLELKREGG* |
| Ga0066685_100496173 | 3300005180 | Soil | MRCKDCKKRTTERPKRPLPTRRRSVATCARCGAPILPLPDLTVGAGDGTLAELARFGLAITGDPLLELKREGI* |
| Ga0066685_111218711 | 3300005180 | Soil | PSMRCKACQKRSNETSKRPRLSRRAMVARCERCGALILPLPDVTLGAGDGTLAQLTRFGLAFTGDPLLELKREGR* |
| Ga0070705_1000011468 | 3300005440 | Corn, Switchgrass And Miscanthus Rhizosphere | MRCTDCKKRIKERAKRPRPPRKRPVAKCDCCGAPIVPLPDLTLGAADGTLAQLTRFGLAITGDPLLELKREGI* |
| Ga0066686_100427442 | 3300005446 | Soil | MRCKDCKKRTTERPKRPLRTRRRPVATCERCGAPILPLPDLTVGAGDGTLAQLARFGLAITGDPLLELKREGI* |
| Ga0066686_108758942 | 3300005446 | Soil | MRCKDCQKRTTERPKRPLPTRRRPVATCERCGAPMLPLPDLTVGAGDGTLAQLARFGLAITGDPLLE |
| Ga0066689_103602761 | 3300005447 | Soil | GRDLRPPSMRCKDCKKRTTERPKRPLRTRRRPVATCERCGAPILPLPDLTVGAGDGTLAQLARFGLAITGDPLLELKREGI* |
| Ga0066682_100033176 | 3300005450 | Soil | MRCKACQKRSNETSKRPRLSRRAKVARCERCGALILPLPDMMLGAGDGTLAQLTRFGLAFTGDPLLELKREGG* |
| Ga0066681_108822002 | 3300005451 | Soil | PSMHCKDCKKRTTERPKRPLRTRRRPVATCERCGAPILPLPDLTVGAGDGTLAQLARFGLAITGDPLLELKREGI* |
| Ga0066701_104441971 | 3300005552 | Soil | KRPRPARRGTVVRCERCGALILPLADVTLGAADGTLAQLSRFGLAITGDPLLELKREGS* |
| Ga0066701_107550741 | 3300005552 | Soil | GRDLRPPSMRCKDCKKRTTERPKRPLPTRRRPVATCARCGAPILPLPDLTVGAGDGMLAELARFGLAITGDPLLELKREGI* |
| Ga0066707_100168642 | 3300005556 | Soil | MRCKACQKRSNETSKRPRLSRRAMVARCERCGALILPLPDVTLGAGDGTLAQLTRFGLAFTGDPLLELKREGR* |
| Ga0066706_101031323 | 3300005598 | Soil | MRCKACQKRSNETSKRPRLSRRAKVARCERCGALILPLPDMMLGAGDGTLAQLTRFGLAFTGDPLLELKREGR* |
| Ga0066696_109061492 | 3300006032 | Soil | DLRPPSMRCKDCKKRSLETSKRPRPARRGTVVRCERCGALILPLADVTLGAADGTLAQLSRFGLAITGDPLLELKREGS* |
| Ga0070712_1017678991 | 3300006175 | Corn, Switchgrass And Miscanthus Rhizosphere | PGLKLVVRGRDLRPPSMRCKDCRKRTNERIQRPRNSRRRPAATCDRCGAPILPLPDDALGAGDGTLSHLKRFGLAINGDPLLELKREGI* |
| Ga0066658_100847902 | 3300006794 | Soil | LETSKRPRPARRGTVVRCERCGALILPLADVTLGAADGTLAQLSRFGLAITGDPLLELKREGS* |
| Ga0079221_117947581 | 3300006804 | Agricultural Soil | MRCKDCRKRTNERIQRPHGSRRRRAATCDRCGAPILPLPDDALGAGDGTLAHLKRFGLAINGDPLLELKREGI* |
| Ga0075433_100135735 | 3300006852 | Populus Rhizosphere | MRCKDCEKRNNERAERPRPPRRRPAARCERCGAPIVPLPEVTPGGGDGTLAQLTRFGLAVTGDPLLELKRKGV* |
| Ga0075436_1008984432 | 3300006914 | Populus Rhizosphere | MRCKDCRKRSNERIQRPRKSRRHPAATCDMCGAPILPLPDDALGASDGTLAQLKRFGLAITGDPLLELKREGI* |
| Ga0099793_100738152 | 3300007258 | Vadose Zone Soil | MRCKDCRTRTDERVKRPRTSRRHPAAKCDVCGAPILPLPDLALGAGDGTLAQLKRFGLAITGDPLLELKREGI* |
| Ga0066710_1013698741 | 3300009012 | Grasslands Soil | MRCKDCKKRSLETSKRPRPARRGTVVRCERCGALILPLADVTLGAADGTLAQLSRFGLAITGDPLLELKR |
| Ga0099827_100008144 | 3300009090 | Vadose Zone Soil | MRCKDCRTRANERVKRPRTSRRHPAAKCDRCGGPLLALPDAALGAGDGTLAQLTRFGLAITGDPLLELKREGI* |
| Ga0099827_101376252 | 3300009090 | Vadose Zone Soil | CQKRSNERGERPRPTRASRVAKCQRCGALILLQPDPMLGAGDGTMAQLTRFGLAITGDPLLELKREGT* |
| Ga0099792_101991922 | 3300009143 | Vadose Zone Soil | MRCKDCQKRSNERGERPRPTRASRVAKCQRCGALILLQPDPMLGAGDGTMAQLTRFGLAITGDPLLELKREGT* |
| Ga0134070_100063063 | 3300010301 | Grasslands Soil | MHCKDCKKRTTERPKRPLRTRRRPVATCERCGAPILPLPDLTVGAGDGTLAELARFGLAITGDPLLELKREGI* |
| Ga0134088_100019524 | 3300010304 | Grasslands Soil | MRCQDCRTRANERVKRPRTSRRHPAAKCDRCGGPLLPLPDAALGAGDGTLAQLTRFGLAITGDPLLELKREGI* |
| Ga0134080_103587982 | 3300010333 | Grasslands Soil | MHCKDCKKRTTERPKRPLPTRRRSVATCARCGAPILPLPDLTVGAGDGTLAELARFGLAITGDPLLELKREGI* |
| Ga0137392_102243612 | 3300011269 | Vadose Zone Soil | MRCKDCRKRTDERIKRRHTSRRRLAARCDICGAPIFPLPDEALGASDGTLAQLKRFGLAITGDPLLELKREGT* |
| Ga0137391_101004222 | 3300011270 | Vadose Zone Soil | MRCKDCRKRTDERIKRRHTSRRRLAARCDICGAPIFPLPDEALGASDGTLAQLKRFGLAITGDPLLELKREGI* |
| Ga0137363_101603811 | 3300012202 | Vadose Zone Soil | MRCKDCRTRANERVKRPRTSRRHPAAKCDRCGGPLLPLPDTALGAGDGTLAQLTRFGLAITGDPLLELKREGI* |
| Ga0137399_100015492 | 3300012203 | Vadose Zone Soil | MRCRDCRTRTDERVKRPRTSRRHPAAKCDVCGAPILPLPDLALGAGDGTLAQLKRFGLAITGDPLLELKREGI* |
| Ga0137399_100017822 | 3300012203 | Vadose Zone Soil | MRCPDCKKRIKETAKRPRPLRQRPVAKCERCGAPILPLPDLTLGAADGTLAQLTRFGLAITGDPLLELKREGI* |
| Ga0137376_100274672 | 3300012208 | Vadose Zone Soil | MSKRPRPARRGTVARCERCGALILPLADVTLGAADGTLAQLSRFGLAITGDPLLELKREGS* |
| Ga0137377_116581001 | 3300012211 | Vadose Zone Soil | KLEVQGRDLRPPSMRCKDCKKRSLETSKRPRPARRGTVVRCERCGALILPLADVTLGAADGTLAQLSRFGLAITGDPLLELKREGS* |
| Ga0137370_100211883 | 3300012285 | Vadose Zone Soil | MYCKDCEKRSDETSKRPRSARRAKVATCERCGGVIIPLPDVGLAAGDGTLAQLARFGLAITGDPLLELKREGS* |
| Ga0137387_101661262 | 3300012349 | Vadose Zone Soil | MHCKACKKRTNETSKRPRLSRRAKVARCERCGALILPLPDMMLGAGDGTLAQLTRFGLAFTGDPLLELKREGR* |
| Ga0137361_102379372 | 3300012362 | Vadose Zone Soil | MRCKDCKKRSNETSKRPRLSRRAKVARCERCGALILPLPDMTLGAGDGTLAQLTRFGLAFTGDPLLELKREGR* |
| Ga0134047_11346654 | 3300012380 | Grasslands Soil | CKDCKKRTTERPKRPLPTRRRSVATCARCGAPILPLPDLTVGAGDGTLAQLARFGLAITGDPLLELKREGI* |
| Ga0134023_10064351 | 3300012385 | Grasslands Soil | PSMRCKDCKKRTTERPKRPLPTRRRSVATCARCGAPILPLPDLTVGAGDGTLAQLARFGLAITGDPLLELKREGI* |
| Ga0134031_11427752 | 3300012388 | Grasslands Soil | CKDCKKRTTERPKRPLPTRRRSVATCARCGAPILPLPDLTVGAGDGTLAELARFGLAITGDPLLELKREGI* |
| Ga0134044_12288481 | 3300012395 | Grasslands Soil | MRCKDCKKRTTERPKRPLPTRRRSVATCARCGAPILPLPDLTVGAGDGTLAQLARFGLAITGDPLLELKREGI* |
| Ga0134049_13292941 | 3300012403 | Grasslands Soil | RTTERPKRPLRTRRRPVATCERCGAPILPLPDLTVGAGDGTLAQLARFGLAITGDPLLELKREGI* |
| Ga0134053_12540901 | 3300012406 | Grasslands Soil | SMRCQDCRIRANERVKRPRTSRRHPAAKCDRCGGPLLPLPDAALGAGDGTLAQLTRFGLAITGDPLLELKREGI* |
| Ga0134060_12101921 | 3300012410 | Grasslands Soil | PSMRCQDCRTRANERVKRPRTSRRHPAAKCDRCGGPLLPLPDAALGAGDGTLAQLTRFGLAITGDPLLELKREGI* |
| Ga0137396_104181052 | 3300012918 | Vadose Zone Soil | MRCKDCKKPTDERAKRPRAPRRRPVAKCERCGAPILPLADLTLGAGDGTLAQLARFGLAITGDPLLELKREGI* |
| Ga0137394_109763051 | 3300012922 | Vadose Zone Soil | LGLKLEVQGRDLRPPSMRCKDCKKPTDERAKRPRAPRRRPVAKCERCGAPILPLADLTLGAGDGTLAQLARFGLAITGDPLLELKREGI* |
| Ga0137359_100208491 | 3300012923 | Vadose Zone Soil | IKRRHTSRRRLAARCDMCGAPIFPLPDEALGASDGTLAQLKRFGLAITGDPLLELKREGI |
| Ga0137359_112144821 | 3300012923 | Vadose Zone Soil | KRPRLSRRAKVARCERCGALILPLPDMMLGAGDGTLAQLSRFGLAITGDPLLELKREGS* |
| Ga0137419_100064901 | 3300012925 | Vadose Zone Soil | ETAKRPRPLRQRPVAKCVRCGAPILPLPDLTLGAADGTLAQLTRFGLAITGDPLLELKREGI* |
| Ga0137404_100194712 | 3300012929 | Vadose Zone Soil | MRCKDCRKRSNEKIKRSRASRRRPAAMCDMCGAPILPLPDQALGANDGTLAELKRFGLAITGDPLLELKREGI* |
| Ga0137407_108056992 | 3300012930 | Vadose Zone Soil | MRCKDCKKRSLETSKRPRPARRGTVARCERCGALILPLADVTLGAADGTLAQLSRFGLAITGDPLLELKREGT* |
| Ga0134075_100740052 | 3300014154 | Grasslands Soil | MRCQDCRIRANERVKRPRTSRRHPAAKCDRCGGPLLPLPDAALGAGDGTLAQLTRFGLAITGDPLLELKREGI* |
| Ga0134078_101458912 | 3300014157 | Grasslands Soil | MRCKDCKKRTTERPKRPLRTRRRPVATCARCGAPVLPLPDLTVGAGDCTLAELARFGLAITGDPLLELKREGI* |
| Ga0137420_10484301 | 3300015054 | Vadose Zone Soil | VKRPRTSRRHPAAKCDVCGAPILPLPDLALGAGDGTLAQLKRFGLAITGDPLLELKREGI |
| Ga0066655_100320652 | 3300018431 | Grasslands Soil | MRCKDCKKRTTERPKRPLPTRRRSVATCARCGAPILPLPDLTVGAGDGTLAELARFGLAITGDPLLELKREGI |
| Ga0066667_100012684 | 3300018433 | Grasslands Soil | MRCKDCKKRTTERPKRPLRTRRRPVATCERCGAPILPLPDLTVGAGDGTLAQLARFGLAITGDPLLELKREGI |
| Ga0066662_100249033 | 3300018468 | Grasslands Soil | MRCKDCKKRTTERPKRPLPTRRRPVATCARCGAPILPLPDLTVGAGDGMLAELARFGLAITGDPLLELKREGI |
| Ga0066662_100264133 | 3300018468 | Grasslands Soil | MRCKDCKKRSLETSKRPRPARRGTVVRCERCGALILPLADVTLGAADGTLAQLSRFGLAITGDPLLELKREGS |
| Ga0066669_100017704 | 3300018482 | Grasslands Soil | MRCKACQKRSNETSKRPRLSRRAKVARCERCGALILPLPDMMLGAGDGTLAQLTRFGLAFTGDPLLELKREGR |
| Ga0066669_101570102 | 3300018482 | Grasslands Soil | MYCKDCEKRSDETSKRPRSARRAKVATCERCGGVIIPLPDVGLAAGDGTLAQLARFGLAITGDPLLELKREGS |
| Ga0210382_100041965 | 3300021080 | Groundwater Sediment | MLCKDCMRHISERTQRRRPGRRRPVGTCERCGAPILTLLDPLGAGDGTLAQLARFGLAITGDPLLELKREGL |
| Ga0179596_100087872 | 3300021086 | Vadose Zone Soil | MRCKDCRTRANERVKRPRTSRRHPAAKCDRCGGPLLPLPDAALGAGDGTLAQLTRFGLAITGDPLLELKREGI |
| Ga0179596_103312521 | 3300021086 | Vadose Zone Soil | MRCKDCQKRSNERGKRPRPTRPSRVVKCQRCGAPILLQPDPMLGAGDGTLAQLTRFGLAITGDPLLELKREGT |
| Ga0179596_103365651 | 3300021086 | Vadose Zone Soil | MRCKDCQKRSNERGERPRPTRASRVAKCQRCGALILLQPDPMLGAGDGTMAQLTRFGLAITGDPLLELKREGT |
| Ga0179584_11746991 | 3300021151 | Vadose Zone Soil | KRINERIKRPRTSRRRPAATCDVCGAPILPLPDQALGVNDGTLAELKRFGLAITGDPLLELKREGI |
| Ga0210408_105755212 | 3300021178 | Soil | MRCKDCRKRTNEGIKRPRASRRRPAAKCDVCGAPILPLPDEALGAGDGTLAQLKRFGLAASGDPLLELKREGI |
| Ga0137417_13649333 | 3300024330 | Vadose Zone Soil | MRCKDCQKRSNERATRPRPARRRKVATCERCGAPILPLPDLTLGAGDGTLAQLARFGLAITGDPLLELKREGI |
| Ga0207684_108404461 | 3300025910 | Corn, Switchgrass And Miscanthus Rhizosphere | MRCKDCRKRTNERIQRPQSSRRTRAATCDRCGAPILPLPDDALGAGDGTLAHLKRFGLAINGDPLLELKREGI |
| Ga0209350_11414491 | 3300026277 | Grasslands Soil | MHCKDCKKRTTERPKRPLRTRRRPVATCERCGAPILPLPDLTVGAGDGTLAQLARFGLAITGDPLLELKREGI |
| Ga0209027_10180201 | 3300026300 | Grasslands Soil | DLRPPSMRCKDCKKRSLETSKRPRPARRGTVVRCERCGALILPLADVTLGAADGTLAQLSRFGLAITGDPLLELKREGS |
| Ga0209238_10399662 | 3300026301 | Grasslands Soil | MHWKDCEKRSDETSKRPRSARRAKVATCERCGAVIIPLPDVGLAAGDGTLAQLARFGLAITGDPLLELKREGS |
| Ga0209468_10093963 | 3300026306 | Soil | MRCKACQKRSNETSKRPRLSRRAKVARCERCGALILPLPDMMLGAGDGTLAQLTRFGLAFTGDPLLELKREGG |
| Ga0209239_100405911 | 3300026310 | Grasslands Soil | RCKDCKKRSLETSKRPRPARRGTVVRCERCGALILPLADVTLGAADGTLAQLSRFGLAITGDPLLELKREGS |
| Ga0209472_10076466 | 3300026323 | Soil | MRCKACQKRSNETSKRPRLSRRAKVARCERCGALILPLADVTLGAADGTLAQLTRFGLAFTGDPLLELKREGG |
| Ga0209152_100121364 | 3300026325 | Soil | LETSKRPRPARRGTVVRCERCGALILPLADVTLGAADGTLAQLSRFGLAITGDPLLELKREGS |
| Ga0209802_10414512 | 3300026328 | Soil | MRCKACQKRSNETSKRPRLSRRAMVARCERCGALILPLPDVTLGAGDGTLAQLTRFGLAFTGDPLLELKREGR |
| Ga0209057_10238043 | 3300026342 | Soil | MTSKRPRLSRRAKVARCERCGALILPLPDMMLGAGDGTLAQLTRFGLAFTGDPLLELKREGG |
| Ga0209690_10107436 | 3300026524 | Soil | KRPRPARRGTVVRCERCGALILPLADVTLGAADGTLAQLSRFGLAITGDPLLELKREGS |
| Ga0209690_10795282 | 3300026524 | Soil | MRCKDCKKRSNETSKRPQLSRRAKVARCERCGALILPLPDMTLGAGDGTLAQLTRFGLAFTGDPLLELKREGR |
| Ga0209157_10292923 | 3300026537 | Soil | MRCKDCQKRTTERPKRPLPTRRRPVATCERCGAPMLPLPDLTVGAGDGTLAQLARFGLAITGDPLLEFKREGI |
| Ga0179587_106109261 | 3300026557 | Vadose Zone Soil | MRCKDCRKRTNERIKRPRASRRRPAATCDMCGAPILPLPDQALGANDGTLAELKRFGLAITGDPLLEL |
| Ga0209076_10254752 | 3300027643 | Vadose Zone Soil | MRCKDCRKRTNERIKRPRASRRRPAATCDMCGAPILPLPDQALGANDGTLAQLKRFGLAITGDPLLELKREGI |
| Ga0208981_11673581 | 3300027669 | Forest Soil | MRCKDCMKRSTERTKRPRPTRRGKVANCERCGAPIIPLPDLTLGAGDGTLAQLTRFGLAITGDPLLELKREGI |
| Ga0209283_101979691 | 3300027875 | Vadose Zone Soil | RCKDCQKRSNERGKRPRPTRPSRVVKCQRCGAPILLQPDPMLGAGDGTLAQLTRFGLAITGDPLLELKREGT |
| Ga0209283_105502242 | 3300027875 | Vadose Zone Soil | MRCKDCRTRANERVKRPRTSRRHPAAKCDRCGGPLLALPDAALGAGDGTLAQLTRFGLAITGDPLLELKREGI |
| Ga0137415_100006017 | 3300028536 | Vadose Zone Soil | MRCPDCKKRIKETAKRPRPLRQRPVAKCERCGAPILPLPDLTLGAADGTLAQLTRFGLAITGDPLLELKREGI |
| Ga0137415_100337602 | 3300028536 | Vadose Zone Soil | MRCRDCRTRTDERVKRPRTSRRHPAAKCDVCGAPILPLPDLALGAGDGTLAQLKRFGLAITGDPLLELKREGI |
| Ga0137415_101011972 | 3300028536 | Vadose Zone Soil | MRCKDCKKPTDERAKRPRAPRRRPVAKCERCGAPILPLADLTLGAGDGTLAQLARFGLAITGDPLLELKREGI |
| Ga0137415_103506762 | 3300028536 | Vadose Zone Soil | MRCKDCRKRTNERIKRPRASRRRPAATCDMCGAPILPLPDQALGANDGTLAQLKRFGLTITGDPLLELKREGI |
| Ga0307296_106270461 | 3300028819 | Soil | MRCKDCMRHISERTQPRRPVRRRPVGTCERCGAPILTLLDLPLGAGDGTLAQLARFGLAITGDPLLELKREGL |
| Ga0307278_100283613 | 3300028878 | Soil | MLCKDCMRHISERTQRRRPGRRRPVGTCERCGAPILTLLDLPLGAGDGTLAQLARFGLAITGDPLLELKREGL |
| Ga0307469_105842162 | 3300031720 | Hardwood Forest Soil | MRCKDCEKRSNERAKRPPRHRRRPVAKCERCGAPILSLPDLTLGAGDGTLAQLTRFGLAITGDPLLELKRE |
| Ga0307469_117710892 | 3300031720 | Hardwood Forest Soil | CKQRIKERAKRPRPPRQRPVAKCDWCGAPILPLPDLALGAADGTLAQLTRFGLAITGDPLLELKREGI |
| Ga0307471_1001046622 | 3300032180 | Hardwood Forest Soil | MRCTDCKKRIEEKAKRPRPPRQRTVAKCEWCGAPILPLPDLTLGAADGTLAQLTRFGLAITGDPLLELKREGI |
| ⦗Top⦘ |