| Basic Information | |
|---|---|
| Family ID | F104900 |
| Family Type | Metagenome |
| Number of Sequences | 100 |
| Average Sequence Length | 69 residues |
| Representative Sequence | AARFDVIAATNAAAAIQDERVFLRWIADHATSFPDAYRTIKETNLGLAEVSNADAEVLESGPNQCAVG |
| Number of Associated Samples | 88 |
| Number of Associated Scaffolds | 100 |
| Quality Assessment | |
|---|---|
| Transcriptomic Evidence | No |
| Most common taxonomic group | Unclassified |
| % of genes with valid RBS motifs | 0.00 % |
| % of genes near scaffold ends (potentially truncated) | 1.00 % |
| % of genes from short scaffolds (< 2000 bps) | 1.00 % |
| Associated GOLD sequencing projects | 81 |
| AlphaFold2 3D model prediction | Yes |
| 3D model pTM-score | 0.54 |
| Hidden Markov Model |
|---|
| Powered by Skylign |
| Most Common Taxonomy | |
|---|---|
| Group | Unclassified (99.000 % of family members) |
| NCBI Taxonomy ID | N/A |
| Taxonomy | N/A |
| Most Common Ecosystem | |
|---|---|
| GOLD Ecosystem | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil (32.000 % of family members) |
| Environment Ontology (ENVO) | Unclassified (38.000 % of family members) |
| Earth Microbiome Project Ontology (EMPO) | Host-associated → Plant → Plant rhizosphere (45.000 % of family members) |
| ⦗Top⦘ |
| ⦗Top⦘ |
| Predicted Topology & Secondary Structure | |||||
|---|---|---|---|---|---|
| Classification: | Globular | Signal Peptide: | No | Secondary Structure distribution: | α-helix: 43.75% β-sheet: 0.00% Coil/Unstructured: 56.25% | Feature Viewer |
|
|
|||||
| Powered by Feature Viewer | |||||
| Structure Viewer | |
|---|---|
|
| |
| Per-residue confidence (pLDDT): 0-50 51-70 71-90 91-100 | pTM-score: 0.54 |
| Powered by PDBe Molstar | |
| ⦗Top⦘ |
| Pfam ID | Name | % Frequency in 100 Family Scaffolds |
|---|---|---|
| PF08487 | VIT | 52.00 |
| PF13768 | VWA_3 | 17.00 |
| PF13519 | VWA_2 | 5.00 |
| PF07883 | Cupin_2 | 3.00 |
| PF13490 | zf-HC2 | 2.00 |
| PF04542 | Sigma70_r2 | 1.00 |
| PF00753 | Lactamase_B | 1.00 |
| COG ID | Name | Functional Category | % Frequency in 100 Family Scaffolds |
|---|---|---|---|
| COG0568 | DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) | Transcription [K] | 1.00 |
| COG1191 | DNA-directed RNA polymerase specialized sigma subunit | Transcription [K] | 1.00 |
| COG1595 | DNA-directed RNA polymerase specialized sigma subunit, sigma24 family | Transcription [K] | 1.00 |
| COG4941 | Predicted RNA polymerase sigma factor, contains C-terminal TPR domain | Transcription [K] | 1.00 |
| ⦗Top⦘ |
| Name | Rank | Taxonomy | Distribution |
| Unclassified | root | N/A | 99.00 % |
| All Organisms | root | All Organisms | 1.00 % |
| Visualization |
|---|
| Powered by ApexCharts |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| 3300007265|Ga0099794_10129578 | All Organisms → cellular organisms → Bacteria | 1273 | Open in IMG/M |
| ⦗Top⦘ |
| Habitat | Taxonomy | Distribution |
| Vadose Zone Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil | 32.00% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil | 21.00% |
| Grasslands Soil | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil | 10.00% |
| Groundwater Sediment | Environmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment | 4.00% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil | 4.00% |
| Populus Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere | 4.00% |
| Grasslands Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil | 3.00% |
| Corn, Switchgrass And Miscanthus Rhizosphere | Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere | 3.00% |
| Soil | Environmental → Aquatic → Sediment → Unclassified → Unclassified → Soil | 2.00% |
| Tropical Forest Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil | 2.00% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil | 2.00% |
| Switchgrass Rhizosphere | Host-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere | 2.00% |
| Miscanthus Rhizosphere | Host-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere | 2.00% |
| Natural And Restored Wetlands | Environmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands | 1.00% |
| Terrestrial Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil | 1.00% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil | 1.00% |
| Hardwood Forest Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil | 1.00% |
| Soil | Environmental → Terrestrial → Soil → Loam → Grasslands → Soil | 1.00% |
| Switchgrass Rhizosphere | Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere | 1.00% |
| Groundwater Sand | Environmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand | 1.00% |
| Sediment | Environmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment | 1.00% |
| Arabidopsis Thaliana Rhizosphere | Host-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere | 1.00% |
| Visualization |
|---|
| Powered by ApexCharts |
| Taxon OID | Sample Name | Habitat Type | IMG/M Link |
|---|---|---|---|
| 3300000881 | Soil microbial communities from Great Prairies - Wisconsin Restored Prairie soil | Environmental | Open in IMG/M |
| 3300002557 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_20cm | Environmental | Open in IMG/M |
| 3300002561 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm | Environmental | Open in IMG/M |
| 3300002908 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cm | Environmental | Open in IMG/M |
| 3300002917 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_100cm | Environmental | Open in IMG/M |
| 3300004058 | Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqB_D1 | Environmental | Open in IMG/M |
| 3300004479 | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAs | Environmental | Open in IMG/M |
| 3300005166 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123 | Environmental | Open in IMG/M |
| 3300005171 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126 | Environmental | Open in IMG/M |
| 3300005172 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132 | Environmental | Open in IMG/M |
| 3300005186 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125 | Environmental | Open in IMG/M |
| 3300005445 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaG | Environmental | Open in IMG/M |
| 3300005447 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138 | Environmental | Open in IMG/M |
| 3300005451 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130 | Environmental | Open in IMG/M |
| 3300005459 | Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M3-2 | Host-Associated | Open in IMG/M |
| 3300005467 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG | Environmental | Open in IMG/M |
| 3300005518 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaG | Environmental | Open in IMG/M |
| 3300005544 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-3L metaG | Environmental | Open in IMG/M |
| 3300005553 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144 | Environmental | Open in IMG/M |
| 3300005556 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156 | Environmental | Open in IMG/M |
| 3300005558 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147 | Environmental | Open in IMG/M |
| 3300005559 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149 | Environmental | Open in IMG/M |
| 3300005560 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_119 | Environmental | Open in IMG/M |
| 3300005561 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148 | Environmental | Open in IMG/M |
| 3300005617 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S2-2 | Host-Associated | Open in IMG/M |
| 3300005844 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S5-2 | Host-Associated | Open in IMG/M |
| 3300006034 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105 | Environmental | Open in IMG/M |
| 3300006791 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102 | Environmental | Open in IMG/M |
| 3300006854 | Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4 | Host-Associated | Open in IMG/M |
| 3300006914 | Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD5 | Host-Associated | Open in IMG/M |
| 3300007076 | Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD4 | Host-Associated | Open in IMG/M |
| 3300007258 | Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3 | Environmental | Open in IMG/M |
| 3300007265 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 | Environmental | Open in IMG/M |
| 3300009012 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159 | Environmental | Open in IMG/M |
| 3300009038 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG | Environmental | Open in IMG/M |
| 3300009090 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG | Environmental | Open in IMG/M |
| 3300009137 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158 | Environmental | Open in IMG/M |
| 3300009143 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 | Environmental | Open in IMG/M |
| 3300010047 | Tropical forest soil microbial communities from Panama - MetaG Plot_30 | Environmental | Open in IMG/M |
| 3300010362 | Tropical forest soil microbial communities from Panama - MetaG Plot_22 | Environmental | Open in IMG/M |
| 3300010373 | Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-4 | Environmental | Open in IMG/M |
| 3300011435 | Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT660_2 | Environmental | Open in IMG/M |
| 3300012035 | Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT338_2 | Environmental | Open in IMG/M |
| 3300012198 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaG | Environmental | Open in IMG/M |
| 3300012200 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaG | Environmental | Open in IMG/M |
| 3300012203 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaG | Environmental | Open in IMG/M |
| 3300012204 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_100_16 metaG | Environmental | Open in IMG/M |
| 3300012206 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaG | Environmental | Open in IMG/M |
| 3300012207 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaG | Environmental | Open in IMG/M |
| 3300012208 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaG | Environmental | Open in IMG/M |
| 3300012210 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaG | Environmental | Open in IMG/M |
| 3300012355 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaG | Environmental | Open in IMG/M |
| 3300012357 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaG | Environmental | Open in IMG/M |
| 3300012359 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaG | Environmental | Open in IMG/M |
| 3300012532 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_80_16 metaG | Environmental | Open in IMG/M |
| 3300012582 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaG | Environmental | Open in IMG/M |
| 3300012683 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaG | Environmental | Open in IMG/M |
| 3300012918 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaG | Environmental | Open in IMG/M |
| 3300012923 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaG | Environmental | Open in IMG/M |
| 3300012925 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly) | Environmental | Open in IMG/M |
| 3300012927 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly) | Environmental | Open in IMG/M |
| 3300012976 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_0_1 metaG | Environmental | Open in IMG/M |
| 3300015358 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_5_24_1 metaG | Environmental | Open in IMG/M |
| 3300017659 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaG | Environmental | Open in IMG/M |
| 3300018028 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_30_coex | Environmental | Open in IMG/M |
| 3300018031 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_200_b1 | Environmental | Open in IMG/M |
| 3300018077 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_170_b1 | Environmental | Open in IMG/M |
| 3300018079 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b1 | Environmental | Open in IMG/M |
| 3300018433 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116 | Environmental | Open in IMG/M |
| 3300024224 | Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK14 | Environmental | Open in IMG/M |
| 3300026089 | Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M3-2 (SPAdes) | Host-Associated | Open in IMG/M |
| 3300026307 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123 (SPAdes) | Environmental | Open in IMG/M |
| 3300026323 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130 (SPAdes) | Environmental | Open in IMG/M |
| 3300026324 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125 (SPAdes) | Environmental | Open in IMG/M |
| 3300026327 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132 (SPAdes) | Environmental | Open in IMG/M |
| 3300026331 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137 (SPAdes) | Environmental | Open in IMG/M |
| 3300026542 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148 (SPAdes) | Environmental | Open in IMG/M |
| 3300027395 | Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Inoculated plant M2 PM (SPAdes) | Host-Associated | Open in IMG/M |
| 3300027490 | Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_0_10 (SPAdes) | Environmental | Open in IMG/M |
| 3300027862 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes) | Environmental | Open in IMG/M |
| 3300028072 | Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK16 | Environmental | Open in IMG/M |
| 3300028536 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly) | Environmental | Open in IMG/M |
| 3300028814 | Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_183 | Environmental | Open in IMG/M |
| 3300028878 | Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_117 | Environmental | Open in IMG/M |
| 3300031199 | Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 7_S | Environmental | Open in IMG/M |
| 3300031226 | Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 10_S | Environmental | Open in IMG/M |
| 3300032180 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515 | Environmental | Open in IMG/M |
| 3300034178 | Sediment microbial communities from East River floodplain, Colorado, United States - 27_j17 | Environmental | Open in IMG/M |
| Geographical Distribution | |
|---|---|
| Zoom: | Powered by OpenStreetMap |
| ⦗Top⦘ |
| Protein ID | Sample Taxon ID | Habitat | Sequence |
| JGI10215J12807_10010711 | 3300000881 | Soil | VIAATNPVVAIQDEREFLKWIAEHQTSFPDAYRTIKEANLGLVELSDADAEVLESGPNQCAVG* |
| JGI25381J37097_10221752 | 3300002557 | Grasslands Soil | NVAAAIQDERTFLQWINEHTTSFPDAYRTIKEVNLGLADLSDAEAEMLESGPNQCAVK* |
| JGI25384J37096_100715313 | 3300002561 | Grasslands Soil | RFDVVAATNVAAAIQDETAFLHWVADHAPAAPDAYRTIKLANLGLLELSDADAEVLESGPNQCAVPGAA* |
| JGI25384J37096_101557281 | 3300002561 | Grasslands Soil | VIAATNGAAAIQDERAFLRWITEHTTSFPDAYRAIKETNLGLADLSDSEAEMLESGPNQCAVK* |
| JGI25382J43887_102125871 | 3300002908 | Grasslands Soil | AHYAGEAERRADRAVAARFDVISATNTAAAIQDERVFLKWIADHTTNFPDAYRTIKETNLGLVDVAEADAEILESGPNQCAVV* |
| JGI25382J43887_102540651 | 3300002908 | Grasslands Soil | ARFDVVAATNEVAAIQDRATFLRWIEDHTTTPPDSYRTIKLANLGLLELSEVDAELVESGPNQCAVG* |
| JGI25616J43925_102851821 | 3300002917 | Grasslands Soil | IQDQQVFLKWIADHATNFPDGYRTIKETNLGLVGLSDADAEILESGPNQCAVI* |
| Ga0055498_100207892 | 3300004058 | Natural And Restored Wetlands | IAGTNPVVAIQDERAFLKWITDHQTSFPDAYRTIKEINLGLVDVSDADIEVLESGPNQCAIG* |
| Ga0062595_1001419421 | 3300004479 | Soil | EDERRADRVIAARFDVITATNPPAAIQDERVFLQWIAEHQTSFPDAYRTIKEANLGLVDLSDADAEVIESGPNQCAVG* |
| Ga0066674_101975131 | 3300005166 | Soil | AARFDVVAATNVAAAIQDEAAFLRWVADHTPVAPDAYRTIKLANLGLVALSDADAEVLESGPNQCAVPGAA* |
| Ga0066677_102764611 | 3300005171 | Soil | ASESERRADRAIAARFDVIAATNVAAAIQDERTFLQWINAHTTTFPDAYRTIKEVNLGLADLSDAEAEMLESGPNQCAVK* |
| Ga0066683_101979652 | 3300005172 | Soil | DRAVAARFDVISATNTAAAIQDERVFLKWIADHAMTSPDAYRMIKEANLGLVQLSDEDAEILESGPNQCAVM* |
| Ga0066676_109224051 | 3300005186 | Soil | IQDERVFLKWIADHTTNFPDAYRTIKETNLGLVDVAEADAEILESGPNQCAVV* |
| Ga0070708_1000535161 | 3300005445 | Corn, Switchgrass And Miscanthus Rhizosphere | HYASEGERRADRAVAARFDVIAATNAAAVIQDEGVFLKWIADRMTPFPEAYRTIKEANLGLVDPSDSDTEILESGANQCAIG* |
| Ga0066689_100737131 | 3300005447 | Soil | AHYASEAERRADRAVAARFDVISATNTAAAIQDERVFLKWIADHTTNFPDAYRTIKETNLGLVDVAEADAETLESGPNQCAVV* |
| Ga0066681_100673241 | 3300005451 | Soil | DVVAATNVAAAIQDKTAFLRWVADHAPAAPDAYRSIKLANLGLLELSDADAEVLESGPNQCAVPGAA* |
| Ga0068867_1003765092 | 3300005459 | Miscanthus Rhizosphere | RFDVIAATNPVVAIQDEREFLTWIADHQTSFPEAYRTIKEANLGLVELSDADAEVLESGPNQCAV* |
| Ga0070706_1002308572 | 3300005467 | Corn, Switchgrass And Miscanthus Rhizosphere | DRAVAARFDVISATNSAAAIQDERVFLKWIADHATAFPDAYRTIKEANLGLTQLSDADAEVVESGPNQCAIV* |
| Ga0070699_1002053911 | 3300005518 | Corn, Switchgrass And Miscanthus Rhizosphere | IQDEGVFLKWIADRVTPFPEAYRTIKEANLGLVDPSDSDTEILESGANQCAIG* |
| Ga0070686_1010443011 | 3300005544 | Switchgrass Rhizosphere | ATNPVVAIQDEREFLRWIADHQTSFPDAYRTIKEANLGLVELSDADAEVLESGPNQCAVG |
| Ga0066695_100248251 | 3300005553 | Soil | AIQDERVFLQWLADHVTSFPDAYRTIKEVNLGLLEVSDADAEILEVGPNQCAIG* |
| Ga0066707_1000208110 | 3300005556 | Soil | ATNAAAVIQDERVFLNWIADRATPFPEAYRTIKEANLGLVDTSDSDAEALESGPNQCAIG |
| Ga0066698_102656621 | 3300005558 | Soil | AARFDVIAATNAAAAIQDERTFLKWIEDHASVFPDAYRMIKETNLGLMDISDADAEVLESGPNQCAVR* |
| Ga0066700_104169431 | 3300005559 | Soil | HYASESERRADRAIAARFDVIAATNVAAAIQDERTFLQWIKEHTTTFPDAYRTIKEVNLGLADLSDAEAEMLESGPNQCAVQ* |
| Ga0066670_101008332 | 3300005560 | Soil | IAATNVAAAIQDERTFLQWINEHTTSFPDAYRTIKEVNLGLADLSDAEAEMLESGPNQCAVK* |
| Ga0066699_101874862 | 3300005561 | Soil | DRAVAARFDVISATNTAAAIQDERIFLQWIADHATNFPDAYRTIKEANLGLVELTDADAEVLESGPNQCAIG* |
| Ga0068859_1006940822 | 3300005617 | Switchgrass Rhizosphere | HYSSETERRADRAVAARFDVIAATNPVVAIQDEREFLTWIADHQTSFPEAYRTIKEANLGLVELSDADAEVLESGPNQCAV* |
| Ga0068862_1021938632 | 3300005844 | Switchgrass Rhizosphere | ETERRADRAVAARFDVIAATNPVVAIQDEREFLKWIADHQTSFPEAYRTIKEANLGLVELSDADAEVLESGPNQCAV* |
| Ga0066656_103572071 | 3300006034 | Soil | DERVFLRWIADHATSFPDAYRTIKETNLGLAEVSNADAEVLESGPNQCAVG* |
| Ga0066653_101044453 | 3300006791 | Soil | SGRRADRSVAARFDVVAATNVAATIQDETAFLRWVADHAPAAPDAYRSIKLANLGLLELSDADAEVLESGPNQCAVPGAA* |
| Ga0075425_1007810272 | 3300006854 | Populus Rhizosphere | VISATNSAAAIQDERVFLKWIADHATTFPDAYRTIKEANLGLAQVSDADAEVVESGPNQCAIG* |
| Ga0075425_1027913922 | 3300006854 | Populus Rhizosphere | AAAIQDQHVFLKWIADHATTFPDAYRTIKETNLGLAQISDEDAELLESGPNQCAVI* |
| Ga0075436_1008684762 | 3300006914 | Populus Rhizosphere | PVARVQDERQFLRWIAEHVTPFPDAYRTIKEANLGLVTLSEADAEIVESGPNQCAIA* |
| Ga0075435_1018133311 | 3300007076 | Populus Rhizosphere | AGESERRADRAIAARFDVISATNTAAAIQDQHVFLKWIADHATSFPEAYRTIKETNLGLVELSDADAEQLESGPNQCAVI* |
| Ga0099793_100193821 | 3300007258 | Vadose Zone Soil | RFDVISATNTAAAIQDERVFLKWIADHTTNFPDAYRTIKETNLGLVDVAEADAEILESGPNQCAVM* |
| Ga0099793_104422871 | 3300007258 | Vadose Zone Soil | NTAAAIQDERVFLQWIADHATNFPDAYRTIKDTNLGLVDVSEADAEILESGPNQCAVA* |
| Ga0099794_101295781 | 3300007265 | Vadose Zone Soil | ESERRADRSVAARFDVVAATNVAAAIQDETAFLRWVADHAPVAPDAYRTIKLANLGLLEISDADAEVLEAGPNQCAVPGAA* |
| Ga0066710_1001353444 | 3300009012 | Grasslands Soil | RFDVISATNGAAAIQDERQFLQWVKDHVTTFPDAYRTIKEANLGLVRLTEPDIDILESGPNQCAVG |
| Ga0066710_1038924021 | 3300009012 | Grasslands Soil | HYESEAERRADRAVAARFDEVVTTNAAAAIQDEREFLNWVADHFGPIPDAYRTIKEANLGLLDLSDSDAGMLESGPNQCAVR |
| Ga0099829_107488471 | 3300009038 | Vadose Zone Soil | IQDERVFLKWIADRQTAFPDAYRTIKEANLGLVDLSDADAELAESGPNQCAVA* |
| Ga0099827_114799831 | 3300009090 | Vadose Zone Soil | SESERRADRAIAARFDVISATNAAAAIQDERVFLKWIAEHATNFPDAYRTIKETNLGLVQLSDPDAEMLESGPNQCAVI* |
| Ga0066709_1006916131 | 3300009137 | Grasslands Soil | ESERRADRAIAARFDVISATNTAAPIQDQQVFLKWIADHATNFPDAYRTIKETNLGLVGLSDADADILESGPNQCAVI* |
| Ga0099792_102528131 | 3300009143 | Vadose Zone Soil | DVIAATNGAAAIQDERAFLRWIADHQTSFPDAYKTIKEANLGLVDVSDPDAEILESGPNQCAV* |
| Ga0126382_124662721 | 3300010047 | Tropical Forest Soil | DRAVASRFDVILATNHVAAIQEEREFLRWIAEHATTFPDAYRTIKEANLGLVTLSDADAEIVESGPNQCAIA* |
| Ga0126377_106944452 | 3300010362 | Tropical Forest Soil | SGEGERRADRAVAARFDVVAATNAPVAIQDEGTFLQWIADHTTTFPDAYRTIKEVNLGLTEVPDADAEILESGPNQCAIV* |
| Ga0134128_118325011 | 3300010373 | Terrestrial Soil | NEAARIQGEGDFLQWVADHQTTPPDAYRTIKLANLGLVDLTDSDAATLEAGPNQCAVK* |
| Ga0137426_10256952 | 3300011435 | Soil | DRAVAARFDVIAATNPIASIQDELQFLTWIADHQASFPDAYRTIKEANLGLVELSEADAEVLESGPNQCAVG* |
| Ga0137445_10636471 | 3300012035 | Soil | AHRQRRATAAVAARAVAARFDVIAATNPVASIQDEHQFLKSIADHQASFPDAYRTIKEANLGLAELSDADAEVLESGPNQCAVA* |
| Ga0137364_113197311 | 3300012198 | Vadose Zone Soil | AARFDVVAATNAAAAIQDEAAFLRWVAEHTSVAPEAYRTIKLANLGLVTLSDADAEVLESGPNQCAVPGTV* |
| Ga0137382_101794482 | 3300012200 | Vadose Zone Soil | VILATNPVAAIQDERQFLQWIGDHATAFPDAYKTIKEANLGLVDVSDLDAELLESGPNQCAVG* |
| Ga0137399_105610012 | 3300012203 | Vadose Zone Soil | VAAAIQDETAFLRWVADHAPVAPDAYRTIKLANLGLLEISDADAEVLESGPNQCAVPGAA |
| Ga0137374_102261722 | 3300012204 | Vadose Zone Soil | AATNAAAAIQDEREFLGWVADHQMTPPAAYRTIKQANLGLVDVAESDADLLESGPNQCAVR* |
| Ga0137380_101077362 | 3300012206 | Vadose Zone Soil | AHYARELERRADRAIAARFDVIAATNAAAAIQDERAFLQWIAEHTTSFPDAYRTIKETNLALADLSDSEAEMLESGPNQCAVK* |
| Ga0137381_106208322 | 3300012207 | Vadose Zone Soil | RFDVIAATNGAAAIQDERAFLGWITEHTATFPDAYRTIKETNLGLADLSDSEAEMLESGPNQCAVK* |
| Ga0137376_102301071 | 3300012208 | Vadose Zone Soil | SATNTAAAIQDERVFLKWIADHATTFPDAYRTIKEANLGLVQLSDEDAEILESGPNQCAIF* |
| Ga0137376_107284741 | 3300012208 | Vadose Zone Soil | LATNPVAAIQDERQFLQWIGDHATAFPDAYKTIKEANLGLVDVSDLDAELLESGPNQCAVG* |
| Ga0137376_110783412 | 3300012208 | Vadose Zone Soil | AIHDERLFLQWIADHHTTPPAAYRTIKLANLGLIEVSDADAEVLESGPNQCAIG* |
| Ga0137378_103325142 | 3300012210 | Vadose Zone Soil | EGERRADRAVAARFDVITATNAAAAIQDERVFLQWIADHQTSFPDAYRTIKEANLGLVQISDSDAEVLESGPNQCAIA* |
| Ga0137369_104968121 | 3300012355 | Vadose Zone Soil | RAIAARFDVITATNAAAAIQDERVFLQWIADHQTSFPDAYRTIKEANLGLVQISDADAEVLESGPNQCAIA* |
| Ga0137384_101351342 | 3300012357 | Vadose Zone Soil | HYASEGERRADRAIAARFDVIAATNGAAAIQDERVFLQWLADHVTSFPDAYRTIKEVNLGLLEVSDADAEILEVGPNQCAIG* |
| Ga0137384_105082362 | 3300012357 | Vadose Zone Soil | AIAARFDVIAATNGAAAIQDERVFLQWLVDHVTSFPDAYRTIKEVNLGLLEVSDADAEILEVGPNQCAIG* |
| Ga0137385_110670191 | 3300012359 | Vadose Zone Soil | SEGERRADRAIAARFDVIAATNGAAAIQDERVFLQWLVDHVTSFPDAYRTIKEVNLGLLEVSDADAEILEVGPNQCAIG* |
| Ga0137373_111595441 | 3300012532 | Vadose Zone Soil | EVERRADRAVAARFDVISATNAAAAIQDERAFLRWVADHQTPPPDAYRSIKLANLGLVAVPDSDAEVLESGPSQCAVG* |
| Ga0137358_103215992 | 3300012582 | Vadose Zone Soil | DVISATNTAAAIQDERVFLEWIADHQSSFPDAYRTVKEANLGLVELSDPDAETLESGANQCAVM* |
| Ga0137398_111522842 | 3300012683 | Vadose Zone Soil | ISATNAAAAIQDERVFLKWVADHATTFPDAYRTIKEANLGLAQLSDADAEVVESGPNQCAIV* |
| Ga0137396_110593582 | 3300012918 | Vadose Zone Soil | DRAIAALFDVISATNEATAIQDERVFLRWIADHSTTFPDAYRTIKEANLGLVDVADPDAEILESGPNLCAVM* |
| Ga0137359_117968091 | 3300012923 | Vadose Zone Soil | FDVILATSPAAAIQDERQFLQWIGDHGTTFPDAYKTIKEANLGLVDVSDLDAELLESGPNQCAVG* |
| Ga0137419_117821632 | 3300012925 | Vadose Zone Soil | WAVAARFDVIAATNAAAAIQDERQFLKWIADHSSTFPDAYRTIKEANLGLTDLSEADAEVLESGPNQCAIA* |
| Ga0137419_119366742 | 3300012925 | Vadose Zone Soil | DVISATNTAAAIQDERVFLKWIADHATDFPDAYRTIKEANLGLVDVAEADAEILESGPNQCAIV* |
| Ga0137416_103102221 | 3300012927 | Vadose Zone Soil | DEPAFLRWLSEHTSSFPDAYRTIKATNLGLADLSDSEAETLESGPNQCAVK* |
| Ga0137416_107428391 | 3300012927 | Vadose Zone Soil | RADRAVAARFDVISATNTAAAIQDERVFLQWIADHATAFPEAYRTIKESNLGFVELSDADAEILESGPNQCAVV* |
| Ga0137416_119327431 | 3300012927 | Vadose Zone Soil | VILATNPAAAIQDERQFLQWIGDHATSLPDAYKTIKEANLGLVDVSDLDAELLESGPNQCAVG* |
| Ga0134076_101494781 | 3300012976 | Grasslands Soil | YASEGERRADRAIAARFDVIAATNGAAAIQDERVFLQWLADHVTSFPDAYRTIKEVNLGLLEVSDADAEILEVGPNQCAIG* |
| Ga0134089_100832372 | 3300015358 | Grasslands Soil | DVITATNAAAAIQDERVFLQWIADRQTGFPDAYRTIKETNLGLADLSDSEAEMLESGPNQCTVK* |
| Ga0134083_105694872 | 3300017659 | Grasslands Soil | AARFDVIAATNAAAAIQDERVFLRWIADHATSFPDAYRTIKETNLGLAEVSNADAEVLESGPNQCAVG |
| Ga0184608_101019162 | 3300018028 | Groundwater Sediment | LATNPVAAIQDKRQFLQWIGDHATTFPDAYKTIKEANLGLVTVSDLDAEILESGPNQCAV |
| Ga0184634_102919912 | 3300018031 | Groundwater Sediment | GEGERRADRAIAARFDVIAATNAAASIQDEAVFLKWIADHQTSFPEAYRTIKEANLGLADLSDADAEVLESGPNQCAVG |
| Ga0184633_101587301 | 3300018077 | Groundwater Sediment | FDVIAATNAAASIQDEAAFLQWIADHRTTFPEAYRRIKEANLGLVDVSEADAEVLESGPNQCAVK |
| Ga0184627_106737472 | 3300018079 | Groundwater Sediment | YAGEGERRADRAIAARFDVIAATNAAASIQDEGVFLKWIADHPTSFPDAYRTIKETNLGLVDLSDADAEVLESGPNQCAVG |
| Ga0066667_100680381 | 3300018433 | Grasslands Soil | QDERVFLQWIADHQTSFPDAYRTIKEANLGLVQISDADAEVLESGPNQCAIA |
| Ga0247673_10661922 | 3300024224 | Soil | ERRADRAIAARFDVISATNTAAAIQDQQVFLKWIADHATTFPDAYRTIKETNLGLVQISDEDAELLESGPNQCAVV |
| Ga0207648_120431342 | 3300026089 | Miscanthus Rhizosphere | AARFDVIAATNPVVAIQDEREFLTWIADHQTSFPEAYRTIKEANLGLVELSDADAEVLESGPNQCAV |
| Ga0209469_10515552 | 3300026307 | Soil | ASESERRADRAIAARFDVILATNPVAAIQDERQFLQWIGDHATTFPDAYKTIKEANLGLVDVSDLDAELLESGPNQCAVG |
| Ga0209472_10427383 | 3300026323 | Soil | DVVAATNVAAAIQDKTAFLRWVADHAPAAPDAYRSIKLANLGLLELSDADAEVLESGPNQCAVPGAA |
| Ga0209470_13635862 | 3300026324 | Soil | AVAARFDVISATNTAAAIQDESVFLKWIADHQSNFPDAYRTIKEANLGLVELSDADAEILESGANQCAVV |
| Ga0209266_11799991 | 3300026327 | Soil | RRADRAVAARFDVISATNTAAAIQDERVFLKWIADHAMTSPDAYRMIKEANLGLVQLSDEDAEILESGPNQCAVM |
| Ga0209266_12088852 | 3300026327 | Soil | ELERRADRALAARFDVISATNAAAAIQDERLFLQWIADHQATPPDAYRTIKLANLGLVDVSDADAEALESGPNQCAVA |
| Ga0209267_12901342 | 3300026331 | Soil | FDVIAATNVAAAIQDERTFLQWIKEHTTTFPDAYRTIKEVNLGLADLSDAEAEMLESGPNQCAVQ |
| Ga0209805_14358731 | 3300026542 | Soil | HYASESERRADRAIAARFDVIAATNVAAAIQDERTFLQWINEHTTAFPDAYRTIKEVNLGLADLSDAESEMLESGPNQCAVK |
| Ga0209996_10726382 | 3300027395 | Arabidopsis Thaliana Rhizosphere | DRVVAARFDVIAATNAVVAIQDERTFLKWIVDHETSFPDAYRTIKEANLGLVELSDADVEILESGPNQCAIA |
| Ga0209899_11112172 | 3300027490 | Groundwater Sand | NPAAAIQDERAFLQWIVDHQTSFPDTYRRIKEANLGLVDVPDADAEVLESGPNQCAVR |
| Ga0209701_104918901 | 3300027862 | Vadose Zone Soil | YVGVSSYSANRTNEAAAIQDERAFLQWIAAHTPIFPDSYRTIKTANLGLVDVGEADAEILEFGPNQCAIR |
| Ga0247675_10262861 | 3300028072 | Soil | TNPVVAIQDEREFLKWIADHQTSFPEAYRTIKEANLGLVELSDADAEVLESGPNQCAV |
| Ga0137415_113190491 | 3300028536 | Vadose Zone Soil | VILATNPAAAIQDERQFLQWIGDHATSLPDAYKTIKEANLGLVDVSDLDAELLESGPNQCAVG |
| Ga0307302_100676602 | 3300028814 | Soil | YAGEDERRADRAVAARFDVISATNSAAAIQDERVFLRWVADHATTFPDAYRTIKEANLGLAQLSDADAEVVESGPNQCAIV |
| Ga0307278_104879271 | 3300028878 | Soil | AHYASEAERRADRAVAARFDVISATNTAAAIQDERVFLKWIADHQSSFPDAYRTIKEANLGLVELSEPDAETLESGANQCAVM |
| Ga0307495_100717861 | 3300031199 | Soil | AHYASESERRADRAVAARFDVIAATNPVASIQDQQQFVTWIADHQARFPDAYRTIKEANLGLVELSDPDADVLESGPNQCAVA |
| Ga0307497_101871591 | 3300031226 | Soil | YASESERRADRAVASRFDVILATNPVAAIQDEREFLRWIANRATTFPDAYRTIKEANLGLVTLSEPDVEILESGPNQCAVA |
| Ga0307471_1028854781 | 3300032180 | Hardwood Forest Soil | VVAATNTAAAIQDERTFLKWIADHTSPFPDAYRTIKEANLGLVDVSDADAEILESGPNQCAIV |
| Ga0364934_0345316_355_564 | 3300034178 | Sediment | IAARFDVIAATNAAAAIQDERTFLGWIADHTTTFPDAYRTIKEANLGLVDPSDADIEVLESGPNQCAVG |
| ⦗Top⦘ |