| Basic Information | |
|---|---|
| Family ID | F049139 |
| Family Type | Metagenome |
| Number of Sequences | 147 |
| Average Sequence Length | 66 residues |
| Representative Sequence | TVHLRCDDPQVGTVTIDGRFLTRLVTARLDAAVVSAVVTVRDGSGEVLYRARDSFEWHPGN |
| Number of Associated Samples | 104 |
| Number of Associated Scaffolds | 147 |
| Quality Assessment | |
|---|---|
| Transcriptomic Evidence | No |
| Most common taxonomic group | Unclassified |
| % of genes with valid RBS motifs | 0.00 % |
| % of genes near scaffold ends (potentially truncated) | 4.76 % |
| % of genes from short scaffolds (< 2000 bps) | 3.40 % |
| Associated GOLD sequencing projects | 91 |
| AlphaFold2 3D model prediction | Yes |
| 3D model pTM-score | 0.65 |
| Hidden Markov Model |
|---|
| Powered by Skylign |
| Most Common Taxonomy | |
|---|---|
| Group | Unclassified (95.238 % of family members) |
| NCBI Taxonomy ID | N/A |
| Taxonomy | N/A |
| Most Common Ecosystem | |
|---|---|
| GOLD Ecosystem | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil (29.932 % of family members) |
| Environment Ontology (ENVO) | Unclassified (61.224 % of family members) |
| Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Soil (non-saline) (68.707 % of family members) |
| ⦗Top⦘ |
| ⦗Top⦘ |
| Predicted Topology & Secondary Structure | |||||
|---|---|---|---|---|---|
| Classification: | Globular | Signal Peptide: | No | Secondary Structure distribution: | α-helix: 0.00% β-sheet: 44.94% Coil/Unstructured: 55.06% | Feature Viewer |
|
|
|||||
| Powered by Feature Viewer | |||||
| Structure Viewer | |
|---|---|
|
| |
| Per-residue confidence (pLDDT): 0-50 51-70 71-90 91-100 | pTM-score: 0.65 |
| Powered by PDBe Molstar | |
| ⦗Top⦘ |
| Pfam ID | Name | % Frequency in 147 Family Scaffolds |
|---|---|---|
| PF01411 | tRNA-synt_2c | 6.12 |
| PF00583 | Acetyltransf_1 | 5.44 |
| PF05163 | DinB | 4.76 |
| PF14534 | DUF4440 | 4.08 |
| PF13476 | AAA_23 | 3.40 |
| PF13302 | Acetyltransf_3 | 3.40 |
| PF02585 | PIG-L | 2.72 |
| PF00291 | PALP | 2.72 |
| PF12867 | DinB_2 | 1.36 |
| PF10041 | DUF2277 | 1.36 |
| PF08734 | GYD | 1.36 |
| PF10503 | Esterase_PHB | 1.36 |
| PF07110 | EthD | 1.36 |
| PF13801 | Metal_resist | 1.36 |
| PF12680 | SnoaL_2 | 1.36 |
| PF13474 | SnoaL_3 | 1.36 |
| PF04237 | YjbR | 0.68 |
| PF06439 | 3keto-disac_hyd | 0.68 |
| PF13924 | Lipocalin_5 | 0.68 |
| PF06283 | ThuA | 0.68 |
| PF13577 | SnoaL_4 | 0.68 |
| PF07311 | Dodecin | 0.68 |
| PF04321 | RmlD_sub_bind | 0.68 |
| PF10882 | bPH_5 | 0.68 |
| PF01872 | RibD_C | 0.68 |
| PF00484 | Pro_CA | 0.68 |
| PF12681 | Glyoxalase_2 | 0.68 |
| PF08592 | Anthrone_oxy | 0.68 |
| PF09413 | DUF2007 | 0.68 |
| PF03173 | CHB_HEX | 0.68 |
| PF12848 | ABC_tran_Xtn | 0.68 |
| PF02224 | Cytidylate_kin | 0.68 |
| PF11528 | DUF3224 | 0.68 |
| PF06127 | Mpo1-like | 0.68 |
| PF08818 | DUF1801 | 0.68 |
| PF13207 | AAA_17 | 0.68 |
| PF07609 | DUF1572 | 0.68 |
| PF06983 | 3-dmu-9_3-mt | 0.68 |
| PF13376 | OmdA | 0.68 |
| PF13561 | adh_short_C2 | 0.68 |
| PF13527 | Acetyltransf_9 | 0.68 |
| COG ID | Name | Functional Category | % Frequency in 147 Family Scaffolds |
|---|---|---|---|
| COG0013 | Alanyl-tRNA synthetase | Translation, ribosomal structure and biogenesis [J] | 6.12 |
| COG2318 | Bacillithiol/mycothiol S-transferase BstA/DinB, DinB/YfiT family (unrelated to E. coli DinB) | Secondary metabolites biosynthesis, transport and catabolism [Q] | 4.76 |
| COG2120 | N-acetylglucosaminyl deacetylase, LmbE family | Carbohydrate transport and metabolism [G] | 2.72 |
| COG0451 | Nucleoside-diphosphate-sugar epimerase | Cell wall/membrane/envelope biogenesis [M] | 1.36 |
| COG0702 | Uncharacterized conserved protein YbjT, contains NAD(P)-binding and DUF2867 domains | General function prediction only [R] | 1.36 |
| COG1086 | NDP-sugar epimerase, includes UDP-GlcNAc-inverting 4,6-dehydratase FlaA1 and capsular polysaccharide biosynthesis protein EpsC | Cell wall/membrane/envelope biogenesis [M] | 1.36 |
| COG4274 | Uncharacterized conserved protein, contains GYD domain | Function unknown [S] | 1.36 |
| COG5649 | Uncharacterized conserved protein, DUF1801 domain | Function unknown [S] | 0.68 |
| COG5646 | Iron-binding protein Fra/YdhG, frataxin family (Fe-S cluster biosynthesis) | Posttranslational modification, protein turnover, chaperones [O] | 0.68 |
| COG4813 | Trehalose utilization protein | Carbohydrate transport and metabolism [G] | 0.68 |
| COG4539 | 2-hydroxy fatty acid dioxygenase MPO1 (alpha-oxidation of fatty acids) | Lipid transport and metabolism [I] | 0.68 |
| COG4430 | Uncharacterized conserved protein YdeI, YjbR/CyaY-like superfamily, DUF1801 family | Function unknown [S] | 0.68 |
| COG3865 | Glyoxalase superfamily enzyme, possible 3-demethylubiquinone-9 3-methyltransferase | General function prediction only [R] | 0.68 |
| COG3525 | N-acetyl-beta-hexosaminidase | Carbohydrate transport and metabolism [G] | 0.68 |
| COG3360 | Flavin-binding protein dodecin | General function prediction only [R] | 0.68 |
| COG2764 | Zn-dependent glyoxalase, PhnB family | Energy production and conversion [C] | 0.68 |
| COG2315 | Predicted DNA-binding protein with ‘double-wing’ structural motif, MmcQ/YjbR family | Transcription [K] | 0.68 |
| COG1985 | Pyrimidine reductase, riboflavin biosynthesis | Coenzyme transport and metabolism [H] | 0.68 |
| COG1091 | dTDP-4-dehydrorhamnose reductase | Cell wall/membrane/envelope biogenesis [M] | 0.68 |
| COG1090 | NAD dependent epimerase/dehydratase family enzyme | General function prediction only [R] | 0.68 |
| COG1089 | GDP-D-mannose dehydratase | Cell wall/membrane/envelope biogenesis [M] | 0.68 |
| COG1088 | dTDP-D-glucose 4,6-dehydratase | Cell wall/membrane/envelope biogenesis [M] | 0.68 |
| COG1087 | UDP-glucose 4-epimerase | Cell wall/membrane/envelope biogenesis [M] | 0.68 |
| COG0288 | Carbonic anhydrase | Inorganic ion transport and metabolism [P] | 0.68 |
| COG0283 | Cytidylate kinase | Nucleotide transport and metabolism [F] | 0.68 |
| COG0262 | Dihydrofolate reductase | Coenzyme transport and metabolism [H] | 0.68 |
| ⦗Top⦘ |
| Name | Rank | Taxonomy | Distribution |
| Unclassified | root | N/A | 95.24 % |
| All Organisms | root | All Organisms | 4.76 % |
| Visualization |
|---|
| Powered by ApexCharts |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| 3300005180|Ga0066685_10348305 | All Organisms → cellular organisms → Bacteria | 1028 | Open in IMG/M |
| 3300005561|Ga0066699_10165213 | All Organisms → cellular organisms → Bacteria | 1521 | Open in IMG/M |
| 3300006796|Ga0066665_10042439 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 3079 | Open in IMG/M |
| 3300026301|Ga0209238_1045928 | All Organisms → cellular organisms → Bacteria | 1580 | Open in IMG/M |
| 3300026333|Ga0209158_1026305 | All Organisms → cellular organisms → Bacteria | 2557 | Open in IMG/M |
| 3300026538|Ga0209056_10146083 | All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes | 1822 | Open in IMG/M |
| 3300027846|Ga0209180_10336796 | All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium | 861 | Open in IMG/M |
| ⦗Top⦘ |
| Habitat | Taxonomy | Distribution |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil | 29.93% |
| Vadose Zone Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil | 21.09% |
| Grasslands Soil | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil | 16.33% |
| Grasslands Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil | 14.97% |
| Agricultural Soil | Environmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil | 4.08% |
| Hardwood Forest Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil | 3.40% |
| Corn, Switchgrass And Miscanthus Rhizosphere | Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere | 2.72% |
| Soil | Environmental → Terrestrial → Soil → Loam → Grasslands → Soil | 2.04% |
| Populus Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere | 1.36% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil | 0.68% |
| Terrestrial Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil | 0.68% |
| Surface Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil | 0.68% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil | 0.68% |
| Rice Paddy Soil | Environmental → Terrestrial → Soil → Wetlands → Unclassified → Rice Paddy Soil | 0.68% |
| Arabidopsis Rhizosphere | Host-Associated → Plants → Rhizosphere → Soil → Unclassified → Arabidopsis Rhizosphere | 0.68% |
| Visualization |
|---|
| Powered by ApexCharts |
| Taxon OID | Sample Name | Habitat Type | IMG/M Link |
|---|---|---|---|
| 3300002558 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cm | Environmental | Open in IMG/M |
| 3300002562 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cm | Environmental | Open in IMG/M |
| 3300002908 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cm | Environmental | Open in IMG/M |
| 3300002916 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cm | Environmental | Open in IMG/M |
| 3300005172 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132 | Environmental | Open in IMG/M |
| 3300005175 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122 | Environmental | Open in IMG/M |
| 3300005179 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133 | Environmental | Open in IMG/M |
| 3300005180 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 | Environmental | Open in IMG/M |
| 3300005186 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125 | Environmental | Open in IMG/M |
| 3300005187 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124 | Environmental | Open in IMG/M |
| 3300005446 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135 | Environmental | Open in IMG/M |
| 3300005447 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138 | Environmental | Open in IMG/M |
| 3300005451 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130 | Environmental | Open in IMG/M |
| 3300005468 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG | Environmental | Open in IMG/M |
| 3300005518 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaG | Environmental | Open in IMG/M |
| 3300005526 | Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire. - Coalmine Soil_Cen17_06102014_R1 | Environmental | Open in IMG/M |
| 3300005536 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaG | Environmental | Open in IMG/M |
| 3300005540 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146 | Environmental | Open in IMG/M |
| 3300005556 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156 | Environmental | Open in IMG/M |
| 3300005560 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_119 | Environmental | Open in IMG/M |
| 3300005561 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148 | Environmental | Open in IMG/M |
| 3300005568 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152 | Environmental | Open in IMG/M |
| 3300005598 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 | Environmental | Open in IMG/M |
| 3300005890 | Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_20C_0N_104 | Environmental | Open in IMG/M |
| 3300006032 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145 | Environmental | Open in IMG/M |
| 3300006046 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101 | Environmental | Open in IMG/M |
| 3300006175 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaG | Environmental | Open in IMG/M |
| 3300006755 | Agricultural soil microbial communities from Georgia to study Nitrogen management - GA Plitter | Environmental | Open in IMG/M |
| 3300006796 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 | Environmental | Open in IMG/M |
| 3300006797 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108 | Environmental | Open in IMG/M |
| 3300006804 | Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200 | Environmental | Open in IMG/M |
| 3300006903 | Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD5 | Host-Associated | Open in IMG/M |
| 3300007004 | Agricultural soil microbial communities from Utah to study Nitrogen management - NC Compost | Environmental | Open in IMG/M |
| 3300007076 | Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD4 | Host-Associated | Open in IMG/M |
| 3300007255 | Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1 | Environmental | Open in IMG/M |
| 3300007258 | Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3 | Environmental | Open in IMG/M |
| 3300009012 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159 | Environmental | Open in IMG/M |
| 3300009088 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG | Environmental | Open in IMG/M |
| 3300009137 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158 | Environmental | Open in IMG/M |
| 3300010304 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015 | Environmental | Open in IMG/M |
| 3300010320 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_11112015 | Environmental | Open in IMG/M |
| 3300010321 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09212015 | Environmental | Open in IMG/M |
| 3300010323 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_2_24_1 metaG | Environmental | Open in IMG/M |
| 3300010329 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_11112015 | Environmental | Open in IMG/M |
| 3300010335 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09082015 | Environmental | Open in IMG/M |
| 3300010403 | Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-3 | Environmental | Open in IMG/M |
| 3300011271 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaG | Environmental | Open in IMG/M |
| 3300012198 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaG | Environmental | Open in IMG/M |
| 3300012202 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaG | Environmental | Open in IMG/M |
| 3300012203 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaG | Environmental | Open in IMG/M |
| 3300012206 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaG | Environmental | Open in IMG/M |
| 3300012208 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaG | Environmental | Open in IMG/M |
| 3300012209 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaG | Environmental | Open in IMG/M |
| 3300012210 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaG | Environmental | Open in IMG/M |
| 3300012211 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaG | Environmental | Open in IMG/M |
| 3300012351 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaG | Environmental | Open in IMG/M |
| 3300012355 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaG | Environmental | Open in IMG/M |
| 3300012359 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaG | Environmental | Open in IMG/M |
| 3300012361 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaG | Environmental | Open in IMG/M |
| 3300012363 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaG | Environmental | Open in IMG/M |
| 3300012685 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaG | Environmental | Open in IMG/M |
| 3300012918 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaG | Environmental | Open in IMG/M |
| 3300012923 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaG | Environmental | Open in IMG/M |
| 3300012927 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly) | Environmental | Open in IMG/M |
| 3300012972 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_24_1 metaG | Environmental | Open in IMG/M |
| 3300012975 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_11112015 | Environmental | Open in IMG/M |
| 3300012976 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_0_1 metaG | Environmental | Open in IMG/M |
| 3300014154 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09212015 | Environmental | Open in IMG/M |
| 3300015356 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_5_24_1 metaG | Environmental | Open in IMG/M |
| 3300015358 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_5_24_1 metaG | Environmental | Open in IMG/M |
| 3300015372 | Soil combined assembly | Host-Associated | Open in IMG/M |
| 3300017654 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_2_24_1 metaG | Environmental | Open in IMG/M |
| 3300017657 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09212015 | Environmental | Open in IMG/M |
| 3300017659 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaG | Environmental | Open in IMG/M |
| 3300018431 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104 | Environmental | Open in IMG/M |
| 3300018433 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116 | Environmental | Open in IMG/M |
| 3300018468 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111 | Environmental | Open in IMG/M |
| 3300018482 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118 | Environmental | Open in IMG/M |
| 3300026301 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cm (SPAdes) | Environmental | Open in IMG/M |
| 3300026310 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cm (SPAdes) | Environmental | Open in IMG/M |
| 3300026313 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes) | Environmental | Open in IMG/M |
| 3300026315 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126 (SPAdes) | Environmental | Open in IMG/M |
| 3300026316 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122 (SPAdes) | Environmental | Open in IMG/M |
| 3300026327 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132 (SPAdes) | Environmental | Open in IMG/M |
| 3300026328 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129 (SPAdes) | Environmental | Open in IMG/M |
| 3300026333 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 (SPAdes) | Environmental | Open in IMG/M |
| 3300026342 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146 (SPAdes) | Environmental | Open in IMG/M |
| 3300026528 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156 (SPAdes) | Environmental | Open in IMG/M |
| 3300026530 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154 (SPAdes) | Environmental | Open in IMG/M |
| 3300026532 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 (SPAdes) | Environmental | Open in IMG/M |
| 3300026537 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135 (SPAdes) | Environmental | Open in IMG/M |
| 3300026538 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes) | Environmental | Open in IMG/M |
| 3300026547 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124 (SPAdes) | Environmental | Open in IMG/M |
| 3300026550 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145 (SPAdes) | Environmental | Open in IMG/M |
| 3300027643 | Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3 (SPAdes) | Environmental | Open in IMG/M |
| 3300027655 | Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1 (SPAdes) | Environmental | Open in IMG/M |
| 3300027725 | Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200 (SPAdes) | Environmental | Open in IMG/M |
| 3300027775 | Agricultural soil microbial communities from Georgia to study Nitrogen management - GA Control (SPAdes) | Environmental | Open in IMG/M |
| 3300027846 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes) | Environmental | Open in IMG/M |
| 3300031962 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515 | Environmental | Open in IMG/M |
| 3300032157 | Garden soil microbial communities collected in Santa Monica, California, United States - V. faba soil | Environmental | Open in IMG/M |
| 3300032180 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515 | Environmental | Open in IMG/M |
| 3300032205 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05 | Environmental | Open in IMG/M |
| 3300033417 | Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT142D155 | Environmental | Open in IMG/M |
| Geographical Distribution | |
|---|---|
| Zoom: | Powered by OpenStreetMap |
| ⦗Top⦘ |
| Protein ID | Sample Taxon ID | Habitat | Sequence |
| JGI25385J37094_101333882 | 3300002558 | Grasslands Soil | FAQTTVACVEAQISAQSVHLRCDDPQVGKVTIDGRFLTRLATSSLDTPVLSAVVTVRTASGEILYSARDSFRWQPGD* |
| JGI25382J37095_100520083 | 3300002562 | Grasslands Soil | VTFSQTTVACIDAKISAQDLHLRCDDPQLGTVTIDGKFLTRLVTSRLDAAVVSAVVTVRDGSGEILYRARDSFEWHPGN* |
| JGI25382J37095_100939103 | 3300002562 | Grasslands Soil | HLRCDDPQLGSVTIDGRFLTRFVTTRLDAQVVAAVVTVRDGAGEVLYKAQDSFEWRPGN* |
| JGI25382J43887_100549666 | 3300002908 | Grasslands Soil | KVGTVTIDGKFLTRLVTNRLDAAVVSAVVTVRTGSGEILYRARDSFEWHPAE* |
| JGI25382J43887_102306841 | 3300002908 | Grasslands Soil | CESPEVGVVTIEGRFLTRVATTLLDTPVVSAIVTVHAPSGEVYSARDSFVWQAGH* |
| JGI25389J43894_10764171 | 3300002916 | Grasslands Soil | DDPQLGMVTTIDGTFHARLATDRLDTTVLSAVVTVRSGSGEVLYRARDSFKWHPVD* |
| Ga0066683_101658794 | 3300005172 | Soil | HLRCDYPQLGTVSIDGKFLTRFATNSLDRAVLSAVVTVRSPSGDVLYSARDSFVWHPSD* |
| Ga0066673_100389035 | 3300005175 | Soil | TEESVACLEARIRPETVHLRCDDPQVGTVTVDGRFLTRFATSQLDAAVLTAVVTVRDGSGEVLYRAQDSFQWQPGK* |
| Ga0066684_103843031 | 3300005179 | Soil | DPQVGTVTIDGRFRTRVATNRLDTAVLAAVVTVRSGSGEILYRARDSFKWHPRDSGEPGT |
| Ga0066684_103843041 | 3300005179 | Soil | DPQVGTVTIDGRFRTRVATNRLDTAVLAAVVTVRSGSGEILYQARDSFKWHPRDSGEPGT |
| Ga0066685_103483054 | 3300005180 | Soil | TVACIEAQISAQNVHLRCDDPQIGTVTIDGRFLTRLASNQLDAAVLSAVVTVRTGSGEILYSARDSFRWQQRD* |
| Ga0066676_108640292 | 3300005186 | Soil | GTVSIDGKFLTRLATTRLDRPVVSAVVTVRDPTGEILYRARDSFVWHQAE* |
| Ga0066675_106347811 | 3300005187 | Soil | EQPMACLDVLIRADTVHLRCDDPQVGTVTIDGRFRTRVATHRLDTAVLAAVVTVRSGSGEILYRAQDSFKWHPRDSGEPGT* |
| Ga0066675_109265662 | 3300005187 | Soil | HLRCDDPQVGTVTIDGRFRTRVATNRLDTAVLAAVVTVRSGSGEILYQARDSFKWHPRDSGEPGT* |
| Ga0066675_110614132 | 3300005187 | Soil | RADTVHLRCDDPQVGTVTIDGRFRTRVATNRLDTAVLAAVVTVRSGSGEILYQARDSFKWHPRDSGDPGT* |
| Ga0066686_103728543 | 3300005446 | Soil | IRLRCESPEVGTVTIEGRFLTRVATTLLDTPVVSAVVTVHAPSGEVYSARDSFVWQAGR* |
| Ga0066686_107193443 | 3300005446 | Soil | VTIDGRFLTRFVTTRLDARVVAAVVTVRDGAGEVLYKAQDSFEWRPGN* |
| Ga0066689_103973591 | 3300005447 | Soil | QISAQNVHLRCDDPQLGTVTIDGKFLTRLVTSRLDAAVVSAVVTVRDASGEILYRARDSFEWHPGH* |
| Ga0066681_100761611 | 3300005451 | Soil | PQVGTVIIDGRFLTRVATNRLDAAVVSAVVTVRAGSGDVLYRARDSFQWHASDQPQ* |
| Ga0070707_1009695622 | 3300005468 | Corn, Switchgrass And Miscanthus Rhizosphere | LRCESPEVGVVTIEGRFLTRVATTRLDAPVVSAVVTVHAPGGEVYSARDSFVWQAGR* |
| Ga0070699_1001695791 | 3300005518 | Corn, Switchgrass And Miscanthus Rhizosphere | TAFHLRCDDPQLGSVTIDGRFLTRFVTTRLDAPVVSAVVMVRDATGDVLYRAQDSFEWRPGN* |
| Ga0073909_106425212 | 3300005526 | Surface Soil | CEGPKGGTITIDGKFLTRRATTRLDVPVLSAVVTARSGSGEIVYSARDRFAWQPGN* |
| Ga0070697_1012470442 | 3300005536 | Corn, Switchgrass And Miscanthus Rhizosphere | EAFVTFGQSGSEEQVACMQARISPTAFHLRCDDPQLGSVTIDGRFLTRFVTTRLDAPVVSAVVTVRDGAGDVLYKAQDSFEWRPGN* |
| Ga0066697_100578111 | 3300005540 | Soil | QPAGCFEALIRADTVHLRCDYPQLGTVTIDGKFLTRFATNSLDAAVLSAVVTVRAASGDILYSARDSFVWHRARLGRGSG* |
| Ga0066697_105434402 | 3300005540 | Soil | CESPEVGVVTIEGRFLTRVATTLLDTPVVSAVVTVHAPSGEVYSARDSFVWQAGR* |
| Ga0066707_100440926 | 3300005556 | Soil | SAQNVHLRCDDPQVGTVTIDGRFLTRLASNRLDAAVLSAVVTVRAGSGEVLYSARDSFRWQPVD* |
| Ga0066670_100722454 | 3300005560 | Soil | QPAGCFEALIRADTVHLRCDYPQLGTVTIDGKFLTRFATNSLDAAVLSAVVTVRAASGDILYSARDSFVWHPRD* |
| Ga0066670_101601173 | 3300005560 | Soil | RADTVHLRCDDPQVGTVTIDGRFRTRVATNRLDTAVLAAVVTVRSGSGEILYRARDSFKWHPRDSGEPGT* |
| Ga0066699_101652134 | 3300005561 | Soil | AQTSAGIEQPAACFEALIRADTVHLRCDSPQVGTVTIDGKFLTRLATTSLDTPVLSAVVTVRSASGEIVYSARDSFVWHPGQ* |
| Ga0066699_102170381 | 3300005561 | Soil | HLRCDDPQVGTVTIDGRFRTRVATNRLDTAVLAAVVTVRSGSGEILYRARDSFKWHPRDSGEPGT* |
| Ga0066703_101426913 | 3300005568 | Soil | QTTVACLEAQISAQRVHLRCDDPRVGTVIIDGRFLTRLASNRLDAEALSAVVTVRAGSGEILYSARDSFRWHPRD* |
| Ga0066703_108257772 | 3300005568 | Soil | VHLRCDDPQVGTVTIDGRFLTRLASNQLDAEVLSAVVTVRAGSGDILYSARDSFRWRPRE |
| Ga0066706_106423171 | 3300005598 | Soil | EAAISVDTIHLRCDDPQVGTVTIDGKFLTRLVTTRLDAAAVTAVVTVRSGSGEILYNARDRFEWHPSN* |
| Ga0066706_110909452 | 3300005598 | Soil | EVGTVTIEGRFLTRVATTLLDTPVVSAVVTVHAPSGEVYSARDSFVWQAGR* |
| Ga0075285_10217981 | 3300005890 | Rice Paddy Soil | KDRESCSEARITATTVHLRCDFPREGTITIDGRFLTRLVTSRLDAAVVSAVVTVRNGNGDVLYNARDSFVWQQAQ* |
| Ga0066696_111134042 | 3300006032 | Soil | GTDQEMGCDETLIRADTVHLRCDYSRIGIITIDGKFLTRLVTTRFDAPVLAAVVAVRTPSGEILYRARDSFVWHPAE* |
| Ga0066652_1005790382 | 3300006046 | Soil | AVSIHLRCESPEVGVVTIEGRFLTRVATTLLDTPVVSAVVTVHAPSGEVYSARDSFVWQAGR* |
| Ga0066652_1020512002 | 3300006046 | Soil | CLEATISAVTIHLRCDYPQVGIVTIEGRFLTRLATNRLDTPAVSAVVTVRTGSGEVLYSARDSFVWNPGG* |
| Ga0070712_1003788383 | 3300006175 | Corn, Switchgrass And Miscanthus Rhizosphere | CDDPQLGSVTIDGRFLTRFVTTRLDAPVVSAVVTVRDGAGEVLYKAQDSFEWRAGN* |
| Ga0079222_114637962 | 3300006755 | Agricultural Soil | PAACLEARITATTVHLRCDYPQVGAVTIDGRFLTRVATTQLDASVISAVVTVRSGSGEVLYNARDSFVWHPGE* |
| Ga0066665_100424398 | 3300006796 | Soil | KISAQDLHLRCDDPQLGTVTIDGKFLTRLVTSRLDAAVVSAVVTVRDATGEILYRARDSFEWHPGN* |
| Ga0066659_100958171 | 3300006797 | Soil | TFARTPSGTEESVACLEARIRPETVHLRCDDPQVGTVTVDGRFLTRFATSQLDAAVLTAVVTVRDGSGEVLYRAQDSFQWQPGK* |
| Ga0066659_115703181 | 3300006797 | Soil | AVACFEAVISVDTIHLRCDDPQVGTVTIDGKFLTRLVTTRLDAPVVTAVVTVRSGSGEILYNARDRFEWHPSN* |
| Ga0079221_100928011 | 3300006804 | Agricultural Soil | AFITFQTAAGTEQPVACLDAVINASTVHLRCDDQQVGTVTIEGRFLTRFATDRLDAAVLSAVVTVRSGSGEILYNGRDSFQWRPAQ* |
| Ga0079221_108929011 | 3300006804 | Agricultural Soil | TVTVDGRFLTRFVTNRPDAAVVSALVTVRTGSGEVLYSARDQFVWHPVDEASSR* |
| Ga0075426_101709562 | 3300006903 | Populus Rhizosphere | MFTRAGGVEQAMACLETLINADTVHLRCDDPQMGTVTVDGRFRTRVATDRLDTAVLSAVVTVRSGSGEILYRARDSFKWHPADSRRPT* |
| Ga0079218_132894702 | 3300007004 | Agricultural Soil | LPAACFETLIRADTVHLRCDYPQVGTVSIDGKFLTRLATTSLDTAVLSAVVTVRTASGEILYSARDSFVWHPGD* |
| Ga0075435_1019864692 | 3300007076 | Populus Rhizosphere | VTVDGRFLTRFVTNRPDAAVVSALVTVRTGSGEVLYSARDQFVWHPVDEASSR* |
| Ga0099791_105211102 | 3300007255 | Vadose Zone Soil | MQARISATAFHLRCDDPQLGSVTIDGRFLTRFVTTRLDAPVVSAVVTVRDGAGEVLYKAQDSFEWRPGN* |
| Ga0099793_105088612 | 3300007258 | Vadose Zone Soil | ACMQARISATAFHLRCDDPQLGSVIIDGRFLTRFVTTRLDARVVAAVVTVRDGAGEVLYKAQDSFEWRPGN* |
| Ga0066710_1003789284 | 3300009012 | Grasslands Soil | VEARISADTVHLRCDYPQVGTVTIDGKFLTRVTTQRLDAAVLSAVVTVRAASGDTLYSARDSFVWHAGE |
| Ga0066710_1018433482 | 3300009012 | Grasslands Soil | DDQHVGTVTIDGKFLTRLVTNRLDAAVVSAVVTVRTGSGEMLYRARDSFQWHPAE |
| Ga0066710_1020965862 | 3300009012 | Grasslands Soil | FARTTVACLEAQISAQSVHLRCDDPQVGTVIIDGRFLTRLASNQLDAEVLSAVVTVRAGSGDILYSARDSFRWRPRE |
| Ga0099830_106730731 | 3300009088 | Vadose Zone Soil | ATITAVSIRLRCESPEVGVVTIEGRFLTRVATTRLDAPVVSAVVTVHAPSGEVYSARDSFVWQPGR* |
| Ga0066709_1014830941 | 3300009137 | Grasslands Soil | FEAAISVDTIHLRCDDPQVGTVTIDGKFLTRLVTTRLDAAVVTAVVTVRSGSGEILYNARDRFEWHPSN* |
| Ga0066709_1014881211 | 3300009137 | Grasslands Soil | QAGTVTIDGKFLTRLATHSLDAAVLSAVVTVRSASGEILYSARDSFVWHPGE* |
| Ga0066709_1029907741 | 3300009137 | Grasslands Soil | TVHLRCDHRVGTVTIDGQCLTRLATHSLDTAVLSAVVTVRSASGEILYSARDSFVWHPGD |
| Ga0134088_101642191 | 3300010304 | Grasslands Soil | QPAGCFEALIRADTVHLRCDYPQLGTVTIDGKFLTRFATTSLDAAVLSAVVTVRAASGDILYSARDSFVWHRARLGRGSG* |
| Ga0134109_100476211 | 3300010320 | Grasslands Soil | SPEVGVVTIEGRVLTRVATTLLDTPVVSAVVTVHAPSGEVYSARDSFVWQAGR* |
| Ga0134109_101033971 | 3300010320 | Grasslands Soil | CDDPQLAMVTTIDGRFGTRLATDRLDTAVLSAVVTVRSGSGEILYRARDSFRWHPAHDDRGSQ* |
| Ga0134109_101296291 | 3300010320 | Grasslands Soil | RPEALVTFGQTTVACTDAKISAQDLHLRCDDPQLGTVTIDGKFLTRLVTSRLDAAVVSAVVTVRDASGEILYRARDSFEWHPGN* |
| Ga0134109_103384062 | 3300010320 | Grasslands Soil | SIDGKFLTRLATTSLDRPVVSAVVTVRDPTGEILYSARDSFVWHQAE* |
| Ga0134067_100162175 | 3300010321 | Grasslands Soil | DDPQVGTVTIDGRFRTRVATNRLDTAVLAAVVTVRSGSGEILYRARDSFKWHPRDSGEPGT* |
| Ga0134086_100856981 | 3300010323 | Grasslands Soil | VSIDGKFLTRLATTSLDRPVVSAVVTVRDPTGEILYSARDSFVWHQAE* |
| Ga0134086_101301282 | 3300010323 | Grasslands Soil | GQTTVACNDAKISAQDLHLRCDDRQLGTVTIDGKFLTRLVTNRLDAAVVSAVVTVRTGSGETLYRARDSFEWHPAK* |
| Ga0134111_100686504 | 3300010329 | Grasslands Soil | RCESPEVGTVTIEGRFLTRVATTLLDTPVVSAVVTVHAPSGEVYSARDSFVWQAGR* |
| Ga0134063_100958231 | 3300010335 | Grasslands Soil | CDDPQVGTVTIDGRFLTRLASNRLDAAVLSAVVTVRAGSGEVLYSARDSFRWQPVD* |
| Ga0134063_102634381 | 3300010335 | Grasslands Soil | PQVGTVTVDGRFLTRFATSQLDAAVLTAVVTVRDGSGEVLYRAQDSFQWQPGK* |
| Ga0134123_106463994 | 3300010403 | Terrestrial Soil | AVRIHLRCESPEVGIVTIEGRFLTRLATNLLDRPVVSAVVTVRAASGEVYSARDSFVWQGR* |
| Ga0137393_116800362 | 3300011271 | Vadose Zone Soil | LRCDDPQVGTVTIDGKFLTRLVTNRLDAAVVSAVVTVRAESGEVLYSARDSFVWEPGR* |
| Ga0137364_108824151 | 3300012198 | Vadose Zone Soil | TIVACSEAQISAQSVHLRCDDPQVGTVTIDGRFLTRSASNLLDAAVLSAVVTVRTGSGEILYSARDSFRWQARD* |
| Ga0137364_113711771 | 3300012198 | Vadose Zone Soil | TTVACVEAQITAQNVHLRCDDPQVGTVTIDGRFLTRLASNRLDAAVLSAFVTVRTASGEILYSARDSFRWQPVD* |
| Ga0137363_113608621 | 3300012202 | Vadose Zone Soil | LRPEAFVTFGQTGSEEQVACIQARIGATAFHLRCDDPQLGSVTIDGRFLTRFVTTRLDAPVVSAVVTVRDGAGEVLYKAQDSFEWRPGN* |
| Ga0137399_108844931 | 3300012203 | Vadose Zone Soil | SIHLRCEYPQVGIVTIEGRFLTRLATDLLDRPVVSAVVTVRAQSGEVLYNARDSFVWQPGR* |
| Ga0137399_117260002 | 3300012203 | Vadose Zone Soil | ARISATAFHLRCDDPQLGSVTIDGRFLTRFVTTRLDAPVVTAVVTVRDGAGDVLYRAQDSFEWRPSP* |
| Ga0137380_102888174 | 3300012206 | Vadose Zone Soil | ADTVHLRCDYPQVGTVTIDGRFLTRVVTQRLDIPVLSAVVTVRAPSGDTLYSARDSFVWHPSE* |
| Ga0137380_113244791 | 3300012206 | Vadose Zone Soil | VPCFEAQISAQNVHLRCDDQHVGTVTIDGKFLTRLVTNRLDAAVVSAVVTVRTGSGEILYRARDSFQWHPAE* |
| Ga0137376_114162201 | 3300012208 | Vadose Zone Soil | QITAQNVHLRCDDPQVGTVTIDGRFLTRLASNRLDAAVLSAFVTVRTASGEILYSARDSFRWQPVD* |
| Ga0137376_115086141 | 3300012208 | Vadose Zone Soil | ISVDTIHLRCDDPQVGTVTIDGKFLTRLLTTRLDAAVVTAVVTVRSGSGEILYNARDRFEWHPSN* |
| Ga0137379_109347562 | 3300012209 | Vadose Zone Soil | HLRCDYPQVGIVTIEGRFLTRLATNRLDTPAVSAVVTVRTGSGEVLYSARDSFVWNPGG* |
| Ga0137379_116765862 | 3300012209 | Vadose Zone Soil | ESPEVGTVTIEGRFLTRVATTLLDTPVVSAVVTVHAPSGEVYSARDSFVWQAGH* |
| Ga0137378_104964751 | 3300012210 | Vadose Zone Soil | TVHLRCDDPQVGTVTIDGRFLTRLVTARLDAAVVSAVVTVRDGSGEVLYRARDSFEWHPGN* |
| Ga0137377_110990743 | 3300012211 | Vadose Zone Soil | SAQSVHLRCDDPQVGTVTIDGRFLTRSASNRLDAAVLSAVVTVRTGSGEILYSARDSFRWQARD* |
| Ga0137386_102832262 | 3300012351 | Vadose Zone Soil | VEARISADTVHLRCDYPQVGTVTIDGKFLTRVATQRLDAAVLSAVVTVRAASGETLYSARDSFVWHAGE* |
| Ga0137369_104497112 | 3300012355 | Vadose Zone Soil | LEAQISAQSVHLRCDDPQLGTVTIDGKFLTRFVTDRLDAAVLSAVVTVRAGNGETLYRARDSFEWHPRD* |
| Ga0137385_102517054 | 3300012359 | Vadose Zone Soil | CDYPQVGIVTIEGRFLTRLATNRLDTPAVSAVVTVRTGSGEVLYSARDSFVWNPGG* |
| Ga0137360_101746014 | 3300012361 | Vadose Zone Soil | MQARISATAFHLRCDDPQLGSVTIDGRFLTRFVTTRLDARVVAAVVTVRDGAGEVLYKAQDSFEWRPGN* |
| Ga0137390_114101452 | 3300012363 | Vadose Zone Soil | VTIDGRFLTRFVTTRLDAPVVSAVVTVWDGAGEVLYKAQDSFEWRPGN* |
| Ga0137397_108322571 | 3300012685 | Vadose Zone Soil | LDAKISAVSIHLRCEFPQVGSVTIDGRFLTRLATSRLDTPTVSAVVTIRAESGEVLYRARDSFVWEPGR* |
| Ga0137396_100259301 | 3300012918 | Vadose Zone Soil | TAATVHLRCDFPREGTVTIDGRFLTRLATSRLDAAVLSAVVTVRNGNGDVLYNARDAFVWHQAQ* |
| Ga0137396_103900671 | 3300012918 | Vadose Zone Soil | TAATVHLRCDFPREGTVTIDGRFLTRLATSRLDAAVLSAVVSVRNGNGDVLYNARDAFVWHQAQ* |
| Ga0137359_100742345 | 3300012923 | Vadose Zone Soil | AVSIHLRCEYPQVGIVTIEGRFLTRLATDLLDSPVVSAVVTVRAQSGEVLYNARDSFVWQPGR* |
| Ga0137416_102531641 | 3300012927 | Vadose Zone Soil | GSEEQVACIEARISATAFHLRCDDPQLGSVTIDGRFLTRFVTTRLDAPVVTAVVTVRDGAGDVLYRAQDSFEWRPSP* |
| Ga0137416_121796182 | 3300012927 | Vadose Zone Soil | CAEARASPDTLHLRCEYPELGTVTIDGRFLTRLATARFDTVVVSAVVTVTAAGRSLHSARYSFVWFEGE* |
| Ga0134077_100308261 | 3300012972 | Grasslands Soil | LRCDYPQVGTVSIDGKFLTRLATTSLDRPVVSAVVTVRDPTGEILYSARDSFVWHPAE* |
| Ga0134110_100507724 | 3300012975 | Grasslands Soil | SGAEQSVVCLEAVIRVDTLHLRCDDPQVGTVTIDGTFLTRLATTRLDAAVLTAVVTVRSGSGEVLYNARDRFEWQPGH* |
| Ga0134076_100744973 | 3300012976 | Grasslands Soil | YPQVGTVSIDGKFLTRLATTSLYRPVVSAVVTVRDPTGEILYSARDSFVWHPAE* |
| Ga0134076_104476522 | 3300012976 | Grasslands Soil | GCVEVLIRTETVHLRCDYPQLGTVSIDGRFLTRLATASLDRAVLSAVVTVRTASGEILYSARDSFVWHPESP* |
| Ga0134076_105613232 | 3300012976 | Grasslands Soil | LAQTTSGAEQAVACFEAAISVDTIHLRCDDPQVGTVTIDGKFLTRLVTTRLDAAVVTAVVTVRSGSGEILYNARDRFEWHPSN* |
| Ga0134075_100537995 | 3300014154 | Grasslands Soil | HLRCDYPQVGTVTIDGRFLTRLATNSLDTAVLSAVVTVRTASGEILYSARDSFVWHPGE* |
| Ga0134073_102104912 | 3300015356 | Grasslands Soil | DETLIRADTVHLRCDYSRIGIITIDGKFLTRLVTTRFDAPVLAAVVAVRTPSGEILYRARDSFVWHPAE* |
| Ga0134089_104578961 | 3300015358 | Grasslands Soil | PQVGTVTIDGRFLTRLATNSLDTAVLSAVVTVRTASGEILYSARDSFVWHPAE* |
| Ga0132256_1036129021 | 3300015372 | Arabidopsis Rhizosphere | GTVTVDGRFLTRFVTNRPDAAVVSALVTVRTGSGEVLYSARDQFVWHPVDEASSR* |
| Ga0134069_10154875 | 3300017654 | Grasslands Soil | FARTPSGTEESVACLEARIRPETVHLRCDDPQVGTVTVDGRFLTRFATSQLDAAVLTAVVTVRDGSGEVLYRAQDSFQWQPGK |
| Ga0134074_10126811 | 3300017657 | Grasslands Soil | TVPCFEAQISAQNVHLRCDDQHVGTVTIDGKFLTRLVTNRLDAAVVSAVVTVRTGSGETLYRARDSFEWHPAK |
| Ga0134083_100939612 | 3300017659 | Grasslands Soil | LRCEDRQVGSVTIDGKFLTRVATNRLDTPVVAAVVTIRSGSGDILYSARDSFAWDSAR |
| Ga0066655_100353346 | 3300018431 | Grasslands Soil | RCDDPQVGTVTIDGRFLTRLASNRLDAAVLSAVVTVRTGSGEILYSARDSFVWHP |
| Ga0066655_101072481 | 3300018431 | Grasslands Soil | RCDDPQVGTVTIDGRFLTRLASNRLDAAVLSAVVTVRAGSGEVLYSARDSFRWQPVD |
| Ga0066655_105585791 | 3300018431 | Grasslands Soil | QVGTVTIDGRFLTRLATKRLDAPVVSAVVTVRAASGDTLYSARDAFLWHPRD |
| Ga0066667_115548572 | 3300018433 | Grasslands Soil | LPVACLETVIRADSVHLRCDDPQLGMVTTIDGRFRTRLATDRLDTAVLSAVVTVRSGSGDIMYRARDSFKWHPADSSGTGPSTRN |
| Ga0066662_128137002 | 3300018468 | Grasslands Soil | FEAQNSEQNVHHRSDDPHVGTVTIDGKFLTRLVTNRLDAAVVSAVVTVRTGSGETLYRARDSFEWHPAE |
| Ga0066669_114167391 | 3300018482 | Grasslands Soil | QLGTVTIDGKFLTRLVTSRLDAAVVSAVVTVRDATGEILYRARDSFEWHPGN |
| Ga0066669_115342621 | 3300018482 | Grasslands Soil | TIDGKFLTRLLTTRLDAAVVTAVVTVRSGSGEILYNARDRFEWHPGN |
| Ga0209238_10459281 | 3300026301 | Grasslands Soil | DDPQLGMVTTIDGTFHARLATDRLDTTVLSAVVTVRSGSGEVLYRARDSFKWHPVD |
| Ga0209239_13237192 | 3300026310 | Grasslands Soil | VTLAQTTSGAEQAVACFEAAISVDTIHLRCDDPQVGTVTIDGKFLTRLLTTRLDAAVVTAVVTVRSGSGEILYNARDRFEWHPSN |
| Ga0209761_10468256 | 3300026313 | Grasslands Soil | FVTFGQSGSEEQVACMQARISATAFHLRCDDPQLGSVTIDGRFLTRFVTTRLDARVVAAVVTVRDGAGEVLYKAQDSFEWRPGN |
| Ga0209761_12095924 | 3300026313 | Grasslands Soil | QNVHLRCDDPKVGTVTIDGKFLTRLVTNRLDAAVVSAVVTVRTGSGEILYRARDSFEWHPAE |
| Ga0209761_12492291 | 3300026313 | Grasslands Soil | ACFEATISVDTIHLRCDDPQVGTVTIDGKFLTRLVTTRLDAAVVTAVVTVRSGSGEILYNARDRFEWHPSN |
| Ga0209686_10292773 | 3300026315 | Soil | HLRCDYSRVGTITIDGRFLTRLATTHLDARVLSAVVTVRTPSGEILYRARDSFVWHPAE |
| Ga0209155_10957341 | 3300026316 | Soil | SGTEESVACLEARIRPETVHLRCDDPQVGTVTVDGRFLTRFATSQLDAAVLTAVVTVRDGSGEVLYRAQDSFQWQPGK |
| Ga0209266_11052624 | 3300026327 | Soil | FAQTTVACVEAQISAQNVHLRCDDPQVGTVTIDGRFLTRLASNRLDAAVLSAVVTVRAGSGEVLYSARDSFRWQPVD |
| Ga0209802_12297952 | 3300026328 | Soil | KISAQDLHLRCDDPQLGTVTIDGKFLTRLVTSRLDAAVVSAVVTVRDGSGEILYRARDSFEWHPGN |
| Ga0209158_10263057 | 3300026333 | Soil | ATITAVSIRLRCESPEVGVVTIEGRFLTRVATTLLDTPVVSAVVTVHAPSGEVYSARDSFVWQAGH |
| Ga0209158_13011132 | 3300026333 | Soil | GTVTIEGRFLTRVATTLLDTPVVSAVVTVHAPSGEVYSARDSFVWQAGR |
| Ga0209158_13627382 | 3300026333 | Soil | IRADTVHLRCDYSRVGTITIDGKFLTRLAATRLDAAVLSAVVAVRTPSGEILYRARDSFEWHPPE |
| Ga0209057_10312601 | 3300026342 | Soil | GCDDPQVGTVTIDGRFRTRVATNRLDTAVLAAVVTVRSGSGEILYRARDSFKWHPRDSGEPGT |
| Ga0209378_10490656 | 3300026528 | Soil | QVGTVTIDGRFLTRLASNRLDAAVLSAVVTVRAGSGEVLYSARDSFRWQPVD |
| Ga0209807_12524171 | 3300026530 | Soil | DPQLGMVTTIDGRFRTRLATDRLDTAVLSAVVTVRSGSGDIMYRARDSFKWHPADSSGAGPSTRN |
| Ga0209160_13293821 | 3300026532 | Soil | TTSGAEQAVACFEAAISVDTIHLRCDDPQVGTVTIDGKFLTRLVTTRLDAAAVTAVVTVRSGSGEILYNARDRFEWHPSN |
| Ga0209157_11158721 | 3300026537 | Soil | PDAFVAFARTPSGTEESVACLEARIRPETVHLRCDDPQVGTVTVDGRFLTRFATSRLDAAVLTAVVTVRDGSGEVLYRAQDSFQWQPGK |
| Ga0209056_101460836 | 3300026538 | Soil | PQLGIVTIDGKFLTRLVTSRLDAAVVSAVVTVRDATGEILYRARDSFEWHPGN |
| Ga0209156_1000237021 | 3300026547 | Soil | GAEQPMACLDVLIRADTVHLRCDDPQVGTVTIDGRFRTRVATHRLDTAVLAAVVTVRSGSGEILYRAQDSFKWHPRDSGEPGT |
| Ga0209156_102670423 | 3300026547 | Soil | GAEQPMACLDVLIRADTVHLRCDDPQVGTVTIDGRFRTRVATNRLDTAVLAAVVTVRSGSGEILYQARDSFKWHPRDSGDPGT |
| Ga0209156_102937472 | 3300026547 | Soil | TFARTPSGTEESVACLEARIRPETVHLRCDDPQVGTVTVDGRFLTRFATSQLDAAVLTAVVTVRDGSGEVLYRAQDSFQWQPGK |
| Ga0209474_107457301 | 3300026550 | Soil | TIDGRFLTRLASNRLDAAVLSAVVTVRAGSGEVLYSARDSFRWQPVD |
| Ga0209076_11724331 | 3300027643 | Vadose Zone Soil | TLRPEAFVTFGQSGSEEQVACMQARISATAFHLRCDDPQLGSVIIDGRFLTRFVTTRLDARVVAAVVTVRDGAGEVLYKAQDSFEWRPGN |
| Ga0209388_10349353 | 3300027655 | Vadose Zone Soil | EQVACMQARISATAFHLRCDDPQLGSVTIDGRFLTRFVTTRLDAPVVSAVVTVRDGAGEVLYKAQDSFEWRPGN |
| Ga0209178_11123822 | 3300027725 | Agricultural Soil | DDQQVGTVTIDGRFLTRFATSRLDAAVLSAVVTVRSGSGEILYNGRDSFQWRPAQ |
| Ga0209177_101170422 | 3300027775 | Agricultural Soil | VGTVTIDGRFLTRFATDRLDAAVLSAVVTVRSGSGEILYNGRDSFQWRPAQ |
| Ga0209180_103367963 | 3300027846 | Vadose Zone Soil | RLRCNSPQVGTVTIDGRFLTRLATTRLDIPVMSAVVTVRTASGEIVYSARDSFLWEPGR |
| Ga0307479_105697281 | 3300031962 | Hardwood Forest Soil | DAVIRPETLHLRCDDPQLGSVTIDGRFLTRFVTRRLDAPVLSAVVTVRSGSGEILYRARDSFEWHPTE |
| Ga0307479_110781991 | 3300031962 | Hardwood Forest Soil | HLRCDDLQIGTVTIDGRFLTRLVTKRLDAPVVSAVVTVRSGSGDIVYRARDSFEWHPKDP |
| Ga0315912_108769811 | 3300032157 | Soil | SAVHLKCDFPQVGTVTVEGRFLTQLATQRLETAALSAQITVRSGGGETLYRARDSFQWHPAD |
| Ga0307471_1003672891 | 3300032180 | Hardwood Forest Soil | ACLDAVIKPTTIHLRCDDPQMGTVTVDGRFLTRFVTNRPDAAVMAALVTVRTGSGEVLYSARDQFVWRPVDEASSR |
| Ga0307471_1043255081 | 3300032180 | Hardwood Forest Soil | QTAAGTDQPVACLDAVIKPTTVHLRCDDPQMGTVTVDGRFLTRFVTNRPDAAVVSALVTVRTGSGEVLYSARDQFVWHPVDEASGR |
| Ga0307472_1017570712 | 3300032205 | Hardwood Forest Soil | DAVIKPTTVHLRCDDPQMGTVTVDGRFLTRFVTNRPDAAAVSALVTVRTGSGEVLYSARDQFVWHPVDEASSR |
| Ga0214471_100812431 | 3300033417 | Soil | AACFEALITPETIHLRCDYPQVGIVTIDGRFLTRSATDRLDTPVLSAVVTVRNASGEILYSARDSFVWHPGE |
| ⦗Top⦘ |