| Basic Information | |
|---|---|
| Family ID | F088350 |
| Family Type | Metagenome |
| Number of Sequences | 109 |
| Average Sequence Length | 138 residues |
| Representative Sequence | LPRITPYSFFISFFLAFLAAAVLADTSPIKGELQHNRTPTPAVVQSPASKQGLNFVVGARGLDSLSFNGQSLLVSPESGELQPQKSVFRAVLDALLPRSSARVATPDKKTDTIDLSFPWGRISCGY |
| Number of Associated Samples | 98 |
| Number of Associated Scaffolds | 109 |
| Quality Assessment | |
|---|---|
| Transcriptomic Evidence | No |
| Most common taxonomic group | Unclassified |
| % of genes with valid RBS motifs | 30.77 % |
| % of genes near scaffold ends (potentially truncated) | 11.93 % |
| % of genes from short scaffolds (< 2000 bps) | 7.34 % |
| Associated GOLD sequencing projects | 94 |
| AlphaFold2 3D model prediction | Yes |
| 3D model pTM-score | 0.31 |
| Hidden Markov Model |
|---|
| Powered by Skylign |
| Most Common Taxonomy | |
|---|---|
| Group | Unclassified (88.073 % of family members) |
| NCBI Taxonomy ID | N/A |
| Taxonomy | N/A |
| Most Common Ecosystem | |
|---|---|
| GOLD Ecosystem | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil (23.853 % of family members) |
| Environment Ontology (ENVO) | Unclassified (34.862 % of family members) |
| Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Soil (non-saline) (46.789 % of family members) |
| ⦗Top⦘ |
| ⦗Top⦘ |
| Predicted Topology & Secondary Structure | |||||
|---|---|---|---|---|---|
| Classification: | Globular | Signal Peptide: | Yes | Secondary Structure distribution: | α-helix: 18.18% β-sheet: 19.48% Coil/Unstructured: 62.34% | Feature Viewer |
|
|
|||||
| Powered by Feature Viewer | |||||
| Structure Viewer | |
|---|---|
|
| |
| Per-residue confidence (pLDDT): 0-50 51-70 71-90 91-100 | pTM-score: 0.31 |
| Powered by PDBe Molstar | |
| ⦗Top⦘ |
| Pfam ID | Name | % Frequency in 109 Family Scaffolds |
|---|---|---|
| PF03808 | Glyco_tran_WecG | 12.84 |
| PF13439 | Glyco_transf_4 | 5.50 |
| PF05711 | TylF | 4.59 |
| PF09721 | Exosortase_EpsH | 3.67 |
| PF01370 | Epimerase | 2.75 |
| PF05016 | ParE_toxin | 1.83 |
| PF11885 | DUF3405 | 0.92 |
| PF05050 | Methyltransf_21 | 0.92 |
| PF13884 | Peptidase_S74 | 0.92 |
| PF01797 | Y1_Tnp | 0.92 |
| PF01850 | PIN | 0.92 |
| COG ID | Name | Functional Category | % Frequency in 109 Family Scaffolds |
|---|---|---|---|
| COG1922 | UDP-N-acetyl-D-mannosaminuronic acid transferase, WecB/TagA/CpsF family | Cell wall/membrane/envelope biogenesis [M] | 12.84 |
| COG1943 | REP element-mobilizing transposase RayT | Mobilome: prophages, transposons [X] | 0.92 |
| ⦗Top⦘ |
| Name | Rank | Taxonomy | Distribution |
| Unclassified | root | N/A | 88.07 % |
| All Organisms | root | All Organisms | 11.93 % |
| Visualization |
|---|
| Powered by ApexCharts |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| 3300005180|Ga0066685_10015101 | All Organisms → cellular organisms → Bacteria | 4482 | Open in IMG/M |
| 3300005713|Ga0066905_100704773 | All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia | 867 | Open in IMG/M |
| 3300006046|Ga0066652_100043979 | All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia | 3316 | Open in IMG/M |
| 3300012198|Ga0137364_10293447 | All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia | 1206 | Open in IMG/M |
| 3300012202|Ga0137363_10014471 | All Organisms → cellular organisms → Bacteria | 5149 | Open in IMG/M |
| 3300012210|Ga0137378_10176536 | All Organisms → cellular organisms → Bacteria | 1985 | Open in IMG/M |
| 3300012354|Ga0137366_10645082 | All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia | 757 | Open in IMG/M |
| 3300012361|Ga0137360_11276015 | All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia | 635 | Open in IMG/M |
| 3300012929|Ga0137404_10015884 | All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia | 5330 | Open in IMG/M |
| 3300013296|Ga0157374_12788331 | All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia | 516 | Open in IMG/M |
| 3300015372|Ga0132256_100433448 | All Organisms → cellular organisms → Bacteria → PVC group → Lentisphaerae → Lentisphaeria → Lentisphaerales → Lentisphaeraceae → Lentisphaera → Lentisphaera araneosa | 1420 | Open in IMG/M |
| 3300019789|Ga0137408_1030386 | All Organisms → cellular organisms → Bacteria | 5662 | Open in IMG/M |
| 3300021560|Ga0126371_12273359 | All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia | 655 | Open in IMG/M |
| ⦗Top⦘ |
| Habitat | Taxonomy | Distribution |
| Vadose Zone Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil | 23.85% |
| Grasslands Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil | 11.01% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil | 10.09% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil | 8.26% |
| Tropical Forest Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil | 4.59% |
| Grasslands Soil | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil | 4.59% |
| Hardwood Forest Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil | 3.67% |
| Terrestrial Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil | 2.75% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil | 2.75% |
| Corn, Switchgrass And Miscanthus Rhizosphere | Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere | 2.75% |
| Groundwater Sediment | Environmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment | 1.83% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil | 1.83% |
| Forest Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil | 1.83% |
| Tropical Forest Soil | Environmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil | 1.83% |
| Populus Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere | 1.83% |
| Switchgrass Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere | 1.83% |
| Miscanthus Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere | 1.83% |
| Corn Rhizosphere | Host-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere | 1.83% |
| Arabidopsis Rhizosphere | Host-Associated → Plants → Rhizosphere → Soil → Unclassified → Arabidopsis Rhizosphere | 1.83% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil | 0.92% |
| Corn Rhizosphere | Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere | 0.92% |
| Switchgrass Rhizosphere | Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere | 0.92% |
| Arabidopsis Rhizosphere | Host-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere | 0.92% |
| Switchgrass Rhizosphere | Host-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere | 0.92% |
| Switchgrass Rhizosphere | Host-Associated → Plants → Roots → Rhizosphere → Soil → Switchgrass Rhizosphere | 0.92% |
| Miscanthus Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere | 0.92% |
| Switchgrass Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere | 0.92% |
| Switchgrass Rhizosphere | Host-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere | 0.92% |
| Miscanthus Rhizosphere | Host-Associated → Plants → Rhizosphere → Soil → Unclassified → Miscanthus Rhizosphere | 0.92% |
| Visualization |
|---|
| Powered by ApexCharts |
| Taxon OID | Sample Name | Habitat Type | IMG/M Link |
|---|---|---|---|
| 3300002899 | Soil microbial communities from Manhattan, Kansas, USA - Combined assembly of Kansas soil 100-500um Nextera (ASSEMBLY_DATE=20140607) | Environmental | Open in IMG/M |
| 3300004479 | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAs | Environmental | Open in IMG/M |
| 3300005180 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 | Environmental | Open in IMG/M |
| 3300005181 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127 | Environmental | Open in IMG/M |
| 3300005339 | Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C3-3 metaG | Host-Associated | Open in IMG/M |
| 3300005340 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-3H metaG | Environmental | Open in IMG/M |
| 3300005447 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138 | Environmental | Open in IMG/M |
| 3300005545 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-2 metaG | Environmental | Open in IMG/M |
| 3300005549 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-2 metaG | Environmental | Open in IMG/M |
| 3300005552 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150 | Environmental | Open in IMG/M |
| 3300005554 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110 | Environmental | Open in IMG/M |
| 3300005559 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149 | Environmental | Open in IMG/M |
| 3300005713 | Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 (version 2) | Environmental | Open in IMG/M |
| 3300005719 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S4-2 | Host-Associated | Open in IMG/M |
| 3300005764 | Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2) | Environmental | Open in IMG/M |
| 3300006046 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101 | Environmental | Open in IMG/M |
| 3300007076 | Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD4 | Host-Associated | Open in IMG/M |
| 3300007258 | Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3 | Environmental | Open in IMG/M |
| 3300009012 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159 | Environmental | Open in IMG/M |
| 3300009137 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158 | Environmental | Open in IMG/M |
| 3300009162 | Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2 | Host-Associated | Open in IMG/M |
| 3300009553 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-4 metaG | Host-Associated | Open in IMG/M |
| 3300010159 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_3 | Environmental | Open in IMG/M |
| 3300010301 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09082015 | Environmental | Open in IMG/M |
| 3300010303 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09182015 | Environmental | Open in IMG/M |
| 3300010304 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015 | Environmental | Open in IMG/M |
| 3300010321 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09212015 | Environmental | Open in IMG/M |
| 3300010325 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_0_1 metaG | Environmental | Open in IMG/M |
| 3300010335 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09082015 | Environmental | Open in IMG/M |
| 3300010336 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09082015 | Environmental | Open in IMG/M |
| 3300010337 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09082015 | Environmental | Open in IMG/M |
| 3300010358 | Tropical forest soil microbial communities from Panama - MetaG Plot_3 | Environmental | Open in IMG/M |
| 3300010359 | Tropical forest soil microbial communities from Panama - MetaG Plot_15 | Environmental | Open in IMG/M |
| 3300010361 | Tropical forest soil microbial communities from Panama - MetaG Plot_23 | Environmental | Open in IMG/M |
| 3300010364 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09212015 | Environmental | Open in IMG/M |
| 3300010376 | Tropical forest soil microbial communities from Panama - MetaG Plot_28 | Environmental | Open in IMG/M |
| 3300010396 | Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-2 | Environmental | Open in IMG/M |
| 3300010397 | Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-4 | Environmental | Open in IMG/M |
| 3300010403 | Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-3 | Environmental | Open in IMG/M |
| 3300012198 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaG | Environmental | Open in IMG/M |
| 3300012201 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaG | Environmental | Open in IMG/M |
| 3300012202 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaG | Environmental | Open in IMG/M |
| 3300012205 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaG | Environmental | Open in IMG/M |
| 3300012206 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaG | Environmental | Open in IMG/M |
| 3300012209 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaG | Environmental | Open in IMG/M |
| 3300012210 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaG | Environmental | Open in IMG/M |
| 3300012211 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaG | Environmental | Open in IMG/M |
| 3300012351 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaG | Environmental | Open in IMG/M |
| 3300012354 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_60_16 metaG | Environmental | Open in IMG/M |
| 3300012357 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaG | Environmental | Open in IMG/M |
| 3300012358 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_100_16 metaG | Environmental | Open in IMG/M |
| 3300012360 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_113_16 metaG | Environmental | Open in IMG/M |
| 3300012361 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaG | Environmental | Open in IMG/M |
| 3300012362 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaG | Environmental | Open in IMG/M |
| 3300012582 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaG | Environmental | Open in IMG/M |
| 3300012892 | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S209-509C-1 | Environmental | Open in IMG/M |
| 3300012923 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaG | Environmental | Open in IMG/M |
| 3300012927 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly) | Environmental | Open in IMG/M |
| 3300012929 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly) | Environmental | Open in IMG/M |
| 3300012958 | Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_221_MG | Environmental | Open in IMG/M |
| 3300012987 | Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_243_MG | Environmental | Open in IMG/M |
| 3300012988 | Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_242_MG | Environmental | Open in IMG/M |
| 3300013296 | Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M2-5 metaG | Host-Associated | Open in IMG/M |
| 3300013297 | Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M6-5 metaG | Host-Associated | Open in IMG/M |
| 3300014150 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_5_24_1 metaG | Environmental | Open in IMG/M |
| 3300014154 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09212015 | Environmental | Open in IMG/M |
| 3300014166 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09182015 | Environmental | Open in IMG/M |
| 3300014325 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S6-5 metaG | Host-Associated | Open in IMG/M |
| 3300014968 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S2-5 metaG | Host-Associated | Open in IMG/M |
| 3300015200 | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S209-509C-1 (version 2) | Environmental | Open in IMG/M |
| 3300015371 | Combined assembly of cpr5 and col0 rhizosphere and soil | Host-Associated | Open in IMG/M |
| 3300015372 | Soil combined assembly | Host-Associated | Open in IMG/M |
| 3300018071 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_b1 | Environmental | Open in IMG/M |
| 3300018073 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_5_b1 | Environmental | Open in IMG/M |
| 3300018468 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111 | Environmental | Open in IMG/M |
| 3300018482 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118 | Environmental | Open in IMG/M |
| 3300019362 | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S104-311B-1 (version 2) | Environmental | Open in IMG/M |
| 3300019789 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (PacBio error correction) | Environmental | Open in IMG/M |
| 3300020012 | Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1s1 | Environmental | Open in IMG/M |
| 3300020062 | Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2a1 | Environmental | Open in IMG/M |
| 3300021560 | Tropical forest soil microbial communities from Panama - MetaG Plot_4 | Environmental | Open in IMG/M |
| 3300025912 | Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C8-3B metaG (SPAdes) | Environmental | Open in IMG/M |
| 3300025915 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaG (SPAdes) | Environmental | Open in IMG/M |
| 3300025919 | Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C3-3 metaG (SPAdes) | Host-Associated | Open in IMG/M |
| 3300025926 | Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M4-3 metaG (SPAdes) | Host-Associated | Open in IMG/M |
| 3300025927 | Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-4 metaG (SPAdes) | Host-Associated | Open in IMG/M |
| 3300025986 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3 metaG (SPAdes) | Host-Associated | Open in IMG/M |
| 3300026318 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128 (SPAdes) | Environmental | Open in IMG/M |
| 3300026536 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147 (SPAdes) | Environmental | Open in IMG/M |
| 3300028536 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly) | Environmental | Open in IMG/M |
| 3300028807 | Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_186 | Environmental | Open in IMG/M |
| 3300031231 | Coassembly Site 11 (all samples) - Champenoux / Amance forest | Environmental | Open in IMG/M |
| 3300031474 | Fir Coassembly Site 11 - Champenoux / Amance forest | Environmental | Open in IMG/M |
| 3300031720 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515 | Environmental | Open in IMG/M |
| 3300031740 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05 | Environmental | Open in IMG/M |
| 3300031912 | Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.080 (v2) | Environmental | Open in IMG/M |
| 3300032205 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05 | Environmental | Open in IMG/M |
| 3300032261 | Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170 (v2) | Environmental | Open in IMG/M |
| Geographical Distribution | |
|---|---|
| Zoom: | Powered by OpenStreetMap |
| ⦗Top⦘ |
| Protein ID | Sample Taxon ID | Habitat | Sequence |
| JGIcombinedJ43975_100227671 | 3300002899 | Soil | MTFTRRVKGNSVKTVDAIKPDWKGGLFIGFFLVFLGSTVFAGTSPVTTEAQGNPSPTAAVVQSPAAKQGLNFVVGAHGLDSLSFNGQSLLVSPQSGELQLQKSAFRAVLDALLPGSSSRVATPNKPADAIDLSY |
| Ga0062595_1004110101 | 3300004479 | Soil | MEIMGPISLAFLAATVFADTRLAKAESQQNQTPNSSVVQSPASKQGLNFVVGAQGLDSLSFNGQSLLVSSESGELQPQKSVFRAVLDALVPRSSPRVATPDKKADTVDLSYPWGRISCAYGKQDDR |
| Ga0062595_1023605151 | 3300004479 | Soil | MISGQWQAVAPQIDC*AFQLCSFSHQVPHIIARSSFISFFLALLAGTAFADTSPLKKESQRNRTPGPALVQSPASKQGLNFVVGVHGLDSLSFNEQSLLSSPERGELRLQKSVFRAVLDALLPGSSSRAGTPNKADTIDLSYPLGR |
| Ga0066685_100151011 | 3300005180 | Soil | MSRGLLSDVMDQRSVIRSQTSVVGRHVTVHSSLIGFFLALLAPTVFAGPSPVKTESQRNRTPTPAVVQSPAAKQGLNFVVGARGLESLSFNGQSLLVSPASGELQPEKSVFRAVLDAFLSRSSSRVATPNKPADTVDLSYPWGRVSCAYAKQDDRITMRITVS |
| Ga0066685_103660331 | 3300005180 | Soil | LPHITPHSSLIGFFLALLAATAFGGASPVETESQQDQTSTPEVRQSPAPKQGLNFAVGAHGLDSLSFNGQSFLVSPESGELQPQKSVFRAVLEALFPRSSPGVAKRNESSDTVDLTYPWGRISCVYG |
| Ga0066678_100251681 | 3300005181 | Soil | VPHIIVRSSFIGFSLALLAATVFTDTSPLKKESQRNRTPGPALVQSPASKQGLNFVVGVHGLDSLSFNEQSLLSSPESGELRLQKSVFRAVLDALLPGSSSRAGTPNKKADTIHLSYPWGRISCSYGKEDDKITMKIEISNTSSE |
| Ga0070660_1007922012 | 3300005339 | Corn Rhizosphere | MKVMRPICLAFLAAAVLADTSPIKGELQHNRTPTPAVVQSPASKQGLNFVVGARGLDSLSFNGQSLLVSPESGELQPQKSVFRAVLDALLPRSSARVATPDKKTDTIDLSFPWGRISCGYSKQGDRLTMRLEVSNTSS |
| Ga0070689_1001739312 | 3300005340 | Switchgrass Rhizosphere | MDRSGNQNQRTNEGLIRMKVMRPICLAFLAAAVLADTSPIKGELQHNRTPTPAVVQSPASKQGLNFVVGARGLDSLSFNGQSLLVSPEGGELQPQKSVFRAVLDALLPRSSARVATPDKKTDTIDLSFPWGRISCGYSKQGDRLTMRLEVSNTSS |
| Ga0066689_100320441 | 3300005447 | Soil | VPHIIVRSSFIGFSLALLAATVFTDTSPLKKESQRNRTPGPALVQSPASKQGLNFVVGVHGLDSLSFNEQSLLSSPESGELRLQKSVFRAVLDALLPGSSSRAGTPNKKADTIHLSYPWG |
| Ga0070695_1007934952 | 3300005545 | Corn, Switchgrass And Miscanthus Rhizosphere | MPYQLPHITRHGSFIGFFLALLATTVFAGTTPVKTESQRNRTPTPAVLQSPVSKQGLNFVVGARGLDSLSFNGQSLLASPESGELQPQKSVFRAVLDVLLPRSLSPSATPNKNTDTIDLSYPWGRISCAY |
| Ga0070704_1001824582 | 3300005549 | Corn, Switchgrass And Miscanthus Rhizosphere | VSHVARSLFISFFLVFLAGTVFANASSVKTGSQRNGTLTPALVQSPAANQGLSFSVGDHGLDSLSFNGQSLLVSPESGELQQQKSVFRALVDALLPRSSSPTPTPTKNTDTVDLTYPWGRISCAYGKQGNRLTMRLEVSN |
| Ga0066701_104792602 | 3300005552 | Soil | MTFTRRVTGNSVKTVEAIKADWKWGTFIGFFLPLLAATVFADTSPVKTESQQNRTPTPAVLQSPASKQGLNFAVGAHGLDSLSFNGQSLLVSPESGELQPQKSVFRAVLDALVPRSSSPSATPNKNTDTIDLSYPWGRISCAYGKQD |
| Ga0066701_106420471 | 3300005552 | Soil | VPHIIARSSFISFSVALLAATVFADTSPLKKESQRNRTPGPALLQSPASKQGLNFVVGVHGLDSLSFNEQSLLSSPESGELRLQKSVFRAVLEALLPGSSSRAGTPNKKADTIHLSYPWGRISCSY |
| Ga0066661_103431082 | 3300005554 | Soil | MTPRSSFTGFLLVLLAATVFACTSPAKAESRPNRMPTPAVLQSDAPKQGLDFVVGAHGLDSLSFNGQSLLVSPETGELRPQKSVFRAVLDAILPRPSPRVATPSKQTNTVDLSCPWGRVSCAYGKQDDRLTMRIEVSNTSEEPLDDFSI |
| Ga0066700_106187871 | 3300005559 | Soil | VPHIIARSSFISFSVALLAATVFADTSPLKKESQRNRTPGPALLQSPASKQGLNFVVGVHGLDSLSFNEQSLLSSPESGELRLQKSVFRAVLDALLPGSSSRAGTPNKKADTIHLSYPWGRISCSYGKEDDKITMKIEISNTSSE |
| Ga0066905_1007047733 | 3300005713 | Tropical Forest Soil | MTFMRRIARGSFIRLFLAISAATVLAGASLIRAEPQRNRTPSPAVLQSPAPKQGLNFVVGARGLDSLSFNGQSLLVSPASGEFQPQKSAFRAVLDAVLPFSSAGVATPNKRPDTIDLSYPWGRVSCVYGKQGDRLKMSIEVSNVSAKTIDQLSLRLMEL |
| Ga0068861_1006267032 | 3300005719 | Switchgrass Rhizosphere | MPYQLPHITPHGSFIGFFLALLATTVFAGTTPVKTESQRNRTPTPAVLQSPVSKQGLNFVVGAHGLDSLSFNGQSLLASPESGELQPQKSVFRAVLDALLPGSSSRVATPNKQADTIDLSYPWG |
| Ga0066903_1019161543 | 3300005764 | Tropical Forest Soil | MKFLLALLAATVFADPGAEKTESQRNPTPTAAARQSPAPKQGLNFVVGARGLDSLSFNGQSLLTSTDHGELQPWKSVFRAVLDAVLPLSSAGVATPNKRPDTIDLSYPWGRISCAYRKQDDRLTMRI |
| Ga0066652_1000439795 | 3300006046 | Soil | LPHITPHSSLIGFCLALLAATAFAGTSPVETVSQRNGTPTPAVVPSPAPKQGLNFVVGAYGLDSLSFDGQSFLLSPESGELQPQKSVFRAVFDALLPRSSSPSATPNKQADTIDLSYPWGRISCAYGKQHDKLTMRIEVSNTSSEPLNEFSVRLME |
| Ga0075435_1008326741 | 3300007076 | Populus Rhizosphere | MAFTNANRSLNGFVISLISFLVSFAPSVFADTRPSKAESQRNGTQTPAVFQSPAPKEGLNFVVGSHGLESLSFNGQSLLRSAQEGELQPQRSGLRAVLDALFSRSSSEVAMTNKQPDTIDLSYRWGRVSCAYGKQENRITMK |
| Ga0099793_100521612 | 3300007258 | Vadose Zone Soil | VPHIIVRSSFIGFSLALLAATVFTDTSPLKKESQRNRTPGPALVQSPASKQGLNFVVGVHGLDSLSFNEQSLLSSPESGELRPQKSVFRAVLEALLPGSSSRARTPNKKADTIDLSYPWGRISCSYGKEDDKITMKI* |
| Ga0066710_1001622125 | 3300009012 | Grasslands Soil | MVAGAYRSSDRLLNSLISLFLVSLAATVWADMSPVKTGSQQNRTQTSALVESPPPKQGLNFAVGERGLTSLSFNGQSLLASSENGELQPQKSVIRAMLDALLPRPSSGVAIGKKQPNTIDLTYRWGRISCAYGKQDNV |
| Ga0066710_1018535542 | 3300009012 | Grasslands Soil | LLNFSTLLIPYQLPHITPYSSFIGFFLALLAATAFAGTSPVKTVSQRNGTPTPAVVLSPAAKQGLNFVVGARGLDSLSFDGQSFLLSPESGELQPQKSVFRAVLDALLPRSSSRVATPNKAADTVDLKYPWGCISCAYRKQDDRLTMRIEV |
| Ga0066709_1005294284 | 3300009137 | Grasslands Soil | MVAGAYRSSDRLLNSLISLFLVSLAATVWADMSPVKTGSQQNRTQTSALVESPPPKQGLNFAVGERGLTSLSFNGQSLLASPENGELQPQKSVIRAMLDALLPRPSSGVAIGKKQPNTIDLTYRWGRISCAYGKQDNVITMKIEVSNTGSEPLNGFSLRLMELIFP |
| Ga0075423_130655511 | 3300009162 | Populus Rhizosphere | MTFTNANRSLNGFVISLISFLVSFAPSVFADTRPSKAESQRNGTQTPAVFQSPAPKEGLNFVVGSRGLESLSFNGQSLLRSAQEGELQPQRSGLRAVLDALFSRSSSEVAMTNKQPDTIDLSYRWGRISCAYGKQENRITMKIEVSNTGSEPLHE |
| Ga0105249_100312596 | 3300009553 | Switchgrass Rhizosphere | VSHVARSLFISFFLVFLAGTVFANASSVKTGSQRNGTLTPALVQSPAANQGLSFSVGDHGLDSLSFNGQSLLVSPESGELQQQKSVFRALVDALLPRSSSPSATPTKNTDTVDLTYPWGHISCAYG |
| Ga0105249_106487361 | 3300009553 | Switchgrass Rhizosphere | MWSAVVRSSFIVFSVAVSAATACADPDAEKTEPQRDPTATAAPRLSSASKQGLSFVVGARGLDSLSYNGESLLVSPKTGELQTEKSALRTVLDAILPLSSAGVATANKKPNTIDLNYSWGRVSCAYGKQGDRLTMCIEV |
| Ga0099796_104908891 | 3300010159 | Vadose Zone Soil | VPHIIVRSSFIGFSLALLAATVFADTSPLKKESQRNRTPGPALVQSPASKQGLNFVVGVHGLDSLSFNEQSLLSSPESGELRLQKSVFRAVLEALLPGSSSRAGTPNKKADTIDLSYPWGRISCSYGKEDDKITMK |
| Ga0134070_100674271 | 3300010301 | Grasslands Soil | VKTVEAIKPDWKWGTFIGFFLPFLAATVFADTNPLKTESQGNPTVTPAVVQSPDTKQGLKFVVGSHGLDSLSFNGQSLLVSLESGELQPQKSVFRAVLDALVPRSSPRAATPDIKADTVDLSYPWGRI |
| Ga0134082_103970431 | 3300010303 | Grasslands Soil | VIGYQLSALLAATVFAGTSPVKTESQRNRIPTSAVLQSSAPKQGLDFVVGAHGLDSLSFNGQSLLVSPENGELQPQKSVFRAVLDAIFPRSSSQVATPNKRRDTIDLSYPWGGISCAYGKQDDRLTMRIEV |
| Ga0134088_100289354 | 3300010304 | Grasslands Soil | VKTVEAIKADWKWGTFIGFFLPLLAATVFADTSPVKTESQQNRTPTPAVLQSPASKQGLNFAVGAHGLDSLSFNGQSLLVSPESGELQPQKSVFRAVLDALVPRSSPRVATPDKKADTVDLSYPWGRISCAYGKQDDRLTMRLEVSNTSS |
| Ga0134067_101018251 | 3300010321 | Grasslands Soil | MKFLLALLAATAFADTGAEKTESQRNPTPIAAVRQSSAPKQGLNSVVGARGLDSLSFNGESLLVSLERGELQPQKSAFRTVLDAVLPLSSAGAATPNKRPDMIDLSYPWGRISCAYGKQDDRLTMRIEAFNTSSKAVNEFSVRLME |
| Ga0134064_101138551 | 3300010325 | Grasslands Soil | VIGYQLSALLAATVFAGTSPVKTESQRNRIPTSAVLQSSAPKQGLDFVVGAHGLDSLSFNGQSLLVSPENGELQPQKSVFRAVLDAIFPRSSSQVATPNKKTDTVDLSYPW |
| Ga0134063_107004051 | 3300010335 | Grasslands Soil | LPHITPHSSLIGFCLALLAATAFAGTSPVETVSQRNGTPTPAVVPSPAPKQGLNFVVGAYGLDSLSFDGQSFLLSPESGELQPQKSVFRAVFDALLPRSSSPSATPNKQADTIDLSYPWGRISCAYGKQDDKI |
| Ga0134071_100267384 | 3300010336 | Grasslands Soil | MKTVEAIKADWKWGTFIGFFLPLLAATVFADTSPVKTESQQNRTPTPAVLQSPASKQGLSFVVGAHGLDSLSFNGQSLLVSPESGELQPQKSIFRAVLDALVPRSSSPSATPNKNTDTIDLSYPWG |
| Ga0134062_104382221 | 3300010337 | Grasslands Soil | VIGYQLSALLAATVFAGTSPVKTESQRNRIPTSAVLQSSAPKQGLDFDVGAHGLDSLSFNGQSLLVSPENGELQPQKSVFRAVLDAIFPRSSSQVATPNKKTDTVDLSYPWGRISCAYGKQDDRI |
| Ga0126370_118559001 | 3300010358 | Tropical Forest Soil | MVASDNSSSKGLLSSLISFFLVLLTAAVFADTGPVNTESQHNGVPTPAVFQSPVPKQGLNFVVGSLGLESLSFNGQSLLASPENGELRPEKSFFRAVLDAFLPGSSAGVATPNKKGDIVDLSYPWGHVSCSYGKQGD |
| Ga0126376_124321652 | 3300010359 | Tropical Forest Soil | MKFLLALLAATACADPGAEKTKSQRNPTLTAGARLSPAPKQGLNFVVGNHGLDSLSFNGESLLVSPESGELQPQKSVFRAVLDALLPFSSAGVATPNKPADTIDLSYSWGRISCTYGKQGDKITMRIEVSNTSSE |
| Ga0126378_103480861 | 3300010361 | Tropical Forest Soil | MELRSVEARVRALTAFIGFFLALLATTAFAGTSALKTESQQDRTPSAAVFQSPAPKQGLNFVVGAHGLDSLWFNGQSLLLSPENGELQPQKSVFRAVLEALFPRSSSPIATPNENTDTVDLSYPWG |
| Ga0134066_102562451 | 3300010364 | Grasslands Soil | VKTVEAIKPDWKWGTFIGFFLPFLAATVFADTNPVKTESQGNPTVTPAVVQSPDTKQGLKFVVGSHGLDSLSFNGQSLLVSLESGELQPQKSVFRAVLDALVPRSSPRAATPDIKADTVDLSYPWGRISCAYGKRDDR |
| Ga0126381_1039972841 | 3300010376 | Tropical Forest Soil | MELRSVEARVRALTAFIGFFLALLAATAFAGTSALKTESQQDRTPSAAVFQSPAPKQGLNFVVGAHGLDSLWFNGQSLLLSPENGELQPQKSVFRAVLDALLSPPSSRVATPNQRADTVDMSFPWGRISCAYGKQDDRVTIRIEVS |
| Ga0134126_129163821 | 3300010396 | Terrestrial Soil | VPHTPPHSSFIGFFLPFLAATVFADMSLVKTESQQNRGPSPAVLQSPASKQGLNFVVGAHGLDSLSFNGQSFLVSPESGELQPQKSFFRAVLDALLPRSSSPSATPNKNTDTVDLS |
| Ga0134124_115953141 | 3300010397 | Terrestrial Soil | MPYQLPHITPHGSFIGFFLALLATTVFAGTTPVKTESQRNRTPTPAVLQSPVSKQGLNFVVGAHGLDSLSFNGQSLLASPESGELQPQKSVFRAVLDALLPGSSSRVATPNKQADTIDLSYPWGHVSCAYSKQDDKITMRIEVSNT |
| Ga0134123_109130442 | 3300010403 | Terrestrial Soil | VSHVARSSLINFFLALLAATVFADASPVKTESQQNGTLTPAVVQSPAANQGLRFSMGAHGLDSLSFNGQSLLVSPESGELQQQKSVFRALVDALLPRSSSPTPTPTKNTDTVDLTYPWGRISCAYGKQGNRLTMRLEVSNTSSKPLN |
| Ga0137364_102934472 | 3300012198 | Vadose Zone Soil | MSYQPLPHITAHSSLIGFFLALFAATVFAGTAPLMTESQGNQTPTLAVVPSPAAKQGLNFVVGARGLDSLSFDGQSFLLSPESGELQPQKSVFRAVLDAFLPRSSSRVATPNKAADTVDLKYPWGCISCAYGKQDDRLTMRIDVSNTSSEPLNEFS |
| Ga0137365_102743472 | 3300012201 | Vadose Zone Soil | MSYQPLPHITAHSSLIGFFLALFAATVFAGTAPLMTESQGNQTPTLAVVPSPAAKQGLNFVVGARGLDSLSFDGQSFLLSPESGELQPQKSVFRAVLDAFLPRSSSRVATPNKAADTVDLKYPLGCISCAYRKQDDRLTMRIEVSNTSSEPLN |
| Ga0137365_104024742 | 3300012201 | Vadose Zone Soil | VAVASANRSSNRFVISLISFFLVLLAASVFADTSPVKAESQRNGAPAPAAFQSPAPKQGLNFVVGSRGLESLSFNGQSLLRSAQEGELQPQKSVYRAVLDALLPGPSSGVAMANIQPDTIDLSYRWGRISCAYGKQDNR |
| Ga0137363_100144711 | 3300012202 | Vadose Zone Soil | LTVVTLTSILSRTRERRKVAADAQRERVISDLLSVISYQLSFFLVLFAATVFAGTSPLKAESQRNQAPTPILVQPPASKQGLNFVVGAHGLDSLSFNGQSLLVSPESGELQPQKSVFRAVLDALLPRSSSQVATPNKQTDTVDLSYPSGRVSCAYGKQGDRLTMSIEVSNASAKTIDQLS |
| Ga0137362_101452764 | 3300012205 | Vadose Zone Soil | MCGWIEVMGPIFVAFFAATVFADTSLLNTESQQNRTPTPAVLQSPDSKQGLNFVVGARGLDSLSFNGQSLLVSPESGELQPQKSVFRAVLDAFFPRSSSPSATPNKNTDTIDLSYPWGRISCAYGKQDDRLTMR |
| Ga0137362_109952401 | 3300012205 | Vadose Zone Soil | MSYLAVLALAVFAVTSPVKAESQRNRTSTPSVTQSPAPKQGLNFAVGERGLTSLSFNGQSLLVSPESGEFQPQKSVFGAVLDALLPRSPLPVAAPNKTPDTVDITYPWGRLS |
| Ga0137380_100737471 | 3300012206 | Vadose Zone Soil | MAVASAHRSSDRFLSFLISFFLVLLAVTVVADTSPVKTESQRNRAPTPAVLQSPVPKQGLNFVVGSHGLDSLSFNGQSLLVSPENGELRPQKSVFHAVLDALLPRSSPGVATPNKNGDTIDLNYPWGRVSCSYGKQGDKITMRIEV |
| Ga0137379_113669842 | 3300012209 | Vadose Zone Soil | VSNFSTLHIPYQLSLITPHSSLIGFFLALFAATVFAGTAPLMTESQGNQTPTPAVVPSPAAKQGLNFVVGARGLDSLSFDGQSFLLSPESGELQPQKSVFRSVLDAFLPRSSSRVATPNKAADTVDLKYPWGCISCAYGKQDDR |
| Ga0137378_101765363 | 3300012210 | Vadose Zone Soil | VPHIIARSSFISFSVALLAATVFAETSPLKKESQRNRTPGPALLQSPASKQGLNFVVGVHGLDSLSFNEQSLLSSPESGELRLQKSVFRAVLEALLPGSSSRAGTPNKKADTIHLSYPWGRISCSYGKEDDKITMKIEISNTSSEPLNELSIRLM |
| Ga0137377_101068541 | 3300012211 | Vadose Zone Soil | VKFSTLLIPYQLPHITPHSSFIGFFLALFAATLFAGTSPVKTESQRNQTPTPAALQSPAPKQGLNFLVGAHGLDSLSFNGQSLLVSPESGDLQPQKSVFRAVLDAILPRPSSQVATTNKPADTVDLSYPWGRVSCAYAKQDDRITMRITVSNA |
| Ga0137386_110858562 | 3300012351 | Vadose Zone Soil | MRYQPLPHITAHSSLIGFFLALFAATVFAGTAPLMTESQGNQTPTPAVVPSPAAKQGLNFVVGARGLDSLSFDGQSFLLSPESGELQPQKSVFRSVLDAFLPRSSSRVATPNKAADTVDLKYPVGCISCAYR* |
| Ga0137366_100358901 | 3300012354 | Vadose Zone Soil | MSYQPLPHITAHSSLIGFFLALFAATVFAGTAPLMTESQGNQTPTLAVVPSPAAKQGLNFVVGARGLDSLSFDGQSFLLSPESGELESQKSVFRAVLDAFLPRSSSRVATPNKAADTVDLKYPWGCISCAYGKQDDRLTMRIEVSNTSSEP |
| Ga0137366_106450821 | 3300012354 | Vadose Zone Soil | MPHITPHSSFIGFCLALLAAIAFAGTSPVQTVSQRNGTPTPAVLQLPAPKQGLNFVVGTRGLDSLSFDGESFLLSPESGELQPQKSVFRAVLDAFLPRSSSRVATPNKQADTINLSYPWGRISGAYGKQDDRLTMRIEVSNTSSEPLNEFSVR |
| Ga0137384_109612541 | 3300012357 | Vadose Zone Soil | VKTVEAIKPDWKWGPFIGFLLPFLAATVFADTSLVKTESQQNRAPTPALLQSPASKQGLNFVVGAHGLDSLSFNGQSLLVSPESGELQPQKSVFRAVLDALLPRSSPRVADPNKNTDIVDLSYPW |
| Ga0137368_105197951 | 3300012358 | Vadose Zone Soil | MEVMAPIVLAFLAVTVFADTSLVKTELQQNRTPTPAVLQSPASKQGLNFVVGAHGLDSLSFNGQSLLVSPESGKLQPQKSVFRAVLDALVPRSSPRVVTPGKKANTVDLSYPWGRLSCAYGKQDDRL |
| Ga0137375_105023591 | 3300012360 | Vadose Zone Soil | MEVMAPIFLAFLAVTVFADTSLVKTELQQNRTPTPAVLQSPASKQGLNFVVGAHGLDSLLFNGQSLLGSRESGELQPQKSVFRAVLDALVPRSSPRVVTPGKKADTVDLSYPWG |
| Ga0137360_112760151 | 3300012361 | Vadose Zone Soil | VISDLLSVISYQLSFFLVLFAATVFAGTSPLKAESQRNQAPTPVLVQPPASKPGLNFVVGAHGLDSLSFNGQSLLVSPASGELQPQKSVFRTVLDALLPRSSSRVATPNKPADTVDLSYPWGRVSCAYAKQDDRITMRITVSNASEKPIDQLSLR |
| Ga0137361_113215441 | 3300012362 | Vadose Zone Soil | VPHIIARSSFISFFLALLAGTAFADTSPLKKESQRNRTPGPALVQSPASKQGLNFVVGVHGLDSLSFNEQSLLSSPESGELRLQKSVFRAVLEALLPGSSSRAGTPNKKADTIHLSYPWGRIS |
| Ga0137358_100718984 | 3300012582 | Vadose Zone Soil | MGPIFVAFFAATVFADTSLLKTESQQNRTPTPAVLQSPASKQGLNFVVGAHGLDSLSFNGQSLLVSPESGELQPQKSVFRAVLDAFFPRSSSPSATPNKNTDTIDLSYPWGRISCAYGKQ |
| Ga0157294_103043011 | 3300012892 | Soil | LPRITPYSFFISFFLAFLAAAVLADTSPIKGELQHNRTPTPAVVQSPASKQGLNFVVGARGLDSLSFNGQSLLVSPESGELQPQKSVFRAVLDALLPRSSARVATPDKKTDTIDLSFPWGRI |
| Ga0137359_105730004 | 3300012923 | Vadose Zone Soil | VVGDLLSVISYQLSFSLAVLAATVFTVASPVKTESQRNRIPIPAVLQSPAPKQGLNFVVGAHGLDSLSFNGQSLLASPESGELQPQKSVFRAVLDALLPRSSSPVATPNKPADT |
| Ga0137416_122402021 | 3300012927 | Vadose Zone Soil | MVASANRSSDRFLSSLISFFLVLLAASVVADTSPVKAESQRNGAPAPAAFQSPAPKQGLNFVVGSRGLESLSFNGQSLLCSAQEGELQPQKSVFRAVLDALLPGPSSGVAMANMQPDTIDLGYRWGRISCAYGKQDNRITMRIEVSNTGPEPLNEF |
| Ga0137404_1001588412 | 3300012929 | Vadose Zone Soil | LLKSSILLIFYQVPPVARSLFISFFLALLAATVFADTSPLKTESQQNRTPTPPVVQSPASKQGLSFVVGAHGLDSLSFNGQSLLISPESGELQPQKSLFRAVLDALVSRSSSPSATRNKNSDTIDLSYPWGRISCAYGKQDDRLTMRLE |
| Ga0164299_102601982 | 3300012958 | Soil | VRHIAPRSSFIGFSLAFLAAAVFADTSPVKKELPANSTPTRVLQSPASKQGLNFVVGAHGLDSLSFNGQSLIGSPESGELQPQKSVFRTVLDAIVPRSSPRIATPDKKADTVDVSYPSGRISCAYGKQDDRLTMRLEV |
| Ga0164307_105059461 | 3300012987 | Soil | VIRLRQGCCGQVGYQLSAFLAATVVVGTNSVKAESQQNRTPTPAVLQSPAPKQGLNFVVGPRGLTSLLFNGQSLLASPESGELQPQKSVFRAVLDALVPRSSPPVATPNQHTDTVDVSY |
| Ga0164306_115911431 | 3300012988 | Soil | VRHIAPRSSFIGFSLAFLAAAVFADTSPVKKESPANSTPTRVLQSPASKQGLNFVVGAHGLDSLSFNGQSLIGSPESGELQPQKSVFRTVLDALVPRSSPRIATPDKKADTVDVSYP |
| Ga0157374_127883311 | 3300013296 | Miscanthus Rhizosphere | VRHIAPRSSFIGFSLAFLAAAVFADTSPVKKESPANSTPTRVLQSPASKQGLNFIVGAHGLDSLSFNGKSLLVSPASGELQPEKSVFRTVLDALVPRSSPRIATPDKKADTVDVSYPSGRISCAYGKQDDRLTMRLEVSNTSSKPLNELSLRL |
| Ga0157378_109280012 | 3300013297 | Miscanthus Rhizosphere | VSHVARSLFISFFLVFLAGTVFANASSVKTGSQRNGTLTPALVQSPAANQGLSFSVGDHGLDSLSFNGQSLLVSPESGELQQQKSVFRALVDALLPRSSSPSATPTKNTDTVDLTYPWGHISCAYGKQGNRL |
| Ga0134081_103620881 | 3300014150 | Grasslands Soil | VVGRHVTVHSSLIGFFLALLAATVFAGPSPVTPESQRNRTPTPAVLQSPAPKQGLNFVVGAHGLESLSFNGQSLLVSPASGELQPEKSVFRAVLDAFLSRSSSRVATPNKPADTVDLSYPWGRVSCAYAKQDDRITMRITVSNASEKPIDQLSLR |
| Ga0134075_105411351 | 3300014154 | Grasslands Soil | VKTVEAIKADWKWGTFIGFFLPLLAATVFADTSPVKTESQQNRTPTPAVLQSPASKQGLSFVVGAHGLDSLSFNGQSLLVSPESGELQPQKSVFRVVLDALLPRSSSPSATPNKKTDAVDLSYPWGRISCAYGKQDDRLTMRLE |
| Ga0134079_100122131 | 3300014166 | Grasslands Soil | MKFLLALLAATAFADTGAEKTESQRNPTPIAAVRQSSAPKQGLNSVVGARGLDSLSFNGESLLVSLERGELQPQKSAFRTVLDAVLPLSSAGVATPNKRPDMIDLSYPWGRISCAYGKQDDRLTMRIEAFNTSSKAVNEFSVR* |
| Ga0163163_100678981 | 3300014325 | Switchgrass Rhizosphere | VRHIAPRSSFIGFSLAFLAAAVFADTSPVKKESPANSTPTRVLQSPASKQGLNFVVGAHGLDSLSFNGQSLIGSPESGELQPQKSVFRTVLDALVPRSSPRIATPDKKADTVDVSYPSGRIFCAYGKQDDRLTMRLEVSNTSSNPLNELPL |
| Ga0157379_101948751 | 3300014968 | Switchgrass Rhizosphere | VSHVARSLFISFFLVFLAGTVFANASSVKTGSQRNGTLTPALVQSPAANQGLSFSVGDHGLDSLSFNGQSLLVSPESGELQQQKSVFRALVDALLPRSSSPSATPTKNTDTVDLTYPWGHISCAYGKQGNRLKMRLEVSNTSSKPLNELSLR |
| Ga0173480_103440492 | 3300015200 | Soil | LPRITPYSFFISFFLAFLAAAVLADTSPIKGELQHNRTPTPAVVQSPASKQGLNFVVGARGLDSLSFNGQSLLVSPESGELQPQKSVFRAVLDALLPRSSARVATPDKKTDTIDLSFPWGRISCGY |
| Ga0132258_113611071 | 3300015371 | Arabidopsis Rhizosphere | LPHITPHSSFIGFLLAFLAATVFADTRLVKTESHQNRTPTATRLQSPASKQGLNFVIGAHGLDSLSFNGQSLLVSPESGELQPQKSRFRAVLDALVPRSSPRVAAPDKKADRVDVSYPWGRI |
| Ga0132256_1004334483 | 3300015372 | Arabidopsis Rhizosphere | VPHIPPPSSVIGFFLAFLAATVFADTSPIEGELQQNRNPAPAALQSPPSKQGLNFVVGAHGLDSLSFNGQSLLVSPESGELQQQKSVFRALVDALLPRSSSPTPTPTKNTDTVELTYPWGRISCAYGKQDDRLTMRLEVSNTSSKPLNELSLRL |
| Ga0132256_1029688252 | 3300015372 | Arabidopsis Rhizosphere | LPHITPHSLFIGFFLPFFAAAVSADTSPVKTEPQQNRTPTRAVLQAPASKQGLNFAVGAHGLDSLSFNGQSLLVSPESGDLQPEKSVFRAVLDALLPRSSSRAATPDKKADTVGLS* |
| Ga0184618_104435762 | 3300018071 | Groundwater Sediment | MGPISLAFFAATVFANTSLVRPESQQTRNPTPAALQSPPSKQGLNFVVGAHGLDSLSFNGQSLLVSPERGELQPQKSVFRAVLDALVPRSSPRVATPDKKADTVDVSYPKGRISCAYGKQEDRLTMRLEVSNTSSEPVN |
| Ga0184624_100215601 | 3300018073 | Groundwater Sediment | MTPRSSFVGFFLAFLVATVLAGVSSVKAESQPNPIPSPAVLQSPASKQGLNFVVGAHGLDSLSFNGQSLLVSPESGELQPQKSVFRAVLDALLPRSSPPVATPDKKADTVELSYPRGRISCAYGK |
| Ga0066662_104232671 | 3300018468 | Grasslands Soil | VPHIIVRSSFIGFSLALLAATVFTDTSPLKKESQRNRTPGPALLQSPASKQGLNFVVGVHGLDSLSFNEQSLLSSPESGELRLQKSVFRAVLDALLPGSSSRAGTPNKKADTIHLSYPWGRISCSYGKEDDKITMKIEISNTS |
| Ga0066669_115509511 | 3300018482 | Grasslands Soil | VPHIIVRSSFISFSLALLAATVFADTSPLKKESQRNRTPGPALVQSPASKQGLNFVVGVHGLDSLSFNEQSLLSSPESGELRLQKSVFRAVLEALLPGSSSRAGTPNKKADTIHLSYPWGRISCSYGKEDDKI |
| Ga0173479_100890121 | 3300019362 | Soil | MDRSGNQNQRTNEGLIRMKVMRPICLAFLAAAVLADTSPIKGELQHNRTPTPAVVQSPASKQGLNFVVGARGLDSLSFNGQSLLVSPEGGELQPQKSVFRAVLDALLPRSSARAATPDKKTDTIDLSYSWGRISCGYGKQGDRLTMRLEVSNTSS |
| Ga0137408_10303869 | 3300019789 | Vadose Zone Soil | MKTVEAIKADWKWGTFIGFFLPLLAATVFADTSPVKTESQQNRTPTPAVLQSPASKQGLSFVVGAHGLDSLSFNGQSLLVSPESGELQPQKSVFRAVLDALVPRSSSPSATPNKNTDTIDLSYPWGRIS |
| Ga0193732_10165581 | 3300020012 | Soil | MGPIFVAFFAATVFADTSLLKTESQQNRTPTPAVLQSPASKQGLNFVVGAHGLDSLSFNGQSLLVSPESGELQPQKSVFRAVLDALFPRSSSPSATPNKNTDTIDLSYPWGRISCAYGKQDDRLTMRLEVSNTSSEPLGEFSLR |
| Ga0193724_10572611 | 3300020062 | Soil | LLNFSTLLISHITAQSSFIGFCLALLAATVLADSGAEKTELRRNLTSTASGRQSSASKQGLNFAVGERGLTSLSFNGQSLLVSRASGELQPQKSVFRAVLDAFLPRSSFQVATPNKTPDTVDIT |
| Ga0126371_122733591 | 3300021560 | Tropical Forest Soil | MRSELAAVAGYQAHSSSIAFFLVLFSATAFAASSAVKIESQPDRTLTPSVVQSTAPKEHLNFVVGSRGLDSLSFNGETLLVSPERGELQPYKSAFRAVLDAVLPHSSAGVATPNKKPNTIDLSYPWGRVSGVYGKQADRLTLRIEVSNTSSKPVN |
| Ga0207707_103567671 | 3300025912 | Corn Rhizosphere | MDRSGNQNQRTNEGLIRMKVMRPICLAFLAAAVLADTSPIKGELQHNRTPTPAVVQSPASKQGLNFVVGARGLDSLSFNGQSLLVSPESGELQPQKSVFRAVLDALLPRSSARVATPDKKTDTIDLSFPWGRISCGYSKQGDRLTMKLEVSNTSS |
| Ga0207693_104722541 | 3300025915 | Corn, Switchgrass And Miscanthus Rhizosphere | MTPHSSFIGFFLAVLALAVLADTSPVETESQGTQTATLALVQSPASKQGLNFAVGERGLTSLSFNGQSLLVSPASGELQPEKSVFRAVLDAILPRSASQVATPKKKADTVDLSYPWGRVSCAYGKQGDRL |
| Ga0207657_102271511 | 3300025919 | Corn Rhizosphere | VSHVARSSLINFFLALLAATVFADASPVKTESQQNGTLTPAVVQSPAANQGLRFSVGAHGLDSLSFNGQSLLVSSESGELQPAKSVFRAVLDALVPRPSPRMATRGQKADTVDLSYASGEISCAYGKQNDRL |
| Ga0207659_111292241 | 3300025926 | Miscanthus Rhizosphere | MKVMRPICLAFLAAAVLADTSPIKGELQHNRTPTPAVVQSPASKQGLNFVVGARGLDSLSFNGQSLLVSPESGELQPQKSVFRAVLDALLPRSSARVATPDKKTDTIDLSYPWG |
| Ga0207687_110932301 | 3300025927 | Miscanthus Rhizosphere | MKVMRPICLAFLAAAVLADTSPIKGELQHNRTPTPAVVQSPASKQGLNFVVGARGLDSLSFNGQSLLVSPESGELQPQKSVFRAVLDALLPRSSARVATPDKKTDTIDLSFPWGRISCGYSKQGDRLTMRLEVSNTSSEPVNEFSLRLM |
| Ga0207658_106466922 | 3300025986 | Switchgrass Rhizosphere | VSHVARSLFISFFLVFLAATVFADASPVKTESQQNGTLTPAVVQSPAANQGLRFSVGAHGLDSLSFNGQSLLVSPESGELQQQKSVFRALVDALLPRSSSPTATPDKNTDTVDLTYPWGRISCAYGKQGNRLKMRL |
| Ga0209471_13138211 | 3300026318 | Soil | VPHIIVRSSFISFSVALLAATVFADTSPLKKESQRNRTPGPALVQSPASKQGLNFVVGVHGLDSLSFNEQSLLSSPESGELRLQKSVFRAVLEALLPGSSSRAGTPNKKADTIHLSYPWGRISCSYGKEDDKITMKIEISNTSS |
| Ga0209058_11260151 | 3300026536 | Soil | LPHITPHSSLIGFCLALLAATAFAGTSPVETVSQRNGTPTPAVVPSPAPKQGLNFVVGAYGLDSLSFDGQSFLLSPESGELQPQKSVFRAVFDALLPRSSSPSATPNKQADTIDLSYPWGRISCAYGKQ |
| Ga0137415_112376931 | 3300028536 | Vadose Zone Soil | MVASANRSSDRFLSSLISFFLVLLAASVVADTSPVKAESQRNGAPAPAAFQSPAPKQGLNFVVGSRGLESLSFNGQSLLCSAQEGELQPQKSVFRAVLDALLPGPSSGVAMANMQPDTIDLSYRWGRISCAYGKQDNRITMRIEVSNTGPEPLNEF |
| Ga0307305_101832081 | 3300028807 | Soil | MSFFLVLLAASIVADTSSVNTESQHNGASSPAVFQSPVPKQGLNFVVGVHGLDSLSFNEQSLLSSPESGEFRLQKSVFRAVLDALLPGSSSRAGTPNKKADTIDLSYPWGRISCSYGKED |
| Ga0170824_1049512561 | 3300031231 | Forest Soil | MTFTRRFTGNSVKTVEAIKPDWKWGLFIGFFLALLAATVFADTSPVKGESQRDRTPTPAVLQSPAPEQGLHFVVGAHGLDSLSYNGQSILRSAQDGELQPWKSVFRAVLDALLSPSQSPVATPNKQADSIDLSYPW |
| Ga0170818_1001782511 | 3300031474 | Forest Soil | MTFTRRFTGNSVKTVEAIKPDWKWGLFIGFFLALLAATVFADTSPVKGESQRDRTPTPAVLQSPAPEQGLHFVVGAHGLDSLSYNGQSILRSAQDGELQPWKSVFRAVLDALLSPSQSPVATPNKQADSIDLSYPWGRVSCAYGKQGDKLTMR |
| Ga0307469_109221523 | 3300031720 | Hardwood Forest Soil | MVSSRSSDKLRNFFLAVLALAVFADTSAVETESQRTQTATLALVQSPASKQSLNFAVGERGLTSLSFNGQSLLVSPESGELQPQKSVFRAVLDALVPRSPTQVATPNKQTDTVDVSYPWG |
| Ga0307468_1004812983 | 3300031740 | Hardwood Forest Soil | LIGQSEISSQTSAIGYQLSVIGYQFLAFLAATVFADTSPVKTEPQRDRTPTPAVVQSPASKQGLNFVVGAHGLESLSYNGESLLRSAQDGELQPQKSVFRAVLDALVPRSSPQVATPDKKADTVDLSYPWGRISCAYGKQDDRLTMRFEVSNTS |
| Ga0306921_126064781 | 3300031912 | Soil | VIGYLLSALLAATVFADTRPAKAESQQNRTPTPAVLQSPASKQGLNFVVGADGLDSLSFNGQSLLVSPESGELRPQRSVFRAVLDALVPRSSPRVATPDKKADTVDLNYPWGRISCAYGKQKDRLTMRLEVS |
| Ga0307472_1001271993 | 3300032205 | Hardwood Forest Soil | VTTGHHLPASCHRDGRIHWFFLALLAATVFADTSAVKTEPQRDRTPTPAVVQSPASKQGLNFVVGAHSLDSLSFDGQSLLRSAQDGELQPQKSVFRAVLDALVPRFSSPSAAPNKNTDTIDLSYPWGRISCAYGKQHDRLT |
| Ga0307472_1007093723 | 3300032205 | Hardwood Forest Soil | MVSSRSSDKLRNFFVAVLALAVFADTSAVATESQRTQTATLALVQSPASKQSLNFAVGERGLTSLSFNGQSLLVSPESGELQPQKSVFRAVLDALVPRAQTQVATPNKQTDTVDVSYPWGRVSCAYGKQDDRLTLRIEVSNTSS |
| Ga0306920_1027083941 | 3300032261 | Soil | MAAARANRSSDRLLSSLISFFLVLLAATVFADTSPVNRESHKHGVPTPAVFQSPTPKQGLSFVVGAYGLDSLSFNGQSLLASPDSGELRPHKSAFRAVLDALLPGSSSRVVTPDKPADTVDLSYSWGRVSCVYGKQDDRITMR |
| Ga0306920_1042516031 | 3300032261 | Soil | VIGYQLSALLAATVFADTRPAKAESQQNRTPTPAVLQSPASKQGLNFVVGADGLDSLSFNGQSLLVSPESGELRPQRSVFRAVLDALVPRSSPRVATPDKKADTVDLNYPWGRISCA |
| ⦗Top⦘ |