| Basic Information | |
|---|---|
| Family ID | F091693 |
| Family Type | Metagenome / Metatranscriptome |
| Number of Sequences | 107 |
| Average Sequence Length | 65 residues |
| Representative Sequence | GVSDGKVLPRWLATGGLAAGAVAMALALVFREDTKADQAAFVLPVLWQVVTATVMLRPLGLRG |
| Number of Associated Samples | 87 |
| Number of Associated Scaffolds | 107 |
| Quality Assessment | |
|---|---|
| Transcriptomic Evidence | Yes |
| Most common taxonomic group | Unclassified |
| % of genes with valid RBS motifs | 0.00 % |
| % of genes near scaffold ends (potentially truncated) | 12.15 % |
| % of genes from short scaffolds (< 2000 bps) | 11.21 % |
| Associated GOLD sequencing projects | 82 |
| AlphaFold2 3D model prediction | Yes |
| 3D model pTM-score | 0.52 |
| Hidden Markov Model |
|---|
| Powered by Skylign |
| Most Common Taxonomy | |
|---|---|
| Group | Unclassified (87.850 % of family members) |
| NCBI Taxonomy ID | N/A |
| Taxonomy | N/A |
| Most Common Ecosystem | |
|---|---|
| GOLD Ecosystem | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil (30.841 % of family members) |
| Environment Ontology (ENVO) | Unclassified (71.028 % of family members) |
| Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Soil (non-saline) (75.701 % of family members) |
| ⦗Top⦘ |
| ⦗Top⦘ |
| Predicted Topology & Secondary Structure | |||||
|---|---|---|---|---|---|
| Classification: | Transmembrane (alpha-helical) | Signal Peptide: | No | Secondary Structure distribution: | α-helix: 53.85% β-sheet: 4.40% Coil/Unstructured: 41.76% | Feature Viewer |
|
|
|||||
| Powered by Feature Viewer | |||||
| Structure Viewer | |
|---|---|
|
| |
| Per-residue confidence (pLDDT): 0-50 51-70 71-90 91-100 | pTM-score: 0.52 |
| Powered by PDBe Molstar | |
| ⦗Top⦘ |
| Pfam ID | Name | % Frequency in 107 Family Scaffolds |
|---|---|---|
| PF13180 | PDZ_2 | 43.93 |
| PF13365 | Trypsin_2 | 21.50 |
| PF00012 | HSP70 | 4.67 |
| PF16576 | HlyD_D23 | 2.80 |
| PF00528 | BPD_transp_1 | 1.87 |
| PF12700 | HlyD_2 | 0.93 |
| PF02780 | Transketolase_C | 0.93 |
| PF12704 | MacB_PCD | 0.93 |
| COG ID | Name | Functional Category | % Frequency in 107 Family Scaffolds |
|---|---|---|---|
| COG0443 | Molecular chaperone DnaK (HSP70) | Posttranslational modification, protein turnover, chaperones [O] | 4.67 |
| ⦗Top⦘ |
| Name | Rank | Taxonomy | Distribution |
| Unclassified | root | N/A | 87.85 % |
| All Organisms | root | All Organisms | 12.15 % |
| Visualization |
|---|
| Powered by ApexCharts |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| 3300005179|Ga0066684_11127612 | All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium | 500 | Open in IMG/M |
| 3300006796|Ga0066665_10886580 | All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium | 693 | Open in IMG/M |
| 3300010132|Ga0127455_1036256 | All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium | 505 | Open in IMG/M |
| 3300012359|Ga0137385_11318065 | All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium | 585 | Open in IMG/M |
| 3300012407|Ga0134050_1248998 | All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium | 523 | Open in IMG/M |
| 3300012927|Ga0137416_10630999 | All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium | 936 | Open in IMG/M |
| 3300012929|Ga0137404_12146844 | All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium | 522 | Open in IMG/M |
| 3300017656|Ga0134112_10084195 | All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium | 1184 | Open in IMG/M |
| 3300026015|Ga0208286_1007850 | All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes | 758 | Open in IMG/M |
| 3300026532|Ga0209160_1008716 | All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → Gemmatimonadetes → Gemmatimonadales → Gemmatimonadaceae | 7615 | Open in IMG/M |
| 3300026532|Ga0209160_1239252 | All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium | 622 | Open in IMG/M |
| 3300027725|Ga0209178_1034260 | All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium RIFCSPLOWO2_02_FULL_71_11 | 1603 | Open in IMG/M |
| 3300031421|Ga0308194_10021680 | All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium | 1415 | Open in IMG/M |
| ⦗Top⦘ |
| Habitat | Taxonomy | Distribution |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil | 30.84% |
| Grasslands Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil | 27.10% |
| Vadose Zone Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil | 14.02% |
| Grasslands Soil | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil | 13.08% |
| Agricultural Soil | Environmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil | 3.74% |
| Corn, Switchgrass And Miscanthus Rhizosphere | Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere | 3.74% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil | 1.87% |
| Agricultural Soil | Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Agricultural Soil | 1.87% |
| Hardwood Forest Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil | 0.93% |
| Rice Paddy Soil | Environmental → Terrestrial → Soil → Wetlands → Unclassified → Rice Paddy Soil | 0.93% |
| Soil | Environmental → Terrestrial → Soil → Loam → Grasslands → Soil | 0.93% |
| Populus Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere | 0.93% |
| Visualization |
|---|
| Powered by ApexCharts |
| Taxon OID | Sample Name | Habitat Type | IMG/M Link |
|---|---|---|---|
| 3300002560 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cm | Environmental | Open in IMG/M |
| 3300002908 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cm | Environmental | Open in IMG/M |
| 3300002916 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cm | Environmental | Open in IMG/M |
| 3300005172 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132 | Environmental | Open in IMG/M |
| 3300005177 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139 | Environmental | Open in IMG/M |
| 3300005179 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133 | Environmental | Open in IMG/M |
| 3300005186 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125 | Environmental | Open in IMG/M |
| 3300005187 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124 | Environmental | Open in IMG/M |
| 3300005406 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-1 metaG | Environmental | Open in IMG/M |
| 3300005435 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaG | Environmental | Open in IMG/M |
| 3300005454 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_136 | Environmental | Open in IMG/M |
| 3300005467 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG | Environmental | Open in IMG/M |
| 3300005553 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144 | Environmental | Open in IMG/M |
| 3300005555 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141 | Environmental | Open in IMG/M |
| 3300005558 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147 | Environmental | Open in IMG/M |
| 3300005559 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149 | Environmental | Open in IMG/M |
| 3300006031 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Angelo_100 | Environmental | Open in IMG/M |
| 3300006032 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145 | Environmental | Open in IMG/M |
| 3300006046 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101 | Environmental | Open in IMG/M |
| 3300006796 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 | Environmental | Open in IMG/M |
| 3300006804 | Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200 | Environmental | Open in IMG/M |
| 3300006806 | Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100 | Environmental | Open in IMG/M |
| 3300006954 | Agricultural soil microbial communities from Georgia to study Nitrogen management - GA Control | Environmental | Open in IMG/M |
| 3300009012 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159 | Environmental | Open in IMG/M |
| 3300009088 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG | Environmental | Open in IMG/M |
| 3300009162 | Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2 | Host-Associated | Open in IMG/M |
| 3300010132 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_40_5_16_1 metaT (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300010142 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_2_4_1 metaT (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300010304 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015 | Environmental | Open in IMG/M |
| 3300010321 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09212015 | Environmental | Open in IMG/M |
| 3300010322 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_2_24_1 metaG | Environmental | Open in IMG/M |
| 3300010323 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_2_24_1 metaG | Environmental | Open in IMG/M |
| 3300010364 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09212015 | Environmental | Open in IMG/M |
| 3300012206 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaG | Environmental | Open in IMG/M |
| 3300012207 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaG | Environmental | Open in IMG/M |
| 3300012208 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaG | Environmental | Open in IMG/M |
| 3300012285 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaG | Environmental | Open in IMG/M |
| 3300012351 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaG | Environmental | Open in IMG/M |
| 3300012357 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaG | Environmental | Open in IMG/M |
| 3300012359 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaG | Environmental | Open in IMG/M |
| 3300012361 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaG | Environmental | Open in IMG/M |
| 3300012371 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_20cm_2_0_1 metaT (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300012380 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_2_0_2 metaT (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300012393 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_0_1 metaT (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300012395 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_2_8_1 metaT (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300012396 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_0_2 metaT (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300012401 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_16_1 metaT (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300012404 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_20cm_2_8_1 metaT (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300012407 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_2_16_2 metaT (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300012927 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly) | Environmental | Open in IMG/M |
| 3300012929 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly) | Environmental | Open in IMG/M |
| 3300012975 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_11112015 | Environmental | Open in IMG/M |
| 3300012976 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_0_1 metaG | Environmental | Open in IMG/M |
| 3300012977 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_5_24_1 metaG | Environmental | Open in IMG/M |
| 3300014150 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_5_24_1 metaG | Environmental | Open in IMG/M |
| 3300014154 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09212015 | Environmental | Open in IMG/M |
| 3300014157 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_2_24_1 metaG | Environmental | Open in IMG/M |
| 3300015359 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09182015 | Environmental | Open in IMG/M |
| 3300017656 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_11112015 | Environmental | Open in IMG/M |
| 3300018431 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104 | Environmental | Open in IMG/M |
| 3300018433 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116 | Environmental | Open in IMG/M |
| 3300018468 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111 | Environmental | Open in IMG/M |
| 3300018482 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118 | Environmental | Open in IMG/M |
| 3300021086 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_1_08_16fungal (Illumina Assembly) | Environmental | Open in IMG/M |
| 3300025898 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-2 metaG (SPAdes) | Environmental | Open in IMG/M |
| 3300025928 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaG (SPAdes) | Environmental | Open in IMG/M |
| 3300025929 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaG (SPAdes) | Environmental | Open in IMG/M |
| 3300026015 | Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_80N_401 (SPAdes) | Environmental | Open in IMG/M |
| 3300026308 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_103 (SPAdes) | Environmental | Open in IMG/M |
| 3300026310 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cm (SPAdes) | Environmental | Open in IMG/M |
| 3300026313 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes) | Environmental | Open in IMG/M |
| 3300026316 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122 (SPAdes) | Environmental | Open in IMG/M |
| 3300026325 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107 (SPAdes) | Environmental | Open in IMG/M |
| 3300026328 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129 (SPAdes) | Environmental | Open in IMG/M |
| 3300026332 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138 (SPAdes) | Environmental | Open in IMG/M |
| 3300026335 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139 (SPAdes) | Environmental | Open in IMG/M |
| 3300026524 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150 (SPAdes) | Environmental | Open in IMG/M |
| 3300026529 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152 (SPAdes) | Environmental | Open in IMG/M |
| 3300026532 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 (SPAdes) | Environmental | Open in IMG/M |
| 3300026538 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes) | Environmental | Open in IMG/M |
| 3300026542 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148 (SPAdes) | Environmental | Open in IMG/M |
| 3300026547 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124 (SPAdes) | Environmental | Open in IMG/M |
| 3300026552 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109 (SPAdes) | Environmental | Open in IMG/M |
| 3300027725 | Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200 (SPAdes) | Environmental | Open in IMG/M |
| 3300031152 | Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 15_S | Environmental | Open in IMG/M |
| 3300031421 | Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_195 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300032180 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515 | Environmental | Open in IMG/M |
| Geographical Distribution | |
|---|---|
| Zoom: | Powered by OpenStreetMap |
| ⦗Top⦘ |
| Protein ID | Sample Taxon ID | Habitat | Sequence |
| JGI25383J37093_100649871 | 3300002560 | Grasslands Soil | VLPRWLGTSGLAAGAVAMALALAFREDTKADQAAFVLPVLWQLVTASVMLRPPGTRG* |
| JGI25382J43887_103649591 | 3300002908 | Grasslands Soil | TSGVAAGAVAIALALVFREDTKADQAAFVLPVAWQLVTAIVMLRQVGTRG* |
| JGI25389J43894_10047711 | 3300002916 | Grasslands Soil | DGKVLPRWLATGGLAAGAVAMALALVFREDTKADQAAFVLPVLWQVVTAAVMLRPLGLRG |
| Ga0066683_102028241 | 3300005172 | Soil | GRGVLNGKVLPRWLGAAGVAAGLVGVALALAFREDTKADQAAYVLPVLWQVVTGAVLLRARL* |
| Ga0066690_100505801 | 3300005177 | Soil | TALFGWGLLDGKVLPRWLGGAAVAAGGVAIVLALVFREDTKADQAAFVLPVVWQLVTGVVMLRRV* |
| Ga0066690_100675263 | 3300005177 | Soil | DGKVLPRWLGAGAVAAGGVGIVLALVFREDAKADQAAFVLPVVWQLVTGVVMLRRA* |
| Ga0066684_107483692 | 3300005179 | Soil | LDGKVLPRWLGAGAVAAGGVRIVLALGFREDTKADQAAFVLPVVWQLMTGAVMLRGA* |
| Ga0066684_111276122 | 3300005179 | Soil | DGKVLPRWLGTGGLAAGAVAMAFALVFREDTKADQAAFVLPVLWQAVTAIVMLRPVGTRG |
| Ga0066676_102777261 | 3300005186 | Soil | LFGWGVSDGKVLPRWLGTGALAAGVVAMALAVAFPETTKADQAAFVLPVLWQVVTGTVMLRVARG* |
| Ga0066676_106757072 | 3300005186 | Soil | DGKVLPRWLATGGLAAGAVAMALALVFREDTKADQAAFVLPVLWQVVASGAMLRRARG* |
| Ga0066675_108236071 | 3300005187 | Soil | VAGFALGLATALFGWGVLDGKVLPRWLGAAAVAAGGVGIVLALVFREDTKADQAAFVLPVVWQLVTAVVMLRG* |
| Ga0070703_100960792 | 3300005406 | Corn, Switchgrass And Miscanthus Rhizosphere | AWGLLDGKVLPRWLGTSGLAAGGVAMGLAFVFPETTKADQAAFVLPVLWQVVAASVMLRSLGTRG* |
| Ga0070714_1004212262 | 3300005435 | Agricultural Soil | FGWGVLDGKVLPRWLGTAGVAAGGIAMGLALIFREDTKADQAAFVLPVFWQVLVSTVMLRGAIRG* |
| Ga0066687_105775202 | 3300005454 | Soil | LGLATALFGWGVLDGKVLPRWLGAAALAAGGVGIVLAVVFREDTKADQAAFVLPVVWQLVTAVVMLRA* |
| Ga0070706_1011102811 | 3300005467 | Corn, Switchgrass And Miscanthus Rhizosphere | LGVATCLFGWGVLDGKVLPRWLGTAGVAAGGIAMGLALIFREDTKADQAAFVLPVFWQVLVSTVMLRGAIRG* |
| Ga0066695_105419931 | 3300005553 | Soil | VLNGKVLPRWLGAAGVAAGLVGAALALAFREDTKADQAAYVLPVLWQVVTGAVLLRARL* |
| Ga0066692_101948431 | 3300005555 | Soil | AAAVAAGGVAIVLALVFREDTKADQAAFVLPVVWQLVTAVVMLRGA* |
| Ga0066698_104099592 | 3300005558 | Soil | GLSTCLYGWGVLDGKVLPRWLATGGLAAGAVAMALALVFREDTKADQAAFVLPVLWQVVASGAMLRGVRG* |
| Ga0066700_109187891 | 3300005559 | Soil | PRWLATGGLAAGAVAMALALVFREDTKADQAAFVLPVLWQLVTGSVMLRAVRG* |
| Ga0066651_100660791 | 3300006031 | Soil | VLPRWLGAGALAAGGVGIVLALVFREDAKADQAAFVLPVAWQLVTAVVMLRRA* |
| Ga0066696_102475711 | 3300006032 | Soil | LGAGAVAAGGVGIVLALVFREDTKADQAAFVLPVVWQLVTGAVMLRGA* |
| Ga0066652_1003003782 | 3300006046 | Soil | KVLPRWLATGGLAAGAVAMALALVFREDTKADQAAFVLPVLWQVVTAAVMLRPLGLRG* |
| Ga0066665_108865802 | 3300006796 | Soil | LFCWGVFDGKVLPRWLATGGLAAGAVAMALALVFREDTKADQAAFVLPVLWQVVTAAVMLRPLGLRG* |
| Ga0079221_110622472 | 3300006804 | Agricultural Soil | CLFGWGVVDGKALPHWVGTGALGAGAVGMAFALVFAESTKADQAAFVLPVLWQVVTGVVLLRRGP* |
| Ga0079220_104100122 | 3300006806 | Agricultural Soil | LNGKVLRGWLGRGGVAAGVVGMVLAVVFSEDTKADQAAFVLPVLWQVVAGVAFLRGVLNDSRSSAASAR* |
| Ga0079219_106702341 | 3300006954 | Agricultural Soil | GFALGLATLLFGWGMLDGKVLPRWLGAAAVAAGGVGVVLALVFREDTKADQAAFVLPVVWQLVTGVVLLRSGVRG* |
| Ga0066710_1017357361 | 3300009012 | Grasslands Soil | GVSDGKVLLRWLGTGGLAAGAVAMGLAVAFPETTKADQAAFVLPVLWQVVTGSVMLRVAR |
| Ga0066710_1023535201 | 3300009012 | Grasslands Soil | GKVLPRWLGAAALAAGGVGIVLALVFREDTKADQAAFVLPVVWQLVTGVVMLRRA |
| Ga0099830_102746652 | 3300009088 | Vadose Zone Soil | FALGLSTCLFAWGVSDGKVLPRWLGTSGLAAGGVAMGLAVVFPETTKADQAAFVLPVLWQVVTGAAMLRVARG* |
| Ga0075423_100614863 | 3300009162 | Populus Rhizosphere | FAWGVYDGKVLPRWLATGGLAAGAVAMVLALVFREDTKADQAAFVLPVLWQAVTGMVLVWDGRRGLGGSASVAP* |
| Ga0127455_10362562 | 3300010132 | Grasslands Soil | STYLFGWGIVNGKVLPRWLGAGAVAAGAVAMALALAFREDTKADQAAFVLPVLWQLVTGIVLLRTVRG* |
| Ga0127483_12314813 | 3300010142 | Grasslands Soil | ALGLSTCLYGWGVLDGKVLPRWLATGGLAAGAVAMALALVFREDTKADQAAFVLPVLWQVVTAAVMLRPLGLRG* |
| Ga0134088_101307651 | 3300010304 | Grasslands Soil | IATYLFGRGVLNGKVLPRWLGAAGVAAGLVGMTLALAFGEDTKADQAAYVLPVLWQLVAGVALLRARP* |
| Ga0134088_105729492 | 3300010304 | Grasslands Soil | STCLFGWGVVEGKALPRWLGTSGLAAGAVALFFALVFREDTKADEAAFVLPVLWQVATGAVMLRLAFRR* |
| Ga0134067_104717961 | 3300010321 | Grasslands Soil | AGAVAAGGVGIVLALVFREDTKADQAAFVLPVVWQLVSAVVMLRGA* |
| Ga0134084_101871001 | 3300010322 | Grasslands Soil | RVAGFALGLATALFGWGVLDGKVLPRWLGAAAVAAGGVGIVLALVFREDTKADQAAFVLPVVWQLVTAVVMLRG* |
| Ga0134084_102244342 | 3300010322 | Grasslands Soil | DGKALPRWLGTSGLAAGAVAMALALVFREDTKADQAAFVLPVLWQVVTAGVMLRPLGTRG |
| Ga0134086_101488292 | 3300010323 | Grasslands Soil | RVAGFALGLSTCLYGWGVLDGKVLPRWLATGGMAAGAVAMALALVFREDTKADQAAFVLPVLWQVVASGAMLRGVRG* |
| Ga0134066_101837691 | 3300010364 | Grasslands Soil | ALLDGKVLPRWLGAGALAAGGVGIVLALVFREDAKADQAAFVLPVAWQLVTAVVMLRRA* |
| Ga0137380_100362164 | 3300012206 | Vadose Zone Soil | GGFALGLATVLFGWGVLDGKVLPRWLGAAALAAGGVGIVLALLFREDTKADQAAFVLPVVWQLVTGVVMLRRA* |
| Ga0137381_107421452 | 3300012207 | Vadose Zone Soil | VLPRWLATGGLAAGAVAMALALVFREDAKADQAAFVLPVLWQVVASGAMLRGVRG* |
| Ga0137376_103789372 | 3300012208 | Vadose Zone Soil | GKVLPRWLATGGLAAGAVAMALALVFREDTKADQAAFVLPVLWQVVASGAMLRGVRG* |
| Ga0137370_103785092 | 3300012285 | Vadose Zone Soil | LPRWLGTSGLAAGAVAMALALVFREDTKADQAAFVLPVLWQVVTASVMLRPLGTRG* |
| Ga0137386_102698171 | 3300012351 | Vadose Zone Soil | TVLFGWGVLDGKVLPRWLGAAALAAGGVGIVLALVFREDTKADQAAFVLPVVWQLVTGVVMLRRA* |
| Ga0137386_108183181 | 3300012351 | Vadose Zone Soil | CLFCWGVFDGKVLPRWLATGGLAAGAVAMALALVFREDTKADQAAFVLPVLWQVVTAAVMLRPLGLRG* |
| Ga0137384_102941762 | 3300012357 | Vadose Zone Soil | SDGKVLPRWLGGGGVAAGVVGMALALLFREDTKADQAAFVLPVVWQVATGAVMLRRAWTRG* |
| Ga0137385_100797611 | 3300012359 | Vadose Zone Soil | WLGAAALAAGGVGIVLALVFREDTKADQAAFVLPVVWQLVTAVVMLRG* |
| Ga0137385_106434972 | 3300012359 | Vadose Zone Soil | WLGAAALAAGGVGIVLALVFREDTKADQAAFVLPVVWQLVTGVVMLRRA* |
| Ga0137385_113180651 | 3300012359 | Vadose Zone Soil | TQRVAGFALGLSTCLFCWGVFDGKVLPRWLATGGLAAGAVAMALALVFREDTKADQAAFVLPVLWQVVTSGAMLRAVRG* |
| Ga0137360_104211612 | 3300012361 | Vadose Zone Soil | RVAGFALGLSTCLFAWGVSDGKVLPRWLGRTGLAAGGVAMGLAVVFPETTKADQAAFVLPVLWQLVAGSLMLRVARG* |
| Ga0134022_10009292 | 3300012371 | Grasslands Soil | GVSDGKVLPRWLATGGLAAGAVAMALALVFREDTKADQAAFVLPVLWQVVTATVMLRPLGLRG* |
| Ga0134047_11848432 | 3300012380 | Grasslands Soil | AGFALGLSTCLFAWGVSDGKVLPRWLATGGLAAGAVAMALALVFREDTKADQAAFVLPVLWQVVTATVMLRPLGLRG* |
| Ga0134052_11742362 | 3300012393 | Grasslands Soil | SAYLFGWGIVNGKVLPRWLGAGAVAAGAVAMALALAFREDTKADQAAFVLPVVWQLVTGAVMLRAR* |
| Ga0134044_10836871 | 3300012395 | Grasslands Soil | TGGLAAGAVAMALALVFREDTKADQAAFVLPVLWQVVTAAVMLRQLGLRG* |
| Ga0134057_10810702 | 3300012396 | Grasslands Soil | STYLFGWGIVNGKVLPRWLGAGAVAAGAVAMALALAFREDTKADQAAFVLPVVWQLVTGAVMLRAR* |
| Ga0134055_10376381 | 3300012401 | Grasslands Soil | AAFALGIATYLFGRGVLNGKVLPHWLGAAGVAAGLVGMALALAFGEDTKADQAAYVLPVLWQLVAGAVLWRARP* |
| Ga0134024_12203502 | 3300012404 | Grasslands Soil | LFGWGVSDGKALPRWLGTSGLAAGAVAMALALVFREDTKADQAAFVLPVLWQVVTATVMLRPLGLRG* |
| Ga0134050_12489982 | 3300012407 | Grasslands Soil | ATQRVAGFALGLSTCLFAWGVSDGKVLPRWLATGGLAAGAVAMALALVFREDTKADQAAFVLPVLWQVVTATVMLRPLGLRG* |
| Ga0137416_106309991 | 3300012927 | Vadose Zone Soil | RVAGFALGLSTCLFAWGVSDGKVLPRWIGTGGLSAGAVALALALAFREDTKADQAAFVLPVLWQVVTAGVMLRALGTRG* |
| Ga0137404_121468442 | 3300012929 | Vadose Zone Soil | LSTCLFAWGVSDGKVLPRWLGTSGLAAGGVAMGLAVVFPETTKADQAAFVLPVLWQVVTASVMLRPLGTRG* |
| Ga0134110_100248163 | 3300012975 | Grasslands Soil | RWLGTGGLAAGAVAMAFALVFREDTKADQAAFVLPILWQVVTGGAMLRGGRG* |
| Ga0134110_101282362 | 3300012975 | Grasslands Soil | AALAAGGVGIVLALVFREDTKADQAAFVLPVVWQLVTAVVMLRRA* |
| Ga0134076_100190423 | 3300012976 | Grasslands Soil | FAWGVSDGKVLPRWLGTSGLAAGAVAMALALAFREDTKADQAAFVLPVLWQVVTASVMFRPLGTRG* |
| Ga0134087_102367421 | 3300012977 | Grasslands Soil | GKVLPRWLGTGGLAAGAVAMAFALVFREDTKADQAAFVLPILWQVVTGGAMLRGGRG* |
| Ga0134087_106870292 | 3300012977 | Grasslands Soil | FGWGVLDGKVLPRWLGAAAVAAAGVGIVLALVFREDTKADQAAFVLPVVWQLVTGVVMLRDV* |
| Ga0134081_103309841 | 3300014150 | Grasslands Soil | LSTCLFAWGVSDGKVLPRWLATWGLAAGAVAMALALVFREDTKADQAAFVLPVLWQVVTGSVMLRPVRG* |
| Ga0134075_100919642 | 3300014154 | Grasslands Soil | ALGLATCLFGWGVVEGKALPRWLGTSGLAAGAVALFFALVFREDTKADEAAFVLPVLWQVATGAVMLRLAFRR* |
| Ga0134078_101253071 | 3300014157 | Grasslands Soil | AGFALGLATALFGWALLDGKVLPRWLGAGAVAAGGVGIVLALVFREDTKADQAAFVLPVVWQLVSAVVMLRGA* |
| Ga0134085_105576781 | 3300015359 | Grasslands Soil | DGKVLPRWLGTSGVAAGAVAMALALVFREDTKADQAAFVLPVLWQVVTASVMLRPLGTRG |
| Ga0134112_100841952 | 3300017656 | Grasslands Soil | RVAGFALGLSTCLFGWGVSDGKVLLRWLGTGGLAAGAVAMALALVFREDTKADQAAFVLPVAWQLVTAIVMLRPVGTRG |
| Ga0134112_101024692 | 3300017656 | Grasslands Soil | GVSDGKVLPRWLGTSGLAAGGVAMGLTVVFPETTKADQAAFVLPVLWQLVTASVMLRPLGTRG |
| Ga0134112_102668382 | 3300017656 | Grasslands Soil | STCLFGWGVSDGKVLPRWLGTGGLAAGVVAMALALAFREDTKADQAAFVLPVLWQVVAGGVMVRLGLTTVRRSASAAP |
| Ga0066655_102004172 | 3300018431 | Grasslands Soil | CLFAWGVSDGKVLPRWLATGGLAAGAVAMALALVFREDTKADQAAFVLPVLWQVVTAAVMLRPLGLRG |
| Ga0066655_106029502 | 3300018431 | Grasslands Soil | WGVSQGKGLPRWLGTTGLAAGAVAMGLALVFREDTKADQAAFVLPVAWQVVTAAVMLRTTAASRKP |
| Ga0066667_100451321 | 3300018433 | Grasslands Soil | AQRAAAFALGIATYLFGRGVLNGRVLSRWLGAAGVAAGLVGMALALAFGEDTKADQAAYVLPVLWQLVAGAVLWRARP |
| Ga0066667_116012752 | 3300018433 | Grasslands Soil | LFGWGVSQGKVLPRWLGTSGLAAGAVAMGLALVFREDTKADQAAFVLPVAWQVVTAAVMLRTTAASRKP |
| Ga0066662_108819551 | 3300018468 | Grasslands Soil | GKVLPRWLGSAALAAGGVGTMLALVFREDTKADQAAFVLPVVWQLVTGVVMLRRA |
| Ga0066669_119785351 | 3300018482 | Grasslands Soil | TLFGWGILEGKMLPRWLGAAALAAGGVGIVLALVFREDTKADQAAFVLPVVWQLVTAVLMLRRA |
| Ga0179596_106660812 | 3300021086 | Vadose Zone Soil | NGKVLPRWLGMGGVAAGVVAMGLAVAFPETTKADQAAFVLPVLWQLVTGVVLLRARAAGA |
| Ga0207692_109975322 | 3300025898 | Corn, Switchgrass And Miscanthus Rhizosphere | ATCLFGWGVLDGKVLPRWLGTAGVAAGGIAMALAVIFREDTKADQAAFVLPVFWQVLVSSAMLRSAITRP |
| Ga0207700_103624212 | 3300025928 | Corn, Switchgrass And Miscanthus Rhizosphere | GKVLPRWLGAAAVAAGGVGILLALVFREDTKADQAAFVLPVVWQLVTGVVLLRSEVNLS |
| Ga0207664_102883742 | 3300025929 | Agricultural Soil | FGWGVLDGKVLPRWLGTAGVAAGGIAMGLALIFREDTKADQAAFVLPVFWQVLVSTVMLRGAIRG |
| Ga0208286_10078501 | 3300026015 | Rice Paddy Soil | LGLATCLFGWGVLDGKALPRWLGSGGVAAGAVAMGLALAFPETTKADQAAFVLPVLWQVVTGAVLLCEGLRAPQGPVSTAP |
| Ga0209265_11692331 | 3300026308 | Soil | VAGFALGLATALFGWALLDGKVLPRWLGAGAVAAGGVGIVLALVFREDTKADQAAFVLPVVWQLVSAVVMLRGA |
| Ga0209239_10034331 | 3300026310 | Grasslands Soil | VAGFALGLATALFGWGVLDGKVLPRWLGAAAVAAGGVGIVLALVFREDTKADQAAFVLPVVWQLVTAVVMLRG |
| Ga0209239_10520263 | 3300026310 | Grasslands Soil | WLGASAVAAGGVGIVLALVFREDAKADQAAFVLPVVWQLVTAVVMLRG |
| Ga0209761_10050001 | 3300026313 | Grasslands Soil | GLSTCLFGWGVMDGKALPRWLGTSGVAAGAVAIALALVFREDTKADQAAFVLPVAWQLVTAIVMLRPVGTRG |
| Ga0209155_12041652 | 3300026316 | Soil | GLSTCLFCWGVFDGKVLPRWLATGGLAAGAVAMALALVFREDTKADQAAFVLPVLWQVVTATVMLRPLGLRG |
| Ga0209152_100259792 | 3300026325 | Soil | RWLATGGLAAGAVAMALALVFREDTKADQAAFVLPVLWQVVTAAVMLRPLGLRG |
| Ga0209152_101430692 | 3300026325 | Soil | AGAVAAGGVGIVLALVFREDTKADQAAFVLPVVWQLMTGAVMLRGA |
| Ga0209802_10192841 | 3300026328 | Soil | KVLPRWLGAAAVAAGGVAIVLALVFREDTKADQAAFVLPVVWQLVTAVVMLRGA |
| Ga0209802_11603122 | 3300026328 | Soil | FGWGVSEGKVLPRWLGTTGLAAGAVAMALAVAFPETTKADQAAFVLPVLWQVVTGTVMLRVARG |
| Ga0209803_10359803 | 3300026332 | Soil | YLFGRGVLNGKVLSRWLGAAGVAAGLVGMALALAFGEDTKADQAAYVLPVLWQLVAGAVLWRARP |
| Ga0209803_12290261 | 3300026332 | Soil | DGKVLPRWLGAAAVAAGGVGIVLALVFREDTKADQAAFVLPVVWQLVTAVVMLRGA |
| Ga0209804_13074201 | 3300026335 | Soil | GFALGLSTCLFAWGVSDGKVLPRWLATGGLAAGAVAMALALVFREDTKADQAAFVLPVLWQVVTAAVMLRPLGLRG |
| Ga0209690_11248842 | 3300026524 | Soil | GIATYLFGRGVLNGKVLPRWLGAAGAAAGLVGMALALAFGEDTKADQAAYVLPVLWQLVAGAVLWRARP |
| Ga0209806_10144731 | 3300026529 | Soil | GLATVLFGWGMLDGKVLPRWLGAAAVAAGGVGIVLALVFREDTKADQAAFVLPVVWQLVTAVVMLRGA |
| Ga0209160_10087168 | 3300026532 | Soil | QRVAAFALGLSTGLFGWGVSQGKVLPRWLGTSGLAAGAVAMGLALVFREDTKADQAAFVLPVAWQVVTAAVMLRTTAASRKP |
| Ga0209160_12392522 | 3300026532 | Soil | PRWLATGGLAAGAVAMALALVFREDTKADQAAFVLPVLWQLVTGSVMLRAVRG |
| Ga0209056_101801391 | 3300026538 | Soil | PRWLATGGLAAGAVAMALALVFREDTKADQAAFVLPVLWQVVTATVMLRPLGLRG |
| Ga0209805_11466552 | 3300026542 | Soil | ALGLATALFGWGVLDGKVLPRWLGAAALAAGGVGIVLAVVFREDTKADQAAFVLPVVWQLVTAVVMLRA |
| Ga0209156_102691281 | 3300026547 | Soil | CLFGWGVSDGKVLPRWLGTGGLAAGAVAMAFALVFREDTKADQAAFVLPVAWQVVTAAVMLRRPVRG |
| Ga0209577_103145041 | 3300026552 | Soil | VLPRWLGAGAVAAGGVGIVLALVFREDTKADQAAFVLPVVWQLMTGAVMLRGA |
| Ga0209178_10342602 | 3300027725 | Agricultural Soil | VGGFALGVATCLFGWGVLDGKVFPRWLGTGGVTAGAIAMGLALFFREDTKADQAAFVLPVFWQLVAAGVMLRSAFNRG |
| Ga0307501_100119691 | 3300031152 | Soil | GKVLPRWVGTGGLAAGAVGVVLAHVFREDTKADQAAFVLPVLWQLVTASVMLRPLGTRGLTRGLGTTDCGS |
| Ga0308194_100216801 | 3300031421 | Soil | VAGFALGLSTSLVGWGVVNGKVLPRWLGSSGVVAGAVAMALALLFREDTKADQAAFVLPVLWQVVTAAVLLRGRGRPAGA |
| Ga0307471_1004813962 | 3300032180 | Hardwood Forest Soil | SDGKVLPRWLGTGGLAAGVVAMGLAVAFPETTKADQAAFVLPVLWQVVTASVMLRPLGTR |
| ⦗Top⦘ |