| Basic Information | |
|---|---|
| Family ID | F103587 |
| Family Type | Metagenome / Metatranscriptome |
| Number of Sequences | 101 |
| Average Sequence Length | 63 residues |
| Representative Sequence | MATAAKFRIDVAIHDLLMNSTQRPPAPAEARGLTLPEPVFPQFPERLSDPGPETPP |
| Number of Associated Samples | 90 |
| Number of Associated Scaffolds | 101 |
| Quality Assessment | |
|---|---|
| Transcriptomic Evidence | Yes |
| Most common taxonomic group | Unclassified |
| % of genes with valid RBS motifs | 95.00 % |
| % of genes near scaffold ends (potentially truncated) | 96.04 % |
| % of genes from short scaffolds (< 2000 bps) | 93.07 % |
| Associated GOLD sequencing projects | 84 |
| AlphaFold2 3D model prediction | Yes |
| 3D model pTM-score | 0.22 |
| Hidden Markov Model |
|---|
| Powered by Skylign |
| Most Common Taxonomy | |
|---|---|
| Group | Unclassified (62.376 % of family members) |
| NCBI Taxonomy ID | N/A |
| Taxonomy | N/A |
| Most Common Ecosystem | |
|---|---|
| GOLD Ecosystem | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil (21.782 % of family members) |
| Environment Ontology (ENVO) | Unclassified (30.693 % of family members) |
| Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Soil (non-saline) (58.416 % of family members) |
| ⦗Top⦘ |
| ⦗Top⦘ |
| Predicted Topology & Secondary Structure | |||||
|---|---|---|---|---|---|
| Classification: | Fibrous | Signal Peptide: | No | Secondary Structure distribution: | α-helix: 13.10% β-sheet: 0.00% Coil/Unstructured: 86.90% | Feature Viewer |
|
|
|||||
| Powered by Feature Viewer | |||||
| Structure Viewer | |
|---|---|
|
| |
| Per-residue confidence (pLDDT): 0-50 51-70 71-90 91-100 | pTM-score: 0.22 |
| Powered by PDBe Molstar | |
| ⦗Top⦘ |
| Pfam ID | Name | % Frequency in 101 Family Scaffolds |
|---|---|---|
| PF13620 | CarboxypepD_reg | 4.95 |
| PF04055 | Radical_SAM | 3.96 |
| PF00535 | Glycos_transf_2 | 1.98 |
| PF07992 | Pyr_redox_2 | 1.98 |
| PF00581 | Rhodanese | 1.98 |
| PF04454 | Linocin_M18 | 0.99 |
| PF07730 | HisKA_3 | 0.99 |
| PF03918 | CcmH | 0.99 |
| PF03706 | LPG_synthase_TM | 0.99 |
| PF02321 | OEP | 0.99 |
| PF00691 | OmpA | 0.99 |
| PF13419 | HAD_2 | 0.99 |
| PF09411 | PagL | 0.99 |
| PF16925 | TetR_C_13 | 0.99 |
| COG ID | Name | Functional Category | % Frequency in 101 Family Scaffolds |
|---|---|---|---|
| COG1538 | Outer membrane protein TolC | Cell wall/membrane/envelope biogenesis [M] | 1.98 |
| COG0392 | Predicted membrane flippase AglD2/YbhN, UPF0104 family | Cell wall/membrane/envelope biogenesis [M] | 0.99 |
| COG3088 | Cytochrome c-type biogenesis protein CcmH/NrfF | Posttranslational modification, protein turnover, chaperones [O] | 0.99 |
| COG3850 | Signal transduction histidine kinase NarQ, nitrate/nitrite-specific | Signal transduction mechanisms [T] | 0.99 |
| COG3851 | Signal transduction histidine kinase UhpB, glucose-6-phosphate specific | Signal transduction mechanisms [T] | 0.99 |
| COG4564 | Signal transduction histidine kinase | Signal transduction mechanisms [T] | 0.99 |
| COG4585 | Signal transduction histidine kinase ComP | Signal transduction mechanisms [T] | 0.99 |
| ⦗Top⦘ |
| Name | Rank | Taxonomy | Distribution |
| Unclassified | root | N/A | 62.38 % |
| All Organisms | root | All Organisms | 37.62 % |
| Visualization |
|---|
| Powered by ApexCharts |
| ⦗Top⦘ |
| Habitat | Taxonomy | Distribution |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil | 21.78% |
| Vadose Zone Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil | 14.85% |
| Forest Soil | Environmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil | 9.90% |
| Soil | Environmental → Terrestrial → Soil → Loam → Forest Soil → Soil | 8.91% |
| Soil | Environmental → Terrestrial → Soil → Wetlands → Unclassified → Soil | 6.93% |
| Hardwood Forest Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil | 3.96% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil | 2.97% |
| Tropical Peatland | Environmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland | 2.97% |
| Corn, Switchgrass And Miscanthus Rhizosphere | Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere | 2.97% |
| Bog Forest Soil | Environmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil | 1.98% |
| Peatland | Environmental → Aquatic → Freshwater → Wetlands → Unclassified → Peatland | 1.98% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil | 1.98% |
| Surface Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil | 1.98% |
| Peatlands Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Peatlands Soil | 1.98% |
| Grasslands Soil | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil | 1.98% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil | 1.98% |
| Palsa | Environmental → Terrestrial → Peat → Unclassified → Unclassified → Palsa | 1.98% |
| Peatland | Environmental → Aquatic → Freshwater → Wetlands → Bog → Peatland | 0.99% |
| Bog | Environmental → Aquatic → Freshwater → Wetlands → Bog → Bog | 0.99% |
| Freshwater Sediment | Environmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment | 0.99% |
| Watersheds | Environmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds | 0.99% |
| Tropical Forest Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil | 0.99% |
| Rice Paddy Soil | Environmental → Terrestrial → Soil → Wetlands → Unclassified → Rice Paddy Soil | 0.99% |
| Tropical Forest Soil | Environmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil | 0.99% |
| Agricultural Soil | Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Agricultural Soil | 0.99% |
| Arabidopsis Rhizosphere | Host-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere | 0.99% |
| Visualization |
|---|
| Powered by ApexCharts |
| Taxon OID | Sample Name | Habitat Type | IMG/M Link |
|---|---|---|---|
| 3300001089 | Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM1_M3 | Environmental | Open in IMG/M |
| 3300001867 | Texas A ecozone_OM1H0_M2 (Combined assembly for Texas A ecozone Site metagenome samples, ASSEMBLY_DATE=20130705) | Environmental | Open in IMG/M |
| 3300002914 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm | Environmental | Open in IMG/M |
| 3300003505 | Forest soil microbial communities from Harvard Forest LTER, USA - Combined assembly of forest soil metaG samples (ASSEMBLY_DATE=20140924) | Environmental | Open in IMG/M |
| 3300004152 | Coassembly of ECP12_OM1, ECP12_OM2, ECP12_OM3 | Environmental | Open in IMG/M |
| 3300005435 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaG | Environmental | Open in IMG/M |
| 3300005471 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaG | Environmental | Open in IMG/M |
| 3300005538 | Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen03_05102014_R1 | Environmental | Open in IMG/M |
| 3300005557 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 | Environmental | Open in IMG/M |
| 3300005568 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152 | Environmental | Open in IMG/M |
| 3300005712 | Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 4 | Environmental | Open in IMG/M |
| 3300005764 | Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2) | Environmental | Open in IMG/M |
| 3300005921 | Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 6 | Environmental | Open in IMG/M |
| 3300006028 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-3 metaG | Environmental | Open in IMG/M |
| 3300006102 | Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Alex Branch Run_MetaG_ABR_2013 | Environmental | Open in IMG/M |
| 3300006176 | Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5 | Environmental | Open in IMG/M |
| 3300006796 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 | Environmental | Open in IMG/M |
| 3300007258 | Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3 | Environmental | Open in IMG/M |
| 3300009623 | Peatland microbial communities from Minnesota, USA, analyzing carbon cycling and trace gas fluxes - June2015DPH_19_10 | Environmental | Open in IMG/M |
| 3300009824 | Peat soil microbial communities from Weissenstadt, Germany - Sb_50d_6_BS metaG | Environmental | Open in IMG/M |
| 3300010358 | Tropical forest soil microbial communities from Panama - MetaG Plot_3 | Environmental | Open in IMG/M |
| 3300011269 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaG | Environmental | Open in IMG/M |
| 3300012205 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaG | Environmental | Open in IMG/M |
| 3300012918 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaG | Environmental | Open in IMG/M |
| 3300012923 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaG | Environmental | Open in IMG/M |
| 3300012924 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Illumina Assembly) | Environmental | Open in IMG/M |
| 3300014199 | Peatland microbial communities from Houghton, MN, USA - PEATcosm2014_Bin17_30_metaG | Environmental | Open in IMG/M |
| 3300015051 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (PacBio error correction) | Environmental | Open in IMG/M |
| 3300015241 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly) | Environmental | Open in IMG/M |
| 3300015371 | Combined assembly of cpr5 and col0 rhizosphere and soil | Host-Associated | Open in IMG/M |
| 3300016319 | Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.00H | Environmental | Open in IMG/M |
| 3300017822 | Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - Control_2 | Environmental | Open in IMG/M |
| 3300017970 | Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_SJ02_MP02_20_MG | Environmental | Open in IMG/M |
| 3300017975 | Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0715_SJ02_MP15_20_MG | Environmental | Open in IMG/M |
| 3300019258 | Metatranscriptome of peatland microbial communities from Houghton, MN, USA - PEATcosm2014_Bin05_10_metaT (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300019887 | Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1c2 | Environmental | Open in IMG/M |
| 3300020170 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad1_1_08_16fungal (Illumina Assembly) | Environmental | Open in IMG/M |
| 3300020199 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly) | Environmental | Open in IMG/M |
| 3300020579 | Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-M | Environmental | Open in IMG/M |
| 3300020581 | Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-M | Environmental | Open in IMG/M |
| 3300020582 | Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-O | Environmental | Open in IMG/M |
| 3300021088 | Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-M | Environmental | Open in IMG/M |
| 3300021168 | Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-M | Environmental | Open in IMG/M |
| 3300021170 | Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-M | Environmental | Open in IMG/M |
| 3300021178 | Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-M | Environmental | Open in IMG/M |
| 3300021180 | Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-O | Environmental | Open in IMG/M |
| 3300021403 | Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-O | Environmental | Open in IMG/M |
| 3300021405 | Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-O | Environmental | Open in IMG/M |
| 3300021406 | Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-O | Environmental | Open in IMG/M |
| 3300021407 | Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-O | Environmental | Open in IMG/M |
| 3300021420 | Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-M | Environmental | Open in IMG/M |
| 3300021432 | Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-M | Environmental | Open in IMG/M |
| 3300021477 | Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-O | Environmental | Open in IMG/M |
| 3300022531 | Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-28-M (Metagenome Metatranscriptome) (v2) | Environmental | Open in IMG/M |
| 3300024330 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (PacBio error correction) | Environmental | Open in IMG/M |
| 3300025414 | Peatland microbial communities from Minnesota, USA, analyzing carbon cycling and trace gas fluxes - June2015DPH_6_10 (SPAdes) | Environmental | Open in IMG/M |
| 3300026034 | Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_5C_0N_302 (SPAdes) | Environmental | Open in IMG/M |
| 3300026320 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_40cm (SPAdes) | Environmental | Open in IMG/M |
| 3300026355 | Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-02-A | Environmental | Open in IMG/M |
| 3300026376 | Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-02-B | Environmental | Open in IMG/M |
| 3300026557 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal | Environmental | Open in IMG/M |
| 3300027047 | Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaG HF042 (SPAdes) | Environmental | Open in IMG/M |
| 3300027117 | Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM1H0_O1 (SPAdes) | Environmental | Open in IMG/M |
| 3300027502 | Forest soil microbial communities from Davy Crockett National Forest, Groveton, Texas, USA - Texas A ecozone_OM3H0_M2 (SPAdes) | Environmental | Open in IMG/M |
| 3300027609 | Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_O2 (SPAdes) | Environmental | Open in IMG/M |
| 3300027629 | Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M3 (SPAdes) | Environmental | Open in IMG/M |
| 3300027655 | Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1 (SPAdes) | Environmental | Open in IMG/M |
| 3300027663 | Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA Ref_M3 (SPAdes) | Environmental | Open in IMG/M |
| 3300027729 | Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP04_OM1 (SPAdes) | Environmental | Open in IMG/M |
| 3300027869 | Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen03_05102014_R1 (SPAdes) | Environmental | Open in IMG/M |
| 3300027879 | Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 4 (SPAdes) | Environmental | Open in IMG/M |
| 3300027903 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes) | Environmental | Open in IMG/M |
| 3300028047 | Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes) | Environmental | Open in IMG/M |
| 3300028906 | Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5 (v2) | Environmental | Open in IMG/M |
| 3300030007 | I_Palsa_E1 coassembly | Environmental | Open in IMG/M |
| 3300030706 | Peat soil microbial communities from Weissenstadt, Germany - Sb_50d_6_BS metaG (v2) | Environmental | Open in IMG/M |
| 3300031236 | Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - Palsa_T0_1 | Environmental | Open in IMG/M |
| 3300031573 | Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.AN111 | Environmental | Open in IMG/M |
| 3300031708 | FICUS49499 Metagenome Czech Republic combined assembly | Environmental | Open in IMG/M |
| 3300031823 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_05 | Environmental | Open in IMG/M |
| 3300031962 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515 | Environmental | Open in IMG/M |
| 3300032180 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515 | Environmental | Open in IMG/M |
| 3300032261 | Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170 (v2) | Environmental | Open in IMG/M |
| 3300032770 | Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.5 | Environmental | Open in IMG/M |
| 3300032782 | Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.1 | Environmental | Open in IMG/M |
| 3300032805 | Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.2 | Environmental | Open in IMG/M |
| 3300032893 | Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_1.1 | Environmental | Open in IMG/M |
| 3300032896 | Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_2.4 | Environmental | Open in IMG/M |
| 3300032897 | Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_1.5 | Environmental | Open in IMG/M |
| 3300032955 | Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_2.5 | Environmental | Open in IMG/M |
| Geographical Distribution | |
|---|---|
| Zoom: | Powered by OpenStreetMap |
| ⦗Top⦘ |
| Protein ID | Sample Taxon ID | Habitat | Sequence |
| JGI12683J13190_10225462 | 3300001089 | Forest Soil | MATAAKFRIDVAIHDLLMNSNQKAAAPAEARGLTLPEPVFPLFPERLNDPGP |
| JGI12627J18819_100763191 | 3300001867 | Forest Soil | MATAAKFRIDVAIHDLLVNSSQRPPAPAQAKGLTLPEPVFPQFPGRLNDPGPETPPAKRK |
| JGI25617J43924_100758671 | 3300002914 | Grasslands Soil | MATAAKFRVDVAIHDLLMNSNQRPPAPAQAQGLTLPEPVFPQFPGRLTDPGPD |
| JGIcombinedJ51221_101622003 | 3300003505 | Forest Soil | MATAAKFRIDVAIHDLLMNSTQRSPAPVEARGLTLPEPVFPQFPERINDPGPEAPPEKRKWG |
| Ga0062386_1011177252 | 3300004152 | Bog Forest Soil | MATAARFRIDVAIHELLVNSAQGSPAPAEARGLTLPEPVFPQYPKRLADPGPDSPPEKRKWGVSGPGS |
| Ga0070714_1004444562 | 3300005435 | Agricultural Soil | MATAAKFRIDVAIHDLLVNSAQRPPTPAQAKGLSLPEPVFPQFPERLNNPGPETPPERRKWGAKDVYRYSRGWLGPYVRSRV |
| Ga0070698_1016504881 | 3300005471 | Corn, Switchgrass And Miscanthus Rhizosphere | MATAAKFRIDVAIHDLLVNSAQRPPTPAQAKGLSLPEPVFPQFPERLTNPGPETPPERRKWGAKDVYRYSRGWLGPYVR |
| Ga0070731_104684452 | 3300005538 | Surface Soil | MATAAKFRIDVAIHDLLMNSNQRRPEPAEARGLTLPEPVFPQFPDRLKDPGPEAPP* |
| Ga0066704_106875372 | 3300005557 | Soil | MATAARFRIDVAIHDLLMNSNQQHPAPAEAKGLTLPEPVFPQFPDRLKDPGPDTPPEKRKWGPRDVYRYSR |
| Ga0066703_106473622 | 3300005568 | Soil | MATAARFRIDVAIHDLLMNSNQQHPAPAEAKGLTLPEPIFPQFPERLKDPGPETPPEKRKWG |
| Ga0070764_106454752 | 3300005712 | Soil | MATAARFRIDVAIHDLLMNSTQRPPAPAEAMGLTLPEPVFPQFPERLHDPGPETPPEKRKWGAKDVY |
| Ga0070764_110027892 | 3300005712 | Soil | VAATPTFRIDVAIHDLLMNSNQRPPAPVESKGLTLPQPVFPQFPDRLKDPGPETPP |
| Ga0066903_1075740441 | 3300005764 | Tropical Forest Soil | MTAAPKFRIDVAINDLLVSGDRRRPVPAEPKGLSLPDPVFPQFPERLNDLGPEVLPEKRK |
| Ga0070766_104710921 | 3300005921 | Soil | MTAAPKFRVDVAINDLLTSSNQRPPKPADSKGLTLPTPVFPLHPERLNDPGPDVPPEKRKWGLR |
| Ga0070766_111959141 | 3300005921 | Soil | MATATKFRIDVAIHDLLMNSTQMPPSPAESMGLTLPEPLFPQFPDRLNDPGPEFPPEKRHWGAKDVYR |
| Ga0070717_107087761 | 3300006028 | Corn, Switchgrass And Miscanthus Rhizosphere | MATAAKFRIDVAIHDLLMNSSQRPPAPAQNQGLTLPEPVFPQFPERLNNPGPETPPEKRKWGAKDVYRYSR |
| Ga0070717_112499581 | 3300006028 | Corn, Switchgrass And Miscanthus Rhizosphere | MATATKFRIDVAIHDLLMNTTQRPPAPAQAKGLKLPEPVFPQFPERLIDPGPGTPPEKRKWGTKDVYRASRG |
| Ga0075015_1004902872 | 3300006102 | Watersheds | MATAAKFRIDVDIHDLLNKPGQQTPAPAEPKGLVLPHPVYPQFPERLNDPGPEVPPDKRKWGRKD |
| Ga0070765_1015351572 | 3300006176 | Soil | MATAAKFRIDVAIHDLLMNSTQPQPAPAEAMGLSLPEPVFPQFPERLNDPGPETPPQKRKWG |
| Ga0066665_112205702 | 3300006796 | Soil | MTAAPKFRIDVAINDLLTDGSHRSSGPAEAKGLTLPVPVFPQFPERLNDPGPEVTPERR |
| Ga0099793_100060981 | 3300007258 | Vadose Zone Soil | MATAAKFRIDVAIHDLLMNSNQRPPAPAESMGLTLPEPVFPRFRDRLKDPGPETPPEKRKWGPKDVYRYS |
| Ga0116133_12043411 | 3300009623 | Peatland | MTAAPKFRIDVTINDLLTSAGQKPAAPAEAKGLSLPTPVFPQFPERVMESGPEVPPEKR |
| Ga0116219_100353087 | 3300009824 | Peatlands Soil | MLCEVIAMATTAEFRIDVAIHDLLMKSAERPPAPAEAKGLTLPEAVFPQHPERLRDPGPETPPEKRKWGAK |
| Ga0126370_114161891 | 3300010358 | Tropical Forest Soil | MTAAPKFRIDVAINDLLTGSGQRPPAPAAPQGLTLPNPAFPQFPERLNDSGPVVPPEKRKWGLRYIY |
| Ga0137392_107399001 | 3300011269 | Vadose Zone Soil | MATAAKFRIDVAIHDLLMNSSQRPPAPAQAKGVTLPEPVFPQFPERLNDPGPGTPPEKRQ |
| Ga0137362_110865931 | 3300012205 | Vadose Zone Soil | MATAAKFRIDVAIHDLLMNSNQRLPAPAEARGLTLPDPVSPQFPGRLKDPGPETPPEKRKWGPKDVYRY |
| Ga0137396_111841041 | 3300012918 | Vadose Zone Soil | MATAAKFRIDVAIHDLLMNSTHLPPAPAEALGLTLPEPVFPRFPDRLKDPGPETPPEKRKWGPK |
| Ga0137359_103705812 | 3300012923 | Vadose Zone Soil | MTAAPKFRIDVAINDLLTDGSHRSSGPAEAKGLTLPVPVFPQFPERLNDPGPEVAPERRKWGLRD |
| Ga0137413_112633941 | 3300012924 | Vadose Zone Soil | MATAAKFRIDVAIHDLLMNSAQKPPMPAEAMGLTLPEPVFPQFPDRLNDPGPDTPPEKRKWGPKD |
| Ga0181535_106058901 | 3300014199 | Bog | MTAAPKFRIDVSINDLLTSAGQKPAAPAEAKGLSLPTPVFPQFPERVMESGPEVPPEKRKWGLRDIHR |
| Ga0137414_12230545 | 3300015051 | Vadose Zone Soil | MATAAKFRIDVAIHDLLMNSTKTPPMPAEAMGLTLPEPVFPRFPDRLKDPGPETPPEKRQ |
| Ga0137418_101969322 | 3300015241 | Vadose Zone Soil | MATAAKFRIDVAIHDLLMNSTKTPPMPAEAMGLTLPEPLFPRFPDRLKDPGPETPPEKRKWGPKDVYRYSR |
| Ga0132258_133271182 | 3300015371 | Arabidopsis Rhizosphere | MATAAKFRIDVAIHDLLTASNRRGPWPVEPKGLTLPQPVFPQYPDLLNNPGPATPPEKRRWAVRDIYRTSEGW |
| Ga0182033_111201851 | 3300016319 | Soil | MATAARFRIDVTIHNLLTNASQPPLAPAEPKGFPLPDPVFPQFPKRLNDPGLEVPPDKRKWGLRDIYRTSRGW |
| Ga0187802_101660842 | 3300017822 | Freshwater Sediment | MATAAKFRIDVAIHDLLMDSSQRPPAPAQAKGLTLPEPVFPQFPERLHDPGPDIPLEKR |
| Ga0187783_106142441 | 3300017970 | Tropical Peatland | LATAAKFRINVAIHELLMNSNPQGPAPALVKGATLPEPVFPQYPEKLSDPGPENLPLRREWKR |
| Ga0187782_108712401 | 3300017975 | Tropical Peatland | MTAAPKFRIDVTINDLLTATTQRSPVPAEARGLTLPAPVFPQHPERLNDPGPDVVPEKRKWGLNDIYRIS |
| Ga0187782_114339841 | 3300017975 | Tropical Peatland | MTAAPKFRIDVTIHDLLTTGSQRPSSPAEAKGLTLPTPVFPQYPQRLKDPGPDVPPQKRKWGINDIHRISRGW |
| Ga0181504_12546001 | 3300019258 | Peatland | MATAAKFRIDVAIHDLLMNSAQRSPAPASARGLTLPEPEFPQYPERLSDPGPETPPE |
| Ga0193729_12156781 | 3300019887 | Soil | MATAAKFRIDVAIHDLLVNSSQRPPAPAQAKGLTLPEPVFPQFAERLDDPGPE |
| Ga0179594_100780891 | 3300020170 | Vadose Zone Soil | MATAAKFRIDVAIHDLLMNSNQRPPAPAESMGLTLPEPVFPRFPDRLKDPGPETPPEKRKWGPKDVYRYSRGW |
| Ga0179592_101427601 | 3300020199 | Vadose Zone Soil | MATAAKFRIDVAIHDLLMNSTQRPPAPAEARGLTLPEPVFPQFPERLSDPGPETPP |
| Ga0210407_104174891 | 3300020579 | Soil | MATVAKFRIDVAIHDLLMNSSQRRPEPAEARGLSLPDPVFPQFPDRLKDPGPETP |
| Ga0210399_110891871 | 3300020581 | Soil | MATTAKFRIDVAIHDLLMNSKHRPPAPAEAMGLTLPEPVFPRFPDRLKDPGPEASPEKRKWGPKDV |
| Ga0210395_112823271 | 3300020582 | Soil | MATAAKFRIDVAIHDLLVNSNQRPPAPAQAMGLTLPEPVFPQFPERIDDPGPETPPEKR |
| Ga0210404_100774561 | 3300021088 | Soil | MATAAKFRIDVAIHDLLMNSTQRSPAPAEAKGLTLPEPIFPQFPERLRDPGPETPPERRKWGVKDVYLT |
| Ga0210404_101064362 | 3300021088 | Soil | VASAAKFRIDVAIHDLLMNSNPQGPAPALVKGQELPEPVFPQSPEKLRDPGP |
| Ga0210406_101738972 | 3300021168 | Soil | MATAAKFRIDVAIHDLLMNSSERLPAPAQAQGLTLPEPIFPQFPGRLTDPGPDSPPEKRKWGPKDVYRY |
| Ga0210400_105329771 | 3300021170 | Soil | MATAAKFRIDIAIHDLLTNSTQRPPAPAQAKGLTLPEPVFPQFPERLNDPGPEVP |
| Ga0210408_100544641 | 3300021178 | Soil | MATAAKFRIDVAIHDLLMNSTQRSPAPVEARGLTLPEPVFPQFPERINDSGPEAPPEK |
| Ga0210396_105983551 | 3300021180 | Soil | MATATKFRIDVAIHDLLMNSTQMPPSPAESMGLTLPEPLFPQFPDRLNDPGPQFPPEKRHWGA |
| Ga0210396_107925011 | 3300021180 | Soil | MATAAKFRIDVAIHDLLVNSSQRPPAPAQAKGLTLPKPVFPQFPERLNNPGPETP |
| Ga0210397_113148861 | 3300021403 | Soil | MATAAKFRIDVAIHDLLMNSSQRSPAPAQAKGLTLPEPVFPQFPERVDDPGPETPP |
| Ga0210387_118682571 | 3300021405 | Soil | VASAAKFRIDVAIHDLLMNSNPQGPAPALVKGQELPEPVFPQHPEKLRDPGPEV |
| Ga0210386_101460661 | 3300021406 | Soil | MATAAKFRIDVAIHDLLMNSAQMPPQPAEARGLTLPEPVFPRFPERLKDPGPETPLEKRNWGPKDVYRYSRGW |
| Ga0210383_110039121 | 3300021407 | Soil | MATAAKFRIDVAIHDLLVNSNQRPPAPAQAMGLTLPEPVFPQFPERIDDPGPETPPEKRQWGAK |
| Ga0210394_113714311 | 3300021420 | Soil | MATAAKFRIDVAIHDLLMNSTQMPPLPAEPMGLTLPEPLFPQFPERLKDPGPETP |
| Ga0210384_103527912 | 3300021432 | Soil | MATAAKFRIDVAIHDLLMNSTERPPAPAEARGLTLPEPIFPQHPQRFNDPGPHT |
| Ga0210398_112335981 | 3300021477 | Soil | MATAAKFRIDVAIHDLLVNSSQRPPAPAQAKGLALPEPVFPQFPERLDDPGPESP |
| Ga0210398_112591071 | 3300021477 | Soil | MATAAKFRIDVAIHDLLMNSNERPPVPAQVQGLTLPEPVFPQFPGRLTDPGPDSPPEKRKWG |
| Ga0242660_10069082 | 3300022531 | Soil | VATAAKFRIDVAIHDLLMNSTERPPTPAEAMGLTLPEPFFPQFPDRLKDPGPETPPERRKWG |
| Ga0137417_11623721 | 3300024330 | Vadose Zone Soil | MATAAKFRIDVAIHDLLMNSTHLPPAPAESMGLTLPEPVFPRFPDRLKDPGPETPPEKRKWGPK |
| Ga0137417_12656041 | 3300024330 | Vadose Zone Soil | MATAAKFRIDVAIHDLLMNSTKTPPMPAEAMGLTLPEPVFPRFPDRLKDPGPETPPEKRQWGVKDVYRTSRGW |
| Ga0208935_10074461 | 3300025414 | Peatland | MATAAKFRIDVAIHDLLMNSAQRSPAPASARGLTLPEPEFPQYPERLSDPGPETPPEK |
| Ga0208773_10342221 | 3300026034 | Rice Paddy Soil | MASPAKFRIDVAIHDLLMNSSAQKPTPLVSKGLTLSEPVFPQYPEKLHDPGPE |
| Ga0209131_12007031 | 3300026320 | Grasslands Soil | MATAAKFRIDVAIHDLLMNSAQKPPTPAEAMGLTLPEPVFPQFPDRLNDPGPDTPPEKRKWGP |
| Ga0257149_10487471 | 3300026355 | Soil | MATAAKFRIDVAIHDLLMNSSQRPPAPAQAKGVTLPEPVFPQFPERLNDPGPGTPPEKRQWGAKD |
| Ga0257167_10523571 | 3300026376 | Soil | MATAAKFRIDVAIHDLLMNSTHLPPAPAESMGLTLPEPVFPRFPDRLKDPGPETPPEKRKWGPKD |
| Ga0179587_101083583 | 3300026557 | Vadose Zone Soil | MATAAKFRIDVAIHDLLMNSAQKPPMPAEAMGLTLPEPVFPQFPDRLNDPGPDTPPEKRKWGPKDIYRYS |
| Ga0208730_10118401 | 3300027047 | Forest Soil | MATAAKFRIDVAIHDLLVNSSQRPPAPAQAKGLTLPKPVFPQFPERLNNPGPETPPEKRKWGAKD |
| Ga0209732_10078401 | 3300027117 | Forest Soil | MATAAKFRIDVAIHDLLMNSTERPPAPAEARGLTLPQPIFPQHPQRFSDPGPDTPPEKRK |
| Ga0209622_10130961 | 3300027502 | Forest Soil | MATAAKFRIDVAIHDLLVNSSQRPPAPAQAKGLTLPEPVFPQFPGRLHDPGPETPPAKRKWGAKDVYRYSRGWLGPYV |
| Ga0209221_10908211 | 3300027609 | Forest Soil | MATAAKFRIDVAIHDLLTNSTQRPPAPAQDMGLTLPEPIFPQYPERIKDPGPEAPPEKRKWGVK |
| Ga0209422_11263351 | 3300027629 | Forest Soil | MATAAKFRIDVAIHDLLMNSTERPPAPAEARGLTLPQPIFPQHPQRFNDPGPDTPPEKR |
| Ga0209388_10944812 | 3300027655 | Vadose Zone Soil | MATAAKFRIDVAIHDLLMNSNQRPPAPAESMGLTLPEPVFPRFPDRLKDPGPET |
| Ga0208990_11157761 | 3300027663 | Forest Soil | MATAAKFRIDVAIHDLLMNSTHLPPAPAESMGLTLPEPVFPRFPDRLKDPGPETPPEK |
| Ga0209248_101866731 | 3300027729 | Bog Forest Soil | MATAAKFRIDVAIHDLLMNSAERTPAPAEARGLTLPEPVFPQYPDRLQDPGPES |
| Ga0209579_103129941 | 3300027869 | Surface Soil | MATAAKFRIDVAIHDLLMNSNQRRPEPAEARGLTLPEPVFPQFPDRLKDPGPEAPP |
| Ga0209169_101541572 | 3300027879 | Soil | MATAAKFRIDVAIHDLLVNSTQPQPAPAEAMGLSLPEPVFPQFPERLNDPGPETPPQKRKWGVKDIYRT |
| Ga0209488_103518471 | 3300027903 | Vadose Zone Soil | MATAAKFRIDVAIHDLLMNSSQRPPAPAQAKGVTLPEPVFPQFPERLNDPGPGTPP |
| Ga0209526_109184381 | 3300028047 | Forest Soil | MATAAKFRIDVAIHDLLMNSSQKPPMPAEAMGFTLPEPVFPRFPDRLKDPGPEAPPEKRKWGAK |
| Ga0308309_103733681 | 3300028906 | Soil | MATAAKFRIDVAIHDLLVNSSQRPPAPAQAKGFTLPEPVFPQFPERLDDPGPESPPEKRKWGARDVYRYSRGWLGPYVRS |
| Ga0308309_110042831 | 3300028906 | Soil | VASAAKFRIDVAIHDLLMNSNPQGPAPALVKGQELPEPVFPQFPEKLRDPGPDVAP |
| Ga0308309_111489471 | 3300028906 | Soil | MATAAKFRIDVAIHDLLMNSSQRSPAPAQAKGLTLPQPVFPQFPERVDDPGPETQP |
| Ga0311338_109796121 | 3300030007 | Palsa | MATTAKFRIDVAIHDLLTNSSQQPPAPAANKGLTLPQPVFPQFPERLLDPGP |
| Ga0310039_101134712 | 3300030706 | Peatlands Soil | MATTAEFRIDVAIHDLLMKSAERPPAPAEAKGLTLPEAVFPQHPERLRDPGPETPPEKRKWGAK |
| Ga0302324_1025939041 | 3300031236 | Palsa | MATTAKFRIDVAIHDLLTNSSQQPPAPAANKGLTLPQPVFPQFPERLLDPGPTTPPQKRKWGAKDVYRAS |
| Ga0310915_110468811 | 3300031573 | Soil | MTAAPKFRIDVAINDLLTSTTQRPPAPAEEKGLTLPNPAFPQFPDRLNDPGPEVPPEKRKWGLHDI |
| Ga0310686_1144119361 | 3300031708 | Soil | MATAAKFRIDVAIHDLLMNSSERPAPAEAKGLTLPTPVFPQFPERLSDPGPVSPLETRKWGAKDSYRIARGWL |
| Ga0307478_111337151 | 3300031823 | Hardwood Forest Soil | MATAAKFRIDVAIHDLLMNSAQRSPAPAEAKGLTLPEPIFPQFPERLNTPGPEVPPE |
| Ga0307478_117396011 | 3300031823 | Hardwood Forest Soil | MATAARFRIDVAIHDLLMNSTQRPPAPAEAKGLTLPEPVFPQFPERLKDPGPETPPEKRQWGAKDVY |
| Ga0307479_119468201 | 3300031962 | Hardwood Forest Soil | MATAAKFRIDVAIHDLLMNSSQRPPTPAEAMGLTLPEPFFPQFPDRLKDPGPETPPEKRQWGAKDIYRYSRGW |
| Ga0307471_1004712561 | 3300032180 | Hardwood Forest Soil | MATAAKFRIDVAIHDLLVNSAQRPPTPAQAKGLSLPEPVFPQFPERLNNPGPETPPERRKWGAKDVYRYSRGWLGPYVRSRVLPGE |
| Ga0306920_1018446453 | 3300032261 | Soil | MTAAPKFRIDVAINDLLTAPTQRQPAPAPPKGLTLPKPVFPQFPERLNDPVQQFRPRSGN |
| Ga0335085_105071423 | 3300032770 | Soil | MATAAKFRIDVDIHDLLNKPGQQTPAPAERRGLVLPDAVYPQFPERLNDPGPDVPPQKRKWGRKDIYRYSRGWL |
| Ga0335082_115045632 | 3300032782 | Soil | VASPAKFRIDVAIHDLLMNSAPTGPQPASVKGEILPTPVFPQYPEKLNDPGPEVP |
| Ga0335078_100045051 | 3300032805 | Soil | MTAAPKFRIDVDINDLLTNENRRPPMPAQAKGLTFPEPVFPQFPRRLDDPGPDAAPLKRKWGLRDIHRTS |
| Ga0335069_100364987 | 3300032893 | Soil | VATAPKFRIDVAIHDLLVNTPELRPVPVEGKGLTLPEAVFPQFPDRLRDPG |
| Ga0335075_104089751 | 3300032896 | Soil | MATAAKFRIDVAIHDLLMNSKERAPLPAAPKGFTLPSPVFPQHPDRLKDSGPSAPPVKRKWGIADI |
| Ga0335071_116205382 | 3300032897 | Soil | VASPTNFRIDVAIHDLLMNSAPTRPQPASVKGEILPTPVFPQYPEKLNDPGPEVPPEKRKWGLK |
| Ga0335076_102677463 | 3300032955 | Soil | MATAAKFRIDVTIHDVLTNSSQRPPAPAAHKGLTLPQPVFPQRPDLLNDPGPATPP |
| ⦗Top⦘ |