| Basic Information | |
|---|---|
| Family ID | F091814 |
| Family Type | Metagenome / Metatranscriptome |
| Number of Sequences | 107 |
| Average Sequence Length | 54 residues |
| Representative Sequence | TAIWFLHDRLGWTYDTRFALALFGLNLLLAVPLFFVLDRGHIIAGSVAEEGGRA |
| Number of Associated Samples | 92 |
| Number of Associated Scaffolds | 107 |
| Quality Assessment | |
|---|---|
| Transcriptomic Evidence | Yes |
| Most common taxonomic group | Unclassified |
| % of genes with valid RBS motifs | 0.00 % |
| % of genes near scaffold ends (potentially truncated) | 0.00 % |
| % of genes from short scaffolds (< 2000 bps) | 0.00 % |
| Associated GOLD sequencing projects | 87 |
| AlphaFold2 3D model prediction | Yes |
| 3D model pTM-score | 0.48 |
| Hidden Markov Model |
|---|
| Powered by Skylign |
| Most Common Taxonomy | |
|---|---|
| Group | Unclassified (100.000 % of family members) |
| NCBI Taxonomy ID | N/A |
| Taxonomy | N/A |
| Most Common Ecosystem | |
|---|---|
| GOLD Ecosystem | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil (25.234 % of family members) |
| Environment Ontology (ENVO) | Unclassified (49.533 % of family members) |
| Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Soil (non-saline) (62.617 % of family members) |
| ⦗Top⦘ |
| ⦗Top⦘ |
| Predicted Topology & Secondary Structure | |||||
|---|---|---|---|---|---|
| Classification: | Transmembrane (alpha-helical) | Signal Peptide: | No | Secondary Structure distribution: | α-helix: 53.66% β-sheet: 0.00% Coil/Unstructured: 46.34% | Feature Viewer |
|
|
|||||
| Powered by Feature Viewer | |||||
| Structure Viewer | |
|---|---|
|
| |
| Per-residue confidence (pLDDT): 0-50 51-70 71-90 91-100 | pTM-score: 0.48 |
| Powered by PDBe Molstar | |
| ⦗Top⦘ |
| Pfam ID | Name | % Frequency in 107 Family Scaffolds |
|---|---|---|
| PF12838 | Fer4_7 | 64.49 |
| PF00499 | Oxidored_q3 | 12.15 |
| PF00420 | Oxidored_q2 | 9.35 |
| PF01059 | Oxidored_q5_N | 1.87 |
| PF00662 | Proton_antipo_N | 0.93 |
| PF00361 | Proton_antipo_M | 0.93 |
| PF13237 | Fer4_10 | 0.93 |
| PF00146 | NADHdh | 0.93 |
| COG ID | Name | Functional Category | % Frequency in 107 Family Scaffolds |
|---|---|---|---|
| COG0839 | NADH:ubiquinone oxidoreductase subunit 6 (chain J) | Energy production and conversion [C] | 12.15 |
| COG1009 | Membrane H+-translocase/NADH:ubiquinone oxidoreductase subunit 5 (chain L)/Multisubunit Na+/H+ antiporter, MnhA subunit | Energy production and conversion [C] | 1.87 |
| COG0650 | Formate hydrogenlyase subunit HyfC | Energy production and conversion [C] | 0.93 |
| COG1005 | NADH:ubiquinone oxidoreductase subunit 1 (chain H) | Energy production and conversion [C] | 0.93 |
| ⦗Top⦘ |
| Name | Rank | Taxonomy | Distribution |
| Unclassified | root | N/A | 100.00 % |
| Visualization |
|---|
| Powered by ApexCharts |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|
| ⦗Top⦘ |
| Habitat | Taxonomy | Distribution |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil | 25.23% |
| Grasslands Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil | 17.76% |
| Vadose Zone Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil | 14.02% |
| Grasslands Soil | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil | 7.48% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil | 3.74% |
| Agricultural Soil | Environmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil | 3.74% |
| Hardwood Forest Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil | 3.74% |
| Soil | Environmental → Terrestrial → Soil → Loam → Grasslands → Soil | 2.80% |
| Corn, Switchgrass And Miscanthus Rhizosphere | Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere | 2.80% |
| Groundwater Sand | Environmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand | 2.80% |
| Watersheds | Environmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds | 1.87% |
| Rice Paddy Soil | Environmental → Terrestrial → Soil → Wetlands → Unclassified → Rice Paddy Soil | 1.87% |
| Sediment | Environmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment | 1.87% |
| Populus Rhizosphere | Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere | 1.87% |
| Groundwater Sediment | Environmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment | 0.93% |
| Freshwater Sediment | Environmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment | 0.93% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil | 0.93% |
| Tropical Forest Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil | 0.93% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil | 0.93% |
| Soil | Environmental → Terrestrial → Soil → Wetlands → Unclassified → Soil | 0.93% |
| Tropical Peatland | Environmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland | 0.93% |
| Tropical Forest Soil | Environmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil | 0.93% |
| Agricultural Soil | Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Agricultural Soil | 0.93% |
| Visualization |
|---|
| Powered by ApexCharts |
| Taxon OID | Sample Name | Habitat Type | IMG/M Link |
|---|---|---|---|
| 3300000953 | Soil microbial communities from Great Prairies - Kansas Corn soil | Environmental | Open in IMG/M |
| 3300002561 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm | Environmental | Open in IMG/M |
| 3300002562 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cm | Environmental | Open in IMG/M |
| 3300004157 | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - - Combined assembly of AARS Block 2 | Environmental | Open in IMG/M |
| 3300005167 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121 | Environmental | Open in IMG/M |
| 3300005171 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126 | Environmental | Open in IMG/M |
| 3300005175 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122 | Environmental | Open in IMG/M |
| 3300005177 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139 | Environmental | Open in IMG/M |
| 3300005179 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133 | Environmental | Open in IMG/M |
| 3300005186 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125 | Environmental | Open in IMG/M |
| 3300005332 | Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly) | Environmental | Open in IMG/M |
| 3300005435 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaG | Environmental | Open in IMG/M |
| 3300005446 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135 | Environmental | Open in IMG/M |
| 3300005447 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138 | Environmental | Open in IMG/M |
| 3300005467 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG | Environmental | Open in IMG/M |
| 3300005468 | Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG | Environmental | Open in IMG/M |
| 3300005555 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141 | Environmental | Open in IMG/M |
| 3300005561 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148 | Environmental | Open in IMG/M |
| 3300005575 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151 | Environmental | Open in IMG/M |
| 3300005598 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 | Environmental | Open in IMG/M |
| 3300005890 | Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_20C_0N_104 | Environmental | Open in IMG/M |
| 3300006041 | Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2014 | Environmental | Open in IMG/M |
| 3300006046 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101 | Environmental | Open in IMG/M |
| 3300006755 | Agricultural soil microbial communities from Georgia to study Nitrogen management - GA Plitter | Environmental | Open in IMG/M |
| 3300006791 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102 | Environmental | Open in IMG/M |
| 3300006796 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 | Environmental | Open in IMG/M |
| 3300006797 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108 | Environmental | Open in IMG/M |
| 3300006804 | Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200 | Environmental | Open in IMG/M |
| 3300006904 | Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3 | Host-Associated | Open in IMG/M |
| 3300007076 | Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD4 | Host-Associated | Open in IMG/M |
| 3300009089 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG | Environmental | Open in IMG/M |
| 3300009137 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158 | Environmental | Open in IMG/M |
| 3300009812 | Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N2_50_60 | Environmental | Open in IMG/M |
| 3300009813 | Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_10_20 | Environmental | Open in IMG/M |
| 3300010095 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_20_5_16_1 metaT (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300010126 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_2_0_1 metaT (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300010132 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_40_5_16_1 metaT (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300010301 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09082015 | Environmental | Open in IMG/M |
| 3300010304 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015 | Environmental | Open in IMG/M |
| 3300010325 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_0_1 metaG | Environmental | Open in IMG/M |
| 3300010329 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_11112015 | Environmental | Open in IMG/M |
| 3300010333 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_2_24_1 metaG | Environmental | Open in IMG/M |
| 3300010359 | Tropical forest soil microbial communities from Panama - MetaG Plot_15 | Environmental | Open in IMG/M |
| 3300010364 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09212015 | Environmental | Open in IMG/M |
| 3300012198 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaG | Environmental | Open in IMG/M |
| 3300012200 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaG | Environmental | Open in IMG/M |
| 3300012206 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaG | Environmental | Open in IMG/M |
| 3300012224 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_20cm_2_4_2 metaT (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300012349 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaG | Environmental | Open in IMG/M |
| 3300012355 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaG | Environmental | Open in IMG/M |
| 3300012929 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly) | Environmental | Open in IMG/M |
| 3300012975 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_11112015 | Environmental | Open in IMG/M |
| 3300012976 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_0_1 metaG | Environmental | Open in IMG/M |
| 3300014154 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09212015 | Environmental | Open in IMG/M |
| 3300014157 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_2_24_1 metaG | Environmental | Open in IMG/M |
| 3300017656 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_11112015 | Environmental | Open in IMG/M |
| 3300017659 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaG | Environmental | Open in IMG/M |
| 3300017927 | Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_4 | Environmental | Open in IMG/M |
| 3300018029 | Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_BV01_MP06_20_MG | Environmental | Open in IMG/M |
| 3300020199 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly) | Environmental | Open in IMG/M |
| 3300021046 | Soil microbial communities from Shale Hills CZO, Pennsylvania, United States - 90cm depth | Environmental | Open in IMG/M |
| 3300021081 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32_coex redo | Environmental | Open in IMG/M |
| 3300025319 | Soil microbial communities from Rifle, Colorado, USA - sediment 16ft 1 | Environmental | Open in IMG/M |
| 3300025992 | Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_20C_0N_104 (SPAdes) | Environmental | Open in IMG/M |
| 3300026295 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_20cm (SPAdes) | Environmental | Open in IMG/M |
| 3300026297 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cm (SPAdes) | Environmental | Open in IMG/M |
| 3300026308 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_103 (SPAdes) | Environmental | Open in IMG/M |
| 3300026310 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cm (SPAdes) | Environmental | Open in IMG/M |
| 3300026324 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125 (SPAdes) | Environmental | Open in IMG/M |
| 3300026329 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131 (SPAdes) | Environmental | Open in IMG/M |
| 3300026335 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139 (SPAdes) | Environmental | Open in IMG/M |
| 3300026343 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144 (SPAdes) | Environmental | Open in IMG/M |
| 3300026523 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157 (SPAdes) | Environmental | Open in IMG/M |
| 3300026524 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150 (SPAdes) | Environmental | Open in IMG/M |
| 3300026536 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147 (SPAdes) | Environmental | Open in IMG/M |
| 3300026557 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal | Environmental | Open in IMG/M |
| 3300027748 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149 (SPAdes) | Environmental | Open in IMG/M |
| 3300027775 | Agricultural soil microbial communities from Georgia to study Nitrogen management - GA Control (SPAdes) | Environmental | Open in IMG/M |
| 3300027787 | Agricultural soil microbial communities from Georgia to study Nitrogen management - GA Plitter (SPAdes) | Environmental | Open in IMG/M |
| 3300027846 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes) | Environmental | Open in IMG/M |
| 3300027882 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes) | Environmental | Open in IMG/M |
| 3300027915 | Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2013 (SPAdes) | Environmental | Open in IMG/M |
| 3300027961 | Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_30_40 (SPAdes) | Environmental | Open in IMG/M |
| 3300028784 | Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_121 | Environmental | Open in IMG/M |
| 3300031421 | Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_195 (Metagenome Metatranscriptome) | Environmental | Open in IMG/M |
| 3300031720 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515 | Environmental | Open in IMG/M |
| 3300032180 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515 | Environmental | Open in IMG/M |
| 3300032205 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05 | Environmental | Open in IMG/M |
| 3300032421 | Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - NN3 | Environmental | Open in IMG/M |
| 3300033513 | Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_M2_C1_D5_C | Environmental | Open in IMG/M |
| 3300033814 | Sediment microbial communities from East River floodplain, Colorado, United States - 55_j17 | Environmental | Open in IMG/M |
| 3300033815 | Sediment microbial communities from East River floodplain, Colorado, United States - 31_s17 | Environmental | Open in IMG/M |
| Geographical Distribution | |
|---|---|
| Zoom: | Powered by OpenStreetMap |
| ⦗Top⦘ |
| Protein ID | Sample Taxon ID | Habitat | Sequence |
| JGI11615J12901_105144063 | 3300000953 | Soil | YLTVIATAIWFFHDQLGWAYDTRFSLALFGVNLALAVPLVFVLDRGHIVAGSVQRRRA* |
| JGI25384J37096_100405603 | 3300002561 | Grasslands Soil | VIATAIWFLHDRLGWTYDSRFALALFALNLVLAVPLFFVLDRGHIIAGSVVEQGGRA* |
| JGI25382J37095_102641531 | 3300002562 | Grasslands Soil | GYVTVIATAMWILHGVLGWTYDTRFGLVLFALNVLLAIPLFFGLDRGHLVAGSVAEERA* |
| Ga0062590_1015411452 | 3300004157 | Soil | LGWAYDARFALALFGMNLLIGIPVFFVLDRGRLVAGSVAEERA* |
| Ga0066672_101849921 | 3300005167 | Soil | WILHAVLGWTYDTRFGLVLFGLNVLLAIPLFFVLDRGHIVAGSMAGERA* |
| Ga0066677_101267561 | 3300005171 | Soil | IWFLHDGLGWTYDTRFALALFALNLVLAVPLFFVLDRGRIVAGAVAEA* |
| Ga0066677_107357882 | 3300005171 | Soil | ATAIWFLHDGLGWTYDTRFALALFALNLALAVPLFFVLDRGRIVAGSMAEGEA* |
| Ga0066673_100838661 | 3300005175 | Soil | IWFLHDRLGWTYDTRFALALFGVNLLLAVPLFFVLDRGHIIAGSVAEEGGRA* |
| Ga0066690_100069381 | 3300005177 | Soil | AMWILHAVLGWTYDTRFGLVLFGLNVLLAIPLFFVLDRGHIVAGSMAGERA* |
| Ga0066690_100761395 | 3300005177 | Soil | TAIWFLHDGLGWTYDTRFALALFALNLVLAVPLFFVLDRGRIVAGSVAEA* |
| Ga0066690_105841452 | 3300005177 | Soil | HDGLGWTYDTRFALALFALNLVLAVPLFFVLDRGRIVAGSVAEA* |
| Ga0066684_105481811 | 3300005179 | Soil | LGWSYDTRFGLVLFALNVLLAVPLFFGLDRGHLIAGAVAEEPA* |
| Ga0066676_101985151 | 3300005186 | Soil | LGWTYDSRFALALFALNLVLAVALFFVLDRGHLIAGSVTEQGERA* |
| Ga0066388_1078043081 | 3300005332 | Tropical Forest Soil | LTVIATAIWLFHDFLGWAYDTRFSLALFAVNVALAIPLLFVLDRGHIVAGSVERRRA* |
| Ga0070714_1008732421 | 3300005435 | Agricultural Soil | VLHERLGWAYDSRFALALFGVNILLAVPLFFVLDRGHIVSGSAAEERA* |
| Ga0066686_102042011 | 3300005446 | Soil | AIWFLHDRLGWTYDTRFALALFGLNLLLAVPLFFVLDRGHIIAGSVVEEGGRA* |
| Ga0066689_101278451 | 3300005447 | Soil | LHDRLAWTYDTRFALTLFALNVLLAIPLFFVLDRGHIVAGSVAEEGRAS* |
| Ga0070706_1000953285 | 3300005467 | Corn, Switchgrass And Miscanthus Rhizosphere | VMLPLALGYITVMATAIWLLHARLGWNYDARFALALFSLNVLLAIPLFFALDRGHLISGSEARGET* |
| Ga0070706_1002731284 | 3300005467 | Corn, Switchgrass And Miscanthus Rhizosphere | YLTVIATAIWFFHDQLGWAYDTRFNLALFAVNVALAVPLVFVLDRGHVVAGSVERRRA* |
| Ga0070707_1006940283 | 3300005468 | Corn, Switchgrass And Miscanthus Rhizosphere | YITVMATAIWLLHARLGWNYDARFALALFSLNVLLAIPLFFALDRGHLISGSEARGET* |
| Ga0066692_108456752 | 3300005555 | Soil | IWFLHDQLGWMYDSRFALALFGLNVLLAVPLFFGLDRGHIIAGSVVEEGGRA* |
| Ga0066699_103046831 | 3300005561 | Soil | ALWILHAVLGWAYDTRFGLALFGLNVLLAIPLFFVLDRGHIVAGSMAGERA* |
| Ga0066702_107756072 | 3300005575 | Soil | IWFLHDGLGWTYDTRFALALFALNLVLAVPLFFVLDRGRIVAGSVVEA* |
| Ga0066706_105401533 | 3300005598 | Soil | LVYLTVIATAIWYLHERLGWVYDARFALALGAVNVALAVPLFFVLDRGRLVSGSVAREGA |
| Ga0075285_10185221 | 3300005890 | Rice Paddy Soil | ATAIWLLHEGLGWTYDTRFALTLFGLNLLLAVPLFFVLDRGHIVSGSVAEERA* |
| Ga0075023_1006399411 | 3300006041 | Watersheds | AIWFFHDQLGWAYDTRFSVAMFAVNLALAVPLFFVLDRGHLVSGSVQRRRA* |
| Ga0066652_1017053211 | 3300006046 | Soil | VIASAIWFLHDRLGWTYDARFALTLFGLNVLLAVSLLFWLDRGHIVAGSVAEEGGRA* |
| Ga0079222_111579042 | 3300006755 | Agricultural Soil | SYVSVVATAVWFFHDRLGWAYDTRFALALFGVNVLLAVPLFFVLDRGHIVSGSVAEERV* |
| Ga0066653_100532531 | 3300006791 | Soil | IWILHALLGWTYDTRFGLVLFALNVLLAIPLLFGLDRGHLVAGSVAEERA* |
| Ga0066665_100958311 | 3300006796 | Soil | TAMWILHGVLGWTYDTRFGLVLFALNVLLAIPLFFGLDRGRLVAGSVAEERA* |
| Ga0066665_108899382 | 3300006796 | Soil | AIWYLHARLGWAYDARFALALAAVNLALAVPLVFVLDRGRLVNGSVARERA* |
| Ga0066659_115289421 | 3300006797 | Soil | LTYLTVIASAIWFLHDRLGWTYDARFALTLFGLNVLLAVPLLFWLDRGHIVAGSVAEEGGRA* |
| Ga0079221_102645391 | 3300006804 | Agricultural Soil | AALLPLALTYVTVIASAIWLLHDRMGWTYDSRFALTLFGLNVLLAVPLLFWLDRGHIVAGSMAEQGGRA* |
| Ga0075424_1025870171 | 3300006904 | Populus Rhizosphere | LLYVSVIATVVWVLRARLGWAYDSRFALALFGVNLLLAVPLFFVLDRGHIVSGSTAEERA |
| Ga0075435_1019651121 | 3300007076 | Populus Rhizosphere | VTVIATAIWFLHDQLGWMYNTQFALALFGLNLLLAVPLFFVLDRGHIIAGSVAEEGGRA* |
| Ga0099828_115239522 | 3300009089 | Vadose Zone Soil | AAIWFLHAQLGWEYDARFAATLFGVNLILGVFVFFVLDRGRIVSGSVARERG* |
| Ga0099828_117257162 | 3300009089 | Vadose Zone Soil | LGWGYDRRFALALFGVNVLLAVPLFFVLDRGRVIAGSVAEERV* |
| Ga0066709_1008999254 | 3300009137 | Grasslands Soil | LGWTYDTRFALALFGLNLLLAVPLFFVLDRGHIIAGSVVEEGGQA* |
| Ga0066709_1023897911 | 3300009137 | Grasslands Soil | TAIWFLHDQLGWTYDTRFALALFGVNLLLAVPLFFVLDRGHIIAGSVAEEGGRA* |
| Ga0105067_11011522 | 3300009812 | Groundwater Sand | LVYLMVIATAIWYLHDQLGWSYDGRFAGVLFGVNLVLAVPLVFVLDRGRLVSGSMEEEEGKA* |
| Ga0105057_10611051 | 3300009813 | Groundwater Sand | QLGWSYDGRFAGVLFGVNLVLAVPLVFVLDRGRLVSGSMEEEEGKA* |
| Ga0127475_10582172 | 3300010095 | Grasslands Soil | VIATAIWILHGLLGWNYDRQFGLALFGLNVLLALPLLFLLDRGHLVAGAVAQEPS* |
| Ga0127482_10361251 | 3300010126 | Grasslands Soil | ATAIWILHGLLGWNYDRQFGLALFGLNVLLALPLLFLLDRGHLVAGAVAQEPS* |
| Ga0127482_11178283 | 3300010126 | Grasslands Soil | ATAIWFLHDRLGWTYDTRFALALFGLNLLLAVPLFFVLDRGHIIAGSVVEEGGRA* |
| Ga0127455_11708632 | 3300010132 | Grasslands Soil | WILHGLLGWNYDRQFGLALFGLNVLLALPLLFLLDRGHLVGRSGSGAELTWPSA* |
| Ga0134070_101614893 | 3300010301 | Grasslands Soil | TAIWLLHDQLGWSYGTPFALALFGLNVLLAIPLFFVLDRGHLVSGSVAEEAG* |
| Ga0134070_102682542 | 3300010301 | Grasslands Soil | AIWILHEQLGWTYGTRFALALFALNVLLAIPLFFVLDRGRIVAGSVAEERA* |
| Ga0134088_101091841 | 3300010304 | Grasslands Soil | VLHDRLGWTYDSRFALALCGLNVLLAIPLFFVLDRGHLIAGSVAEGGA* |
| Ga0134064_101092551 | 3300010325 | Grasslands Soil | IATAIWILHGLLGWNYDRQFGLALFGLNVLLALPLLFLLDRGHLVAGAVAQEPS* |
| Ga0134064_103681552 | 3300010325 | Grasslands Soil | LAYVTVIATAIWILHEQLGWTYGTRFALALFGLNVLLAIPLFFVLDRGRIVAGSVAEERA |
| Ga0134111_104732622 | 3300010329 | Grasslands Soil | AYVTIIATAIWFLHDRLDWTYDTRFALALFGLNLLLAVPLFFVLDRGHIIAGSVAEEGGRA* |
| Ga0134080_102585441 | 3300010333 | Grasslands Soil | PLALGYVTVIATAMWILHGVLGWTYDTRFGLVLFALNVLLAIPLFFGLDRGHLVAGSVAEERA* |
| Ga0126376_128055561 | 3300010359 | Tropical Forest Soil | IATAIWVFHDSLGWAYDTRFSLALFAVNVALAIPLLFVLDRGHIVAGSVERRRA* |
| Ga0134066_101315222 | 3300010364 | Grasslands Soil | PLALTYLTVIASAIWFLHDRLGWTYDARFALTLFGLNVLLAVSLLFWLDRGHIVAGSVAEEGGRA* |
| Ga0137364_102576111 | 3300012198 | Vadose Zone Soil | GWTYDTRFALALFGVNLLLAVPLFFVLDRGHIIAGSVAEEGRRA* |
| Ga0137382_108906491 | 3300012200 | Vadose Zone Soil | TIIATAIWFLHDRLGWTYDTRFALALFGVNLLLAVPLFFVLDRGHIIAGSVAEEGGRA* |
| Ga0137382_109140741 | 3300012200 | Vadose Zone Soil | ATAIWFLHDQLGWLYDSRFALALFGLNVLLAVPLFFGLDRGHIIAGSVVEEGGRA* |
| Ga0137380_100725325 | 3300012206 | Vadose Zone Soil | PLALGYVSVIATAIWLLHDRLAWTYDTRFALTLFALNALLAIPLFFVLDRGHIVAGSVAPEGGRA* |
| Ga0134028_12530762 | 3300012224 | Grasslands Soil | IWFLHDQLGWTYDTRFALALFGVNLLLAVPLFFVLDRGHIIAGSVAEEGGRA* |
| Ga0137387_100341861 | 3300012349 | Vadose Zone Soil | VMASAIWLLHARLGWTYDTRFALALFGLNVLLAIPLFFALDRGHLISGSEARGEA* |
| Ga0137387_101889381 | 3300012349 | Vadose Zone Soil | TAIWFLHDRLGWTYDTRFALALFGLNLLLAVPLFFVLDRGHIIAGSVAEEGGRA* |
| Ga0137387_106104273 | 3300012349 | Vadose Zone Soil | LLHDRLAWTYDTRFALTLFALNALLAIPLFFVLDRGHIVAGSVAPEGGRA* |
| Ga0137369_109904292 | 3300012355 | Vadose Zone Soil | ALGYVTVIATAIWFLHDQLGWTYDTRFALALFGLNLLLAVPLFLVLDRGHIIAGSEAAEGT* |
| Ga0137404_117431041 | 3300012929 | Vadose Zone Soil | TVIATAIWFLHDQLGWTYDTRFALALFGLNLLLAVPLFFVLDRGHIIAGSVAAEGAGGGGA* |
| Ga0134110_103223211 | 3300012975 | Grasslands Soil | IATAMWILHAVLGWTYDTRFGLVLFVLNVLLAIPLFFGLDRGHLVAGSVAEERA* |
| Ga0134076_101494191 | 3300012976 | Grasslands Soil | QLGWTYGTRFALALFGLNVLLAIPLFFVLDRGRIVAGSVAEERA* |
| Ga0134075_100495335 | 3300014154 | Grasslands Soil | LTVIATAIWFLHNRLGWSYDSRFALALFGLNLLLAVLLFFVLDRGHIIAGSVAAEEGAGGV* |
| Ga0134078_101986573 | 3300014157 | Grasslands Soil | DGLGWTYDTRFALALFALNLTLAVPLFFVLDRGRIVAGSVAEA* |
| Ga0134112_103013921 | 3300017656 | Grasslands Soil | VQLGWAYDTRFALALGAVNVVLAIPLLFVLDRGHLVSGSVARGRA |
| Ga0134083_104653571 | 3300017659 | Grasslands Soil | HDWLGWTYDSRFALALFALNLVLAVPLFFVLDRGHLIAGSVVEQGERA |
| Ga0187824_100730123 | 3300017927 | Freshwater Sediment | WFFHDFLGWAYDTRFSLALFAVNVALAIPLLFVLDRGHIVAGSVERRRA |
| Ga0187787_102663082 | 3300018029 | Tropical Peatland | LALVYVTLLAGAIWVLHERLGWQYDQRFALALLGLNVALAIPLFFVLDRGHLIAGSEARAGGEV |
| Ga0179592_103422692 | 3300020199 | Vadose Zone Soil | GWNYDTRFALALFGLNLLLAVPLFFVLDRGHIIAGSVAAEGAGGGGP |
| Ga0215015_109809122 | 3300021046 | Soil | VLHNRLGWSYDQRFALALFGVNLLLAVPLFFVLDRGRLVAGSVAEERV |
| Ga0210379_103730552 | 3300021081 | Groundwater Sediment | ALGYLAVLATAIWVLHDRLGWAYNTRFALALFGVNLLIAVPLVFVLDRGRLVAGSVAEEH |
| Ga0209520_107190561 | 3300025319 | Soil | LASAIWLLHDRLGWAYDTRFSLALFGLNVLLAIPLFFVLDRGRIVAGSVAEEGA |
| Ga0208775_10179372 | 3300025992 | Rice Paddy Soil | ATAIWLLHEGLGWTYDTRFALTLFGLNLLLAVPLFFVLDRGHIVSGSVAEERA |
| Ga0209234_12363932 | 3300026295 | Grasslands Soil | ERLGWVYDARFALALGAVNVALAVPLFFVLDRGRLVSGSVARERA |
| Ga0209237_12865961 | 3300026297 | Grasslands Soil | TIIATAIWFLHDQLGWMYDTRFALALFGLNLLLAVPLFFVLDRGHIIAGSVAEEGGRA |
| Ga0209265_10502591 | 3300026308 | Soil | FLHDQLGWTYDTRFALALFGLNLLLAVPLFFVLDRGHIIAGSVVEEGGRA |
| Ga0209239_11502083 | 3300026310 | Grasslands Soil | VLATAIWILHALLGWSYDTRFGLVLFALNVLLAVPLFFGLDRGHLIAGAVAEEPA |
| Ga0209239_12001612 | 3300026310 | Grasslands Soil | TALWILHAVLGWSYDTRFGLVLFVLNVLLAIPLFFGLDRGHLVAGSVAEERA |
| Ga0209470_11084881 | 3300026324 | Soil | HARLGWTYDARFALALFGLNVLLAIPLFFALDRGHLISGSEARGEA |
| Ga0209375_12352191 | 3300026329 | Soil | DRLGWPYDTRFALALFGLNLLLAVLLFFVLDRGHIIAGSVAAEEGAGGV |
| Ga0209804_11881853 | 3300026335 | Soil | LHDGLGWTYDTRFALALFALNLVLAVPLFFVLDRGRIVAGSVAEA |
| Ga0209159_11898971 | 3300026343 | Soil | DRLGWPYDTRFALALFGLNLLLAVLLFFVLDRGHIIAGSVAAEEGAAGRGA |
| Ga0209808_13176651 | 3300026523 | Soil | TANWFLHDRLGWTYDTRFALALFGLNLLLAVPLFFVLDRGHIIAGSVVEEGGRA |
| Ga0209690_11525143 | 3300026524 | Soil | YITVMASAIWLLHARLGWTYDTRFALALFGLNVLLAIPLFFALDRGHLISGSEARGEA |
| Ga0209058_13266252 | 3300026536 | Soil | MLPLALGYVTVIATAMWILHGVLGWTYDTRFGLVLFALNVLLAIPLFFGLDRGHLVAGSVAEERA |
| Ga0179587_106445071 | 3300026557 | Vadose Zone Soil | VIATAIWLLHDQLGWNYDTRFALALFGLNLLLAVPLFFVLDRGHIIAGSVAAEGAGGGGP |
| Ga0209689_10070971 | 3300027748 | Soil | AVLGWAYDTRFGLALFGLNVLLAIPLFFVLDRGHIVAGSMAGERA |
| Ga0209177_100832121 | 3300027775 | Agricultural Soil | WFLHARLGWAYDSRFALALFGVNLLLAVPLFFVLDRGHIVSGSAAEERA |
| Ga0209074_102886301 | 3300027787 | Agricultural Soil | IATAIWFLHDQLGWTYDTRFALALFALNLALAVPLFFVLDRGRLVAGSMAEGEA |
| Ga0209180_103911853 | 3300027846 | Vadose Zone Soil | LHDRLGWGYDRRFALALFGVNVLLAVPLFFVLDRGRVIAGSVAEERV |
| Ga0209590_102142811 | 3300027882 | Vadose Zone Soil | LTVIATAIWFLHDQLGWTYDTRFALALFGLNLLLAVPLFFVLDRGHIIAGSVAAEGAGGGGA |
| Ga0209069_106787221 | 3300027915 | Watersheds | VYLTVIATAIWFFHDQLGWAYDTRFSVAMFAVNLALAVPLFFVLDRGHLVSGSVQRRRA |
| Ga0209853_10911661 | 3300027961 | Groundwater Sand | TVVASAIWLLHDRLGWVYDTRFSLALFGLNVLLAIPLFFVLDRGHIIAGSVAEERA |
| Ga0307282_104249921 | 3300028784 | Soil | DRLGWTYDTRFALVLFAVNLLLAVPLFFVLDRGHIIAGSVAEERA |
| Ga0308194_100823603 | 3300031421 | Soil | IWLLHDRLGWTYDTRFALVLFAVNLLLAVPLFFVLDRGHIIAGSVAEERA |
| Ga0307469_117070222 | 3300031720 | Hardwood Forest Soil | LHDRLKWAYDTRFALALFGMNLLIGIPLFFLLDRGRLVAGSVVEEQV |
| Ga0307471_1000101041 | 3300032180 | Hardwood Forest Soil | LSWAYDSRFALALFGVNLLLAVPLFFVLDRGHIVSGSAAEERA |
| Ga0307471_1014808121 | 3300032180 | Hardwood Forest Soil | VTVIATAIWFLHDRLGWMYDTRFALALFGLNLLLAVPLFFVLDRGHIIAGSVLEEGGSA |
| Ga0307472_1023567502 | 3300032205 | Hardwood Forest Soil | LALGYVTVIATAIWFLHDQLGWIYNTQFALALFGLNLLLAVPLFFVLDRGHIIAGSVAEEGGRA |
| Ga0310812_104881872 | 3300032421 | Soil | ALIYLTVIATAIWFFHDQLGWAYDTRFSLALFGVNLALAVPLVFVLDRGHIVAGSVQRRR |
| Ga0316628_1011581343 | 3300033513 | Soil | YLTVVATAIWYLHAVLGWAYDMRFSLVMFALNLVLAVPVFLVLDRGHLISGSVQRRSA |
| Ga0364930_0138561_633_818 | 3300033814 | Sediment | VYLMVIATAIWYLHDQLGWGYDGRFAGVLFAINLALAVPLFFVLDRGRLVSGSMEGEGGK |
| Ga0364946_014478_3_170 | 3300033815 | Sediment | ATAIWYLHDQLGWGYDRRFAGVLFAVNLALAVPLVFVLDRGRLVSGSMEEEGGKA |
| ⦗Top⦘ |