| Basic Information | |
|---|---|
| Family ID | F093610 |
| Family Type | Metagenome |
| Number of Sequences | 106 |
| Average Sequence Length | 63 residues |
| Representative Sequence | GKELATLFKSRFLQAFEFQMDVINNLSRSVFVVDNPDSAFAHKINEAAEMLTNELAEATQPAN |
| Number of Associated Samples | 81 |
| Number of Associated Scaffolds | 106 |
| Quality Assessment | |
|---|---|
| Transcriptomic Evidence | No |
| Most common taxonomic group | Unclassified |
| % of genes with valid RBS motifs | 0.00 % |
| % of genes near scaffold ends (potentially truncated) | 28.30 % |
| % of genes from short scaffolds (< 2000 bps) | 26.42 % |
| Associated GOLD sequencing projects | 72 |
| AlphaFold2 3D model prediction | No |
| Hidden Markov Model |
|---|
| Powered by Skylign |
| Most Common Taxonomy | |
|---|---|
| Group | Unclassified (72.642 % of family members) |
| NCBI Taxonomy ID | N/A |
| Taxonomy | N/A |
| Most Common Ecosystem | |
|---|---|
| GOLD Ecosystem | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil (33.962 % of family members) |
| Environment Ontology (ENVO) | Unclassified (53.774 % of family members) |
| Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Soil (non-saline) (60.377 % of family members) |
| ⦗Top⦘ |
| ⦗Top⦘ |
| Predicted Topology & Secondary Structure | |||||
|---|---|---|---|---|---|
| Classification: | Globular | Signal Peptide: | No | Secondary Structure distribution: | α-helix: 74.60% β-sheet: 0.00% Coil/Unstructured: 25.40% | Feature Viewer |
|
|
|||||
| Powered by Feature Viewer | |||||
| ⦗Top⦘ |
| Pfam ID | Name | % Frequency in 106 Family Scaffolds |
|---|---|---|
| PF02830 | V4R | 58.49 |
| PF06505 | XylR_N | 1.89 |
| PF01882 | DUF58 | 0.94 |
| PF07726 | AAA_3 | 0.94 |
| PF03952 | Enolase_N | 0.94 |
| COG ID | Name | Functional Category | % Frequency in 106 Family Scaffolds |
|---|---|---|---|
| COG1719 | Predicted hydrocarbon binding protein, contains 4VR domain | General function prediction only [R] | 58.49 |
| COG0148 | Enolase | Carbohydrate transport and metabolism [G] | 0.94 |
| COG1721 | Uncharacterized conserved protein, DUF58 family, contains vWF domain | Function unknown [S] | 0.94 |
| ⦗Top⦘ |
| Name | Rank | Taxonomy | Distribution |
| Unclassified | root | N/A | 72.64 % |
| All Organisms | root | All Organisms | 27.36 % |
| Visualization |
|---|
| Powered by ApexCharts |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| 3300005167|Ga0066672_10748840 | All Organisms → cellular organisms → Archaea | 620 | Open in IMG/M |
| 3300005167|Ga0066672_10748841 | All Organisms → cellular organisms → Archaea | 620 | Open in IMG/M |
| 3300005174|Ga0066680_10220516 | All Organisms → cellular organisms → Archaea | 1201 | Open in IMG/M |
| 3300005174|Ga0066680_10930483 | All Organisms → cellular organisms → Archaea | 513 | Open in IMG/M |
| 3300005176|Ga0066679_10849841 | All Organisms → cellular organisms → Archaea | 579 | Open in IMG/M |
| 3300005178|Ga0066688_10813200 | All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon | 583 | Open in IMG/M |
| 3300005554|Ga0066661_10700876 | All Organisms → cellular organisms → Archaea | 594 | Open in IMG/M |
| 3300005586|Ga0066691_10485195 | All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon | 738 | Open in IMG/M |
| 3300006794|Ga0066658_10562098 | All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon | 620 | Open in IMG/M |
| 3300006806|Ga0079220_11801910 | All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon | 539 | Open in IMG/M |
| 3300009012|Ga0066710_104739691 | All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon | 508 | Open in IMG/M |
| 3300009088|Ga0099830_11425027 | All Organisms → cellular organisms → Archaea | 576 | Open in IMG/M |
| 3300009089|Ga0099828_11144704 | All Organisms → cellular organisms → Archaea | 691 | Open in IMG/M |
| 3300010321|Ga0134067_10061554 | All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon | 1228 | Open in IMG/M |
| 3300012201|Ga0137365_10889259 | All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon | 650 | Open in IMG/M |
| 3300012350|Ga0137372_10136038 | All Organisms → cellular organisms → Archaea | 2022 | Open in IMG/M |
| 3300012362|Ga0137361_11285479 | All Organisms → cellular organisms → Archaea | 656 | Open in IMG/M |
| 3300012972|Ga0134077_10389142 | All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon | 600 | Open in IMG/M |
| 3300017659|Ga0134083_10379972 | All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon | 613 | Open in IMG/M |
| 3300018468|Ga0066662_10338105 | All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon | 1288 | Open in IMG/M |
| 3300018468|Ga0066662_12977587 | All Organisms → cellular organisms → Archaea | 503 | Open in IMG/M |
| 3300026306|Ga0209468_1032555 | All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon | 1816 | Open in IMG/M |
| 3300026313|Ga0209761_1304099 | All Organisms → cellular organisms → Archaea | 553 | Open in IMG/M |
| 3300026325|Ga0209152_10039817 | All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon | 1627 | Open in IMG/M |
| 3300026342|Ga0209057_1221776 | All Organisms → cellular organisms → Archaea | 537 | Open in IMG/M |
| 3300026361|Ga0257176_1062963 | All Organisms → cellular organisms → Archaea | 594 | Open in IMG/M |
| 3300026536|Ga0209058_1256650 | All Organisms → cellular organisms → Archaea | 611 | Open in IMG/M |
| 3300031962|Ga0307479_10054000 | Not Available | 3868 | Open in IMG/M |
| 3300032180|Ga0307471_103715555 | All Organisms → cellular organisms → Archaea | 540 | Open in IMG/M |
| 3300032180|Ga0307471_104039661 | All Organisms → cellular organisms → Archaea | 518 | Open in IMG/M |
| ⦗Top⦘ |
| Habitat | Taxonomy | Distribution |
| Vadose Zone Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil | 33.96% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil | 33.02% |
| Grasslands Soil | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil | 12.26% |
| Grasslands Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil | 8.49% |
| Hardwood Forest Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil | 4.72% |
| Tropical Forest Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil | 2.83% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil | 1.89% |
| Agricultural Soil | Environmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil | 0.94% |
| Soil | Environmental → Terrestrial → Soil → Loam → Grasslands → Soil | 0.94% |
| Groundwater Sand | Environmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand | 0.94% |
| Visualization |
|---|
| Powered by ApexCharts |
| Taxon OID | Sample Name | Habitat Type | IMG/M Link |
|---|---|---|---|
| 3300002558 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cm | Environmental | Open in IMG/M |
| 3300002560 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cm | Environmental | Open in IMG/M |
| 3300002561 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm | Environmental | Open in IMG/M |
| 3300002562 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cm | Environmental | Open in IMG/M |
| 3300002908 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cm | Environmental | Open in IMG/M |
| 3300005167 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121 | Environmental | Open in IMG/M |
| 3300005174 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129 | Environmental | Open in IMG/M |
| 3300005176 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128 | Environmental | Open in IMG/M |
| 3300005177 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139 | Environmental | Open in IMG/M |
| 3300005178 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137 | Environmental | Open in IMG/M |
| 3300005180 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 | Environmental | Open in IMG/M |
| 3300005446 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135 | Environmental | Open in IMG/M |
| 3300005447 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138 | Environmental | Open in IMG/M |
| 3300005540 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146 | Environmental | Open in IMG/M |
| 3300005552 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150 | Environmental | Open in IMG/M |
| 3300005554 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110 | Environmental | Open in IMG/M |
| 3300005557 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 | Environmental | Open in IMG/M |
| 3300005558 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147 | Environmental | Open in IMG/M |
| 3300005559 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149 | Environmental | Open in IMG/M |
| 3300005576 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157 | Environmental | Open in IMG/M |
| 3300005586 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 | Environmental | Open in IMG/M |
| 3300006034 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105 | Environmental | Open in IMG/M |
| 3300006794 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107 | Environmental | Open in IMG/M |
| 3300006806 | Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100 | Environmental | Open in IMG/M |
| 3300007255 | Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1 | Environmental | Open in IMG/M |
| 3300007265 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 | Environmental | Open in IMG/M |
| 3300009012 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159 | Environmental | Open in IMG/M |
| 3300009088 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG | Environmental | Open in IMG/M |
| 3300009089 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG | Environmental | Open in IMG/M |
| 3300009090 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG | Environmental | Open in IMG/M |
| 3300009137 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158 | Environmental | Open in IMG/M |
| 3300010048 | Tropical forest soil microbial communities from Panama - MetaG Plot_11 | Environmental | Open in IMG/M |
| 3300010321 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09212015 | Environmental | Open in IMG/M |
| 3300010333 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_2_24_1 metaG | Environmental | Open in IMG/M |
| 3300010366 | Tropical forest soil microbial communities from Panama - MetaG Plot_24 | Environmental | Open in IMG/M |
| 3300010398 | Tropical forest soil microbial communities from Panama - MetaG Plot_35 | Environmental | Open in IMG/M |
| 3300011270 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaG | Environmental | Open in IMG/M |
| 3300011271 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaG | Environmental | Open in IMG/M |
| 3300012189 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaG | Environmental | Open in IMG/M |
| 3300012201 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaG | Environmental | Open in IMG/M |
| 3300012205 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaG | Environmental | Open in IMG/M |
| 3300012206 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaG | Environmental | Open in IMG/M |
| 3300012207 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaG | Environmental | Open in IMG/M |
| 3300012209 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaG | Environmental | Open in IMG/M |
| 3300012210 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaG | Environmental | Open in IMG/M |
| 3300012349 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaG | Environmental | Open in IMG/M |
| 3300012350 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_60_16 metaG | Environmental | Open in IMG/M |
| 3300012351 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaG | Environmental | Open in IMG/M |
| 3300012355 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaG | Environmental | Open in IMG/M |
| 3300012357 | Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaG | Environmental | Open in IMG/M |
| 3300012362 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaG | Environmental | Open in IMG/M |
| 3300012363 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaG | Environmental | Open in IMG/M |
| 3300012917 | Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaG | Environmental | Open in IMG/M |
| 3300012925 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly) | Environmental | Open in IMG/M |
| 3300012972 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_24_1 metaG | Environmental | Open in IMG/M |
| 3300012977 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_5_24_1 metaG | Environmental | Open in IMG/M |
| 3300014157 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_2_24_1 metaG | Environmental | Open in IMG/M |
| 3300015359 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09182015 | Environmental | Open in IMG/M |
| 3300017659 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaG | Environmental | Open in IMG/M |
| 3300018468 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111 | Environmental | Open in IMG/M |
| 3300024330 | Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (PacBio error correction) | Environmental | Open in IMG/M |
| 3300026277 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_2_20cm (SPAdes) | Environmental | Open in IMG/M |
| 3300026297 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cm (SPAdes) | Environmental | Open in IMG/M |
| 3300026298 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cm (SPAdes) | Environmental | Open in IMG/M |
| 3300026306 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102 (SPAdes) | Environmental | Open in IMG/M |
| 3300026313 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes) | Environmental | Open in IMG/M |
| 3300026315 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126 (SPAdes) | Environmental | Open in IMG/M |
| 3300026325 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107 (SPAdes) | Environmental | Open in IMG/M |
| 3300026331 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137 (SPAdes) | Environmental | Open in IMG/M |
| 3300026335 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139 (SPAdes) | Environmental | Open in IMG/M |
| 3300026342 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146 (SPAdes) | Environmental | Open in IMG/M |
| 3300026361 | Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-03-B | Environmental | Open in IMG/M |
| 3300026499 | Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-06-B | Environmental | Open in IMG/M |
| 3300026529 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152 (SPAdes) | Environmental | Open in IMG/M |
| 3300026536 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147 (SPAdes) | Environmental | Open in IMG/M |
| 3300027671 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 (SPAdes) | Environmental | Open in IMG/M |
| 3300027846 | Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes) | Environmental | Open in IMG/M |
| 3300027947 | Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_40_50 (SPAdes) | Environmental | Open in IMG/M |
| 3300031962 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515 | Environmental | Open in IMG/M |
| 3300032180 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515 | Environmental | Open in IMG/M |
| 3300032205 | Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05 | Environmental | Open in IMG/M |
| Geographical Distribution | |
|---|---|
| Zoom: | Powered by OpenStreetMap |
| ⦗Top⦘ |
| Protein ID | Sample Taxon ID | Habitat | Sequence |
| JGI25385J37094_101179281 | 3300002558 | Grasslands Soil | VKVGKELATLFKSRFLQAFEFQMDVINNLSRSVFVVDNPDSAFAHKINEAAEMLVNELAEATQPAN* |
| JGI25383J37093_100654601 | 3300002560 | Grasslands Soil | KVGKELATLFKSRFLQAFEFQMDVINNLSRSVFVVDNPDSAFAHKINEAAEMLTNELAEATHTAN* |
| JGI25384J37096_100422953 | 3300002561 | Grasslands Soil | LFKSHFLAAFEFQIDVINNLSRSVFVLDNPNSNFAKKVSEAVENLEKELAGMSAQQSSL* |
| JGI25382J37095_100889852 | 3300002562 | Grasslands Soil | LFKSRFLQAFEFQMDVINNLSRSVFVVDNPDSAFAHKINEAAEMLVNELAEATQPAN* |
| JGI25382J43887_100871502 | 3300002908 | Grasslands Soil | LATLFKSRFLQAFEFQMDVINNLSRSVFVVDNPDSGFAHKINEAAEMLTRELAEATQPAN |
| Ga0066672_107488402 | 3300005167 | Soil | PGVRIKDAVKVGQELSQLFRSRFLQAFEFQLDVINNLSRSVFVVDNPDSSFAKKVGEAVETLERELSGATKIAN* |
| Ga0066672_107488412 | 3300005167 | Soil | PGVRIKDAVKVGQELSQLFRSRFLQAFEFQLDVINNLSRSVFVVDNPDSAFAKKVGEAVETLERELSGATKIAN* |
| Ga0066680_102205163 | 3300005174 | Soil | PGVRIKDAVKVGQELSQLFRSRFLQAFEFQLDVINNLSRSVFVVDNPDSSFAKKIGEAVETLERELSGATKIAN* |
| Ga0066680_109304831 | 3300005174 | Soil | AVKVGKDLAVLFKSHFLAAFEFQIDVINNLSRSVFVLDNPNSNFAKKVSEAVENLEKELAGMSVQQSSL* |
| Ga0066679_101835741 | 3300005176 | Soil | GKELATLFKSRFLQAFEFQMDVINNLSRSVFVVDNPDSAFAHKINEAAEMLTNELAEATQPAN* |
| Ga0066679_108498411 | 3300005176 | Soil | RFLQAFEFQLDVINNLSRSVFVIDNPNSAFAKKIVEAVETLERELAGATKIAN* |
| Ga0066679_109495082 | 3300005176 | Soil | ELATLFKSRFLQAFEFQMDVINNLSRSVFVVDNPDSAFAHKINEAAEMLTNELAEATQPAN* |
| Ga0066690_102748141 | 3300005177 | Soil | VKVGQELSQLFRSRFLQAFEFQLDVINNLSRSVFVVDNPDSPFAKKIGEAVEILERELSGATKIAN* |
| Ga0066688_108132001 | 3300005178 | Soil | GKELATLFKSRFLQAFEFQMDVINNLSRSVFVVDNPDSSFAHKINEAAEMLTNELAEATQPAN* |
| Ga0066685_102698151 | 3300005180 | Soil | KVGQELSQLFRSRFLQAFEFQLDVINNLSRSVFVVDNPDSPFAKKIGEAVETLERELSGATKIAN* |
| Ga0066685_105316211 | 3300005180 | Soil | KVGKELADLFKSKFLEAFEFQIDVINNLGRSVFILDNPNSTFARKVGDAADYLVRELGEAPKIAN* |
| Ga0066686_110495062 | 3300005446 | Soil | KSHFLAAFEFQIDVINNLSRSVFVLDNPNSNFAKKVSEAVENLEKELAGMSAQQSSL* |
| Ga0066689_100131655 | 3300005447 | Soil | IKDAVKVGKELATLFKSRFLQAFEFQMDVINNLSRSVFVVDNPDSAFAHKINEAAEMLVNELAEATQPAN* |
| Ga0066697_100638211 | 3300005540 | Soil | SQLFRSHFLQAFEFQLDVINNLSRSVFVVDNPDSAFAKKVTEAVDTLERELSGATKIAN* |
| Ga0066701_102421851 | 3300005552 | Soil | SQLFRSRFLQAFEFQLDVINNLSRSVFVVDNPDSSFAKKVTEAVDTLERELSGATKIAN* |
| Ga0066661_103936451 | 3300005554 | Soil | LQAFEFQLDVINNLSRSVFVVDNPDSAFAKKVTEAVDTLERELSGATKIAN* |
| Ga0066661_107008761 | 3300005554 | Soil | LQAFEFQLDVINNLSRSVFVVDNPDSSFAKKVTEAVDTLERELSGATKIAN* |
| Ga0066704_105443932 | 3300005557 | Soil | LFKSRFLQAFEFQMDVINNLSRSVFVVDNPDSSFAHKINEAAEMLTNELAEATQPAN* |
| Ga0066698_102259923 | 3300005558 | Soil | KELATLFKSRFLQAFEFQMDVINNLSRSVFVVDNPDSAFAHKINEAAEMLTNELAEATQPAN* |
| Ga0066698_102994631 | 3300005558 | Soil | RFLQAFEFQMDVINNLSRSVFVIDNPDSGFAHKVTEAADMLTKELEEIPKAAN* |
| Ga0066698_105848821 | 3300005558 | Soil | KVGQELSQLFRSRFLQAFEFQLDVINNLSRSVFVVDNPDSSFAKKVTEAVDTLERELSGATKIAN* |
| Ga0066700_103142301 | 3300005559 | Soil | FEFQLDVINNLSRSVFVVDNPDSSFAKKIGEAVETLERELSGATKIAN* |
| Ga0066708_103242661 | 3300005576 | Soil | KVGKELATLFKSRFLQAFEFQMDVINNLSRSVFVVDNPDSAFAHKINEAAEMLTNELAEATQPAN* |
| Ga0066691_104851951 | 3300005586 | Soil | GVRIKDAVKVGKELATLFKSRFLQAFEFQMDVINNLSRSVFVVDYPDSGFAHKINEAAEMLINELAEATQPAN* |
| Ga0066656_108688921 | 3300006034 | Soil | GKELAVLFRSRFLQAFDFQMDVINNLSRSVFVMDNPDSAFAHKVNEAAELLTKELAEAPATAN* |
| Ga0066658_105620982 | 3300006794 | Soil | LFKSRFLQAFEFQMDVINNLSRSVFVVDNPDSAFAHKINEAAEMLTNELAEATQPAN* |
| Ga0079220_118019101 | 3300006806 | Agricultural Soil | KELATLFRSHFLQAFEFQIDVINNLSRSVFVVDNPDSPFAHKVSEAVELLTQELAEVPKSAN* |
| Ga0099791_103887081 | 3300007255 | Vadose Zone Soil | LQAFEFQLDVINNLSRSVFVVDNPDSAFAKKVTEAADTLERELSGATKIAN* |
| Ga0099794_103383541 | 3300007265 | Vadose Zone Soil | PGVRIKDAIKVGQELSQLFRSRFLQAFEFQLDVINNLSRSVFVVDNPDSSFAKKVTEAVDTLERELSGATKLAN* |
| Ga0066710_1047396911 | 3300009012 | Grasslands Soil | VKVGKELATLFKSRFLQAFEFQMDVINNLSRSVFVVDNPDSAFAHKINEAAEMLTNELAEATQPAN |
| Ga0099830_113371961 | 3300009088 | Vadose Zone Soil | DAMKVGKELATLFHSRFLQAFEFQMDVINNLSRSVFVIDNPDSAFAHKVNEAAEMLTRELEEIPKAAN* |
| Ga0099830_114250272 | 3300009088 | Vadose Zone Soil | ELSQLFRSHFLQAFEFQLDVINNLSRSVFVVDNPDSAFAKKVTEAADTLERELSGATKIAN* |
| Ga0099828_104803521 | 3300009089 | Vadose Zone Soil | QELSQLFRSRFLQAFEFQLDVINNLSRSVFVVDNPYSSFAKKIGEAVETLERELSGATKIAN* |
| Ga0099828_104980971 | 3300009089 | Vadose Zone Soil | QAFEFQLDVINNLSRSVFVVDNPDSAFAKKVTEAADTLERELSGATKIAN* |
| Ga0099828_109797262 | 3300009089 | Vadose Zone Soil | FLQAFEFQLDVINNLSRSVFVVDNPDSSFAKKVTEAVDTLERELSGATKIAN* |
| Ga0099828_111447041 | 3300009089 | Vadose Zone Soil | PPGVRIKDAVKVGQELSQLFRSHFLQAFEFQLDVINNLSRSVFVVDNPDSAFAKKVTEAADTLERELSGATKIAN* |
| Ga0099827_105708132 | 3300009090 | Vadose Zone Soil | FRSHFLQAFEFQLDVINNLSRSVFVVDNPDSAFAKKIGEAVETLERELSGATKIAN* |
| Ga0066709_1045196961 | 3300009137 | Grasslands Soil | VKVGKELATLFKSRFLQAFEFQMDVINNLSRSVFVVDNPDSAFAHKINEAAEMLTNELAEATQPAN* |
| Ga0126373_108096701 | 3300010048 | Tropical Forest Soil | ATLFKSHFLQAFEFQIDVINNLSRSVFVIDNPDSPFAHKVSEAVELLTKELEEAPKTAN* |
| Ga0134067_100615542 | 3300010321 | Grasslands Soil | PPGVRIKDAVKVGKELATLFKSRFLQAFEFQMDVINNLSRSVFVVDNPDSAFAHKINEAAEMLTNELAEATQPAN* |
| Ga0134067_101498091 | 3300010321 | Grasslands Soil | LFRSRFLQAFEFQLDVINNLSRSVFVVDNPDSSFAKKVTEAVDTLERELSGATKIAN* |
| Ga0134080_100054501 | 3300010333 | Grasslands Soil | KELAVLFRSRFLQAFDFQMDVINNLSRSVFVMDNPDSAFAHKVNEAAELLTKELAEAPATAN* |
| Ga0126379_103275091 | 3300010366 | Tropical Forest Soil | GKELATLFKSHFLQAFEFQIDVINNLSRSVFVIDNPDSPFAHKVSEAVELLTKELEEAPKTAN* |
| Ga0126383_115881561 | 3300010398 | Tropical Forest Soil | KFLAAFEFQIDVINNLSRSVFVMDNPNSSFAKKVDEAVGNLEKELNGSS* |
| Ga0137391_103350191 | 3300011270 | Vadose Zone Soil | LFRSKFLEAFEFQIDVINNLSRSVFVLDHPTSTFARKVGEAADFLVKELGEAPKIAN* |
| Ga0137391_112935411 | 3300011270 | Vadose Zone Soil | TLFHSRFLQAFEFQMDVINNLSRSVFVIDNPDSAFAHKVNEAAEMLTRELEEIPKAAN* |
| Ga0137393_108836512 | 3300011271 | Vadose Zone Soil | HSRFLQAFEFQMDVINNLSRSVFVVDNPDSGFAHKINEAAEMLTRELAEEPQPAN* |
| Ga0137388_102587074 | 3300012189 | Vadose Zone Soil | LQAFEFQLDVINNLSRSVFVVDNPDSAFAKKIGEAVETLERELSGATKIAN* |
| Ga0137388_107903842 | 3300012189 | Vadose Zone Soil | VRIKDAIKVGQELSQLFRSRFLQAFEFQLDVINNLSRSVFVVDNPDSSFAKKVTEAVDTLERELSGATKIAN* |
| Ga0137365_107473531 | 3300012201 | Vadose Zone Soil | FRSRFLQAFEFQLDVINNLSRSVFVIDNPDSSFAKKIGEAVETLERELSGATKIAN* |
| Ga0137365_108892591 | 3300012201 | Vadose Zone Soil | TLFKSRFLQAFEFQMDVINNLSRSVFVVDNPDSGFAHKINEAAEMLINELAEVTQPAN* |
| Ga0137362_107538731 | 3300012205 | Vadose Zone Soil | DAVKVGRELATLFKSRFLQAFEFQMDVINNLSRSVFVVDNPDSAFAHKINEAAEMLVNELAEVTQPAN* |
| Ga0137380_109043261 | 3300012206 | Vadose Zone Soil | VKVGQELSQLFRSHFLQAFEFQLDVINNLSRSVFVVDNPDSPFAKKVTEAADILERELSGATKIAN* |
| Ga0137381_109183432 | 3300012207 | Vadose Zone Soil | ELADLFKSKFLEAFEFQIDVINNLSRSVFVLDHPNSTFARKVGDAADFLVKELGEAPRIAN* |
| Ga0137379_104685833 | 3300012209 | Vadose Zone Soil | VKVGQELSQLFRSHFLQAFEFQLDVINNLSRSVFVVDNPDSAFAKKVTEAADILERELSGATKIAN* |
| Ga0137378_105059621 | 3300012210 | Vadose Zone Soil | GKELATLFRSRFLQAFDFQMDVINNLSRSVFVIDNPDSAFAHKVNEAAELLTKELAEVPATAN* |
| Ga0137387_100911321 | 3300012349 | Vadose Zone Soil | GQELSQLFRSHFLQAFEFQLDVINNLSRSVFVVDNPDSAFAKKVTEAVDTLERELSGATKIAN* |
| Ga0137387_101511191 | 3300012349 | Vadose Zone Soil | VKVGQELSQLFRSRFLQAFEFQLDVINNLSRSVFVVDNPDSSFAKKIGEAVETLERELSGATKIAN* |
| Ga0137387_113266511 | 3300012349 | Vadose Zone Soil | KELAILFKSRFLQAFEFQMDVINNLSRSVFVIDNPDSAFAHKVNEAAEMLTKELEEIPKAAN* |
| Ga0137372_101360381 | 3300012350 | Vadose Zone Soil | DAVKVGKELATLFKSRFLQAFEFQMDVINNLSRSVFVVDNPDSAFAHKINEAAEMLTNELAEATQPAN* |
| Ga0137386_110640991 | 3300012351 | Vadose Zone Soil | KVGQELSQLFRSHFLQAFEFQLDVINNLSRSVFVVDNPDSPFAKKVTEAADILERELSGATKIAN* |
| Ga0137369_108925282 | 3300012355 | Vadose Zone Soil | AVKVGKELAMLFRSRFLQAFDFQMDVINNLSRSVFVMDNPDSAFAHKVNEAVELLMQELKEVPDSAN* |
| Ga0137384_113821031 | 3300012357 | Vadose Zone Soil | ADLFRSKFLEAFEFQIDVINNLSRSVFVLEHPNSTFARKVGEAADFLVKELGEAPKIAN* |
| Ga0137361_112854792 | 3300012362 | Vadose Zone Soil | GVRIKDAVKVGQELSQLFRSHFLQAFEFQLDVINNLSRSVFVVDNPDSAFAKKVTEAVDTLERELSGATKIAN* |
| Ga0137390_101980284 | 3300012363 | Vadose Zone Soil | LFRSHFLQAFEFQLDVINNLSRSVFVVDNPDSAFAKKVTEAVDTLERELSGATKIAN* |
| Ga0137390_104402162 | 3300012363 | Vadose Zone Soil | LQAFEFQMDVINNLSRSVFVIDNPDSAFAHKVNEAAEMLTKELEEIPKAAN* |
| Ga0137395_108273201 | 3300012917 | Vadose Zone Soil | DAVKVGKELATLFKSRFLQAFEFQMDVINNLSRSVFVIDNPDSGFAHKVTEAAEMLTKELEEIPKAAN* |
| Ga0137419_119451531 | 3300012925 | Vadose Zone Soil | RSHFLQAFEFQLDVINNLSRSVFVVDNPDSAFAKKVTEAVDTLERELSGATKIAN* |
| Ga0134077_1000012615 | 3300012972 | Grasslands Soil | GKDLAVLFKSHFLAAFEFQIDVINNLSRSVFVLDNPNSNFAKKVSEAVENLEKELAGMSVQQSSL* |
| Ga0134077_103891422 | 3300012972 | Grasslands Soil | VPPGVRIKDAMKVGKELATLFKSRFLQAFEFQMDVINNLSRSVFVIDNPDSAFAHKVNEAAEMLTKELEEIPKAAN* |
| Ga0134087_104536751 | 3300012977 | Grasslands Soil | RLKDAVKVGKELAVLFRSRFLQAFDFQIDVINNLSRSVFVMDNPDSAFAHKVNEAAELLTKELAEAPATAN* |
| Ga0134078_101046051 | 3300014157 | Grasslands Soil | VRLKDAVKVGKELAVLFRSRFLQAFDFQIDVINNLSRSVFVIDNPDSAFAHKVNEAVELLTKELAEVPATAN* |
| Ga0134085_106049571 | 3300015359 | Grasslands Soil | AIKVGQELSQLFRSRFLQAFEFQLDVINNLSRSVFVIDNPDSSFAKKIGEAVETLERELSGATKIAN* |
| Ga0134083_103799722 | 3300017659 | Grasslands Soil | VPPGVRIKDAVKVGKELATLFKSRFLQAFEFQMDVINNLSRSVFVVDNPDSAFAHKINEAAEMLTNELAEATHTAN |
| Ga0066662_103381052 | 3300018468 | Grasslands Soil | PGVRIKDAVKVGKELATLFKSRFLQAFEFQMDVINNLSRSVFVVDNPDSAFAHKINEAAEMLTNELAEATQPAN |
| Ga0066662_129775871 | 3300018468 | Grasslands Soil | DAVKVGKDLAVLFKSHFLAAFEFQIDVINNLSRSVFVLDNPNSNFAKKVSEAVENLEKELAGMSAQQSSL |
| Ga0137417_11672491 | 3300024330 | Vadose Zone Soil | RIKDAMKVGKELATLFHSRFLQAFEFQMDVINNLSRSVFVIDNPDSAFAHKVNEAAEMLTRELEEIPKAAN |
| Ga0209350_10344371 | 3300026277 | Grasslands Soil | FLQAFEFQMDVINNLSRSVFVVDNPDSSFAHKINEAAEMLTNELAEATQPAN |
| Ga0209237_11624652 | 3300026297 | Grasslands Soil | GVRIKDAVKVGKELATLFKSRFLQAFEFQMDVINNLSRSVFVVDNPDSAFAHKINEAAEMLTNELAEATHPAN |
| Ga0209236_11942941 | 3300026298 | Grasslands Soil | AIKVGQELSQLFRSRFLQAFEFQLDVINNLSRSVFVVDNPDSSFAKKVTEAVDTLERELSGATKIAN |
| Ga0209468_10325553 | 3300026306 | Soil | VPPGVRLKDAVKVGKELAVLFRSRFLQAFDFQMDVINNLSRSVFVMDNPDSAFAHKVNEAAELLTKELAEAPATAN |
| Ga0209761_13040991 | 3300026313 | Grasslands Soil | GRELAQLFRSKFLAAFEFQMDVINNLSRSVFVVDNPNSTFAHKVSGAAENLIRELGEAAKVAN |
| Ga0209686_11738491 | 3300026315 | Soil | GVRIKDAVKVGKELATLFKSRFLQAFEFQMDVINNLSRSVFVVDNPDSAFAHKINEAAEMLTNELAEATQPAN |
| Ga0209152_100398172 | 3300026325 | Soil | VPPGVRIKDAVKVGKELATLFKSRFLQAFEFQMDVINNLSRSVFVVDNPDSAFAHKINEAAEMLTNELAEATQPAN |
| Ga0209267_11984962 | 3300026331 | Soil | GQELSQLFRSRFLQAFEFQLDVINNLSRSVFVVDNPDSSFAKKIGEAVETLERELSGATKIAN |
| Ga0209804_12171872 | 3300026335 | Soil | FLQAFEFQMDVINNLSRSVFVVDNPDSAFAHKINEAAEMLTNELAEATQPAN |
| Ga0209057_11846052 | 3300026342 | Soil | LFHSRFLQAFEFQMDVINNLSRSVFVIDNPDSGFAHKVTEAADMLTKELEEIPKAAN |
| Ga0209057_12217762 | 3300026342 | Soil | SQLFRSHFLQAFEFQLDVINNLSRSVFVVDNPDSAFAKKVTEAVDTLERELSGATKIAN |
| Ga0257176_10629632 | 3300026361 | Soil | KVGQELSQLFRSRFLQAFEFQLDVINNLSRSVFVVDNPDSSFAKKVTEAVDTLERELSGATKIAN |
| Ga0257181_10148742 | 3300026499 | Soil | MKVGKELATLFHSRFLQAFEFQMDVINNLSRSVFVIDNPDSAFAHKVNEAAEMLTRELEEIPKAAN |
| Ga0209806_10058419 | 3300026529 | Soil | GKDLAVLFKSHFLAAFEFQIDVINNLSRSVFVLDNPNSNFAKKVSEAVENLERELAGMSAQQSSL |
| Ga0209806_10866931 | 3300026529 | Soil | TLFKSRFLQAFEFQMDVINNLSRSVFVVDNPDSSFAHKINEAAEMLTNELAEATQPAN |
| Ga0209058_12566502 | 3300026536 | Soil | LFRSRFLQAFEFQLDVINNLSRSVFVVDNPDSPFAKKIGEAVETLERELSGATKIAN |
| Ga0209588_11570882 | 3300027671 | Vadose Zone Soil | IKVGQELSQLFRSRFLQAFEFQLDVINNLSRSVFVVDNPDSSFAKKVTEAVDTLERELSGATKIAN |
| Ga0209180_106175731 | 3300027846 | Vadose Zone Soil | KVGKELATLFHSRFLQAFEFQMDVINNLSRSVFVIDNPDSAFAHKVNEAAEMLTKELEEIPKAAN |
| Ga0209868_10123141 | 3300027947 | Groundwater Sand | AVKVGRELAQLFRSRFLEAFEFQLDVINNLSRSVFVLDQPNSSFAKKISEAAENLTRQIGEIGETPKVAN |
| Ga0307479_100540006 | 3300031962 | Hardwood Forest Soil | ATLFKSRFLQAFEFQMDVINNLSRSVFVIDNPDSAFAHKVSEAAEMLTNELREAPRAAN |
| Ga0307471_1006922341 | 3300032180 | Hardwood Forest Soil | LFKSRFLQAFEFQMDVINNLSRSVFVIDNPDSAFAHKVREAAEMLTNELTEAPKPAN |
| Ga0307471_1037155552 | 3300032180 | Hardwood Forest Soil | VKVGQELSQLFRSHFLQAFEFQLDVINNLSRSVFVVDNPDSAFAKKVTEAVDTLERELSGATKIAN |
| Ga0307471_1040396611 | 3300032180 | Hardwood Forest Soil | PGVRIKDAMKVGQELSQLFRSHFLQAFEFQLDVINNLSRSVFVVDNPDSPFAKKVTEAVDTLERELSGATKIAS |
| Ga0307472_1010754382 | 3300032205 | Hardwood Forest Soil | VGKELATLFKSHFLQAFEFQMDVINNLSRSVFVIDNPDSSFAHKVTEAAEMLTNELREAPASAN |
| ⦗Top⦘ |