| Basic Information | |
|---|---|
| Family ID | F105523 |
| Family Type | Metagenome |
| Number of Sequences | 100 |
| Average Sequence Length | 78 residues |
| Representative Sequence | MARSPIDDVANAVKDATYVSVGLGVIAFQRLQVRRNELSKALSGQADDAKGALGVVGTLVGERLKVVEERVGAALERR |
| Number of Associated Samples | 63 |
| Number of Associated Scaffolds | 100 |
| Quality Assessment | |
|---|---|
| Transcriptomic Evidence | No |
| Most common taxonomic group | Unclassified |
| % of genes with valid RBS motifs | 0.00 % |
| % of genes near scaffold ends (potentially truncated) | 0.00 % |
| % of genes from short scaffolds (< 2000 bps) | 0.00 % |
| Associated GOLD sequencing projects | 59 |
| AlphaFold2 3D model prediction | Yes |
| 3D model pTM-score | 0.43 |
| Hidden Markov Model |
|---|
| Powered by Skylign |
| Most Common Taxonomy | |
|---|---|
| Group | Unclassified (100.000 % of family members) |
| NCBI Taxonomy ID | N/A |
| Taxonomy | N/A |
| Most Common Ecosystem | |
|---|---|
| GOLD Ecosystem | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil (39.000 % of family members) |
| Environment Ontology (ENVO) | Unclassified (34.000 % of family members) |
| Earth Microbiome Project Ontology (EMPO) | Host-associated → Plant → Plant rhizosphere (38.000 % of family members) |
| ⦗Top⦘ |
| ⦗Top⦘ |
| Predicted Topology & Secondary Structure | |||||
|---|---|---|---|---|---|
| Classification: | Globular | Signal Peptide: | No | Secondary Structure distribution: | α-helix: 66.98% β-sheet: 0.00% Coil/Unstructured: 33.02% | Feature Viewer |
|
|
|||||
| Powered by Feature Viewer | |||||
| Structure Viewer | |
|---|---|
|
| |
| Per-residue confidence (pLDDT): 0-50 51-70 71-90 91-100 | pTM-score: 0.43 |
| Powered by PDBe Molstar | |
| ⦗Top⦘ |
| Pfam ID | Name | % Frequency in 100 Family Scaffolds |
|---|---|---|
| PF00535 | Glycos_transf_2 | 40.00 |
| PF13641 | Glyco_tranf_2_3 | 18.00 |
| PF01370 | Epimerase | 5.00 |
| PF16363 | GDP_Man_Dehyd | 4.00 |
| PF00534 | Glycos_transf_1 | 3.00 |
| PF13439 | Glyco_transf_4 | 2.00 |
| PF06897 | DUF1269 | 1.00 |
| PF01063 | Aminotran_4 | 1.00 |
| PF14013 | MT0933_antitox | 1.00 |
| PF06721 | DUF1204 | 1.00 |
| PF13632 | Glyco_trans_2_3 | 1.00 |
| PF01933 | CofD | 1.00 |
| COG ID | Name | Functional Category | % Frequency in 100 Family Scaffolds |
|---|---|---|---|
| COG0115 | Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase | Amino acid transport and metabolism [E] | 2.00 |
| COG0391 | Archaeal 2-phospho-L-lactate transferase/Bacterial gluconeogenesis factor, CofD/UPF0052 family | Carbohydrate transport and metabolism [G] | 1.00 |
| COG4803 | Uncharacterized membrane protein | Function unknown [S] | 1.00 |
| ⦗Top⦘ |
| Name | Rank | Taxonomy | Distribution |
| Unclassified | root | N/A | 100.00 % |
| Visualization |
|---|
| Powered by ApexCharts |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|
| ⦗Top⦘ |
| Habitat | Taxonomy | Distribution |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil | 39.00% |
| Polar Desert Sand | Environmental → Aquatic → Freshwater → Ice → Unclassified → Polar Desert Sand | 10.00% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil | 7.00% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil | 7.00% |
| Wetland Sediment | Environmental → Aquatic → Freshwater → Wetlands → Unclassified → Wetland Sediment | 6.00% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Agricultural → Soil | 6.00% |
| Freshwater | Environmental → Aquatic → Freshwater → Ice → Glacial Lake → Freshwater | 4.00% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil | 4.00% |
| Arabidopsis Rhizosphere | Host-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere | 3.00% |
| Arabidopsis Thaliana Rhizosphere | Host-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere | 2.00% |
| Populus Endosphere | Host-Associated → Plants → Roots → Bulk Soil → Unclassified → Populus Endosphere | 2.00% |
| Freshwater | Environmental → Aquatic → Freshwater → Lentic → Unclassified → Freshwater | 1.00% |
| Sediment | Environmental → Aquatic → Freshwater → Lake → Sediment → Sediment | 1.00% |
| Sediment | Environmental → Aquatic → Marine → Sediment → Unclassified → Sediment | 1.00% |
| Groundwater Sediment | Environmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment | 1.00% |
| Serpentine Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Serpentine Soil | 1.00% |
| Glacier Forefield Soil | Environmental → Terrestrial → Soil → Unclassified → Unclassified → Glacier Forefield Soil | 1.00% |
| Soil | Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil | 1.00% |
| Switchgrass Rhizosphere | Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere | 1.00% |
| Biofilm | Environmental → Terrestrial → Cave → Unclassified → Unclassified → Biofilm | 1.00% |
| Sediment | Environmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment | 1.00% |
| Visualization |
|---|
| Powered by ApexCharts |
| Taxon OID | Sample Name | Habitat Type | IMG/M Link |
|---|---|---|---|
| 3300000033 | Soil microbial communities from Great Prairies - Iowa, Continuous Corn soil | Environmental | Open in IMG/M |
| 3300000956 | Soil microbial communities from Great Prairies - Kansas, Native Prairie soil | Environmental | Open in IMG/M |
| 3300003970 | Enrichment cultures from Lake Fryxell 39872 | Environmental | Open in IMG/M |
| 3300004114 | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 5 | Environmental | Open in IMG/M |
| 3300004156 | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 1 | Environmental | Open in IMG/M |
| 3300004157 | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - - Combined assembly of AARS Block 2 | Environmental | Open in IMG/M |
| 3300004463 | Combined assembly of Arabidopsis thaliana microbial communities | Host-Associated | Open in IMG/M |
| 3300004480 | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 4 | Environmental | Open in IMG/M |
| 3300004778 | Wetland sediment microbial communities from St. Louis River estuary, USA, under dissolved organic matter induced mercury methylation - T4Bare3Fresh | Environmental | Open in IMG/M |
| 3300004779 | Wetland sediment microbial communities from St. Louis River estuary, USA, under dissolved organic matter induced mercury methylation - T0Bare3Fresh | Environmental | Open in IMG/M |
| 3300004782 | Wetland sediment microbial communities from St. Louis River estuary, USA, under dissolved organic matter induced mercury methylation - T4Bare2Fresh | Environmental | Open in IMG/M |
| 3300005093 | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of KBS All Blocks | Environmental | Open in IMG/M |
| 3300005343 | Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3L metaG | Environmental | Open in IMG/M |
| 3300006038 | Populus root and rhizosphere microbial communities from Tennessee, USA - Endosphere MetaG P. deltoides DD176-5 | Host-Associated | Open in IMG/M |
| 3300006046 | Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101 | Environmental | Open in IMG/M |
| 3300006178 | Populus root and rhizosphere microbial communities from Tennessee, USA - Endosphere MetaG P. TD hybrid TD303-2 | Host-Associated | Open in IMG/M |
| 3300007521 | Freshwater microbial communities from Lake Fryxell liftoff mats and glacier meltwater in Antarctica - MAT-01 | Environmental | Open in IMG/M |
| 3300007799 | Freshwater microbial communities from Lake Fryxell liftoff mats and glacier meltwater in Antarctica - MAT-06 | Environmental | Open in IMG/M |
| 3300010036 | Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot26 | Environmental | Open in IMG/M |
| 3300012043 | Polar desert sand microbial communities from Dry Valleys, Antarctica - metaG UQ601 (22.06) | Environmental | Open in IMG/M |
| 3300012188 | Polar desert sand microbial communities from Dry Valleys, Antarctica - metaG UQ330 (21.06) | Environmental | Open in IMG/M |
| 3300012527 | Polar desert sand microbial communities from Dry Valleys, Antarctica - metaG UQ83 (22.06) | Environmental | Open in IMG/M |
| 3300012680 | Polar desert sand microbial communities from Dry Valleys, Antarctica - metaG UQ224A (23.06) | Environmental | Open in IMG/M |
| 3300012682 | Polar desert sand microbial communities from Dry Valleys, Antarctica - metaG UQ223 (23.06) | Environmental | Open in IMG/M |
| 3300015163 | Arctic soil microbial communities from a glacier forefield, Rabots glacier, Tarfala, Sweden (Sample Rb1b, glacier snout) | Environmental | Open in IMG/M |
| 3300015371 | Combined assembly of cpr5 and col0 rhizosphere and soil | Host-Associated | Open in IMG/M |
| 3300015373 | Combined assembly of cpr5 rhizosphere | Host-Associated | Open in IMG/M |
| 3300017787 | Polar desert sand microbial communities from Dry Valleys, Antarctica - metaG UQ497 (22.06) (version 2) | Environmental | Open in IMG/M |
| 3300018073 | Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_5_b1 | Environmental | Open in IMG/M |
| 3300018422 | Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 124 T | Environmental | Open in IMG/M |
| 3300018429 | Populus adjacent soil microbial communities from riparian zone of Shoshone River, Wyoming, USA - 504 T | Environmental | Open in IMG/M |
| 3300018465 | Populus adjacent soil microbial communities from riparian zone of Blue River, Arizona, USA - 249 IS | Environmental | Open in IMG/M |
| 3300018466 | Populus adjacent soil microbial communities from riparian zone of Blue River, Arizona, USA - 249 T | Environmental | Open in IMG/M |
| 3300018469 | Populus adjacent soil microbial communities from riparian zone of Weber River, Utah, USA - 320 T | Environmental | Open in IMG/M |
| 3300018476 | Populus adjacent soil microbial communities from riparian zone of Yellowstone River, Montana, USA - 531 T | Environmental | Open in IMG/M |
| 3300018481 | Populus adjacent soil microbial communities from riparian zone of Weber River, Utah, USA - 356 T | Environmental | Open in IMG/M |
| 3300019361 | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S133-311R-2 (version 2) | Environmental | Open in IMG/M |
| 3300019377 | Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 112 T | Environmental | Open in IMG/M |
| 3300021432 | Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-M | Environmental | Open in IMG/M |
| 3300022213 | Sediment microbial communities from San Francisco Bay, California, United States - SF_Oct11_sed_USGS_4_1 | Environmental | Open in IMG/M |
| 3300027831 | Wetland sediment microbial communities from St. Louis River estuary, USA, under dissolved organic matter induced mercury methylation - T0Bare3Fresh (SPAdes) | Environmental | Open in IMG/M |
| 3300027840 | Wetland sediment microbial communities from St. Louis River estuary, USA, under dissolved organic matter induced mercury methylation - T4Bare2Fresh (SPAdes) | Environmental | Open in IMG/M |
| 3300027843 | Wetland sediment microbial communities from St. Louis River estuary, USA, under dissolved organic matter induced mercury methylation - T4Bare3Fresh (SPAdes) | Environmental | Open in IMG/M |
| 3300027850 | Freshwater microbial communities from Lake Fryxell liftoff mats and glacier meltwater in Antarctica - MAT-01 (SPAdes) | Environmental | Open in IMG/M |
| 3300028578 | Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT160D0 | Environmental | Open in IMG/M |
| 3300028590 | Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_PalmiticAcid_Day30 | Environmental | Open in IMG/M |
| 3300028710 | Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_380 | Environmental | Open in IMG/M |
| 3300028802 | Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 17_S | Environmental | Open in IMG/M |
| 3300028810 | Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_151 | Environmental | Open in IMG/M |
| 3300028812 | Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_Vanillin_Day48 | Environmental | Open in IMG/M |
| 3300030006 | Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT152D67 | Environmental | Open in IMG/M |
| 3300030336 | Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day1 | Environmental | Open in IMG/M |
| 3300030619 | Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT150D86 (Novaseq) | Environmental | Open in IMG/M |
| 3300031170 | Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 12_S | Environmental | Open in IMG/M |
| 3300031228 | Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT153D57 | Environmental | Open in IMG/M |
| 3300031455 | Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 23_S | Environmental | Open in IMG/M |
| 3300031576 | Biofilm microbial communities from Wishing Well Cave, Virginia, United States - WW16-25 | Environmental | Open in IMG/M |
| 3300032013 | Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C48D3 | Environmental | Open in IMG/M |
| 3300032256 | Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - C3_top | Environmental | Open in IMG/M |
| 3300032456 | Freshwater microbial communities from Lake Fryxell liftoff mats and glacier meltwater in Antarctica - MAT-03 (spades assembly) | Environmental | Open in IMG/M |
| 3300033407 | Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT140D175 | Environmental | Open in IMG/M |
| 3300033417 | Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT142D155 | Environmental | Open in IMG/M |
| 3300034151 | Sediment microbial communities from East River floodplain, Colorado, United States - 2_s17 | Environmental | Open in IMG/M |
| Geographical Distribution | |
|---|---|
| Zoom: | Powered by OpenStreetMap |
| ⦗Top⦘ |
| Protein ID | Sample Taxon ID | Habitat | Sequence |
| ICChiseqgaiiDRAFT_23050531 | 3300000033 | Soil | MEPISVDRLTSAARDAAYVSVGLGVIAFQRLQVRRNELSKSLGGQAGQARGVVDGIGAVLAERVKLVEERLGSALDR* |
| ICChiseqgaiiDRAFT_23544511 | 3300000033 | Soil | MAPNPVDDXPEAVKDLAYVSVGLGVLAFQRLQVRRQELRKAMSGPADEARGTIEVLGALLGERVKMVEERIGSALKH* |
| JGI10216J12902_1010044521 | 3300000956 | Soil | VAQNPIDDLTSAVKDAVFVSVGLGVIAFQRLAVRRNELSKAISAQAEEARGALDLVGTAVGERVKAVEERVGATFDRSR* |
| JGI10216J12902_1037004491 | 3300000956 | Soil | MAQNPIDDLSSAVKDAVFVTVGLGVIAFQRIQVRRNELTKAISTQAEEARGALDVVGTLVGERLKAVEERVGATLDRSR* |
| JGI10216J12902_1135935382 | 3300000956 | Soil | MARTSIDDVTNAVKDAAYVSVGLSVIAFQRLQVRRNELEKALSGQAEEAKGALGVVGALVGERLKLVEERIDAAREQLSGYIPGATDR* |
| JGI10216J12902_1230873092 | 3300000956 | Soil | PPAEEHLMARSPIDDVTSAAKDAAYVSVGLGVIAFQRLQVRRHELHKALTDQTEEAKGAVDLVGALVGERVKMLEERLGAALAHR* |
| Ga0063602_1121021 | 3300003970 | Freshwater | MARPQIDDVTAVVKDAAYVSVGLGVIAFQRLQVRRNELSKALAGQTDDAKGTLDTVAAMVGDQLKLVEERVSAALERTPLR* |
| Ga0062593_1027107311 | 3300004114 | Soil | VARLQLDDLTNAAKDAAYVSVGLSVIAFQRLQVRRHELTKALESRSDEARGVLEVATSVVGDRVKMVEERLGA |
| Ga0062589_1020143321 | 3300004156 | Soil | MAQNPLDDVTAAVKDAALVTVGLGVIAFQRAQVRRNELEKAISAQAEEARGALGLVSELLGERLKAVEERVGATLDRGR* |
| Ga0062590_1011455282 | 3300004157 | Soil | MAQNPIDEVTAALRDAMFVSVGLGVIAFQRIQVRRNELSKAISAQAEEARGAIDVVGGLVGERLKAVEERVGATFDRSR* |
| Ga0062590_1016876431 | 3300004157 | Soil | MAQNPLDDLTSAAKDAVFVTVGLGVIAFQRIQVRRNELTKAITTQAEEARGALDVVSSLVGERLKSVEERVGATFDRSR* |
| Ga0063356_1014250011 | 3300004463 | Arabidopsis Thaliana Rhizosphere | LATPLYVAAMAQNPIEDVTAAVKDAMFVTVGLGVIAFQRIQVRRNELSRAIASQAEEARGALDVVGELVGDRLKAVEERVGATFDRSR* |
| Ga0063356_1054018601 | 3300004463 | Arabidopsis Thaliana Rhizosphere | VARTPVEDLVNAVKDATYVSVGLGVIAFQRLQVRRNELAKAINGPVEEARGTLEVVGALVGERIKLVEERVSDAIKR* |
| Ga0062592_1005870382 | 3300004480 | Soil | HKCLPELATPLYVAAMAQNPIEDVTAAVKDAMFVTVGLGVIAFQRIQVRRNELSRAIASQAEEARGALDVVGELVGDRLKAVEERVGATFDRSR* |
| Ga0062592_1019293841 | 3300004480 | Soil | MAQNPLDDLTSAAKDAVFVTVGLSVIAFQRIQVRRNELTKAITTQAEEARGALDVVSSLVGERLKSVEERVGATFDRSR* |
| Ga0062383_100137794 | 3300004778 | Wetland Sediment | DITNVVKDAAYVTVGLGVIAFQRLQVRRHELSQALAGGGDAATGALDLVNTVVGERVKAVEERFIAVLGR* |
| Ga0062380_100374512 | 3300004779 | Wetland Sediment | MTHSPLDDITNVVKDAAYVTVGLGVIAFQRLQVRRHELSQALAGGGDAATGALDLVNTVVGERVKAVEERFIAVLGR* |
| Ga0062382_100217771 | 3300004782 | Wetland Sediment | PLDDITNVVKDAAYVTVGLGVIAFQRLQVRRHELSQALAGGGDAATGALDLVNTVVGERVKAVEERFIAVLGR* |
| Ga0062594_1026361041 | 3300005093 | Soil | DLSSAVKDAVFVTVGLGVIAFQRIQVRRNELTKAISTQAEEARGALDVVGTLVGERLKAVEERVGATLDRSR* |
| Ga0070687_1007916482 | 3300005343 | Switchgrass Rhizosphere | MAQNPLDDVTAAVKDAALVTVGLGVIAFQRAQVRRNELEKAISAQAEEARGALGLVSELLGERLKAVEERVGATLDRGR |
| Ga0075365_109818022 | 3300006038 | Populus Endosphere | VARLQLDDLTNAAKDAAYVSVGLSVIAFQRMQVRRNELNKALEARSGEAREALDVVNALVSDRVKMVEERLGSVLDLTRR* |
| Ga0066652_1005828051 | 3300006046 | Soil | PMAQNPLDDVTAAVKDAALVTVGLGVIAFQRVQVRRNELEKAITAQAEEARGALDVVSALVGERLKAVEERVGATFDRGR* |
| Ga0075367_105952362 | 3300006178 | Populus Endosphere | VARLQLDDLTNAAKDAAYVSVGLSVIAFQRMQVRRNELNKALEARSGEARDALDVVNALVSDRVKMVEERLGSVLDITRR* |
| Ga0105044_100164672 | 3300007521 | Freshwater | MARLQIDEVADVVKDAAYVSVGLGVIAFQRMQVRRNELAKALKGRTDDAKGTLDVVGSLVNERLKLVEERVSSAVDITRR* |
| Ga0105049_1000468211 | 3300007799 | Freshwater | MARLQIDEVADVVKDAAYVSVGLGVIAFQRMQVRRNELAKALKGRTDDAKGTLDVVGSLVNERLKLVEERVSATLDITRR* |
| Ga0126305_103814572 | 3300010036 | Serpentine Soil | LARSPIDDVTSTAKDAAYVSVGLGLIAFQRLQVRRNELHKAISGQADEAKGALDLVSALVGDRVKVLEERLGAALEHR* |
| Ga0136631_102251582 | 3300012043 | Polar Desert Sand | DDVANAVKDATYVSVGLGVIAFQRLQVRRNELSKAFSGQADDPKGALGAVSTLVGERLKLVEERLGAARERR* |
| Ga0136618_100028431 | 3300012188 | Polar Desert Sand | MARSPIDDVANAVKDATYVSVGLGVIAFQRLQVRRNELSKALSGQADDAKGALGVVGTLVGERLKVVEERVGAALERR* |
| Ga0136618_100258721 | 3300012188 | Polar Desert Sand | MARSPIDGVADAVKDAAFVSVGLGVIAFQRLQVRRNELNKALSGQADDARGALEVVGSLVGERLKVVEERLGAALERR* |
| Ga0136633_10044803 | 3300012527 | Polar Desert Sand | MARSPIDDVANAVMDATYVSVGLGVIAFQRMQVRRNELSKTLSGQAGQAKGTLDIVGAIVGERLKVVEERVEAAREQLTSRVTGSNDR* |
| Ga0136633_11207492 | 3300012527 | Polar Desert Sand | MARSPIDDVANAVKDATYVSVGLGVIAFQRLQVRRNELSKALSGQADDAKGALGVVGTLVDERLKLVEERLGAAREGR* |
| Ga0136612_101224052 | 3300012680 | Polar Desert Sand | MARSPIDDVANAVKDATYVSVGLGVIAFQRLQVRRNELGKALSGQADDARGALGVVGTLVGERLKLVEERLGAARERR* |
| Ga0136611_101237432 | 3300012682 | Polar Desert Sand | MARSPIDDVANVVKDATYVSVGLGVIAFQRLQVRRNELTKALSGQADDAKGALGILSAVVGERLKLVEERLGDAFERR* |
| Ga0136611_102588132 | 3300012682 | Polar Desert Sand | MARSPIDDVANAVKDATYVSVGLGVIAFQRLQVRRNELSKALSGQADDAKGALGVVSTLVGERLKLVEERLGAARERR* |
| Ga0136611_105108201 | 3300012682 | Polar Desert Sand | MARSPIDDVANAVKDATYVSVGLGVIAFQRLQVRRNDLSKALSGQADDARGALGVVSTLVGERLKLVEERLGAARERR* |
| Ga0167665_10067272 | 3300015163 | Glacier Forefield Soil | MARTPVEDLANAVKDAAYVSVGLGVIAFQRMQVRRNELAKAMSGPVEETKGTLEVLGALVGERVKLVEERVTAALGR* |
| Ga0132258_100907263 | 3300015371 | Arabidopsis Rhizosphere | MAQNPIDEVTAALRDAMFVSVGLGVIAFQRIQVRRNELSKAISAQAEEARGAIDVVGGLVGGLVGERLKAVEERVGATFDRSR* |
| Ga0132258_126599811 | 3300015371 | Arabidopsis Rhizosphere | NPLDDLTSAAKDVVFVSVGLGVIAFQRIQVRRNELTKAITTQAEEARGALDVVGTLVGERLKSVEERVGATFDRTR* |
| Ga0132257_1016869132 | 3300015373 | Arabidopsis Rhizosphere | MAQNPIDEVTAALRDAMFVSVGLGVIAFQRIQVRRNELSKAISAQAEEARGALDVVGGLVGERLKAVEERVGATFDRSR* |
| Ga0183260_100593833 | 3300017787 | Polar Desert Sand | MARSPIDGVADVVKDAAFVSVGLGVIAFQWLQVRRNELNKALSGQADDARGALEVVGSLVGERLKVVEERLGAALERR |
| Ga0184624_100818962 | 3300018073 | Groundwater Sediment | MAQNPLDDVTAAIKDAAIVSVGLGVIAFQRAQVRRNELEKAITAQAEGARGAIGLVSALVEERLKAVEERVGATFDRGR |
| Ga0190265_100125233 | 3300018422 | Soil | MARLQVDDVASVVKDAAYVSVGLGVIAFQRLQVRRNELSKLLSSQAEDARGALDVVSALVGDRVKVVEERLTAVLDRSSH |
| Ga0190265_105641922 | 3300018422 | Soil | MARSPIDDVADVVKDAAYVSVGLGVIALQRLQVRRDELGKALSGQSDQVGGALGLVGSLVGERLKLVEERVSAARDRT |
| Ga0190265_110013872 | 3300018422 | Soil | MARSPIDDVAHVVKDAAYVSVGLGVIAFQRLQVRRNELSQALSGQAGEAKGALEVVGTLVGERLKLVEERLGAVLERK |
| Ga0190265_110640772 | 3300018422 | Soil | MAQKTLPSSLEDVTAAVKDAAFVTVGLGVIAFQRFQVRRTELAKAITTQAEEARGALDLVGSLFGERLKAVEERVGATFERER |
| Ga0190265_111744372 | 3300018422 | Soil | MARSPIDDVAHAVKDAAYVSVGLGVIAFQRLQVRRNELNRVLSGQADEAKGALEVVGILVGERLKVVEERLGAVLERK |
| Ga0190265_118255731 | 3300018422 | Soil | MAASPIDDVTNAVKDAAYVSVGLGVIAFQRVQVRRNELAKTLGERTGEAKGALEIVGNLVGDRLKVVEERVGAAIGRPAR |
| Ga0190272_101198272 | 3300018429 | Soil | MARMQVEDVASALKDAAYVSVGLGVIAFQRLQVRRNELTKALNEQAGLAQGALDTVGALVGDRVKLVEERLGAVLDRSSR |
| Ga0190272_114622051 | 3300018429 | Soil | VLRMARSPIDDVANAVKDAAYVSVGLGVIAFQRLQVRRNELSKALSGQTDDAKGALGVVGALVGDRLKLVEERIEAAR |
| Ga0190272_116694022 | 3300018429 | Soil | MARTPVDDLANAIKEATYVSVGLGVIAFQRLQVRRNELAKAISGPAEEARGTLEVLGAVVGERVKLVEERITAVLNR |
| Ga0190272_120072632 | 3300018429 | Soil | MARLQIDDVTNVVKDAAYVSVGLGVIAFQRLQVRRNELTKLLDQRTGEAKGALEVVGSLVGDRVKVVEERLGAAFDRGR |
| Ga0190269_104078492 | 3300018465 | Soil | MARLQLDDVTNVVKDAAFVSVGLGVIAFQRLQVRRNELEKTLTARAEEARGTLEVVGGLVGDRLKVVEERLRG |
| Ga0190269_109200252 | 3300018465 | Soil | VKDAAYVSVGLGVIAFQRLQVRRNELTKALSGPTEEARGTLEVLGTLVGERLKLVEERVSATLNR |
| Ga0190269_109621142 | 3300018465 | Soil | VPRSPIDDVAHAVKDAAYVSVGLGVIAFQRLQVRRNELSKALSGQADASGPLGLVTTLVGERLKMVEERVGAALDRK |
| Ga0190269_111392501 | 3300018465 | Soil | VARTPVEDLVNAVKDATYVSVGLGVIAFQRLQVRRNELAKAMNGPVEEARGTLEVVGALVGERIKLVEERVSDAIKR |
| Ga0190269_113115391 | 3300018465 | Soil | PGGTSTMARLQLDEITNVAKEAAYVSVGLGVIAFQRLQVRRNELEKAVGSLVGDRVKLVEERLGAVLPRPGR |
| Ga0190268_123161991 | 3300018466 | Soil | MARTPVDDLANAIKDATYVSVGLGVIAFQRLQVRRNELAKAISGPAEEARGTLEVLGAVVGERVKLLEERITATLKR |
| Ga0190270_101511643 | 3300018469 | Soil | MARSPIDDVADAAKDAAYVSLGLGVLAFQRLQVRRNELHKALSGQADEAKGALDLVSALVGDRVKVLEERLGAALEHR |
| Ga0190270_107252591 | 3300018469 | Soil | MARSPIDDVATVAQDAAYVSLGLGVLAFQRLQVRRHELHKALAAQGGEARGALELVETLVAERLKMLEERVGAALGSR |
| Ga0190270_107695801 | 3300018469 | Soil | MAQNPIDDLSAVVKDAAFVTVGLGVIAFQRLQVRRNELGKALSEQADGAKGALEVVGALVGERLKVVE |
| Ga0190270_110733141 | 3300018469 | Soil | PSGRRSTHEGVSRTMAQNPLDDVTAAMKDAVYVSVGLSVIAFQRLQVRRNELNKAISAQAEEARGALGVVSALVGERLKAVEERVGATFERQR |
| Ga0190270_128936531 | 3300018469 | Soil | MAQNPIDDLTAAAKDAVFVTVGLSVIAFQRIQVRRNELSKAITAQAEEARGALDVVGALVGERLKAVEERVGATFDRGR |
| Ga0190270_133858891 | 3300018469 | Soil | ALYGADMAQNPIDDLTAVVKEATFVTVGLGVIAFQRLQVRRNELSKALSEQAEGAKGALEVVGALVGERLKVVEERVGAVLDRG |
| Ga0190274_111340332 | 3300018476 | Soil | MAQHPLDDVTAAIKDAAIVSVGLGVIAFQRAQVRRNELEKAITAQAEEARGAIGVVSALVEERLKAVEERVGATFDRGR |
| Ga0190274_116502772 | 3300018476 | Soil | MAGSPIDDVASAAKDAAYVSVGLGVIAFQRLQVRRNELHKALSGQAGDAKGALGDRLKMVEERLGAALEHR |
| Ga0190274_124599782 | 3300018476 | Soil | MARLQLDEVTDAVKDAAYVSVGLGVIVFQRLQVRRNELAKALEGHAGEAMEAFEVVGALVNERLKLVEERVSATIDITRR |
| Ga0190274_132703462 | 3300018476 | Soil | MAQNPLDDVASAIKDAAIVSVGLGVIAFQRVQVRRVELEKAITAQAEGARGAIGVVSALVEERLKAVEERVGATFDRGR |
| Ga0190271_111282351 | 3300018481 | Soil | VARLQLDDLTNAAKDAAYVSVGLSLLAFQRVQVRRNELNKALQERSGEAREALDVVNALVTDRVKMVEERLGAVLDITRR |
| Ga0190271_114267761 | 3300018481 | Soil | MARTSLDDVVKAAQDAAFVSVGLSVIAFQRLQVRRHELNKALNEQAEGARGALEVVGSLVGERLKVVEERVGAVLERR |
| Ga0190271_116360542 | 3300018481 | Soil | LPSSLEDVTAVVKDAALVTVGLGVIAYQRFQVRRVELTKAMSTQGEEARGALDAVGALLGERLKAVEERVGATFERER |
| Ga0173482_100621582 | 3300019361 | Soil | MAQNPLDDVTAAVKDAALVTVGLGVIAFQRAQVRRNELEKAISAQAEEARGALGLVSELLGERLKAVEERVGATFDRGR |
| Ga0190264_103881122 | 3300019377 | Soil | MARTPVDDLANAIKDATYVSVGLGVIAFQRLQVRRNELAKAISGPAEEARGTLEVLGAVVGERVKLVEERIT |
| Ga0210384_115820161 | 3300021432 | Soil | MARLQLDDVANLAKDAAYVSVGLGVIAFQRLQVRRNELTKAFNDQSGQALGALEAVGTLVGDRVKVVEERLGAVLDRGGR |
| Ga0224500_102360311 | 3300022213 | Sediment | MARRQIEEVADAVKDAAYVSVGLGVLALQRVQVRRSELTKALGGHSEDAKGAVEAISSVVSERVKLVEERLSAALDITRR |
| Ga0209797_100274602 | 3300027831 | Wetland Sediment | MTHSPLDDITNVVKDAAYVTVGLGVIAFQRLQVRRHELSQALAGGGDAATGALDLVNTVVGERVKAVEERFIAVLGR |
| Ga0209683_102172291 | 3300027840 | Wetland Sediment | DDITNVVKDAAYVTVGLGVIAFQRLQVRRHELSQALAGGGDAATGALDLVNTVVGERVKAVEERFIAVLGR |
| Ga0209798_100522143 | 3300027843 | Wetland Sediment | DITNVVKDAAYVTVGLGVIAFQRLQVRRHELSQALAGGGDAATGALDLVNTVVGERVKAVEERFIAVLGR |
| Ga0209591_100072749 | 3300027850 | Freshwater | MARLQIDEVADVVKDAAYVSVGLGVIAFQRMQVRRNELAKALKGRTDDAKGTLDVVGSLVNERLKLVEERVSATLDITRR |
| Ga0272482_101274031 | 3300028578 | Soil | MARLQIDDVTNAVKDAAYVSVGLGVIAFQRLQVRRNELTKLVGGVGELVGDRVKVVEERLGATFDRGR |
| Ga0247823_104768892 | 3300028590 | Soil | AYVSVGLGLIAFQRLQVRRQELHKALNGQADDAKGALDAVGTLVGDRLKMLEERLGAVLEHR |
| Ga0307322_100831392 | 3300028710 | Soil | MAQHPLDDVTAAIKDAAIVSVGLGVIAFQRAQVRRNELEKAITAQAEGARGAIGLVSALVEERLKAVEERVGATFDRGR |
| Ga0307503_101616782 | 3300028802 | Soil | MAQNPLDDVTAAVKDAALVTVGLSVIAFQRVQVRRNELEKAITAQAEEARGALGVVSALVGERLKAVEERVGATFDRGR |
| Ga0307503_102114731 | 3300028802 | Soil | MAQNPLDDVAAAIKDAAIVSVGLGVIAFQRAQVRRVELEKAITAQAEGARGAIGVVSALVEERLKAVEERVGATFDRGH |
| Ga0307294_102166161 | 3300028810 | Soil | MAQNPLDDLTSAAKDAVFVTVGLSVIAFQRIQVRRNELTKAITTQAEEARGALDVVSSLVGERLKSVEERVGATFDRSR |
| Ga0247825_113611181 | 3300028812 | Soil | DVASAAQDAAYVSLGLGVIAFQRLQVRRHELHKALAAQSGEARGALDLVETLVGERLKMLEERLGAALGSR |
| Ga0247825_113875851 | 3300028812 | Soil | MAQNPIEDVTAAVKDAMFVTVGLGVIAFQRIQVRRNELSRAIASQAEEARGALDVVGELVGDRLKAVEERVGATLDRSR |
| Ga0299907_105071172 | 3300030006 | Soil | MARTPVDDLANTIKDATYVSVGLGVIAFQRLQVRRNELAKAISGPAEEARGTLEVLGAVVSERLKLVEERIGAVLNR |
| Ga0247826_111848612 | 3300030336 | Soil | MAQNPIDDLTAAAKDAVFVTVGLSVIAFQRIQVRRNELSKAITAQAEEARGALDVVGALVGERLKAVEERVGATFDRDR |
| Ga0268386_104445322 | 3300030619 | Soil | MARSPIDDVAGVVKDAAYVSVGLGVIAFQRLQVRRNELSKVLSGQGGDVSGPLGLVGSLVGERLKLVEERLGAVLERR |
| Ga0307498_104869922 | 3300031170 | Soil | MAQNPLDDVTAAVKDAALVTVGLGVIAFQRVQVRRNELEKAITAQAEEARGALGVVSALVGERLKAVEERVGATFDRGQ |
| Ga0299914_101386233 | 3300031228 | Soil | MAGSPLEDVGNAVKDAAYVSIGLGVIAFQRLQVRRNELTKSLGSSADEARGTLDVVGNLVGERVKLVEERMAAAREQLTGRIVGGR |
| Ga0307505_102340801 | 3300031455 | Soil | MARLQVEDVASVVKDAAYVSVGLGVIAFQRLQVRRNELSKTLTDQASLAQGALETVGALVGDRVKLVEERLGAVLDRSGR |
| Ga0247727_1000015651 | 3300031576 | Biofilm | MARSPIDDVADAVKDAAYVSVGLGVIAFQRLQVRRHELHRSLSSQAGAATGPLEVIGTLIADRVKVVEDRLGAALEHR |
| Ga0310906_112952212 | 3300032013 | Soil | MAQNPIDDVTAAVKDAVFVTVGLSVIALQRLAVRRNELSKAIGAQAQEARGALDLVGSAVGERLKAVEERVGATFDRSR |
| Ga0310906_113331931 | 3300032013 | Soil | LYGARMAQNPIDEVTAAVKDAVFVSVGLSVIAFQRIQVRRHELTQAISAQAEEARGALDVVGALVGERLKAVEERVGATLDRSR |
| Ga0315271_101701242 | 3300032256 | Sediment | MSRPQIEEVATIVQDVAYVSVGLGVLAFQRLQVRRHEITKNLEGHSQEARGTLDLVGSLVSERLKLVEERLSAALDITRR |
| Ga0335394_105221331 | 3300032456 | Freshwater | MARLQIDEVADVVKDAAYVSVGLGVIAFQRMQVRRNELAKALKGRTDDAKGTLDVVGSLVNERLKLVEERVSSAVDITRR |
| Ga0214472_103458322 | 3300033407 | Soil | MARTPVEDLAEVVKDAAYVSVGLGLIAFQRLQVRRNELTKALSTPVEETKGTLELIGALVGERVKLVEERVNAAVRR |
| Ga0214471_100737601 | 3300033417 | Soil | LMARSSIDEVADVVKDAAYVSVGLGVIAFQRLQVRRNELSKALRGQGGDPWGPLGAVTSLVGERLKVVEERLGAALERK |
| Ga0364935_0085971_2_196 | 3300034151 | Sediment | DAAYVSIGLGVIAFQRLQVRRQELHKALSSQTGDAMGAVDLVGTLVGDRLKMLEERLGAALEHR |
| ⦗Top⦘ |