| Basic Information | |
|---|---|
| IMG/M Taxon OID | 3300004196 Open in IMG/M |
| GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0111384 | Gp0110937 | Ga0066641 |
| Sample Name | Groundwater microbial communities from aquifer - Crystal Geyser CG12_big_fil_rev_8/21/14_0.65 |
| Sequencing Status | Permanent Draft |
| Sequencing Center | DOE Joint Genome Institute (JGI) |
| Published? | Y |
| Use Policy | Open |
| Dataset Contents | |
|---|---|
| Total Genome Size | 1082144179 |
| Sequencing Scaffolds | 29 |
| Novel Protein Genes | 33 |
| Associated Families | 32 |
| Dataset Phylogeny | |
|---|---|
| Taxonomy Groups | Number of Scaffolds |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium | 1 |
| Not Available | 11 |
| All Organisms → cellular organisms → Bacteria → PVC group → Candidatus Omnitrophica → Candidatus Velamenicoccus → Candidatus Velamenicoccus archaeovorus | 1 |
| All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Huberarchaea → Candidatus Huberarchaeum → Candidatus Huberarchaeum crystalense | 4 |
| All Organisms → cellular organisms → Bacteria | 3 |
| All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia | 1 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Comamonadaceae → unclassified Comamonadaceae → Comamonadaceae bacterium CG_4_9_14_0_8_um_filter_57_21 | 1 |
| All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon | 3 |
| All Organisms → cellular organisms → Bacteria → Spirochaetes → unclassified Spirochaetota → Spirochaetota bacterium | 1 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Syntrophobacterales → Syntrophaceae → Desulfobacca → Desulfobacca acetoxidans | 1 |
| All Organisms → Viruses → environmental samples → uncultured archaeal virus | 1 |
| All Organisms → cellular organisms → Archaea → Asgard group → Candidatus Lokiarchaeota → unclassified Lokiarchaeota → Candidatus Lokiarchaeota archaeon | 1 |
| Ecosystem Assignment (GOLD) | |
|---|---|
| Name | Development Of A Pipeline For High-Throughput Recovery Of Near-Complete And Complete Microbial Genomes From Complex Metagenomic Datasets |
| Type | Environmental |
| Taxonomy | Environmental → Aquatic → Freshwater → Groundwater → Unclassified → Groundwater → Development Of A Pipeline For High-Throughput Recovery Of Near-Complete And Complete Microbial Genomes From Complex Metagenomic Datasets |
| Alternative Ecosystem Assignments | |
|---|---|
| Environment Ontology (ENVO) | freshwater biome → aquifer → groundwater |
| Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Subsurface (non-saline) |
| Location Information | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location | USA: Utah: Grand County | |||||||
| Coordinates | Lat. (o) | 38.9383 | Long. (o) | -110.1342 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F001003 | Metagenome / Metatranscriptome | 808 | Y |
| F002857 | Metagenome / Metatranscriptome | 525 | Y |
| F003235 | Metagenome / Metatranscriptome | 498 | Y |
| F008219 | Metagenome | 337 | Y |
| F014357 | Metagenome / Metatranscriptome | 263 | Y |
| F014684 | Metagenome / Metatranscriptome | 261 | Y |
| F015419 | Metagenome / Metatranscriptome | 255 | Y |
| F016112 | Metagenome / Metatranscriptome | 249 | N |
| F016358 | Metagenome | 247 | Y |
| F016496 | Metagenome / Metatranscriptome | 246 | N |
| F021710 | Metagenome | 217 | Y |
| F029929 | Metagenome | 187 | Y |
| F034062 | Metagenome | 175 | Y |
| F039194 | Metagenome | 164 | Y |
| F042607 | Metagenome | 158 | Y |
| F043249 | Metagenome / Metatranscriptome | 156 | Y |
| F048100 | Metagenome | 148 | Y |
| F049070 | Metagenome / Metatranscriptome | 147 | Y |
| F053063 | Metagenome | 141 | Y |
| F059646 | Metagenome | 133 | Y |
| F068439 | Metagenome / Metatranscriptome | 124 | N |
| F076867 | Metagenome | 117 | N |
| F076874 | Metagenome | 117 | N |
| F079376 | Metagenome / Metatranscriptome | 116 | Y |
| F086591 | Metagenome | 110 | Y |
| F088269 | Metagenome / Metatranscriptome | 109 | N |
| F088316 | Metagenome / Metatranscriptome | 109 | N |
| F091390 | Metagenome | 107 | Y |
| F096722 | Metagenome | 104 | Y |
| F100367 | Metagenome | 102 | N |
| F100376 | Metagenome / Metatranscriptome | 102 | Y |
| F102531 | Metagenome | 101 | N |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| Ga0066641_10019496 | All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium | 3037 | Open in IMG/M |
| Ga0066641_10080325 | Not Available | 1463 | Open in IMG/M |
| Ga0066641_10096805 | All Organisms → cellular organisms → Bacteria → PVC group → Candidatus Omnitrophica → Candidatus Velamenicoccus → Candidatus Velamenicoccus archaeovorus | 1322 | Open in IMG/M |
| Ga0066641_10110775 | All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Huberarchaea → Candidatus Huberarchaeum → Candidatus Huberarchaeum crystalense | 1227 | Open in IMG/M |
| Ga0066641_10139064 | Not Available | 1081 | Open in IMG/M |
| Ga0066641_10214021 | Not Available | 842 | Open in IMG/M |
| Ga0066641_10216414 | All Organisms → cellular organisms → Bacteria | 836 | Open in IMG/M |
| Ga0066641_10216970 | Not Available | 835 | Open in IMG/M |
| Ga0066641_10221886 | All Organisms → cellular organisms → Bacteria | 824 | Open in IMG/M |
| Ga0066641_10235676 | All Organisms → cellular organisms → Bacteria | 795 | Open in IMG/M |
| Ga0066641_10248157 | All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia | 771 | Open in IMG/M |
| Ga0066641_10262564 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Comamonadaceae → unclassified Comamonadaceae → Comamonadaceae bacterium CG_4_9_14_0_8_um_filter_57_21 | 745 | Open in IMG/M |
| Ga0066641_10280512 | All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon | 716 | Open in IMG/M |
| Ga0066641_10288627 | All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon | 703 | Open in IMG/M |
| Ga0066641_10309972 | Not Available | 674 | Open in IMG/M |
| Ga0066641_10344590 | All Organisms → cellular organisms → Bacteria → Spirochaetes → unclassified Spirochaetota → Spirochaetota bacterium | 631 | Open in IMG/M |
| Ga0066641_10386351 | All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Huberarchaea → Candidatus Huberarchaeum → Candidatus Huberarchaeum crystalense | 588 | Open in IMG/M |
| Ga0066641_10411944 | Not Available | 565 | Open in IMG/M |
| Ga0066641_10419862 | All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Huberarchaea → Candidatus Huberarchaeum → Candidatus Huberarchaeum crystalense | 558 | Open in IMG/M |
| Ga0066641_10419927 | Not Available | 558 | Open in IMG/M |
| Ga0066641_10427345 | Not Available | 552 | Open in IMG/M |
| Ga0066641_10427764 | Not Available | 551 | Open in IMG/M |
| Ga0066641_10429454 | All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Syntrophobacterales → Syntrophaceae → Desulfobacca → Desulfobacca acetoxidans | 550 | Open in IMG/M |
| Ga0066641_10455155 | All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Huberarchaea → Candidatus Huberarchaeum → Candidatus Huberarchaeum crystalense | 530 | Open in IMG/M |
| Ga0066641_10462894 | Not Available | 525 | Open in IMG/M |
| Ga0066641_10483533 | Not Available | 510 | Open in IMG/M |
| Ga0066641_10483900 | All Organisms → Viruses → environmental samples → uncultured archaeal virus | 510 | Open in IMG/M |
| Ga0066641_10485262 | All Organisms → cellular organisms → Archaea → Asgard group → Candidatus Lokiarchaeota → unclassified Lokiarchaeota → Candidatus Lokiarchaeota archaeon | 509 | Open in IMG/M |
| Ga0066641_10490820 | All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon | 505 | Open in IMG/M |
| Scaffold ID | Protein ID | Family | Sequence |
|---|---|---|---|
| Ga0066641_10019496 | Ga0066641_100194963 | F043249 | MPKIVGTRERVHQPFYDSLIRVDGSGDLRQGNVGVFGAVQSRSQLFVRQGADVAVSNLTTGGFFPSDQTFVTLAVRVWTYFRFNVESQRTDAQNTTGPVASLAGVTADRIQRVHKLYHQAENQLFWQFIAGDKPQLTTFTAYTPAAGGLDGFFSDTRLPRANNGVPTSAALMRLARPILVPPRQGFQVVAIASPIGQAQGASIIEQLNGAVPNNDPWGASSTGGTVTGTVGTNGLTTTGRDDIEKDIKYLIDGIHSRDVL* |
| Ga0066641_10080325 | Ga0066641_100803252 | F008219 | MPKYLVEIILVVNAKDSDEARKIADYVIDIPISDSKVENTIETLRCEEIVEITNPKLR* |
| Ga0066641_10096805 | Ga0066641_100968051 | F076867 | MDEKNQNLVELAKKKRYIALVEKLGRGSLGPKELKELEEFEKVEQAKETGVIDGAVDLGTISIYLEKSSRMVRRYVSQGMPVIRDSSGELSRFKVNDVFKWVYGNKGKDTEDKDYWENEYRKNRAKLSELELKQKEGELIPFADHVSVVKNQIRGIRAGLLRLPKHVAPKLYQQDPKLICEMLDQEIRYIINQFAGVKSNDKANKRGA* |
| Ga0066641_10110775 | Ga0066641_101107752 | F016358 | MTEIKNIAQANKSWAKQSPAIKYVILKRSNSKSDIFVLEE* |
| Ga0066641_10139064 | Ga0066641_101390641 | F039194 | IFKTQVGMFDPVALQNHNVCIHYVQKSYYRKVDFLEGIPAFQCIDITAGAGLAALMVTGRVNVTNLEMADNEFGLWRWYPIDDAQVRLYHPAGIAKYQLRNLQVPVDMNIVLRDPNLVSTEIAVWQNNRPGVEAINGHAFALGAVRLIAIGYRFHSVDLESGKDADPMLVKAIKEGKAPCTDIWCSGRGTGD* |
| Ga0066641_10214021 | Ga0066641_102140211 | F096722 | VNNKEKAYIGPVEQEINSLMETAGNEIMDLTQEGPDRNITEKIQNIYITNLSKCIIYLAKQFDKLAGSIK* |
| Ga0066641_10216414 | Ga0066641_102164143 | F021710 | FFVRAIRRGKSNPSLTEFRLCKEFGWTPQQLRKTPAKTIQEFIVILNEVDRQTQEEMDKAKREAKLR* |
| Ga0066641_10216970 | Ga0066641_102169701 | F016496 | MEDKKKEDKYFFKKGNKYQQKGSAGLRYGGTTPVAKDKEKNLERLRCGWKPKTLICKDITISDDEEAFAVEFSKKVSEENDTYVDTFLIELAIAQILQVHRVYVYAKEKKISRDASRMIGTVLSTLREMNATKNARKEDNIKVTVNSDIMTLIQQNLNLISNDESKKGNNIDKK* |
| Ga0066641_10216970 | Ga0066641_102169702 | F088269 | IYRKNNGVYELFPIQKLDILSNEITFQTDIGENYDMIVEYFEDNFHFSIDTLSDASSLFAAAMCMRSLPLTSQTNNDAKEYERRAKEIITRSSASFV* |
| Ga0066641_10221886 | Ga0066641_102218862 | F079376 | MFDIVITRQPGRPCVSLEVTRVCRTTEEGERQMRHRESDSFVVPEKAGNAAGGKEATHVSAL* |
| Ga0066641_10235676 | Ga0066641_102356761 | F059646 | VFHLKLESITVVILVICIFLSFYFTFLSFQTIDDTLKKQLVTLAASSLITGVIMLACITISLGIKKAFPRVDSKLRSRKEPSEHEG* |
| Ga0066641_10248157 | Ga0066641_102481571 | F049070 | MSTATLSEPMKERWRTAGRFADVLEAEVKSRQHPFVARVVSWFNLCRVCQDLEEQLLLAQKPSAEDKQLHAALLSTAIASGEGLLLEGADAETLRPLKLTPQALQAKLVSLRITYEQWHTELNPQ |
| Ga0066641_10262564 | Ga0066641_102625642 | F100376 | MKLAPELRFLCNKIATERTRPALWKTAAELRKDCTRVLASGQTLADAQTWLESLLADALAAQSKS* |
| Ga0066641_10278574 | Ga0066641_102785742 | F100367 | MSSCIFSSFYGSDLNRDVMLSLVWSVFLTEVDEDVRFERDSEWLSRHYRDLQVDYPNKYVAVLDERVVGVGDDGEKLYVEVVGKHGRDPVIDFIKDTKTIQVGRLRARVLV* |
| Ga0066641_10280512 | Ga0066641_102805122 | F029929 | MVLLGLPIFQQSVDAALYEWLMRELKRRGARTVQDVIRQILREAKAEAEKHG* |
| Ga0066641_10288627 | Ga0066641_102886272 | F086591 | MSLKKHLAKRYIPSSTKWKLVLKGVGKLYRAVYDELEKDEKGSSAKALADASYRVGLDFGETLKKDFGLGNSIEDIAFAMEIDHKIFGVKAVVAEKDERKIVYHCYQCAWRKYFTPKLCIAIGQAEKGIAHALNSEAKYN |
| Ga0066641_10309972 | Ga0066641_103099721 | F048100 | DNMKEKSDDIDRMNLEFYGMVRKTYGTQTIFEQHRTTIKIDTFRRDIGKRWAENMRKREEIYEIIDMLYELAKIETENPDAERSKQMSWAEVRKKIQEKMVKTLSR* |
| Ga0066641_10344590 | Ga0066641_103445902 | F015419 | MKYEFKITDEAGNVFLYDVSRSSSDEEKSLNDFILEALHISEDKRKLPFKIQCPNNLEVYPLLKMKFENYGSPLLGDKLEAMHVTWRD* |
| Ga0066641_10386351 | Ga0066641_103863511 | F001003 | MILLIQMKKITMEKFKKFKIENKDFVLPQNLSAKEKDIFMYIFRLLRQKVNKGIYPELFNNEIIYSKSDMKNLISKGIIIFMQHKKGWVITINPTFSTKN |
| Ga0066641_10411944 | Ga0066641_104119441 | F002857 | MENKKRIWGTRETDEAYDEMIRTEKEKQNGNVFKDDDKKEMKASMKFLSNKNFIERISLRDGMKRKFKLVKDEATEIIGFDGNAKQGITYTVMENGNIRSFFTTSLKCISMLSRLQKDDVFEIELRTKKVGDTLISYHVVKKL* |
| Ga0066641_10419862 | Ga0066641_104198621 | F003235 | IIYSKSDVKNLISKGIIIFMSYKKGWVLTINPTYVTKNVECSWCGAKFNEKIYFRMKRMRCPSCMSGMNGSTIAEKFETDDIIFPTKVENKTQIKKITNDIEIVRTGIQKIIQQSKEKIIPFRSSKADRLNAISELNRLAEKKKEERILTIQKKILANFSDTCVNFVKEYFCIYDNKIFSAMKYM |
| Ga0066641_10419927 | Ga0066641_104199272 | F091390 | MKSTESTTIRINKSTKGLLDNLDFVRKDTYDDIIIKLMDYYKRKKG* |
| Ga0066641_10427345 | Ga0066641_104273451 | F068439 | RVFFYTIENECFFEYYSYDNIKRYNKERTDFLSLLTDENKKKVYDFERKMHEKYPEYLYKMKNKKVTYPEYKNIKEYRISENYTVMFFTIGQHAIFRYFEDDGLIVYNEVDRSCVLHWLSGEETERLFNFEDEIKEEYSKYIF* |
| Ga0066641_10427764 | Ga0066641_104277642 | F042607 | IQLYNKIVASYTNLTNKGEDLFIDLFETDDNGTIIFLKPPSKRQTSFEVFLFLMAVMQQQHIRLMYKQVEDLCAQVNEKLKDK* |
| Ga0066641_10429454 | Ga0066641_104294542 | F088316 | MTLYRVREGYVVHLGNRQTLSPGTIFEPDQKVLESQGWKVEPVKEVEPVQEEPKPVTRDVEAPPQDRMIKKPPVKK* |
| Ga0066641_10452151 | Ga0066641_104521512 | F076874 | AGNETYIGKSISGDDGRLEISKTALGGKGAKIVVRNAKRLSGEDIVQTSDGIYSTFIVSSDETVQTVEIVFQIKKKHNIRINVTWK* |
| Ga0066641_10455155 | Ga0066641_104551552 | F001003 | MEKYKKFQTEYKKDFLLPCNLSQKEKDIFIYIFRLLRKKINTGIYPEVRNDEIIYSKSDMKSLISKGIILFQPYKRGWIITIHPTHITKNVECSWCGTK |
| Ga0066641_10462894 | Ga0066641_104628941 | F016112 | KMWGSIEKVRNICGITKEQINDVVIHDILEQTDRKVRDRVYIFRRLTMQMNKIGTNKIRFDINKIGDGNRDTSIDYTDISIYRKNNGVYELFPIQKLDVLSNEITFQTDIGENYEMIVEYFEDNFHFSIDTLSDASSLFAAAICMRSLPLTETENNAKEYERRAKEIITRSSAS |
| Ga0066641_10473620 | Ga0066641_104736201 | F102531 | KLEAIRQVGSVSVPDYAATPEQIAQLQLADKARTADEILQCPYTWCRHNSTVSDAILESRGYQPYRAAMMMQTTEGLSAGGHQVSAIIVNGEPVFIDLTNNLIIPGQQALEQILLNSGKQLTALEMIRLTTNNVWDVINLIPK* |
| Ga0066641_10483533 | Ga0066641_104835331 | F053063 | MSKDKILARIELLKKQNDEDNTKLISLNQVRETVIQGMLVRNGRI |
| Ga0066641_10483900 | Ga0066641_104839001 | F014357 | ADVNTATDTQITKKNNPSEFCDFPFRGGVATKRKIDLYGIVFSSRGADDGITAANYIRTNYIKLMKGRDVLFDSMRMGLFAAGRTVQTNNTFEAENGFDVGGEITDLYQKDPLIFDQPIRFANGEELDTYWTTEVATPPGTFLASELEIAYILKETQE* |
| Ga0066641_10485262 | Ga0066641_104852621 | F014684 | MKITWHRYYYRIIGFALAGAGGGLLLDELIHGPFTLTPANHEFWGLAALVVGSILISKKPHGKD* |
| Ga0066641_10490820 | Ga0066641_104908201 | F034062 | FKKFSAPKVKIELKLDEVAYDYKDKLTGRIVLDPQEEISVNEFRLEFGGNKKIKWKKGFSSLSSSSSLDTRKIPIGGPVKLQKGQHYEQPFQIDIPLYSKPDPFTENEVKVKGVAAVQGRPDLTHEVKPVINFPFVIECLRQYGGCGFITQPLSEPVKACLKCGNNLE |
| ⦗Top⦘ |