Basic Information | |
---|---|
IMG/M Taxon OID | 3300004185 Open in IMG/M |
GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0111384 | Gp0110938 | Ga0066642 |
Sample Name | Groundwater microbial communities from aquifer - Crystal Geyser CG13_big_fil_rev_8/21/14_2.50 |
Sequencing Status | Permanent Draft |
Sequencing Center | DOE Joint Genome Institute (JGI) |
Published? | Y |
Use Policy | Open |
Dataset Contents | |
---|---|
Total Genome Size | 601162568 |
Sequencing Scaffolds | 18 |
Novel Protein Genes | 22 |
Associated Families | 21 |
Dataset Phylogeny | |
---|---|
Taxonomy Groups | Number of Scaffolds |
All Organisms → cellular organisms → Archaea | 1 |
Not Available | 12 |
All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Huberarchaea → Candidatus Huberarchaeum → Candidatus Huberarchaeum crystalense | 2 |
All Organisms → cellular organisms → Bacteria | 1 |
All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Altiarchaeota → Candidatus Altiarchaeales → Candidatus Altiarchaeum → unclassified Candidatus Altiarchaeum → Candidatus Altiarchaeum sp. CG2_30_32_3053 | 1 |
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Rikenellaceae → Alistipes → Alistipes communis | 1 |
Ecosystem Assignment (GOLD) | |
---|---|
Name | Development Of A Pipeline For High-Throughput Recovery Of Near-Complete And Complete Microbial Genomes From Complex Metagenomic Datasets |
Type | Environmental |
Taxonomy | Environmental → Aquatic → Freshwater → Groundwater → Unclassified → Groundwater → Development Of A Pipeline For High-Throughput Recovery Of Near-Complete And Complete Microbial Genomes From Complex Metagenomic Datasets |
Alternative Ecosystem Assignments | |
---|---|
Environment Ontology (ENVO) | freshwater biome → aquifer → groundwater |
Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Subsurface (non-saline) |
Location Information | ||||||||
---|---|---|---|---|---|---|---|---|
Location | USA: Utah: Grand County | |||||||
Coordinates | Lat. (o) | 38.9383 | Long. (o) | -110.1342 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
Zoom: | Powered by OpenStreetMap © |
Family | Category | Number of Sequences | 3D Structure? |
---|---|---|---|
F000212 | Metagenome / Metatranscriptome | 1580 | Y |
F000320 | Metagenome / Metatranscriptome | 1306 | Y |
F000449 | Metagenome / Metatranscriptome | 1126 | Y |
F001003 | Metagenome / Metatranscriptome | 808 | Y |
F002777 | Metagenome | 530 | Y |
F003622 | Metagenome / Metatranscriptome | 476 | Y |
F016112 | Metagenome / Metatranscriptome | 249 | N |
F016496 | Metagenome / Metatranscriptome | 246 | N |
F020302 | Metagenome | 224 | Y |
F026801 | Metagenome | 196 | Y |
F028419 | Metagenome / Metatranscriptome | 191 | Y |
F039475 | Metagenome / Metatranscriptome | 163 | N |
F048100 | Metagenome | 148 | Y |
F050470 | Metagenome / Metatranscriptome | 145 | Y |
F055778 | Metagenome / Metatranscriptome | 138 | Y |
F057106 | Metagenome / Metatranscriptome | 136 | Y |
F057887 | Metagenome / Metatranscriptome | 135 | N |
F065251 | Metagenome / Metatranscriptome | 128 | Y |
F078218 | Metagenome | 116 | N |
F085132 | Metagenome | 111 | Y |
F085143 | Metagenome | 111 | Y |
Scaffold | Taxonomy | Length | IMG/M Link |
---|---|---|---|
Ga0066642_10068339 | All Organisms → cellular organisms → Archaea | 1166 | Open in IMG/M |
Ga0066642_10090271 | Not Available | 1004 | Open in IMG/M |
Ga0066642_10101719 | Not Available | 939 | Open in IMG/M |
Ga0066642_10132215 | Not Available | 811 | Open in IMG/M |
Ga0066642_10134734 | Not Available | 803 | Open in IMG/M |
Ga0066642_10143640 | All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Huberarchaea → Candidatus Huberarchaeum → Candidatus Huberarchaeum crystalense | 774 | Open in IMG/M |
Ga0066642_10161719 | All Organisms → cellular organisms → Bacteria | 723 | Open in IMG/M |
Ga0066642_10175907 | Not Available | 689 | Open in IMG/M |
Ga0066642_10201521 | Not Available | 636 | Open in IMG/M |
Ga0066642_10237976 | All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Altiarchaeota → Candidatus Altiarchaeales → Candidatus Altiarchaeum → unclassified Candidatus Altiarchaeum → Candidatus Altiarchaeum sp. CG2_30_32_3053 | 576 | Open in IMG/M |
Ga0066642_10242715 | Not Available | 569 | Open in IMG/M |
Ga0066642_10257537 | Not Available | 549 | Open in IMG/M |
Ga0066642_10263640 | Not Available | 541 | Open in IMG/M |
Ga0066642_10271285 | Not Available | 531 | Open in IMG/M |
Ga0066642_10274939 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Rikenellaceae → Alistipes → Alistipes communis | 527 | Open in IMG/M |
Ga0066642_10277352 | Not Available | 524 | Open in IMG/M |
Ga0066642_10281650 | All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Huberarchaea → Candidatus Huberarchaeum → Candidatus Huberarchaeum crystalense | 519 | Open in IMG/M |
Ga0066642_10293731 | Not Available | 505 | Open in IMG/M |
Scaffold ID | Protein ID | Family | Sequence |
---|---|---|---|
Ga0066642_10068339 | Ga0066642_100683391 | F065251 | MYLRAFKKGGKKYYYIAKAVRKGLRVIQKSILYIGTADTLYEK |
Ga0066642_10090271 | Ga0066642_100902712 | F020302 | MTNVRSFNKKPRGEPLLTHQVHCNVTPKDYTKAKTLKLSWNQLLNIGINSTDVLRNLRTKDIEIYNLKQRIEKLTARLQFYNNKKNALLQENCKLRCVILDKDVYYAKEEEEFSTFLFDYKFNHHKIYCEPEASGRDEESQPDK* |
Ga0066642_10098029 | Ga0066642_100980291 | F000212 | LRTETKCHPQCSWKCDDPHCPAICDPVCEPPKCHTSCAEPKNAICDVKCEKPQCEIKCPDKGCEMFDCPKCVTVCKAPHCVTHCQAPKPECEAVCEEPKCDWKCHKPHCPKPKCELVCENPNCAPKVECCACAAGGARVSSPMPFFKEAEKDPNCCGCHKPLVI* |
Ga0066642_10101719 | Ga0066642_101017191 | F016496 | MEDKIKGRRHLFEKGNKYQLMGSAGLRYGGTTSIAKNKEKNLERLRCGWKPKTLICKDITISDDEEAFAVEFSKKVSEENDTYVDTVLVELAIAQILQVHRVYVYAKDKKISRDASRMIGTVLSTLREMNATKSARKVDNIKVTVNSDIMTLI |
Ga0066642_10101719 | Ga0066642_101017192 | F016112 | INDIVIHDILEQTDRKVRDMVYIYRRLTMQMNKIGTNKIRFDCNKIGDGNMDTSIDSTDISIYHKNEGVYELFAIEKLDILSNEITFQTDIGENYEMLVEYFEDNFHFLIDSLSDASSLLATAMCVRSLPLTPQKDINAKEYERRAKEIIARSSTSFI* |
Ga0066642_10128075 | Ga0066642_101280751 | F048100 | PQTIFEQHRTTIKIDTFRRDIGKRWAENMRKREEIYEIIDMLYELVKIETENSDAERSKQMSWAEIRQKIQERMKKTLFR* |
Ga0066642_10132215 | Ga0066642_101322151 | F055778 | MLSSVDLPEPELPTIKTNSPCLIENETLSRALTLLSPCP* |
Ga0066642_10134734 | Ga0066642_101347342 | F078218 | MKNTEAVVGVLLTQSDKIKKNILDRALKKIEDMQEIANEIDGMNKELYIFIRKTRCPETIAKQERNTKKINVFLTSIRKKLDKISEGRGEIDEIIDTLYELSKIEAENSDVKEQGQIFWADIRREIQKKIKKPKIK* |
Ga0066642_10143640 | Ga0066642_101436401 | F001003 | MEKYKKFMQEYKKDFLLPHNLSPKEKDIFIYIFRLLRKKINIGIYPEVRNDEIIYSKSDMKSLISKGIILFQPYKRGWIITIHPTHITKNVECSWCGTKFNEKIYFRLKRMR |
Ga0066642_10161719 | Ga0066642_101617192 | F050470 | ARGARNLTGKIEKIEDRDALAQVRRQARRIVFKAFLAAIPLTLIVFVLP* |
Ga0066642_10175907 | Ga0066642_101759071 | F085132 | VKNYKSILIVGAIAVGLYFFLHSKKILPMTLAQKAAWHEKVSRRMAQMSTAEKIRALKAARKEKAAAEGT |
Ga0066642_10201521 | Ga0066642_102015212 | F085143 | MARIDDGFATLIEFAEDSDVQMWEKEVTPPGVSGGGENDTSTMRNTTWRTKAPKGLMSLSEASLVVAYDPAVYNEIIIMLNVNQQITITFADGSTLVFWGWIDEFTPGAAAEGSQPTATVKIIPSNQNGSGVETAPQYSVAP* |
Ga0066642_10237976 | Ga0066642_102379761 | F002777 | MENIKKNTGTNPNISMHTEIISGRIIRKINEIGHKTHALKKKQNAFYAEALEKSRRSTEFPGKDKLREMTDDKFEHEWKEIEEGKDEVDWLIHRLYLSIEGEEEKNK* |
Ga0066642_10242715 | Ga0066642_102427152 | F057106 | MVQTIKTIYLTPPQKTKVAEMPHTLHRIFFSIRVVVGVNSWYPSKISFDDPSFASFYLLAGTERYFEMWGEDIFQGDIWVKNEAT |
Ga0066642_10257537 | Ga0066642_102575371 | F039475 | IYTKDEDYNPLGSIQLEFFQVDNDGNETYIGKSISGDDGRIEVSKTTLGGEGARIVVRNAKRLSGEDIVQTSDGKYSTFIVSNDETIQKVEIVFQIKKKHNISINVTWK* |
Ga0066642_10263640 | Ga0066642_102636401 | F057887 | ISELEVTKDEYEGEEFWKNLLAATIAPNLSFSDIIYDDLKVYRDEEGNEYVFDDLGNLYLVSRPWNPTEPIIFLSIDTEKEYEYNTNTHSFVDKPIVYEAGLGVFKLFSKILKKLKITGSTARVASVLARIKNIRTGKKILYFNRKRGTWTAARDVSLMSLVYFGLSQGGKVATQTGSIA |
Ga0066642_10271285 | Ga0066642_102712851 | F028419 | PIITKLSYLYQNNYMSDKFSLASDPTGKIIITGGYNNMFHIIDCDQKLNTQIVIDENNEKIMNTNVIRKINSKGSCFYKKDDPSLTNINFDKRILHQAYSPVENFANLILLNCIYSYTGALAKKSK* |
Ga0066642_10274939 | Ga0066642_102749391 | F003622 | MYDIMKNYQDELEIIANDLLEQAVGDGFKPNYTNRDFMNAVIIFQTALMDKMYDNQDYDKMDFDDRLNMATSCGQELRKLIHTYTGLDTHKVEEFL* |
Ga0066642_10277352 | Ga0066642_102773521 | F026801 | KTYKLKKDVNDICGLILSKPGATMFVEGHKAELWKEASDDFSHRWKEIAAEKYEVECMVHLMYDLVLREEEKLKGKIVFNAKRRAMSMVK* |
Ga0066642_10281650 | Ga0066642_102816502 | F001003 | MEKFKKFKIENKDFVLPQNLSPKEKDIFIYIFRLLRQKINKGIYPELYNNEIIYFKSDMKNLISKGIILFVRYKKGWVITLNPIHITKNVECPWCGAKFNEKIYFR |
Ga0066642_10293731 | Ga0066642_102937311 | F000320 | MSRSKGGKVEVLCGEILRKIKDIGQKTYKLKKDVNDVCGIILRKPGGTFFFEGRKAELWEEASDDFFKQWKEIGAEKYEVECLVNLMYDLVLQEEVKR |
Ga0066642_10293731 | Ga0066642_102937312 | F000449 | MENKKAGEEDTSSPDIDQEFELHTSAPQEEMERYYARKKASLERKIRKINKKIKKNIYHPYISDDEKYKNVPQ* |
⦗Top⦘ |