| Basic Information | |
|---|---|
| IMG/M Taxon OID | 3300004105 Open in IMG/M |
| GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0111384 | Gp0097057 | Ga0065182 |
| Sample Name | Groundwater microbial communities from aquifer in Utah, USA - Crystal Geyser 4/10/14 0.2 um filter (version 2) |
| Sequencing Status | Permanent Draft |
| Sequencing Center | DOE Joint Genome Institute (JGI) |
| Published? | Y |
| Use Policy | Open |
| Dataset Contents | |
|---|---|
| Total Genome Size | 445606306 |
| Sequencing Scaffolds | 13 |
| Novel Protein Genes | 15 |
| Associated Families | 14 |
| Dataset Phylogeny | |
|---|---|
| Taxonomy Groups | Number of Scaffolds |
| All Organisms → cellular organisms → Bacteria | 3 |
| All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → unclassified Bacteroidales → Bacteroidales bacterium | 1 |
| All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Altiarchaeota → Candidatus Altiarchaeales → Candidatus Altiarchaeum → unclassified Candidatus Altiarchaeum → Candidatus Altiarchaeum sp. CG2_30_32_3053 | 1 |
| Not Available | 6 |
| All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Huberarchaea → Candidatus Huberarchaeum → Candidatus Huberarchaeum crystalense | 1 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium | 1 |
| Ecosystem Assignment (GOLD) | |
|---|---|
| Name | Development Of A Pipeline For High-Throughput Recovery Of Near-Complete And Complete Microbial Genomes From Complex Metagenomic Datasets |
| Type | Environmental |
| Taxonomy | Environmental → Aquatic → Freshwater → Groundwater → Unclassified → Groundwater → Development Of A Pipeline For High-Throughput Recovery Of Near-Complete And Complete Microbial Genomes From Complex Metagenomic Datasets |
| Alternative Ecosystem Assignments | |
|---|---|
| Environment Ontology (ENVO) | freshwater biome → aquifer → groundwater |
| Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Subsurface (non-saline) |
| Location Information | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location | USA: Utah | |||||||
| Coordinates | Lat. (o) | 38.9383 | Long. (o) | -110.1342 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F022890 | Metagenome / Metatranscriptome | 212 | Y |
| F029019 | Metagenome | 189 | Y |
| F040819 | Metagenome | 161 | Y |
| F043249 | Metagenome / Metatranscriptome | 156 | Y |
| F069687 | Metagenome | 123 | Y |
| F071959 | Metagenome / Metatranscriptome | 121 | Y |
| F075685 | Metagenome | 118 | Y |
| F076868 | Metagenome | 117 | Y |
| F094905 | Metagenome | 105 | N |
| F094906 | Metagenome | 105 | N |
| F095532 | Metagenome / Metatranscriptome | 105 | N |
| F096626 | Metagenome / Metatranscriptome | 104 | N |
| F102531 | Metagenome | 101 | N |
| F104462 | Metagenome / Metatranscriptome | 100 | Y |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| Ga0065182_1004734 | All Organisms → cellular organisms → Bacteria | 5418 | Open in IMG/M |
| Ga0065182_1025375 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → unclassified Bacteroidales → Bacteroidales bacterium | 2114 | Open in IMG/M |
| Ga0065182_1032357 | All Organisms → cellular organisms → Bacteria | 1827 | Open in IMG/M |
| Ga0065182_1042191 | All Organisms → cellular organisms → Bacteria | 1553 | Open in IMG/M |
| Ga0065182_1086339 | All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Altiarchaeota → Candidatus Altiarchaeales → Candidatus Altiarchaeum → unclassified Candidatus Altiarchaeum → Candidatus Altiarchaeum sp. CG2_30_32_3053 | 979 | Open in IMG/M |
| Ga0065182_1107348 | Not Available | 844 | Open in IMG/M |
| Ga0065182_1112611 | Not Available | 816 | Open in IMG/M |
| Ga0065182_1144421 | All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Huberarchaea → Candidatus Huberarchaeum → Candidatus Huberarchaeum crystalense | 684 | Open in IMG/M |
| Ga0065182_1162616 | Not Available | 627 | Open in IMG/M |
| Ga0065182_1165001 | Not Available | 621 | Open in IMG/M |
| Ga0065182_1188029 | Not Available | 565 | Open in IMG/M |
| Ga0065182_1199763 | All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium | 540 | Open in IMG/M |
| Ga0065182_1207147 | Not Available | 526 | Open in IMG/M |
| Scaffold ID | Protein ID | Family | Sequence |
|---|---|---|---|
| Ga0065182_1004734 | Ga0065182_10047346 | F076868 | LGILLAACIVGATFFLNGQRPSGEIKKDVSDNESALTANEPEVEQKSSATATTMSAGKVIIKAYLNLGGCGEEVINLLDGLVKEYQGRVSLGYIDFGTREGYNRMIEDGLNCQGLVINGKQTYAIIDKNGAQKEVTFSHPLNSQYTADDLKTVVKMLLGE* |
| Ga0065182_1025375 | Ga0065182_10253754 | F022890 | LELRPNLLYTLLGAVIFKIMTTKKLVNLMMKSSDHNPYTGKLSESANVLAAIELAKKFKEFADEVYTDEAMGIESSQWVEVINKLEEKRLNCA* |
| Ga0065182_1032357 | Ga0065182_10323573 | F071959 | MASRNYLVMSNDMTLSDKKEYRLNALSAGLERCGLRGIGDIKADIPGLAGIPDANKVARVKLIHNYLITGQWPRSIDQRELTTGTDLVVAPAVDSWLTAPMAAVGNIVSCFQGVAAPQLVQGKLMVCYAVSVESSAVPMPVSRLIFRRGAAGNVQAQFDMEPMGIRWEVDAFFSEVVVIDPQDVFAIQVRCRNATAV |
| Ga0065182_1042191 | Ga0065182_10421911 | F040819 | MTKAQEIYEKVEALVATGVPKADAFRQVAEEFGQPFNSMRGAYYAHSRTITGGSSRPRRRQTTTADAVESAAQLLRRALESIDDEVLAAKARAEEAKAEYEALRDSVKE |
| Ga0065182_1066065 | Ga0065182_10660654 | F094905 | MNALYGYIAVFVVIVFLGAGWAVEHDKRITYQAKVEQAGADALAQTEKINAKHREEMQNAEHNTIIATNSIADWYRAHPVVRVQNNGCSTMPGT |
| Ga0065182_1086339 | Ga0065182_10863391 | F029019 | MKNTEICLSDCFKSEVSYRKILRQIDLLMYQIFEVKKTTNIFWDIRLKKSELARWISKGDSMWWKTAADDISERWKEIDKIKDEIDWTIGNMFASVEREEKQRMKKTEKDKH* |
| Ga0065182_1107348 | Ga0065182_11073481 | F069687 | MEELAKKIESILSQFITEELGNRLSQFALISLKEMILNEIKSYEPKIKEK* |
| Ga0065182_1112611 | Ga0065182_11126111 | F102531 | SILNLPNADNSAGWTLGTYGGQTIPIQPQKTIAQINQEIDNLVARGVDELEAIRQVGSISIPNYATTPEQIAALRLADQARTADEILQCPYTWCRHNSAISDAILESRDYQPYRAAMTMQTTEGLSAGGHQTSAIIINGEPVFIDLTNNLIITGQQALEQVLINSEKQLTALEMIRLTTNNVWDVINLIPK* |
| Ga0065182_1144421 | Ga0065182_11444211 | F104462 | MAQNFKTLYIGNNTVHTLFEAPTTLTRVFFGIYIVAKGDPSFQVNVSFDNSQFLNPICLTYCRGYYEFTGPDISQGSIFVKTVNFAQGDICATEILK* |
| Ga0065182_1162616 | Ga0065182_11626161 | F095532 | MNRQNVIVVIAPTLEVWGNFKNLCEAKGFDILPYHSLKSKPFPIIHKDWIIHKVPFL* |
| Ga0065182_1165001 | Ga0065182_11650011 | F102531 | QTNVIQPQKTVAGINQEIDKLVANGVDKLEAIRQVGSVSVPDYAATPEQIAQLQLADKARTADEILQCPYTWCRHNSTVSDAILESRGYQPYRAAMMMQTTEGLSAGGHQVSAIIVNGEPVFIDLTNNLIIPGQQALEQILLNSGKQLTALEMIRLTTNNVWDVINLIPK* |
| Ga0065182_1188029 | Ga0065182_11880292 | F075685 | MSNTTSTLYDPVTHPLAAAYAAASTAECRAAECRAARRQWVASDEERGYRLTVYRAVGLQHCDPVPRRRESPLRCRPSELWHRVNLWARAFAGEQEPQTEPYWLVCHAYPVHPRTGRSDH |
| Ga0065182_1199763 | Ga0065182_11997631 | F043249 | MQSRSQLFVRQGADVAISNLTTGGFFPSDQTFVTLAVRVWTYFRFNTESPRDNGVAPPGNATVPIGSIPGLAADRTMRVHKLYHQAENQLFWQFIAGDKPQLTTFTAYTPAAGGLDGFFADPLLPRANNGTPTSAALMRLARPILVPPRQGFQVVAIASPIGQQQGASIVEQMNGMVRNI |
| Ga0065182_1207147 | Ga0065182_12071472 | F096626 | ALLASSATKRMTSAMLLGMWRNSSKYVLDVAPKLAAMTGSQLLRYQANIERHEKRLAALCSRNPELSLSSADLDKIMVNLADEDADTAFGAYLADRTEEVRGKLTEDSEAL* |
| Ga0065182_1221248 | Ga0065182_12212481 | F094906 | AVSNGDSKVYDVYMNVSSGQYRLVWKYFVANLPPGSQWTALYQKNGGGWYSWGSPTISYGYYYTAYTGSGWQTGDQLDVDFYNPSDRSETNIQDSILYSNQCF* |
| ⦗Top⦘ |