Basic Information | |
---|---|
IMG/M Taxon OID | 3300004209 Open in IMG/M |
GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0111384 | Gp0110945 | Ga0066649 |
Sample Name | Groundwater microbial communities from aquifer - Crystal Geyser CG22_combo_CG10-13_8/21/14_all |
Sequencing Status | Permanent Draft |
Sequencing Center | DOE Joint Genome Institute (JGI) |
Published? | Y |
Use Policy | Open |
Dataset Contents | |
---|---|
Total Genome Size | 1311645270 |
Sequencing Scaffolds | 30 |
Novel Protein Genes | 39 |
Associated Families | 34 |
Dataset Phylogeny | |
---|---|
Taxonomy Groups | Number of Scaffolds |
All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium | 1 |
All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Nitrosopumilales → Nitrosopumilaceae → Nitrosopumilus | 1 |
All Organisms → cellular organisms → Archaea | 2 |
Not Available | 16 |
All Organisms → cellular organisms → Bacteria → FCB group → Candidatus Marinimicrobia → Candidatus Marinimicrobia bacterium | 1 |
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage | 1 |
All Organisms → cellular organisms → Bacteria → Proteobacteria | 1 |
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Riflebacteria → unclassified Candidatus Riflebacteria → Candidatus Riflebacteria bacterium | 1 |
All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Huberarchaea → Candidatus Huberarchaeum → Candidatus Huberarchaeum crystalense | 2 |
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla | 1 |
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Desantisbacteria → Candidatus Desantisbacteria bacterium CG23_combo_of_CG06-09_8_20_14_all_40_23 | 1 |
All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Altiarchaeota → Candidatus Altiarchaeales → Candidatus Altiarchaeum → unclassified Candidatus Altiarchaeum → Candidatus Altiarchaeum sp. CG2_30_32_3053 | 1 |
All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Ciliophora → Intramacronucleata → Spirotrichea | 1 |
Ecosystem Assignment (GOLD) | |
---|---|
Name | Development Of A Pipeline For High-Throughput Recovery Of Near-Complete And Complete Microbial Genomes From Complex Metagenomic Datasets |
Type | Environmental |
Taxonomy | Environmental → Aquatic → Freshwater → Groundwater → Unclassified → Groundwater → Development Of A Pipeline For High-Throughput Recovery Of Near-Complete And Complete Microbial Genomes From Complex Metagenomic Datasets |
Alternative Ecosystem Assignments | |
---|---|
Environment Ontology (ENVO) | freshwater biome → aquifer → groundwater |
Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Subsurface (non-saline) |
Location Information | ||||||||
---|---|---|---|---|---|---|---|---|
Location | USA: Utah: Grand County | |||||||
Coordinates | Lat. (o) | 38.9383 | Long. (o) | -110.1342 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
Zoom: | Powered by OpenStreetMap © |
Family | Category | Number of Sequences | 3D Structure? |
---|---|---|---|
F000212 | Metagenome / Metatranscriptome | 1580 | Y |
F000449 | Metagenome / Metatranscriptome | 1126 | Y |
F002857 | Metagenome / Metatranscriptome | 525 | Y |
F003081 | Metagenome / Metatranscriptome | 508 | Y |
F006305 | Metagenome / Metatranscriptome | 376 | Y |
F006747 | Metagenome / Metatranscriptome | 365 | Y |
F008119 | Metagenome / Metatranscriptome | 338 | Y |
F008120 | Metagenome / Metatranscriptome | 338 | Y |
F009876 | Metagenome | 311 | Y |
F016244 | Metagenome | 248 | Y |
F016358 | Metagenome | 247 | Y |
F017598 | Metagenome | 239 | N |
F017599 | Metagenome / Metatranscriptome | 239 | Y |
F019952 | Metagenome / Metatranscriptome | 226 | Y |
F028419 | Metagenome / Metatranscriptome | 191 | Y |
F029019 | Metagenome | 189 | Y |
F030345 | Metagenome | 185 | Y |
F042051 | Metagenome / Metatranscriptome | 159 | Y |
F043249 | Metagenome / Metatranscriptome | 156 | Y |
F063344 | Metagenome | 129 | Y |
F065251 | Metagenome / Metatranscriptome | 128 | Y |
F067432 | Metagenome / Metatranscriptome | 125 | Y |
F071960 | Metagenome | 121 | N |
F074391 | Metagenome | 119 | Y |
F080881 | Metagenome | 114 | N |
F084884 | Metagenome / Metatranscriptome | 112 | Y |
F085143 | Metagenome | 111 | Y |
F091390 | Metagenome | 107 | Y |
F094905 | Metagenome | 105 | N |
F096625 | Metagenome | 104 | N |
F096627 | Metagenome | 104 | N |
F099485 | Metagenome / Metatranscriptome | 103 | Y |
F102531 | Metagenome | 101 | N |
F104465 | Metagenome | 100 | Y |
Scaffold | Taxonomy | Length | IMG/M Link |
---|---|---|---|
Ga0066649_10034703 | All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium | 2756 | Open in IMG/M |
Ga0066649_10110403 | All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Nitrosopumilales → Nitrosopumilaceae → Nitrosopumilus | 1498 | Open in IMG/M |
Ga0066649_10141044 | All Organisms → cellular organisms → Archaea | 1309 | Open in IMG/M |
Ga0066649_10208870 | Not Available | 1049 | Open in IMG/M |
Ga0066649_10232837 | All Organisms → cellular organisms → Bacteria → FCB group → Candidatus Marinimicrobia → Candidatus Marinimicrobia bacterium | 985 | Open in IMG/M |
Ga0066649_10316424 | Not Available | 821 | Open in IMG/M |
Ga0066649_10353744 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage | 766 | Open in IMG/M |
Ga0066649_10362516 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 754 | Open in IMG/M |
Ga0066649_10384781 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Riflebacteria → unclassified Candidatus Riflebacteria → Candidatus Riflebacteria bacterium | 727 | Open in IMG/M |
Ga0066649_10401086 | Not Available | 708 | Open in IMG/M |
Ga0066649_10405289 | All Organisms → cellular organisms → Archaea | 703 | Open in IMG/M |
Ga0066649_10409079 | Not Available | 699 | Open in IMG/M |
Ga0066649_10413702 | All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Huberarchaea → Candidatus Huberarchaeum → Candidatus Huberarchaeum crystalense | 694 | Open in IMG/M |
Ga0066649_10436388 | Not Available | 671 | Open in IMG/M |
Ga0066649_10438541 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla | 669 | Open in IMG/M |
Ga0066649_10465951 | Not Available | 643 | Open in IMG/M |
Ga0066649_10479656 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Desantisbacteria → Candidatus Desantisbacteria bacterium CG23_combo_of_CG06-09_8_20_14_all_40_23 | 631 | Open in IMG/M |
Ga0066649_10498701 | Not Available | 615 | Open in IMG/M |
Ga0066649_10505192 | Not Available | 610 | Open in IMG/M |
Ga0066649_10509258 | Not Available | 607 | Open in IMG/M |
Ga0066649_10522340 | Not Available | 597 | Open in IMG/M |
Ga0066649_10523310 | Not Available | 596 | Open in IMG/M |
Ga0066649_10523557 | Not Available | 596 | Open in IMG/M |
Ga0066649_10541169 | All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Altiarchaeota → Candidatus Altiarchaeales → Candidatus Altiarchaeum → unclassified Candidatus Altiarchaeum → Candidatus Altiarchaeum sp. CG2_30_32_3053 | 583 | Open in IMG/M |
Ga0066649_10564711 | Not Available | 567 | Open in IMG/M |
Ga0066649_10596279 | Not Available | 546 | Open in IMG/M |
Ga0066649_10601455 | All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Ciliophora → Intramacronucleata → Spirotrichea | 543 | Open in IMG/M |
Ga0066649_10605675 | Not Available | 541 | Open in IMG/M |
Ga0066649_10645986 | Not Available | 518 | Open in IMG/M |
Ga0066649_10679958 | All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Huberarchaea → Candidatus Huberarchaeum → Candidatus Huberarchaeum crystalense | 500 | Open in IMG/M |
Scaffold ID | Protein ID | Family | Sequence |
---|---|---|---|
Ga0066649_10034703 | Ga0066649_100347032 | F043249 | MPKIVGTRERVHQPFYDSLIRVDGSGDLRQGNVGVFGAVQSRSQLFVRQGADVAVSNLTTGGFFPSDQTFVTLAVRVWTYFRFNVESQRTDAQNTTGPVASLAGVTADRIQRVHKLYHQAENQLFWQFIAGDKPQLTTFTAYTPAAGGLDGFFSDTRLPRANNGVPTSAALMRLARPILVPPRQGFQVVAIASPIGQAQGASIIEQLNGAVPNNDPWGASSTGGTVTGTGGTNGLTTTGRDDIEKDIKYLIDGIHSRDVL* |
Ga0066649_10110403 | Ga0066649_101104032 | F099485 | MNQKIIHNWQHSPSETQVASNPVKAVSKSLQQTKSLVFTNEDLMGLSWILFEERH* |
Ga0066649_10141044 | Ga0066649_101410443 | F065251 | MYLRHFTKGKKKYYYIAKAVRKGTSVIQKSILYIGTADTLYEKLISLKKK* |
Ga0066649_10208870 | Ga0066649_102088702 | F063344 | MKEPTYSEYQNIKKHKISIEYSVMFFTIGRCAFFRYFEDDGIEVSPIDRSCVLPMLSDEEIEKLSKFLDETKKEYNKYIF* |
Ga0066649_10232837 | Ga0066649_102328371 | F084884 | PTLGILARFQAFFYASAFSQSDGDPPPAPARVTQTVIGA* |
Ga0066649_10311488 | Ga0066649_103114881 | F096627 | MKQDAGVFQFDLSLAELYWLAGAFGIASLPLPSDAPSGLSPSQMEARQKNGHASLLTRGLIRPSPGFGWQVERLPAALLQWISSAPSLLRLERIPKDGAAQRIHLFTAGDQGLSLEMDDDTARFVIFQTRRLLQESAVRWLALPAGAKKSTTAHDLPQPITFLPTTWKDSQLASRILKEQGMNTKTAKSTLAWAVSLEWVTALSKVKLEGQRNVLANQFVLCGNAKSIWG |
Ga0066649_10316424 | Ga0066649_103164241 | F006305 | LLIFLLISSIKSDSVCTESTLVTELIQDIKDNGKLDCLRRPLNPPSDKIESEDDKKLRLEGAWDTDCAFEADYDWLKSLKETYGLKTGLVDVNGKPVENDFDDQADMCEIVRAVIAGGLLEGAKLDELSPSVVDGIDCPGEGELGQSEICAVSGGAASQRYAWYIFLNGISITATNKPKWELSPDSQKLLDARKDTSS* |
Ga0066649_10336744 | Ga0066649_103367443 | F016358 | MIIMQMTVIVNIAQASKSFAKQSPAIKYATLKMTTSKSDIFVLPEWFG* |
Ga0066649_10337409 | Ga0066649_103374091 | F096625 | FQYFGFPVHKTGFVTKTFQQSLRFLPLLPAQLSQVIPPLDFPPIGDSYGTNRSLDLCQQTESQLASALSERLSDIQFTIETVHSEQFFADLEKLSHSLHSDHCPEDLGEAALAHHRNLSLWHALIKLEIHTTPDEIKVQFTEWYQRCAAHSTAALASVFATTYLKNNQDKEPIFEEDGAYTFKLILPCTCLDEHLLDKMADHLKDSWIYNPSSETDEVSNPELIPLYLLWFYQIVHPGPHLEFSEYLHALSSPAFTPNHLRLS |
Ga0066649_10353744 | Ga0066649_103537441 | F042051 | FGEVSWNDDVFSGSEKKNSKDLFLRLDEGSNEMRLITQPFQYLVHKYKKEGDPGFGQKVNCSAVHGSCPLCAAGDKAKPRWLLGVISRKTGTYKILDVSFAVFSQVRKYARNTARWGDPTKYDIDIVVDKNGGATGYYAVQPIPKEPLSAADQQIKDSVDFDDLKRRVTPPTPDMVQKRIDKINGVTGEAAEAAPTPSGKAAKAATKAAPAPVNMSEEEDESFPAYDGDQAK* |
Ga0066649_10362516 | Ga0066649_103625163 | F006747 | TEKTFMSNATLTSAKPPGNRTQLNKQLAQAKTLLRQMRETVEDIEDARTIERAKRANGNKPRIPWAQVKKELQLN* |
Ga0066649_10384781 | Ga0066649_103847812 | F009876 | MSAKFLGSASNSGRGILPRRSGWKPLPLFRWIPTEHFKAGIET* |
Ga0066649_10401086 | Ga0066649_104010861 | F102531 | IPIQPQKTIAQINQEIDNLVARGVDELEAIRQVGSISIPNYATTPEQIAALRLADQARTADEILQCPYTWCRHNSAISDAILESRDYQPYRAAMTMQTTEGLSAGGHQTSAIIINGEPVFIDLTNNLIITGQQALEQVLINSEKQLTALEMIRLTTNNVWDVINLIPK* |
Ga0066649_10405289 | Ga0066649_104052891 | F091390 | MNTKSTKSTTIRINEPTKEKLETLDFVRKHTFDDILTELIDFYEKNKGKKTK* |
Ga0066649_10407105 | Ga0066649_104071051 | F096625 | RSLDLCQQTESQLASDLSELTSDLQFTIETVHSEQFFADLKRLSHSLHSDHCPKDLGESALAHHRNLSLWHALNKLEIHTTSDEIQAKFTEWYRSYSAHSTAKLAALFATTYLRTHQDKEPIFEEDGAYTFKLILPCTFLDEHLLEEMADHLKDSWIYNPSSETDEVSNPELLPLYLLWYYQTVHPGPHLEFSEYLHALSSPSFTPNHLRQSQHWFLHALPYPVLPCFESGEG |
Ga0066649_10409079 | Ga0066649_104090791 | F071960 | QNQGEAEFPEGFQERTGKTVLELLAMPLGEAIPLLKQGDGSPLSRLLPHRAFEGMLLMPRGADTICATCHEHGACRDWVIGAAEDVCLELDLVEPGEILSYIAAKVIEGDYDHNTSITQIAQEFRADVLITTEVCK* |
Ga0066649_10413702 | Ga0066649_104137021 | F019952 | MVQTVQSTRVYPGQSVILLEKPRVLRRILFSIRALTDLTMGHKSHISFDDPSFLTYYILDGPVQQLEAKGEGICQGNIWAHNASSVELLFVMTEILV* |
Ga0066649_10432250 | Ga0066649_104322501 | F000212 | DPVCEPPKCHTSCAEPKNAICDVKCEKPECEIKCPDKGCEMFDCPKCVTVCKAPHCVTHCQAPKPECEAVCEEPKCDWKCHKPQCPKPKCELVCENPNCAPKVECCPCAAGGARVAQPFPFFKEAEKNKDCCSCK* |
Ga0066649_10436388 | Ga0066649_104363881 | F017598 | VTKLLEPHSKIVGYFLGKDIIHTNRDDITNRLAAHIIKHSQTHWTPALDVLNTTVLGKGHNTRMVTLVVGNTDYQGVLDILTQQPMETLSFLDHRTKRQNINQFDKMLKYHDYIVSHSTAVRLENVHYLDTQALWAHLQPITNTSFCDIFAGRTQGTTYIQCFKEKEPEVATAIQSYLLANFSDSADRPIIGERGGGTVNSSSTGTRTYRGNHSGKSTGKETT |
Ga0066649_10438541 | Ga0066649_104385412 | F074391 | MVQTVRSVLIQPGHKLSILEKPLVLSRIFFGICAFAPQTSWYESQISLGGPIFSSYYVMGGPVNYFGARGEGIFQGDIWVRNVSAIDLLYSVTEILH* |
Ga0066649_10465951 | Ga0066649_104659511 | F002857 | MKNTKENTGVFKDDETTEMKASMKFLTDRNFVERISLRDGMKRKFTMVKDERTEIIGFDGNTKQGITYTVTENGNLRNFFTASMKCISMLSKFSKDDMFEIELRTKKVGNNIVSFYVVKKL* |
Ga0066649_10479656 | Ga0066649_104796562 | F029019 | MKNTEICSSEYIKSEVLYRKMLHKINLLEHQIFEVKKMTNIFWDIRLKKPELACWILEGDSVWWKTAADDISERWKEIDETKDEIDWAIRNMFVSVEREEEYKKKIKNNKKRTQ* |
Ga0066649_10498701 | Ga0066649_104987011 | F080881 | MSKNSKSAVALSELAVMPQFAAPVVEITPEVLSMIVMDIDPALIEAAELKEARLDEIRAKKDEAKRLIALYHEIAKTLNPLCDEYDSVEEERKLLKGVLEDALLASRITLANHPKVLENKAQIVKAAPNLMAEFNEKFNAAVIKQTQIAQADNIARMKALDASFAELDKKTAQLS |
Ga0066649_10505192 | Ga0066649_105051921 | F008120 | MKNTETSTSECIKAEALHQKILWQINLLGCQISECKKRFTFFYTQVQKKSDVSAGEKNENDISWVEAFNDFVENWEEMENVKDEVDWMIRDIFFAIEREEEEYNKKIKNNKKRTQ* |
Ga0066649_10505192 | Ga0066649_105051922 | F104465 | MVREKPDENEKTIKEYNARGERMYKNAINRENAKKTRKIK* |
Ga0066649_10509258 | Ga0066649_105092581 | F102531 | YGRQTNVIQPQKTVAGINQEIDKLVANGVDKLEAIRQVGSVSVPDYAATPEQIAQLQLADKARTADEILQCPYTWCRHNSTVSDAILESRGYQPYRAAMMMQTTEGLSAGGHQVSAIIVNGEPVFIDLTNNLIIPGQQALEQILLNSGKQLTALEMIRLTTNNVWDVINLIPK* |
Ga0066649_10522340 | Ga0066649_105223401 | F030345 | MELKNEKICIQTIEKRERENDSCWYAITDNQGKRFSCFEDEVAKKLQTNIVNLCKVKYSGKYANVMSVEGYEDNPKVADVNIERKREKNLESLRILKCVALKSAALCSGGQSVNSAEVLTKA |
Ga0066649_10523310 | Ga0066649_105233101 | F085143 | MARIDDGFATLIEFAEDSDVQMWEKEVTPPGVSGGGENDTSTMRNTTWRTKSPKGLMSLSEASLVVAYDPAVYNEIITMLNVNQQITITFADGSTLVFWGWIDEFTPGAAAEGSQPTATVKIIPSNQNGSGVETAPQYSAAP* |
Ga0066649_10523557 | Ga0066649_105235571 | F008119 | ENVYNMIFVCTEKTNKSITHLVRSVITIGVKRSSTDGTVYFDKINMNIGYINNAGNFTSVSDLDALHAFSTASETYVDLCLQEFVRIMSDYPLVNKRFAVKVNIFAHVDATTTSGQLGMYHTRGSADSYVEVELK* |
Ga0066649_10541169 | Ga0066649_105411692 | F000449 | MENKKAGEEDIPSPDFDQEFELRTSAPQEEMERYYARKKASLQRKIRQINKKIKKNIYNPYAGDDEKHKNVLE* |
Ga0066649_10564711 | Ga0066649_105647111 | F000449 | MKNKKAGEGISPSPDLDQEFELRTSAPQEEMERYYTRKKASLLRKIRKINKKIKKNIYNPYAGDDEKHKNRK* |
Ga0066649_10564711 | Ga0066649_105647112 | F017599 | MQATMKNTKIENEIKKRTPPGSKNIEKYRISTEYTSISYTTPHFLVFRYYCNSETYLCETRKSIVLWWISQAEKEKLLAFEKQKKEKHKGLKVFDGTTMEGL* |
Ga0066649_10586203 | Ga0066649_105862032 | F094905 | MNSLYGYIAVLIVVVFLGAGWAVEHDKRITYQAKVEQAGADALAQTEKINAKHREEMQNAEQNTIIATNSIAD |
Ga0066649_10596279 | Ga0066649_105962791 | F028419 | FASSIMSVHDINFNIMNDNMVTTRHYLSLNYWDLRNLGSPSNKFLLYEPIITKLSYLYQNNYMSDKFSLSSDPTGKVIITGGYNNMFHIIDADQKLNTQIIIDENNEKIMNTNVIRKINSKGSCFYKKDDPSLTNINFDKRILHQTYSPVENFCHLILLNCIYSYTGALAKKSK* |
Ga0066649_10601455 | Ga0066649_106014551 | F003081 | MREEMFEETRFGTEVFAMHVRGVDVIFVFSYLHILKKIHLKNYITS |
Ga0066649_10605675 | Ga0066649_106056752 | F016244 | MCIFPYLCFCNYNYKIFKNIIKINKMVLPILPGIIVLAGARILVSYGTHLLRFIVANPKILLGTATVATVADALKEHEKNEQIRNSILQDIYTQNPELAQKIVSAGGFSFHPVENIFQMAVSSAITGLIIYAIIQKI* |
Ga0066649_10621776 | Ga0066649_106217761 | F067432 | PHCVTHCQAPKPECEAVCEEPKCDWKCHKPACPKPKCELVCENPNCAPKVECCPCAAGAARVAQPFPMFKESATNKDCCSCK* |
Ga0066649_10645986 | Ga0066649_106459861 | F002857 | MENKKRIWGTTQMDEAYDEMIRMEKEKQNGSVFKNDEKNDKKEMEASMKFLSDKNFIERISLRDGMKRKFKLVKDEATEIIGFDGNKKQGITYTVTENGNLRSFFTTSLKCISMLSKLQKDDVFEIELRTKKVGDTLISYHVVKKL* |
Ga0066649_10679958 | Ga0066649_106799581 | F008119 | KEVVRKIYVFQQVTNPYVFEGNNTHLLMENPSPSNLINGSDFSTLNSFSNTDSLIVYDMVFVCTEKTDKSITHSVRGVITIGVKRNSGNGNVNFDKINMNIGYIESTGAFTPISNADATHAYSTTSQTYVGLCLQDFVGIVNDYSLAGKRFAVKVNIFAHVDSADT |
⦗Top⦘ |