| Basic Information | |
|---|---|
| IMG/M Taxon OID | 3300006007 Open in IMG/M |
| GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0114663 | Gp0115666 | Ga0073917 |
| Sample Name | Groundwater microbial communities from the Columbia River, Washington, USA, for microbe roles in carbon and contaminant biogeochemistry - GW-RW metaG T3_23-Sept-14 |
| Sequencing Status | Permanent Draft |
| Sequencing Center | DOE Joint Genome Institute (JGI) |
| Published? | N |
| Use Policy | Open |
| Dataset Contents | |
|---|---|
| Total Genome Size | 130273757 |
| Sequencing Scaffolds | 35 |
| Novel Protein Genes | 40 |
| Associated Families | 39 |
| Dataset Phylogeny | |
|---|---|
| Taxonomy Groups | Number of Scaffolds |
| All Organisms → Viruses → Predicted Viral | 2 |
| All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage | 4 |
| All Organisms → cellular organisms → Archaea → unclassified Archaea → archaeon | 2 |
| Not Available | 18 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Nitrosomonadales → Methylophilaceae → unclassified Methylophilaceae → Methylophilaceae bacterium | 2 |
| All Organisms → cellular organisms → Bacteria | 1 |
| All Organisms → cellular organisms → Bacteria → Nitrospirae | 2 |
| All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes | 1 |
| All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales → Myoviridae → unclassified Myoviridae → Synechococcus phage S-CBM2 | 1 |
| All Organisms → cellular organisms → Eukaryota → Sar → Stramenopiles → Ochrophyta → Bacillariophyta → Coscinodiscophyceae → Thalassiosirophycidae → Thalassiosirales → Thalassiosiraceae → Thalassiosira → Thalassiosira pseudonana | 1 |
| All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Prevotella → Prevotella disiens | 1 |
| Ecosystem Assignment (GOLD) | |
|---|---|
| Name | Groundwater Microbial Communities From The Columbia River, Washington, Usa |
| Type | Environmental |
| Taxonomy | Environmental → Terrestrial → Soil → Sand → Unclassified → Sand → Groundwater Microbial Communities From The Columbia River, Washington, Usa |
| Alternative Ecosystem Assignments | |
|---|---|
| Environment Ontology (ENVO) | freshwater river biome → microcosm → sand |
| Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Subsurface (non-saline) |
| Location Information | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location | USA: Columbia River, Washington | |||||||
| Coordinates | Lat. (o) | 46.372 | Long. (o) | -119.272 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F000166 | Metagenome / Metatranscriptome | 1810 | Y |
| F000331 | Metagenome / Metatranscriptome | 1285 | Y |
| F001097 | Metagenome / Metatranscriptome | 780 | Y |
| F001808 | Metagenome / Metatranscriptome | 631 | Y |
| F002487 | Metagenome / Metatranscriptome | 554 | Y |
| F003299 | Metagenome / Metatranscriptome | 495 | N |
| F003806 | Metagenome / Metatranscriptome | 467 | Y |
| F007169 | Metagenome / Metatranscriptome | 356 | N |
| F008361 | Metagenome / Metatranscriptome | 334 | Y |
| F008688 | Metagenome / Metatranscriptome | 329 | N |
| F009134 | Metagenome | 322 | Y |
| F009204 | Metagenome | 321 | Y |
| F009682 | Metagenome / Metatranscriptome | 314 | Y |
| F010688 | Metagenome / Metatranscriptome | 300 | Y |
| F010915 | Metagenome / Metatranscriptome | 297 | Y |
| F011136 | Metagenome / Metatranscriptome | 294 | Y |
| F016803 | Metagenome / Metatranscriptome | 244 | Y |
| F020140 | Metagenome / Metatranscriptome | 225 | Y |
| F021301 | Metagenome / Metatranscriptome | 219 | N |
| F021761 | Metagenome / Metatranscriptome | 217 | Y |
| F023108 | Metagenome | 211 | Y |
| F026423 | Metagenome / Metatranscriptome | 198 | Y |
| F030691 | Metagenome / Metatranscriptome | 184 | Y |
| F031025 | Metagenome / Metatranscriptome | 183 | N |
| F032259 | Metagenome / Metatranscriptome | 180 | Y |
| F033034 | Metagenome / Metatranscriptome | 178 | Y |
| F033776 | Metagenome | 176 | Y |
| F041765 | Metagenome / Metatranscriptome | 159 | Y |
| F043233 | Metagenome / Metatranscriptome | 156 | N |
| F044447 | Metagenome / Metatranscriptome | 154 | N |
| F050158 | Metagenome / Metatranscriptome | 145 | Y |
| F054024 | Metagenome | 140 | N |
| F055558 | Metagenome | 138 | N |
| F055721 | Metagenome / Metatranscriptome | 138 | Y |
| F056186 | Metagenome | 138 | N |
| F057371 | Metagenome / Metatranscriptome | 136 | Y |
| F072094 | Metagenome / Metatranscriptome | 121 | N |
| F080037 | Metagenome / Metatranscriptome | 115 | Y |
| F097186 | Metagenome / Metatranscriptome | 104 | N |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| Ga0073917_1001287 | All Organisms → Viruses → Predicted Viral | 2901 | Open in IMG/M |
| Ga0073917_1003659 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage | 1605 | Open in IMG/M |
| Ga0073917_1007549 | All Organisms → cellular organisms → Archaea → unclassified Archaea → archaeon | 1100 | Open in IMG/M |
| Ga0073917_1007753 | Not Available | 1084 | Open in IMG/M |
| Ga0073917_1008344 | All Organisms → Viruses → Predicted Viral | 1042 | Open in IMG/M |
| Ga0073917_1008914 | All Organisms → cellular organisms → Archaea → unclassified Archaea → archaeon | 1006 | Open in IMG/M |
| Ga0073917_1009577 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Nitrosomonadales → Methylophilaceae → unclassified Methylophilaceae → Methylophilaceae bacterium | 968 | Open in IMG/M |
| Ga0073917_1010616 | Not Available | 916 | Open in IMG/M |
| Ga0073917_1011431 | Not Available | 883 | Open in IMG/M |
| Ga0073917_1012601 | Not Available | 839 | Open in IMG/M |
| Ga0073917_1012904 | All Organisms → cellular organisms → Bacteria | 829 | Open in IMG/M |
| Ga0073917_1013425 | All Organisms → cellular organisms → Bacteria → Nitrospirae | 812 | Open in IMG/M |
| Ga0073917_1013591 | Not Available | 807 | Open in IMG/M |
| Ga0073917_1015133 | Not Available | 762 | Open in IMG/M |
| Ga0073917_1016059 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes | 738 | Open in IMG/M |
| Ga0073917_1016638 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage | 725 | Open in IMG/M |
| Ga0073917_1017387 | Not Available | 708 | Open in IMG/M |
| Ga0073917_1017604 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales → Myoviridae → unclassified Myoviridae → Synechococcus phage S-CBM2 | 704 | Open in IMG/M |
| Ga0073917_1019299 | All Organisms → cellular organisms → Eukaryota → Sar → Stramenopiles → Ochrophyta → Bacillariophyta → Coscinodiscophyceae → Thalassiosirophycidae → Thalassiosirales → Thalassiosiraceae → Thalassiosira → Thalassiosira pseudonana | 670 | Open in IMG/M |
| Ga0073917_1020109 | Not Available | 657 | Open in IMG/M |
| Ga0073917_1021286 | Not Available | 638 | Open in IMG/M |
| Ga0073917_1021705 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Nitrosomonadales → Methylophilaceae → unclassified Methylophilaceae → Methylophilaceae bacterium | 632 | Open in IMG/M |
| Ga0073917_1022321 | Not Available | 623 | Open in IMG/M |
| Ga0073917_1022549 | All Organisms → cellular organisms → Bacteria → Nitrospirae | 620 | Open in IMG/M |
| Ga0073917_1025072 | Not Available | 586 | Open in IMG/M |
| Ga0073917_1025368 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage | 583 | Open in IMG/M |
| Ga0073917_1025998 | Not Available | 576 | Open in IMG/M |
| Ga0073917_1026987 | Not Available | 565 | Open in IMG/M |
| Ga0073917_1027229 | Not Available | 563 | Open in IMG/M |
| Ga0073917_1028413 | Not Available | 551 | Open in IMG/M |
| Ga0073917_1029961 | Not Available | 536 | Open in IMG/M |
| Ga0073917_1030960 | Not Available | 528 | Open in IMG/M |
| Ga0073917_1031661 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Prevotella → Prevotella disiens | 522 | Open in IMG/M |
| Ga0073917_1031729 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage | 521 | Open in IMG/M |
| Ga0073917_1033666 | Not Available | 506 | Open in IMG/M |
| Scaffold ID | Protein ID | Family | Sequence |
|---|---|---|---|
| Ga0073917_1001287 | Ga0073917_10012872 | F021301 | MNWNNITIHQLQEIHSCRDMSDLERQMNILAIALNLSMDEVESMTLDKLTSEFEKLSFLNDLPKAPIQFMFKLRGRYFKLAKTPNEMCGHHFIELQQVFNGDVIESLNKIVALLSVEVDFFGRNKKVVDAQAHYEDKCGLMMGLPVPLPYTYALFFLEVYPELLKNILCSLKEEMKDMTEQLTNPQ* |
| Ga0073917_1001287 | Ga0073917_10012874 | F031025 | VALSITQQPDSYHPAFNDTNFVITESSGGIYTSSNFKFIANVKVAATSVAKLKAPIYFGSVNKGVFNIGRIMESYVSNNWSFTDTSPSGCVDSFSDYEVEFGYEYSPSATGTITEYLDLTSATGTVWNAALNPFDLVTYAQAQYLATSSSAKFLTNVRTRYIHRTQKDWLYALKGDATSVVITYSDASTQTFTLPSSKVVRIPVGSQLTIPGAATYFDVVLKLGGTAKSETYRINIKDECSKYETTDIFFMNRLGGFDSFRFNMVRRDTFEVARKQFQSNPYTLGATYGYATSVRTRSNYHTTASQKVKLTSNWIDDTESVWLKDLIESPVVYMYDGTLYAVNIDNANYEQKKGVQDKLFNLELDITLSFADKSQRL* |
| Ga0073917_1003659 | Ga0073917_10036591 | F072094 | MKLLILTDGINGVVYHRLFTPHLRMQIDGQADVSVCQSIEEWLTLDYTQFDVIIFSRWLGAKHYDVLKKIADSGTPYVVDIDDYWILPKYNPAYW |
| Ga0073917_1007549 | Ga0073917_10075492 | F009682 | LTKIERDNNIQTMIKDLYMKNSNEKILVQSDDLKYDGKNIIIPSYYASIVYDYLDNVNIEDMNLNDADMHDYLAFCSFFEDIIDHKADKGGN* |
| Ga0073917_1007753 | Ga0073917_10077532 | F097186 | MSDLIVGEQDPRGVDWVSITLGACLGATITFIVGGQNGEAVGKSIERATVQCIDELGVRNSDSTSACWDMQAEALNRMELQRQAIDRLAERCVLSVPEADRLRVTRVKDVKPFKVVIPAPSSSAGSADWDAEDFGPPGDPDGKP* |
| Ga0073917_1008344 | Ga0073917_10083442 | F007169 | MSYQLQFDFETLEQKEKRLKDWHDQQVKLNKMFEGKANDYYIYNKHVDQFIDFLPYRLGWGLKGNYNELRWWIKCQYQKFRYGVSDDEVYSLETNIAKYMVPRLQYFKKKGKMGIPMKFLPSNYDNLQDEDREKAEKIGEKEINRILDEMIFAFDYIIDPDKYVTFPKSCSWDIKDKNYFNREKNLEAKQAWDEYTKTCEQLDARKKQGLQFFVDHMDMLWI* |
| Ga0073917_1008914 | Ga0073917_10089141 | F020140 | FDSPYSLHFMTHKFAVIVDKKFYNQIQNNSDFYLRVLVLDVIRALVPNEESADKYSSDIFYKCAKSISKEVKMLEDEKTVIFYLELSSGYLDDFFEKPLDDFE* |
| Ga0073917_1009577 | Ga0073917_10095772 | F001808 | SCSAQYHLNKAIKKGYTCEETGDTIRITTLDSIPVIIHDSIVWEKFITTKDTIIKYNTVYVPKTRLEKRIEYKLKVKTIYKDRIVQKAQAKATRPKTRGNLNLLFVGVGIGLLLSYLFKFARDKYLF* |
| Ga0073917_1010616 | Ga0073917_10106162 | F021761 | MISNLWVNRITALVVLAAIYAAGYAGGRDATVQAHHNHPACHTNLKP* |
| Ga0073917_1011431 | Ga0073917_10114312 | F008361 | LVAVELSEATKELEKATSAIENAETSKARLDASVELKKATARLEGINYLS* |
| Ga0073917_1012601 | Ga0073917_10126013 | F010915 | MIVIVQVGLDLYTLSYVKYEECDIHQCIKYDLSSEDVSQFLGNN* |
| Ga0073917_1012601 | Ga0073917_10126014 | F033034 | MKVAKLTKKAQDIVNQIMSADAVDIDHGCIGERIAYGNMHLSGEGLECNVNVDGDIGEMIISNESLNAAVISEGTIEIDSEHIKDNPYGDMTITLYKLSKINAL* |
| Ga0073917_1012904 | Ga0073917_10129043 | F001097 | MNNFEWPTNDSSRIKPLQGLRSERVDTQVQPKEIDDWLKQSVALVAGSVRGLGTNQRKVRYFAFEGQNDEATK* |
| Ga0073917_1013425 | Ga0073917_10134252 | F026423 | LVTVGCTAPMKQPTTVGPYCNISWDKTNNSKVAWYQLTVIDQSKQAKIVRFIPADTTTVSCRDVGANHDGIWEVTVQSCYDKSTCGLPTEAARMQITTK* |
| Ga0073917_1013591 | Ga0073917_10135911 | F041765 | AEFFNKEVDKIIYKVTTTKSTKQRIKYIKQMIALKNRLSLEVKMLEDLDNF* |
| Ga0073917_1015133 | Ga0073917_10151331 | F030691 | TQTSGSATSGEIAFDVFVKNISSTSDAARLSLAQQLKDAGLWTGKISSKFNIKYYTALAKLEEKYQGQITVDQIVGATVSAKRFDVLADLVEGGDGEDGPKTTKQTYVTSASQTAKLLNAVAVDLLERDLTKAEQAKYLKMINAEQRKQPSVQTSGKGFTTTLGGVDEEQFIKEKLQSTSEAKNVRATDAYTVLMKEFGGLR* |
| Ga0073917_1016059 | Ga0073917_10160592 | F002487 | MITRQDAIKDLSHGDYCCYCTEPKTSGSCCGENHFVPFEDLYEEDKEAMIEEYLSEGNSNGT* |
| Ga0073917_1016638 | Ga0073917_10166382 | F032259 | MLAEPVRARVYAFDKDKKLVGPSKVVLPAGWYVLPKN* |
| Ga0073917_1017387 | Ga0073917_10173871 | F044447 | MKLQDLTIDQFQRIGAIEFSSVLGDYDKRAGVVAIVEGVDISIVREMPAKSVLKRYKVIISEWNALP |
| Ga0073917_1017604 | Ga0073917_10176042 | F055721 | MNREEYYKYIEENDTYPEHSHTWIVRTYTGDKLFYRNFGTFETKEEAKEFIENYKVKYTTKGFITRYSIQGLCEVL* |
| Ga0073917_1019299 | Ga0073917_10192992 | F000331 | ERVHQVIATMLRTAEIDMANSVAPSDIDTFLTNASWAIRSTYHTVLKASPGAAIFGRDMLFDIPYIADWSKIGDYRQRQTDLNTARENKSRADYDYKVGDKVLIRKDGILRKSESRYDSEPWTITSVHTNGTIRVERGTKSERINIRRVTPYFEN* |
| Ga0073917_1020109 | Ga0073917_10201091 | F009134 | VCIEKKTESLESLSILVRNRLAADAASTLERIDSYSLDGIKDESVRETILGSVAKRSALVFGWSEQGEQASVSINLLGSMPDRSIEVSVTNEAETK* |
| Ga0073917_1021286 | Ga0073917_10212862 | F080037 | HDCMTQAMAYAWAIRDNAQDDGVPIPVELVASFQDDYNNIIAALNEAHNLAS* |
| Ga0073917_1021705 | Ga0073917_10217051 | F023108 | MMRYLAIILLLSSCSAQYHLNKAIKKGYKCEQTGDTIRITTLDSIPVIINDTIVWEKIINTKDTIIKYNTVYVPKTRLDKRIEY |
| Ga0073917_1022321 | Ga0073917_10223211 | F008688 | FITVTVEPPKTKVYLIMSYRGNDLSIEKVYLKKENAQKYCDMYKDSHNYSVEERELTE* |
| Ga0073917_1022549 | Ga0073917_10225491 | F056186 | MTKEQAAALREKWEEGENPPCRHLHLELEHNNDDYLTDNYHCTACGELVAANTRDPFQVI |
| Ga0073917_1025072 | Ga0073917_10250722 | F043233 | AGIKDAVGIAKIRLKAESASDLEKKVAKYESELAQLRKATTPASGQPSAPARQKQFHELSSNEQEKELLRMAAEADRMGV* |
| Ga0073917_1025368 | Ga0073917_10253681 | F003299 | PVKLVTRIIEAYDADHVKQLIQKNDDLILLIEEV* |
| Ga0073917_1025998 | Ga0073917_10259981 | F009204 | NNYPRRPYNTKTRKSIQERIKMKTTIKYYTQNIYGVRREKFIDKKQESVFFQLTGRRTLDSVSRELIRDLSGSSIEFEQSLPPE* |
| Ga0073917_1026987 | Ga0073917_10269872 | F054024 | MSDTFSNHFYIEGPYEDLLNVTKDLDFTDGSIDYDGWEIEGGSAVLHFDGYYCPLDELEKASAKYPSLKIIFRFTQELLIAGLLIYEDGKIKLQSYYNWDTGTSSVTTAQE* |
| Ga0073917_1027229 | Ga0073917_10272291 | F080037 | MAYAHAIRDNAQDDGVPIPMELVVSFQDDYNNILTALNEAHNLAS* |
| Ga0073917_1027229 | Ga0073917_10272292 | F057371 | MQEIKVRFEPADLTDLDHQAAAAGTSRSAFIRNKALSLPVARLNTVEYHALVADAVSAMRGDLPRLQVEYLVAYVITRLDQHSRQAVAGHQPAT* |
| Ga0073917_1028413 | Ga0073917_10284132 | F016803 | MISLISRVRAAWAFGRHQCWVDALPWNRDDATTLNNFFKSETGKKFKDALLNTVLMQNASAITDKNHLQYSSGFAMGQASLVKVIEMMADRESITGQEDDPDSVTNT* |
| Ga0073917_1029721 | Ga0073917_10297211 | F050158 | MAQETVSIAWCDNGMVDGKFMQGVTDVMLKSGINFTTTLRSQ |
| Ga0073917_1029961 | Ga0073917_10299611 | F033776 | ARSKGLNFTVNIRLSREEIEAARRLGDGNISMGVRWCIRYANGREMKPIKLSTMLRSAAVLAAQLEAA* |
| Ga0073917_1030960 | Ga0073917_10309601 | F055558 | DGFNLQSAGKLMTYNFALIVMDRVFESESNTIEVLSDTAQIMSDIFALVETNTESDGDFELSINGNASPFYDSKTDILAGYAINFQVLTPYLSNSCVVPI* |
| Ga0073917_1031661 | Ga0073917_10316611 | F011136 | SRLAHDKQKGSDLLHEVLARLMDRPQQDIEDIVCRGKVEAYVNRALWLSWHSNRSDYAIKYRKYYELHVEKQVDDSKQDETWIGAFIDGEYLYNAIGRLNEFDAILLRLYSKPDFDYKELSAETGIPYSYLRTSIHRALKRIREYVKLQRSLSHTARETEYLQKM* |
| Ga0073917_1031729 | Ga0073917_10317291 | F003806 | MSNEIEIPLKLSGVQSLKAELRSLKAAIAEASDPEQMAALAAQAGKVADRIKDANDAVNVFASGSKFEQIKNSFGGIQDSLMSLDFEEASDKAKVFAKNL |
| Ga0073917_1033666 | Ga0073917_10336661 | F010688 | TLAPLPILLGTWCIVRRPLPRCNMRPKTATIMVIAVGPKGHRREIGGAPSHSACGCDEADNNAPMIAIPVEALSTDTEDGQQASPEVGDEVVLQEVRGVLKKLENGEAYVEIKSVNGMPAEYEKAGKESMEPMDEEGMRNMVSEYDSEMES* |
| Ga0073917_1033968 | Ga0073917_10339682 | F000166 | MPNIPTPEQSQLFAQSVRKWQQVLSLGDWRIEKGSKAAKAAMASVEFNASARLATYRLGDFGAERITPESLDQTALHELLHVFLHDLMTVAQDPKSSQDEIEMQEHRVINLLEKLLSKDSNGRT* |
| ⦗Top⦘ |