| Basic Information | |
|---|---|
| IMG/M Taxon OID | 3300026893 Open in IMG/M |
| GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0114663 | Gp0115674 | Ga0209883 |
| Sample Name | Groundwater microbial communities from the Columbia River, Washington, USA, for microbe roles in carbon and contaminant biogeochemistry - GW-RW metaG T4_21-May-14 (SPAdes) |
| Sequencing Status | Permanent Draft |
| Sequencing Center | DOE Joint Genome Institute (JGI) |
| Published? | N |
| Use Policy | Open |
| Dataset Contents | |
|---|---|
| Total Genome Size | 58403968 |
| Sequencing Scaffolds | 8 |
| Novel Protein Genes | 10 |
| Associated Families | 9 |
| Dataset Phylogeny | |
|---|---|
| Taxonomy Groups | Number of Scaffolds |
| All Organisms → cellular organisms → Eukaryota → Opisthokonta | 2 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales | 1 |
| Not Available | 2 |
| All Organisms → cellular organisms → Archaea → Asgard group → Candidatus Thorarchaeota → Candidatus Thorarchaeota archaeon | 1 |
| All Organisms → cellular organisms → Eukaryota → Sar → Stramenopiles → Ochrophyta → Bacillariophyta → Coscinodiscophyceae → Thalassiosirophycidae → Thalassiosirales → Skeletonemataceae → Skeletonema → Skeletonema marinoi-dohrnii complex → Skeletonema marinoi | 1 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria | 1 |
| Ecosystem Assignment (GOLD) | |
|---|---|
| Name | Groundwater Microbial Communities From The Columbia River, Washington, Usa |
| Type | Environmental |
| Taxonomy | Environmental → Terrestrial → Soil → Sand → Unclassified → Sand → Groundwater Microbial Communities From The Columbia River, Washington, Usa |
| Alternative Ecosystem Assignments | |
|---|---|
| Environment Ontology (ENVO) | freshwater river biome → microcosm → sand |
| Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Subsurface (non-saline) |
| Location Information | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location | USA: Columbia River, Washington | |||||||
| Coordinates | Lat. (o) | 46.372 | Long. (o) | -119.272 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F000212 | Metagenome / Metatranscriptome | 1580 | Y |
| F015593 | Metagenome / Metatranscriptome | 253 | Y |
| F027760 | Metagenome / Metatranscriptome | 193 | Y |
| F044595 | Metagenome | 154 | Y |
| F054132 | Metagenome | 140 | Y |
| F065441 | Metagenome / Metatranscriptome | 127 | Y |
| F082695 | Metagenome / Metatranscriptome | 113 | N |
| F088774 | Metagenome / Metatranscriptome | 109 | Y |
| F104469 | Metagenome / Metatranscriptome | 100 | Y |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| Ga0209883_1000847 | All Organisms → cellular organisms → Eukaryota → Opisthokonta | 4136 | Open in IMG/M |
| Ga0209883_1001218 | All Organisms → cellular organisms → Eukaryota → Opisthokonta | 3100 | Open in IMG/M |
| Ga0209883_1001836 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales | 2195 | Open in IMG/M |
| Ga0209883_1013865 | Not Available | 562 | Open in IMG/M |
| Ga0209883_1013926 | All Organisms → cellular organisms → Archaea → Asgard group → Candidatus Thorarchaeota → Candidatus Thorarchaeota archaeon | 560 | Open in IMG/M |
| Ga0209883_1014164 | All Organisms → cellular organisms → Eukaryota → Sar → Stramenopiles → Ochrophyta → Bacillariophyta → Coscinodiscophyceae → Thalassiosirophycidae → Thalassiosirales → Skeletonemataceae → Skeletonema → Skeletonema marinoi-dohrnii complex → Skeletonema marinoi | 555 | Open in IMG/M |
| Ga0209883_1014956 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria | 539 | Open in IMG/M |
| Ga0209883_1017303 | Not Available | 500 | Open in IMG/M |
| Scaffold ID | Protein ID | Family | Sequence |
|---|---|---|---|
| Ga0209883_1000847 | Ga0209883_10008472 | F082695 | LFVLLSYESQTGDESDVARWAHKLMNRSVANRTITKPEAMCELGQLPMVICSESIETVSITGQTRCSIDTTTSTILSQYKNRPNTQERLSLHEFYHAKRNNHSMTTAANHREFVPHYVGGRGQPVYPVTDTKQSISYARSEILKHMPWSQKNPMPNECDWVAIFKEFLQDPSCPAGVKLGFERAKLRYELRRKGIQEVFQPDTEHSNATDDLDDDEIGDVIALTESLGYTEDELDKMEENGFCIGRDYDWGRRVYTVSNICII |
| Ga0209883_1001218 | Ga0209883_10012181 | F082695 | KLMNRSVANRTITKQEAMCELGGLPMVICSESIETISITGSTKCATDTNTSTILSQYRNRPDAQQHLSLHQFYHIKKNKKLASTPSYREFIPHYVGGKGQPVYPITRSYARSELLKHLPWGRKNPMPNDCDLITMFKQFLENPKCPVGVRLGFERAKLRKELKEKGIQEAFQPDIEHSSNTDDVDDDEVGEVIALTESLGYTEDELEKLENNGFFLGKDYDWGKRIYTVSNPTIHHTRFGIILNY |
| Ga0209883_1001836 | Ga0209883_10018363 | F044595 | MNGLVAFVRAAAAMESFRGCASALHLTLPASFQLDRNEDQLHPGEGSA |
| Ga0209883_1004738 | Ga0209883_10047381 | F104469 | QRLRLNPTFLAPEVSNIAYLHDIEDKTITVDNDDASDNNLHQMNVEADVNSVASSGSVAAVDSFSENKFVDTTKAPAGDSDYACFTTSQKCITSLMYLLDDMECPDYAFQCIMDWARNCFETGFDFNPKSKTCLGNLKWMYDSLHNAKQMLPNVMLIQLPDPLPDTKSMDVICYDFVPQLLSILQNKEMMSANNLVLDPNNPLAMYKPQNCRLGKALSGSVYQDMYQRLVSNPTKQLLCPLICYTDGTQIDALSRFSVESFLFMPAVLSHVTRCKAEAWRPFGYVQHVRSTQTKLNGAAKARNYHAQLQAMLQGLQRVQTGVDSRLQNVEIYCFGKCLRVDVLCPILFIAADTPAADKLCGHF |
| Ga0209883_1010573 | Ga0209883_10105731 | F000212 | MKLITKIIVMTFFLMNSTMSLGLRTETQCHPQCSWKCDDPHCPAICDPVCEPPKCHTSCAEPKNAICDVKCEKPECEIKCPDKGCEMFDCPKCVTVCKQPHCVTHCQAPKPECEAVCEEPRCDWKCHKPNCPKPKCELVCENPNCVPKVECCPCAMGAPR |
| Ga0209883_1013865 | Ga0209883_10138651 | F015593 | AKGEIELADEVEMKLTEDEKISHSNAWRSHRETTESLKKSRGKVYSLLLGQCTQVLIDKMKQDTDWVTISESFDPTLLFKLIEKFVLKQSDNQYATAVLISEQLSILSFRQDDHLGNAAYYDRFTTRVEVARQAGVCYYSPALLEEKATQLKLGAYDDLASDAKKKVVDQVEQEYLAYLFLNNSNA |
| Ga0209883_1013926 | Ga0209883_10139261 | F065441 | KREMYAKIALLMFYPFRQLNDLTYNGSYWRLFHNELKKHINKENTVFWKKGFEILQNIQDRSTLEKHVKRARDPISITTKNEKPNDANGIQAKSMAGNSAMGDILDMNKQLK |
| Ga0209883_1014164 | Ga0209883_10141641 | F027760 | CQLDLTPNDHKIERKKIKSKLDDLNNKLAQEMTGRKFKLEPEQFSPEVTVKHYQTSSKKVAFRCKMKQIPANSNDATTGHKLQGMSKDAIIVSSWPTGGLAAMFKNWEYVVLSRVRTLSGLYLVKPIDMDKSFQPSPQLASYMDKIRKFEKDMLEKRKQAISKTFL |
| Ga0209883_1014956 | Ga0209883_10149561 | F054132 | QSRKPVDLARVQAIEFLDAALGADRRQLIKQYVENHDSAPKLAERIWQAIYDLSQGFIYAYQTALEEAMRQNGNARWKPLTPLLFARLVHYYGTDAKLRVFRYERWIPGKWMELHRVYMRASELGFDRVPVVMPSAGPNATPWTIEQEYLYVLLVHQLNTGNMSPPQLDWAMSQLRAWS |
| Ga0209883_1017303 | Ga0209883_10173032 | F088774 | IDVGSTVQADAKFLRRVFLGLVLLTLATGFLAGLTTVQAMTDKTGTCVLLCQ |
| ⦗Top⦘ |