| Basic Information | |
|---|---|
| IMG/M Taxon OID | 3300026919 Open in IMG/M |
| GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0114663 | Gp0115666 | Ga0209892 |
| Sample Name | Groundwater microbial communities from the Columbia River, Washington, USA, for microbe roles in carbon and contaminant biogeochemistry - GW-RW metaG T3_23-Sept-14 (SPAdes) |
| Sequencing Status | Permanent Draft |
| Sequencing Center | DOE Joint Genome Institute (JGI) |
| Published? | N |
| Use Policy | Open |
| Dataset Contents | |
|---|---|
| Total Genome Size | 57622659 |
| Sequencing Scaffolds | 10 |
| Novel Protein Genes | 14 |
| Associated Families | 14 |
| Dataset Phylogeny | |
|---|---|
| Taxonomy Groups | Number of Scaffolds |
| Not Available | 5 |
| All Organisms → cellular organisms → Archaea → Euryarchaeota → unclassified Euryarchaeota → Euryarchaeota archaeon | 1 |
| All Organisms → cellular organisms → Bacteria | 1 |
| All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage | 2 |
| All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Dinophyceae | 1 |
| Ecosystem Assignment (GOLD) | |
|---|---|
| Name | Groundwater Microbial Communities From The Columbia River, Washington, Usa |
| Type | Environmental |
| Taxonomy | Environmental → Terrestrial → Soil → Sand → Unclassified → Sand → Groundwater Microbial Communities From The Columbia River, Washington, Usa |
| Alternative Ecosystem Assignments | |
|---|---|
| Environment Ontology (ENVO) | freshwater river biome → microcosm → sand |
| Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Subsurface (non-saline) |
| Location Information | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location | USA: Columbia River, Washington | |||||||
| Coordinates | Lat. (o) | 46.372 | Long. (o) | -119.272 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F010768 | Metagenome / Metatranscriptome | 299 | Y |
| F023108 | Metagenome | 211 | Y |
| F025224 | Metagenome / Metatranscriptome | 202 | Y |
| F026270 | Metagenome | 198 | N |
| F026867 | Metagenome | 196 | Y |
| F027438 | Metagenome / Metatranscriptome | 194 | Y |
| F039104 | Metagenome / Metatranscriptome | 164 | Y |
| F042872 | Metagenome | 157 | Y |
| F043284 | Metagenome | 156 | N |
| F048834 | Metagenome | 147 | N |
| F050335 | Metagenome / Metatranscriptome | 145 | N |
| F058798 | Metagenome / Metatranscriptome | 134 | Y |
| F072095 | Metagenome | 121 | N |
| F096710 | Metagenome | 104 | Y |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| Ga0209892_1003475 | Not Available | 1255 | Open in IMG/M |
| Ga0209892_1004537 | Not Available | 1050 | Open in IMG/M |
| Ga0209892_1006149 | All Organisms → cellular organisms → Archaea → Euryarchaeota → unclassified Euryarchaeota → Euryarchaeota archaeon | 878 | Open in IMG/M |
| Ga0209892_1006746 | Not Available | 834 | Open in IMG/M |
| Ga0209892_1007831 | All Organisms → cellular organisms → Bacteria | 768 | Open in IMG/M |
| Ga0209892_1009140 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage | 702 | Open in IMG/M |
| Ga0209892_1012728 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage | 585 | Open in IMG/M |
| Ga0209892_1015844 | Not Available | 519 | Open in IMG/M |
| Ga0209892_1015965 | All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Dinophyceae | 517 | Open in IMG/M |
| Ga0209892_1016857 | Not Available | 502 | Open in IMG/M |
| Scaffold ID | Protein ID | Family | Sequence |
|---|---|---|---|
| Ga0209892_1003475 | Ga0209892_10034751 | F072095 | VAVIRDLEKEGVLPATASQSYGSAQLSQTTALIDLQVKHRDLRNRLDRIEEILESLLKKSGVQS |
| Ga0209892_1003475 | Ga0209892_10034753 | F042872 | MTPQSIIDQIESGNCTVGSNPTPSESYHYNDLRRNGNTAVITEPKKKHYQQHSLLIRGGYPAEPAYHPNPSVARMWSAQR |
| Ga0209892_1004537 | Ga0209892_10045371 | F050335 | VALSITQQPDSYHPAFNDTNFVITESSGGIYTKDNFKFIANVKVAATSVAKLKAPIYFGSVNKGVFNIGRIMESYVSNNWSFTDTSPSGCVDSFSDYEVEFGYEYSPSATGTITEYLDLTSATGTVWNAALNPFDLVTYAQAQYLATSASAKFLTNVRTRYIHR |
| Ga0209892_1004537 | Ga0209892_10045372 | F026270 | RVIEKYPLLFEGGDEGYDGAVDQSPIVWLEIVDKIVKGDRTKWDFILDMPLIEFLNAMAFYKAKTKERQKRLEDAAGKGFNPYIVACLNEML |
| Ga0209892_1006149 | Ga0209892_10061493 | F025224 | MADAMWMPGSFFDNVDKFANSLGYQNAGVAMELAMMSWNSPEERDAFITSITNAEVQGGTEKNYIKRK |
| Ga0209892_1006746 | Ga0209892_10067464 | F039104 | MTNLKALVVKASINNIIKWRVYFAGELLATFENET |
| Ga0209892_1007831 | Ga0209892_10078312 | F026867 | MKLRKQDQALSLRQLGAAYGLGYVRIKQMRELSGFPLIAGKVIPSDFDQWRLMQTGLNLPRPADRLRSAAGKAHALASRSDSRVSWRQIANTLKDEAWLHELREGNGSSDF |
| Ga0209892_1009140 | Ga0209892_10091402 | F048834 | MAIPGVDWAAALDAYVAGDREEVVPPGWFTKIDIAKLWGKTPVYTNKVLTRMMKSGGAERKVFSIRIKICDRGSKVGHCR |
| Ga0209892_1010969 | Ga0209892_10109692 | F023108 | MGKLFIVIISILLYSCSAQYHLNKAIKKGYVCEDTLQMDTIRIATIDSVPVIVNNEIIYEKFITQKDTIVKWKTKNVYVPKTRIELKREYKIKIKTIYKDKVVEKAQARAEGKK |
| Ga0209892_1012728 | Ga0209892_10127281 | F058798 | MAEHKVEGGSMLLFIDPDGGTAYDTVVCLTSVSKSDSVTVVDASSACGPDKSPGTLELSYAFEGQHLQDPENGKISGTDLRILLRGEQTIGWKIAPEVPVEGDEIEEGTGYLSELSSTYSFDSVGTFTGTIQPYGTPTITISGGGGG |
| Ga0209892_1015844 | Ga0209892_10158441 | F043284 | KLNNTRDKALELIGKTIGLASDAGAIIQQARADGQDIVAICEEAGITEEVGKRYEKVAAAQHKLANGDADPGLMRQTYLRIGFLPDPITMSEPSEPKHFLFPIMQARQWLASRGVKFISQDKGLRAQFLAEAEPIVKTYNELKDSA |
| Ga0209892_1015965 | Ga0209892_10159652 | F010768 | VDNGGYNVHPVPPPNSEIKERIRKRYESKRIKIEKLLTLGYTTSGDP |
| Ga0209892_1016403 | Ga0209892_10164032 | F027438 | MTTRKSHKARGATYETDIRDWFRANGYDSERLARTGAKDEGDVVVRSGF |
| Ga0209892_1016857 | Ga0209892_10168571 | F096710 | MKDLVTKSALVSIFNSKQHDFLYIIKQVNTELFLPRVPKAVVKDMIKSEGLYNISYRIYTNEKGYESIYISKASKIVTK |
| ⦗Top⦘ |