| Basic Information | |
|---|---|
| Taxon OID | 3300027365 Open in IMG/M |
| Scaffold ID | Ga0209300_1000214 Open in IMG/M |
| Source Dataset Name | Subsurface microbial communities from deep shales in Ohio, USA - Utica-3 well 1 S input2 RT (SPAdes) |
| Source Dataset Category | Metagenome |
| Source Dataset Use Policy | Open |
| Sequencing Center | DOE Joint Genome Institute (JGI) |
| Sequencing Status | Permanent Draft |
| Scaffold Components | |
|---|---|
| Scaffold Length (bps) | 35171 |
| Total Scaffold Genes | 45 (view) |
| Total Scaffold Genes with Ribosome Binding Sites (RBS) | 37 (82.22%) |
| Novel Protein Genes | 10 (view) |
| Novel Protein Genes with Ribosome Binding Sites (RBS) | 9 (90.00%) |
| Associated Families | 10 |
| Taxonomy | |
|---|---|
| All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage | (Source: UniRef50) |
| Source Dataset Ecosystem |
|---|
| Environmental → Terrestrial → Deep Subsurface → Unclassified → Unclassified → Deep Subsurface → Subsurface Microbial Communities From Deep Shales In Ohio And West Virginia, Usa |
| Source Dataset Sampling Location | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location Name | Ohio, USA | |||||||
| Coordinates | Lat. (o) | 39.849 | Long. (o) | -81.036 | Alt. (m) | Depth (m) | 2500 | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F001733 | Metagenome / Metatranscriptome | 644 | Y |
| F031872 | Metagenome / Metatranscriptome | 181 | N |
| F033435 | Metagenome / Metatranscriptome | 177 | N |
| F039641 | Metagenome / Metatranscriptome | 163 | N |
| F042901 | Metagenome / Metatranscriptome | 157 | Y |
| F049628 | Metagenome / Metatranscriptome | 146 | N |
| F050378 | Metagenome / Metatranscriptome | 145 | N |
| F059968 | Metagenome / Metatranscriptome | 133 | N |
| F065785 | Metagenome / Metatranscriptome | 127 | Y |
| F066786 | Metagenome / Metatranscriptome | 126 | N |
| Protein ID | Family | RBS | Sequence |
|---|---|---|---|
| Ga0209300_100021410 | F066786 | AGGGGG | MTQERVDLTWRCGHTAHIMVGYSQSDLKYKMAMMTSTLQICAACENKLAIERAWSLTQRLLEPNPIVMTGSEKQIEWARSIRTTKYEVLAHVLDCLHEAYKTRQDEWPAIARAISPVVNDVSIWRSYTQSGAIIDRRNINWTGAFTNALSRAGLYIGGLK |
| Ga0209300_100021413 | F065785 | AGGA | MDIKLSCIECNRPNVVPYGRGHRICGICSQRQLKRERRKQTQRRIQTLGGFVLVVMCVWTACAMASDWNTPNSPDHRAHQAMQARD |
| Ga0209300_100021420 | F059968 | GGA | MSKATATEEKAELLVRIKELRAAGNSISRTAQIMCMTRGTVQRWIMEERPEREVKKMDPYVSIDEKTAIVVKWAELIASGESRSSAAASVGFPTMMLNRWLMSEPSLRIEFQESIGKKQNNFGGRKSFEQILADVRAGRPVWRDGGRFKIQLVEAALMRYELDGANVWRCKGFATLSGNDVLARDWTVVS |
| Ga0209300_100021421 | F049628 | GGA | MKFSEVIEPLMHGKPVTRASWEHDVYVRYSDLYEAFVMHAIPEPKIIQGITIYPEWMLADDWMWGEFHPVKDEIKWTQTTS |
| Ga0209300_100021431 | F039641 | N/A | MIPAAYSNALKNAIQAYSYADRVAIWRTVNQSDGIGGISQHWVQVAEIRATISNTGDTEGIVGGMIEQSGTWTLTCSPDIEVRADDRIYVSGNPQALSPYYECIGSDYGHTNAVSQTIALRARTNG |
| Ga0209300_100021432 | F050378 | GAGG | MSPEMWVQIGIQAFITIMSIGAAWVALQVRLTRLETQVAHIISTLDGQQQEVRRIEQRLGKLENKVSALEAIIQR |
| Ga0209300_100021433 | F001733 | GGAGG | MNSISIKRLVVVVIVAFVAAFTSVFGDGIRTSEAHDISELGAVLALYGSKAVAAGVTAAMSSVLAFLTMPFKGVEANSLKVGK |
| Ga0209300_100021439 | F031872 | GGA | MVESLVVDEWIYDTLTADATLQGLLAVDNRAPSYQQGIYLYYAPEKDPISLRQPQVPYIVVRHLDAGQTDTTSVCGGRIVTTSSHQVWCWDTQSGAVSMARIKGIVDRIDTLLNKQSVDTTTPVFFLNRASVSSSIDVSQDGRVDNGISQVYVATITP |
| Ga0209300_100021441 | F033435 | AGGA | LSSIFDNIPKLEGRPNHTVDIERFIGAPGSFTFREPKASDLFPRPEVEKMLKIAFPEFPAQMLQILMIMARCYVTQPGDGEINPARRFAQLARDRSDIYLFVVGQFAAAFPINIEEAVDEVPND |
| Ga0209300_10002145 | F042901 | GGA | MARKQSDNKDITAGPVRVAEKPEGLLWLLKASEHEILERLHQDGALIYIHPALDGVVSYRIEENPAHDQKVVHVWR |
| ⦗Top⦘ |