| Basic Information | |
|---|---|
| IMG/M Taxon OID | 3300034683 Open in IMG/M |
| GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0118431 | Gp0324337 | Ga0310147 |
| Sample Name | Fracking water microbial communities from deep shales in Oklahoma, United States - K-7-3 |
| Sequencing Status | Permanent Draft |
| Sequencing Center | DOE Joint Genome Institute (JGI) |
| Published? | N |
| Use Policy | Open |
| Dataset Contents | |
|---|---|
| Total Genome Size | 389110958 |
| Sequencing Scaffolds | 9 |
| Novel Protein Genes | 15 |
| Associated Families | 8 |
| Dataset Phylogeny | |
|---|---|
| Taxonomy Groups | Number of Scaffolds |
| All Organisms → cellular organisms → Bacteria | 2 |
| Not Available | 3 |
| All Organisms → cellular organisms → Bacteria → Synergistetes → Synergistia → Synergistales → Synergistaceae | 1 |
| All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes | 3 |
| Ecosystem Assignment (GOLD) | |
|---|---|
| Name | Subsurface Microbial Communities From Deep Shales In Ohio And West Virginia, Usa |
| Type | Environmental |
| Taxonomy | Environmental → Terrestrial → Deep Subsurface → Unclassified → Unclassified → Fracking Water → Subsurface Microbial Communities From Deep Shales In Ohio And West Virginia, Usa |
| Alternative Ecosystem Assignments | |
|---|---|
| Environment Ontology (ENVO) | terrestrial biome → planetary subsurface zone → fracking liquid |
| Earth Microbiome Project Ontology (EMPO) | Unclassified |
| Location Information | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location | USA: Oklahoma | |||||||
| Coordinates | Lat. (o) | 35.812 | Long. (o) | -98.262 | Alt. (m) | N/A | Depth (m) | 2943 | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F022008 | Metagenome / Metatranscriptome | 216 | Y |
| F028664 | Metagenome | 191 | Y |
| F033769 | Metagenome / Metatranscriptome | 176 | N |
| F075996 | Metagenome / Metatranscriptome | 118 | N |
| F077200 | Metagenome / Metatranscriptome | 117 | N |
| F082560 | Metagenome | 113 | N |
| F088079 | Metagenome / Metatranscriptome | 109 | N |
| F103496 | Metagenome / Metatranscriptome | 101 | Y |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| Ga0310147_000039 | All Organisms → cellular organisms → Bacteria | 148369 | Open in IMG/M |
| Ga0310147_001095 | Not Available | 25127 | Open in IMG/M |
| Ga0310147_004430 | All Organisms → cellular organisms → Bacteria → Synergistetes → Synergistia → Synergistales → Synergistaceae | 9830 | Open in IMG/M |
| Ga0310147_029113 | All Organisms → cellular organisms → Bacteria | 2218 | Open in IMG/M |
| Ga0310147_037228 | Not Available | 1800 | Open in IMG/M |
| Ga0310147_067039 | Not Available | 1070 | Open in IMG/M |
| Ga0310147_081359 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes | 898 | Open in IMG/M |
| Ga0310147_128161 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes | 588 | Open in IMG/M |
| Ga0310147_133461 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes | 566 | Open in IMG/M |
| Scaffold ID | Protein ID | Family | Sequence |
|---|---|---|---|
| Ga0310147_000039 | Ga0310147_000039_1653_2333 | F075996 | MLNSSTGQAVGVLLPYWSYIPTDRISPKGLLYPVSQIMKEGNLLRFVMLVGGAKEEVSMAGKDFFWSDVNKMYLPEYCLIDITAKFRDRICTFTGLYAGKDGLFVNGKKAGSFKDISSIFAHYGLSEVWNLITGDIRPSCGVYRYGRLFVPETEVCLGLISERANFPKGYDMKDLGRVERILEDAGVFEIPEEISLADVEEKLSKFAEVEKILDGFADRDEETLLR |
| Ga0310147_000039 | Ga0310147_000039_52673_53434 | F077200 | MVKFTVIEVKTQPNLKRIEEYAKRYGMTTEEALSKKRIGEDYFPIRDQQVVSVGLLNFVSEKEDSVGIMAAIYTGEEKKVLEETAKKLEKVVKATGKPFFITGDGRKYALELLAGRAMAHMIEAKRKDQEIIPEMKEMIKLLSSTKNGYLKPFDVRDSIDLQAAFGLGTDRTPAPDSLRYENKDLPALAEETKRSVLDMMTNYAAYLESQGEKIKPVVYKLNEKIFKTVEIFKFPEEEPDMEKGEINIVDIDR |
| Ga0310147_001095 | Ga0310147_001095_2744_3097 | F022008 | MTLSEIEMLQSALNELSKQLQTLSDENRSSFGHIYDRLTRIETQMTERECRFKKYEKALENHEVRIRTVEGELGTLRDVPERLWRVSMSNSKLTGMIAAAGGMGGLVATLISKILGE |
| Ga0310147_001095 | Ga0310147_001095_3099_3323 | F103496 | MPLDDRIRKLPLTKLIALYFTVFTCVLIVLDVTTLCPLTDKSADLLKWLGSTIIVGYFGKSAYEHKMNTTSGVK |
| Ga0310147_001095 | Ga0310147_001095_342_611 | F033769 | MSAINCGVKTATYTPSTVGVAVGRKYLRLTNESLDMRCRAGVSNISSTAGIIIEPGQTVEFRPLPSENREIYVMSEGKAVKLAYYEVIV |
| Ga0310147_001095 | Ga0310147_001095_4971_5363 | F082560 | MTEKAIVKKIQEWFKAKGGVCHKVHGGPLSAGFPDLIGCIEANAWVVEVKVPNAKPRVPKELRERLPKEMREWMEKGATMLQAKTLYDWQCAGAIAMVATSVEDVERRYQEEYFGSNIGGKHYGGFKRTQ |
| Ga0310147_004430 | Ga0310147_004430_5981_6334 | F022008 | MTLSDIEILQSTLRDLSKQVQKLSEENRVSFNHIHDRLTRIETQMSERECQFKRYEKQLENHEVRIRTVEGELGTLRDVPERLWRVSMSNSKLTGMIAAAGGVGGLVATIIVKMLGD |
| Ga0310147_004430 | Ga0310147_004430_6336_6563 | F103496 | MPLDDRIRKLPLTKLIALCFTVFTCVLIVLDVTTLCPLTDKSADLLKWLGSTIIVGYFGKSAYEHKINTTSGMKQ |
| Ga0310147_004430 | Ga0310147_004430_8207_8650 | F082560 | MYAVIWRAWISLDGGIGMTEKAIVKKIQEWFKAKGGVCHKVHGGPLSAGFPDLIGCIEADAWVVEVKVPNAKPRVPKELRERSPKEMQEWMEKGATMLQAKTLYDWQCAGAIAMVATSVEDVERRYQEEYFGFSIGGKHYDRFRRTQ |
| Ga0310147_029113 | Ga0310147_029113_1795_2187 | F082560 | MTEKAIVKKIQEWFKAKGGVCHKVHGGPLSAGFPDLIGCIEANAWVVEVKVPNAKPRVPKELRERLPQEMREWMEKGATMLQAKTLYDWQCAGAIAMVATSVEDVERRYQEEYFGSNIGGKHYGGFKRTQ |
| Ga0310147_037228 | Ga0310147_037228_800_1192 | F082560 | MTEKAIVKKIQEWFKAKGGVCHKVHGGPLSAGFPDLIGCIEADAWVVEVKVPNAKPRVPKELREGLPKEMREWMEKGATMLQAKTLYDWQCAGAIAMVATSVEDVERRYQEEYFGSNIGGKHYGGFKRTQ |
| Ga0310147_067039 | Ga0310147_067039_464_859 | F028664 | MILIGGQALRNLGSDRYTEDIDFLIDNKESNQAFITGASVDYLNANANKFFKEIYNIEEGNEQATPQSLFELKAYAFVQHCQNFNFAKADACEYDLKFLVRNFDVKPNIVKRYISSGELSEINKIINSVKK |
| Ga0310147_081359 | Ga0310147_081359_1_225 | F088079 | MLTEKAIWLEKVPFRNVDGAIGGQNCPVYFVESMIVGQSCTFCTSILGNLAPNPYIMLTRLDVFIHICEIYKVDI |
| Ga0310147_128161 | Ga0310147_128161_244_504 | F088079 | MLTEKAIWLEKVPFRSIDEAIGGQNCSVYFVESMFVGQSCTFCTSRLGNLAPNPYRMLTRLDVFVHICEIYKADITFFKSNVVSTK |
| Ga0310147_133461 | Ga0310147_133461_336_566 | F088079 | MLTEKAIWLENVPFRSIDEAIGRQNCSVYFVESMIVGQSCTFCTSRLGNLAPNPYRMLTRLDVFIHICEIYKVDITF |
| ⦗Top⦘ |