| Basic Information | |
|---|---|
| Taxon OID | 3300027867 Open in IMG/M |
| Scaffold ID | Ga0209167_10000012 Open in IMG/M |
| Source Dataset Name | Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen05_05102014_R1 (SPAdes) |
| Source Dataset Category | Metagenome |
| Source Dataset Use Policy | Open |
| Sequencing Center | DOE Joint Genome Institute (JGI) |
| Sequencing Status | Permanent Draft |
| Scaffold Components | |
|---|---|
| Scaffold Length (bps) | 277103 |
| Total Scaffold Genes | 263 (view) |
| Total Scaffold Genes with Ribosome Binding Sites (RBS) | 205 (77.95%) |
| Novel Protein Genes | 9 (view) |
| Novel Protein Genes with Ribosome Binding Sites (RBS) | 8 (88.89%) |
| Associated Families | 9 |
| Taxonomy | |
|---|---|
| All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae | (Source: IMG/M) |
| Source Dataset Ecosystem |
|---|
| Environmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil → Surface Soil Microbial Communities From Centralia Pennsylvania, Which Are Recovering From An Underground Coalmine Fire. |
| Source Dataset Sampling Location | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location Name | USA: Pennsylvania, Centralia | |||||||
| Coordinates | Lat. (o) | 40.7999 | Long. (o) | -76.3402 | Alt. (m) | Depth (m) | Location on Map | |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F000208 | Metagenome / Metatranscriptome | 1593 | Y |
| F001742 | Metagenome / Metatranscriptome | 643 | Y |
| F002102 | Metagenome / Metatranscriptome | 593 | Y |
| F002560 | Metagenome / Metatranscriptome | 548 | Y |
| F002814 | Metagenome / Metatranscriptome | 528 | Y |
| F003000 | Metagenome / Metatranscriptome | 514 | Y |
| F003851 | Metagenome / Metatranscriptome | 465 | Y |
| F004412 | Metagenome / Metatranscriptome | 439 | Y |
| F090964 | Metagenome / Metatranscriptome | 108 | Y |
| Protein ID | Family | RBS | Sequence |
|---|---|---|---|
| Ga0209167_10000012210 | F002814 | GGA | MKVNYGKRTLLNFAMTKARMLTLLIVIAHWAVAVLHLFLAAKVLSAPNNSVSWLAITLITFGHLCVCIAVWKFSDKLAGLVSVIFFFAALGADLYEHFLHASANNIFMVAPTGDWTIGFDASVFFLLGFEILGCLLGILLLGRRTRNSSQPKFANHSVLS |
| Ga0209167_10000012241 | F090964 | AGGAGG | MAGRWKPRKEDRTDAEAVRREQQFQQDLSDILEYGTEDDFVAALKNYKPDIGKEELQESIMQFRVYVREKRGLC |
| Ga0209167_1000001240 | F000208 | GAGG | MRIRVTTKGHLFEKRKQQHIERGYRIEDERPIPVNGLCSFIAVSEIPESDGVGDLVAQALNGMAR |
| Ga0209167_1000001241 | F002560 | AGGAGG | MVEHNPPPSADSIRPPAIPLPIALYVSGELIFELYDGGQPSRKFSIRWDRLVPTKENEPEAQSPQASLRTRQD |
| Ga0209167_1000001244 | F004412 | GGAG | MQDQPSNAEPALPPEQAAAKAAILSECERLAEAGITFVAVHFDGSGDEGVNEDVKCYATEDYAYEESEAQATNFSHLQEHFDALVPYGYENNCGGFGDVILNVKTRKITVERNDRFEDYTTSSYEV |
| Ga0209167_1000001247 | F001742 | GAGG | MPDTETGNRGRNRQKIPRIGPEEAARVFAQCEHLLDSSTVFVENLRQAGFARFLAPLHQQILELAVQPDSWGERTRARLISELFAGLKGIPAEDARVEEIVQVSNVVVPCFLLELGRRRQHIEIEFPANPCDSAARFALRAGPSYPVHSVNSEQLVRLVAEAGEELVGLCYFGDQRSREHIEAQLAFTGATTDS |
| Ga0209167_1000001254 | F002102 | AGGAG | MSKDEKENRSQPRSGALDKFEPGGESGPLNDRNEYQQRRNSLPAEQKELAEESTRFADLCQYFSQQKMDIPPSIVERVGNVSKLPAPGRIRAMKDINRELMEYLNDVGQDPGIRQ |
| Ga0209167_1000001257 | F003000 | N/A | MKANAQLGEMHDRLIECYTQNPKEGFCDALLRLVEDPLKPRDEDGRFRVNPILLLLTLVLALALGTFLLLSFVQS |
| Ga0209167_100000126 | F003851 | AGGGGG | MKRRRLAAATCRCIDEETKRVLRALEEAAHGRANTNDPIMERLRQCPEAFKVGEVLGKAFDSLRYAQDGSEIGDRYLPPSGTVIPPLV |
| ⦗Top⦘ |