| Basic Information | |
|---|---|
| Taxon OID | 3300005538 Open in IMG/M |
| Scaffold ID | Ga0070731_10000104 Open in IMG/M |
| Source Dataset Name | Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen03_05102014_R1 |
| Source Dataset Category | Metagenome |
| Source Dataset Use Policy | Open |
| Sequencing Center | DOE Joint Genome Institute (JGI) |
| Sequencing Status | Permanent Draft |
| Scaffold Components | |
|---|---|
| Scaffold Length (bps) | 157586 |
| Total Scaffold Genes | 155 (view) |
| Total Scaffold Genes with Ribosome Binding Sites (RBS) | 136 (87.74%) |
| Novel Protein Genes | 8 (view) |
| Novel Protein Genes with Ribosome Binding Sites (RBS) | 8 (100.00%) |
| Associated Families | 8 |
| Taxonomy | |
|---|---|
| All Organisms → cellular organisms → Bacteria | (Source: IMG/M) |
| Source Dataset Ecosystem |
|---|
| Environmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil → Surface Soil Microbial Communities From Centralia Pennsylvania, Which Are Recovering From An Underground Coalmine Fire. |
| Source Dataset Sampling Location | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location Name | USA: Pennsylvania, Centralia | |||||||
| Coordinates | Lat. (o) | 40.7999 | Long. (o) | -76.3402 | Alt. (m) | Depth (m) | Location on Map | |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F004485 | Metagenome / Metatranscriptome | 436 | Y |
| F005865 | Metagenome | 388 | Y |
| F007233 | Metagenome / Metatranscriptome | 355 | Y |
| F008958 | Metagenome / Metatranscriptome | 325 | Y |
| F033134 | Metagenome | 178 | Y |
| F037842 | Metagenome / Metatranscriptome | 167 | N |
| F044617 | Metagenome / Metatranscriptome | 154 | N |
| F065986 | Metagenome | 127 | N |
| Protein ID | Family | RBS | Sequence |
|---|---|---|---|
| Ga0070731_10000104121 | F004485 | AGGAGG | MSNRERYTLSPASAEQFCQFWSAHMDALERHLSAKTWAIAAFTVVLTAYPIARIATPAILHAVVPDVVRTVLNLIIR* |
| Ga0070731_10000104134 | F008958 | AGGAGG | VKLVRFEDINGRETFVNAERVNHISDYGSGTTEINFGGEKATVYVAMDVAKVASTLLKAAKT* |
| Ga0070731_1000010418 | F005865 | GAGG | MPAPEPGHEWVLSGVNPRSDSVAESELLKAARSAWPKALAHVRQVQSQSPFNDDGLLVAEVWESVLQSVSMTLERMNGRRGEIGDLEAYLLGSFYHRLNRALRKEQRREEMIRLVSSVEELEVLGAENQAEQQDIEGALHAQEILDRMDAWSRKVWVAREYGYTWKHIGRVLGIPGDSVMLRFRRRMKILRARLSGGR* |
| Ga0070731_1000010465 | F007233 | GGAG | MANRKQREHQRQLNQDTFSDGIKGKVLEAVSISNELECRYISLEFEDKTELTISLDTRLTGKLELFDCKSGDHRLVRKIGLIPDDAPVWRPKD* |
| Ga0070731_1000010466 | F033134 | AGGA | MKQEQESKQLVTFSSLHIEVLGPKELGNSQQAIRGVTEFLHRVELDVGKLLRERAEMSPILDGFRLRFG* |
| Ga0070731_1000010467 | F037842 | AGGAGG | LKQKTVKVRGEGVEYPALTGKVIRQVRFVNDEDYTALTIEFDDSTLVSFRFKASIAFSIDPELSTVKAGNIVKWRELKTRPVMRQVARR* |
| Ga0070731_1000010469 | F065986 | GGA | MSNIAKRQGRMIRKGKEVQYGSVRIELIGPTELLHDREAKGAAIDFLRTEILASKDWLKDCARISPSLKRFRLISIRK* |
| Ga0070731_1000010470 | F044617 | AGGAG | MSTSPQPSRQRPTVTAITAEERRKLDERVRDKHQRLRNRYPEIYGKVVDWIHHYIEDGTLYVCIRFQDKSDFALEFSPQIVRVGTELCDSATDDFDVIRNYYQEERE* |
| ⦗Top⦘ |