| Basic Information | |
|---|---|
| Taxon OID | 3300027869 Open in IMG/M |
| Scaffold ID | Ga0209579_10000107 Open in IMG/M |
| Source Dataset Name | Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen03_05102014_R1 (SPAdes) |
| Source Dataset Category | Metagenome |
| Source Dataset Use Policy | Open |
| Sequencing Center | DOE Joint Genome Institute (JGI) |
| Sequencing Status | Permanent Draft |
| Scaffold Components | |
|---|---|
| Scaffold Length (bps) | 147663 |
| Total Scaffold Genes | 135 (view) |
| Total Scaffold Genes with Ribosome Binding Sites (RBS) | 84 (62.22%) |
| Novel Protein Genes | 8 (view) |
| Novel Protein Genes with Ribosome Binding Sites (RBS) | 5 (62.50%) |
| Associated Families | 8 |
| Taxonomy | |
|---|---|
| All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae | (Source: IMG/M) |
| Source Dataset Ecosystem |
|---|
| Environmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil → Surface Soil Microbial Communities From Centralia Pennsylvania, Which Are Recovering From An Underground Coalmine Fire. |
| Source Dataset Sampling Location | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location Name | USA: Pennsylvania, Centralia | |||||||
| Coordinates | Lat. (o) | 40.7999 | Long. (o) | -76.3402 | Alt. (m) | Depth (m) | Location on Map | |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F000131 | Metagenome / Metatranscriptome | 1986 | Y |
| F000136 | Metagenome / Metatranscriptome | 1961 | Y |
| F000882 | Metagenome / Metatranscriptome | 849 | Y |
| F000883 | Metagenome / Metatranscriptome | 849 | Y |
| F000912 | Metagenome / Metatranscriptome | 839 | Y |
| F002831 | Metagenome / Metatranscriptome | 527 | Y |
| F007996 | Metagenome / Metatranscriptome | 341 | Y |
| F040501 | Metagenome / Metatranscriptome | 161 | Y |
| Protein ID | Family | RBS | Sequence |
|---|---|---|---|
| Ga0209579_10000107106 | F000136 | GAGG | MAKAKLRILYGEGDDEILKQHSTAIEKAGHIVQQAAGRKAVQEALNKSVFDLVLLGPTLTRNDRHHLPYMVKKASAETSVLVMHADGTRHPYVDACTDTGASLETVLQRIEAMKIAGMMPAAAGAAAGR |
| Ga0209579_10000107122 | F000131 | GGAG | MDTIRRQRLLVHFSIILASSCILLPLPLHAEPLSLQGLVTPSTVILKDGHPITFAVHGFIEFQSLSELFPYIESQSQRWKSSPEIDDAARRRLAHDLLRRGIESRIISMTDERPLETLITHTREELQQALAEVKEPTPPGYAEAFLSVQEKWKHSLNCWSAAPSIPARVLSNWYPIDEGISLYGATYDSTEHFWQSVKYHPDVTVGDLTDLLSMIEHTDWTPWLQRLDDSPKIYLPNAYAIEFLRHNLAAERLRWFREELTRQGAQANDHARTIQQRGATAFRFTAYEEKVLWGDLADLFHLVYIFSAPADPIRRELKARHFDGIYLENKKMGFISEEFRSLMLEIWKVKYLEMPRFGEVIRSIPIEIHLSHFLNDGDSPDIPIPVYVSYLNQIRDLARAKAAAHSHRAK |
| Ga0209579_10000107133 | F007996 | GAGG | MRNAVLVILAGGLLVMGGYIAGAGKTAVVHAGTPMPMTSAVPKSYGKLVAAIPDSIGTGLIFEGNDGTIRFVSMTGMKEGELARFDQTPTHGGIPKSYGHLVSAVVNNGSTGLVFEDANGEIRLVTIAGVTEAELTRN |
| Ga0209579_1000010715 | F000882 | N/A | MAQAAGGKEILSVAVWEPVPDMEAASLATIRELTSIVERKGYGRDLLYRDRDAHYVFLRYWNSEEARRAAQEDPDMLRCWAKLGNEIQIVKVYETLTEIPTNPVA |
| Ga0209579_1000010741 | F040501 | N/A | MGQTVPRARLIWIGGGYAAVVAVSAALIFARYVAYVTHPADVAAYGGMWAGGDLALEVLVAGMLLVVTFLAALAIFKYEEAYTTYSKIMVALSLTFPASVGIIAISAISQSDSMLGWVCLFRVFASPFVLMGFGMSRAFARFPVAKRMTNYAMVIEGLTLVFLVVMLFLPFRLHRG |
| Ga0209579_1000010761 | F002831 | N/A | VPDTPPEDDHKHHRHAVYASETTGLLLIAVTLLILTLVRYWHTIHWSLR |
| Ga0209579_1000010773 | F000912 | GGA | MSLDAFVRCTCIRDGRAKPHPFPDRLMWDETGSPSLSGDPTDEEWEAHDTWVQQSCEHEGYLVSEFLGNITRAQHVREFLRGLQGNPGPKFPILLKKVVYDGTHTGDWIPVKESPALLREVDLVLGSSDILTESEKEFFDSMKRLCEASIATGNPILF |
| Ga0209579_1000010784 | F000883 | AGGAG | MSGQVVLYFEDQADALRFALAAGSVMAGEGGKVTDNLVEETTRVSRIRLDAVNAGGSKKSNPVRAA |
| ⦗Top⦘ |