| Basic Information | |
|---|---|
| Taxon OID | 3300027857 Open in IMG/M |
| Scaffold ID | Ga0209166_10000005 Open in IMG/M |
| Source Dataset Name | Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1 (SPAdes) |
| Source Dataset Category | Metagenome |
| Source Dataset Use Policy | Open |
| Sequencing Center | DOE Joint Genome Institute (JGI) |
| Sequencing Status | Permanent Draft |
| Scaffold Components | |
|---|---|
| Scaffold Length (bps) | 428464 |
| Total Scaffold Genes | 364 (view) |
| Total Scaffold Genes with Ribosome Binding Sites (RBS) | 273 (75.00%) |
| Novel Protein Genes | 9 (view) |
| Novel Protein Genes with Ribosome Binding Sites (RBS) | 7 (77.78%) |
| Associated Families | 9 |
| Taxonomy | |
|---|---|
| All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae | (Source: IMG/M) |
| Source Dataset Ecosystem |
|---|
| Environmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil → Surface Soil Microbial Communities From Centralia Pennsylvania, Which Are Recovering From An Underground Coalmine Fire. |
| Source Dataset Sampling Location | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location Name | USA: Pennsylvania, Centralia | |||||||
| Coordinates | Lat. (o) | 40.7999 | Long. (o) | -76.3402 | Alt. (m) | Depth (m) | Location on Map | |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F000426 | Metagenome / Metatranscriptome | 1154 | Y |
| F000534 | Metagenome / Metatranscriptome | 1044 | Y |
| F001318 | Metagenome / Metatranscriptome | 724 | Y |
| F001511 | Metagenome / Metatranscriptome | 680 | Y |
| F002898 | Metagenome / Metatranscriptome | 522 | Y |
| F004902 | Metagenome / Metatranscriptome | 419 | Y |
| F005617 | Metagenome / Metatranscriptome | 395 | Y |
| F040900 | Metagenome / Metatranscriptome | 161 | Y |
| F060288 | Metagenome | 133 | Y |
| Protein ID | Family | RBS | Sequence |
|---|---|---|---|
| Ga0209166_1000000511 | F002898 | GAGG | MADGEKQSRFRLLIEFFKRLLGRKAAPPGDPYAYAMAPIRRGPKGRSGAAVAEIEDDSFRSFPPRHGR |
| Ga0209166_10000005171 | F004902 | AGG | MKKPTIRVTKWLGDIPVEAGCSACPGVVFRAKGSSHRPNREEFQKSLQAQFDEHCKAVHL |
| Ga0209166_10000005192 | F000534 | AGGA | MPQEISVSYQAIKSKVYRLIDALVVGEKSEAEVAESVRRWWSLIHPADRPIAQKYLLMILGRSNSALDAMGAELLSVSGCEIALPLADPSLPSKRMRLVQRLVKESSVRTAV |
| Ga0209166_10000005193 | F001318 | GGAGG | MKVLAWMLAVPVLISLAGAQISPSLNRQSPSQEPSSQRSWHAADPGDRMFFPRDMFWGWAQFDLSPPHNEIDPNLCAGNAGQYGGANAPCSMFARYMLSGILEVRPFGRGPLRRFMVYGAPTFLFGKNVPQYLYTWSPDAIGIEHSWGAGIYLNKGFEFRVTQHFLFDRLGARNRSLGVADLGNNGPWGRYMSLGVRKTFGTRRW |
| Ga0209166_10000005197 | F040900 | N/A | MPAATILVMFYAGPAADDALKLLLLALIGGGLLVLGRWSSRLGGARSHDAFPDVPTPDVADLLPGSTPKLWPPSAEEVAASLPFDPALGKIRIKKFFFEKTDAIPGPTDRDVFADELHVELYDPDSDHSWWQSYFVATPQGLAKILHDKSWRYLHAPDVLVFSRYDLEEIRRAVVSRIMADHEYFKDKEQEEEEPL |
| Ga0209166_10000005221 | F000426 | AGGGGG | MKDIHEVLRQKQAKYAQLGKQIEMLQQAAEKLREVAPLLAESDEEDNVVLAEVDDAIGQTDSMAAKAGAGSGSAAAPKGSRPTAPRWP |
| Ga0209166_10000005250 | F005617 | AGAAG | MKVQIAAQAVTEPLTCEGETIVVNLHGALISTAVPLRVGMKIEVHVLLTGKRARAEVAYIDPDRPRLCGIGLEKPSNIWGVSLPPEDWIEMDPR |
| Ga0209166_10000005347 | F060288 | N/A | LDNRDRKVTPTNVVARGGTRIPCEIPVLLTNSDPQQRFTETCKIILANLRGCALRAPRPITTGTEVQLHGLPGKPQVAARVVNCISLGEFEKMWLLGLALNEAGNVWGIANVPDDWSQS |
| Ga0209166_100000058 | F001511 | GGAG | MFEDVDGMEVFVNPDRVIWIREYPNQTTVISCGHEDKFAVRLAPAHAVAALGKAIR |
| ⦗Top⦘ |