Basic Information | |
---|---|
Taxon OID | 3300027869 Open in IMG/M |
Scaffold ID | Ga0209579_10000014 Open in IMG/M |
Source Dataset Name | Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen03_05102014_R1 (SPAdes) |
Source Dataset Category | Metagenome |
Source Dataset Use Policy | Open |
Sequencing Center | DOE Joint Genome Institute (JGI) |
Sequencing Status | Permanent Draft |
Scaffold Components | |
---|---|
Scaffold Length (bps) | 540412 |
Total Scaffold Genes | 472 (view) |
Total Scaffold Genes with Ribosome Binding Sites (RBS) | 319 (67.58%) |
Novel Protein Genes | 10 (view) |
Novel Protein Genes with Ribosome Binding Sites (RBS) | 8 (80.00%) |
Associated Families | 10 |
Taxonomy | |
---|---|
All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae | (Source: IMG/M) |
Source Dataset Ecosystem |
---|
Environmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil → Surface Soil Microbial Communities From Centralia Pennsylvania, Which Are Recovering From An Underground Coalmine Fire. |
Source Dataset Sampling Location | ||||||||
---|---|---|---|---|---|---|---|---|
Location Name | USA: Pennsylvania, Centralia | |||||||
Coordinates | Lat. (o) | 40.7999 | Long. (o) | -76.3402 | Alt. (m) | Depth (m) | Location on Map | |
Zoom: | Powered by OpenStreetMap © |
Family | Category | Number of Sequences | 3D Structure? |
---|---|---|---|
F000508 | Metagenome / Metatranscriptome | 1069 | Y |
F000595 | Metagenome / Metatranscriptome | 999 | Y |
F000822 | Metagenome / Metatranscriptome | 876 | Y |
F000969 | Metagenome / Metatranscriptome | 818 | Y |
F001431 | Metagenome / Metatranscriptome | 696 | Y |
F002981 | Metagenome / Metatranscriptome | 516 | Y |
F003708 | Metagenome / Metatranscriptome | 473 | Y |
F011701 | Metagenome / Metatranscriptome | 288 | Y |
F048481 | Metagenome / Metatranscriptome | 148 | Y |
F089720 | Metagenome / Metatranscriptome | 108 | Y |
Protein ID | Family | RBS | Sequence |
---|---|---|---|
Ga0209579_10000014113 | F001431 | AGG | MWSIQDVSPEELAMLFHHYQGALAHGCDEGEECRAAWERTPQGERKRLVAAARLALLELAAAATHRDAERPYFAKPGEAEWGC |
Ga0209579_10000014122 | F011701 | AGGAGG | MRSLRLSRAWLFCALLTSLLLVTVGASAKNGRYFTGYYNLSGVQEQGDLIQVTLHLRLFNHSNSDLSSVVVTLLDSAPALNFRGNYEPVKVWKKQQFIEMSQEFTVPKLEFQQWTQAPAQPNLVILFQDSTGKTWQEGAQLSRREIAK |
Ga0209579_1000001417 | F000822 | GGAGG | MQRILKLCAATILAAAGAAHGQEAKVFHGEVSDSQCALNVHSLTRSHQEMLKSKSMGGTANTCAVYCVEHMGGYLVLSAGKDVFRLDRADLVHGFEGQRVKITGTLDTKLNLIHVLKIDLDEKE |
Ga0209579_10000014263 | F000508 | AGGGGG | MTNLNKVVTSLQNEYARLEKEMGRVGKALDALGHAGGKKLKKTGRILSKEARKRIADAQRLRWAKVRKQAAKLVKP |
Ga0209579_10000014298 | F003708 | AGGA | MKISTILIVIAVLLSFAVLSLHTRATPLGAASQADSNAPGFMQTLIEPASSVGPLKLGDSEEHALELFPKKDEDQRWEDSCGTTLDWIDATNPTGRGDLFIRLKKDKVFQIESATTRFHTAEGITTFDPPEKVARVYKDLRAYTLLTPPLSSLGDRPLVFWIDKKKGIAFTFAYYPDQHKRYLYKITVFGSNKTFCPEQETMNSPKWQSIPSYSLEPPPEMAPNRE |
Ga0209579_10000014334 | F048481 | N/A | MGLPPEGALNMAAWDMIRDLVSSEPSLNEAAWESLRGVIGLLGSGEEPESTGRYRRAG |
Ga0209579_10000014358 | F000595 | AGGAG | MEGLTGLAAIVMIFGMPTAAIAMYTFYRVRKLRTEERLAAIQRGVNVPMEPDLSEAAHSRRYGILLIAGAVGYMLTFTILARYEPDAMMAGAFGAIPFALGLGFFLDSALIRRDARAA |
Ga0209579_10000014412 | F000969 | GAGG | MINDDVVKCPLCGGFTHIEKADLRDALSNPRLREQVERYITELLKSPVEELSFVGATQAGRDFQKDVHSWNPCVPMWRRSPKE |
Ga0209579_10000014416 | F089720 | GAGG | VITPQDGLPTAEVCSQIKQFGYAMSQKIRIYGEEFEVLSDPFASDGGIAIHVRSKRTSQTRVLQLPATIVHRVRQDMTRIA |
Ga0209579_10000014430 | F002981 | N/A | VPAGPTINIGEEYGTAKKNLPPAKIVLIAVAAVIVVVIFASFLKRAKPQASGSLDNVAAAEIPGQNSTMVALTFTLHNTSDKILYVHNIQASVKAPDGDATADAVPAVDFDRYFQAFPALKVGAQPAIPPETKIQPGETVSSSVIVVFPKTLDAFNHRQSVSVIIWPYDQQLPVTMTK |
⦗Top⦘ |