Basic Information | |
---|---|
Taxon OID | 3300005529 Open in IMG/M |
Scaffold ID | Ga0070741_10000002 Open in IMG/M |
Source Dataset Name | Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen16_06102014_R1 |
Source Dataset Category | Metagenome |
Source Dataset Use Policy | Open |
Sequencing Center | DOE Joint Genome Institute (JGI) |
Sequencing Status | Permanent Draft |
Scaffold Components | |
---|---|
Scaffold Length (bps) | 1190824 |
Total Scaffold Genes | 1062 (view) |
Total Scaffold Genes with Ribosome Binding Sites (RBS) | 754 (71.00%) |
Novel Protein Genes | 11 (view) |
Novel Protein Genes with Ribosome Binding Sites (RBS) | 8 (72.73%) |
Associated Families | 11 |
Taxonomy | |
---|---|
All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae | (Source: IMG/M) |
Source Dataset Ecosystem |
---|
Environmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil → Surface Soil Microbial Communities From Centralia Pennsylvania, Which Are Recovering From An Underground Coalmine Fire. |
Source Dataset Sampling Location | ||||||||
---|---|---|---|---|---|---|---|---|
Location Name | USA: Pennsylvania, Centralia | |||||||
Coordinates | Lat. (o) | 40.7999 | Long. (o) | -76.3402 | Alt. (m) | Depth (m) | Location on Map | |
Zoom: | Powered by OpenStreetMap © |
Family | Category | Number of Sequences | 3D Structure? |
---|---|---|---|
F000072 | Metagenome / Metatranscriptome | 2651 | Y |
F000793 | Metagenome / Metatranscriptome | 889 | Y |
F003242 | Metagenome / Metatranscriptome | 498 | Y |
F003894 | Metagenome / Metatranscriptome | 463 | Y |
F004194 | Metagenome / Metatranscriptome | 449 | Y |
F004531 | Metagenome / Metatranscriptome | 434 | Y |
F005716 | Metagenome / Metatranscriptome | 392 | Y |
F006572 | Metagenome / Metatranscriptome | 370 | Y |
F012297 | Metagenome / Metatranscriptome | 282 | Y |
F030722 | Metagenome | 184 | Y |
F045172 | Metagenome | 153 | Y |
Protein ID | Family | RBS | Sequence |
---|---|---|---|
Ga0070741_1000000211 | F006572 | GGA | MTHFPQCNPAHRPTRIKLQTTPALIRLNDGHRAKGKLQVVSTTGGLLQLANAIAEGDFVEVAFQAQSGMVNGMAEMLNPVHKSPGSVFQPFRFIALADDDQDHLRRLVQDCGDQSFRGLRSSKWNHET* |
Ga0070741_10000002130 | F000072 | GGA | MDVERTISDIERLERLFATPDTRPLTATDISAANRRHDEKQANSPWFRLWQAYGLCCRTDAPILQLPEGRR* |
Ga0070741_10000002192 | F030722 | N/A | MKLLGFLLLLSGWAIVIAALILLHGGAVSAFLIAGVAVEVVGLVLVARAHLPA* |
Ga0070741_10000002244 | F004531 | AGGAG | MGQEISVSYQAVKSKVYRLIDSLVEDAKNQDDVQESVKRWWKHIHPADRPIARKHLLSVLSKSSASLEAISGGLLDLQDFEIHQVHTDAPRIHTMPHPHVERMKASV* |
Ga0070741_1000000225 | F012297 | GGAGG | MNSMHYLYAAYAATWVIHGVYLAILTRKYAKLRREMEGLKKS* |
Ga0070741_10000002467 | F003242 | GGA | MWLEKLTSGVLRVLTPLGPRYLNPSFAQRLYLIWIFRHFQTLPVKVLNSRQRHMIESMCANNQFVPLSVGFDEAPVLGTLEQRPPVPPPSLPRRPTSSVRDSVSPLAADVQR* |
Ga0070741_10000002564 | F045172 | N/A | MLWRQKLKTLSFALLTMSIVSAMLVAGADAQSLTSAPHPSAEIVALTEALEGRWVINVKFEPNSSATSGLAYTGEETWRPGPGGFTLLEEEHMPTPEGDLYLLGILWWNTATKSFHGMECQNLLPYTCDVKGAQNDITMAWNGKEFVIDEIETSKSGKKSVWHEVWSDITPDSFTQTGEYGDPGGPRKRLFTIHATKVGNSQIKNDQSQSNISEPAPEMQSLAKALKGNWSTTYEFAPGGISRSGGTGTGEENWKAGPGGYVLIEEEHVHTPSEEMFLIAFHWWDSTTNSLRGMLCNNSGPAACDFNTYSNSSLNWDGKKLTIDLEFPQAAKKMTWHEVWSSITSRSFTQTGDMGELGGPLKRAVTIHGLKEQ* |
Ga0070741_1000000270 | F005716 | N/A | VGHPQILNKGVNQLQFKVKEQDYFLAFVEQERRWYVFAPTAQGVHRIPVYVDAEKYGSLATLGNETTLSS* |
Ga0070741_10000002809 | F003894 | AGGAG | MQHTNIVLLQSDPKIAQTLAALLSNSFHRVHVAGSVDELRHAAAKHHPSAIVLDLESAPITEVEALTREFEGVRVICNHRVPDEEMWTRTLSLGAEDCCPSSDTRAILSAATRAEQLSRHMAA* |
Ga0070741_10000002891 | F000793 | GGAGG | MHIPLIFRSTSFTLVVEYIPLIATVTTTILSIVLARATLRYTEATDKSLALAREEFERQWSPELHVKLEKIGTRQAKIVVTNLAKTSVLLQLMQMRRLSMGVPSLRSFMNEPLVGGCIWTEELGKRFFACTGDDYEGQIATAVTFYASGRLFRTDWFRFQVQVSGGEIRRLDPVNIAARKVRVVEGAKESRRELVRDVVGVTSAVTAD* |
Ga0070741_10000002933 | F004194 | GAG | MKSPASLVARILGAALIVLAGIPAADGELHWHSVHKHEIEVRLIALAEAYPRSSVFANDEVFVAEQELSKEESRFIKLIYDFLPYQPSLSDSGLDYSYVHKVIAVRDTSCDENLWQMRSLMQQRSQAKQPNSSWKYAEESPISDLDRRQARLRCYRTSSDDYEQALHEPTAETPY* |
⦗Top⦘ |