| Basic Information | |
|---|---|
| Taxon OID | 3300005533 Open in IMG/M |
| Scaffold ID | Ga0070734_10000011 Open in IMG/M |
| Source Dataset Name | Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen06_05102014_R1 |
| Source Dataset Category | Metagenome |
| Source Dataset Use Policy | Open |
| Sequencing Center | DOE Joint Genome Institute (JGI) |
| Sequencing Status | Permanent Draft |
| Scaffold Components | |
|---|---|
| Scaffold Length (bps) | 658267 |
| Total Scaffold Genes | 628 (view) |
| Total Scaffold Genes with Ribosome Binding Sites (RBS) | 474 (75.48%) |
| Novel Protein Genes | 12 (view) |
| Novel Protein Genes with Ribosome Binding Sites (RBS) | 9 (75.00%) |
| Associated Families | 12 |
| Taxonomy | |
|---|---|
| All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae | (Source: IMG/M) |
| Source Dataset Ecosystem |
|---|
| Environmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil → Surface Soil Microbial Communities From Centralia Pennsylvania, Which Are Recovering From An Underground Coalmine Fire. |
| Source Dataset Sampling Location | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location Name | USA: Pennsylvania, Centralia | |||||||
| Coordinates | Lat. (o) | 40.7999 | Long. (o) | -76.3402 | Alt. (m) | Depth (m) | Location on Map | |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F000655 | Metagenome / Metatranscriptome | 958 | Y |
| F000725 | Metagenome / Metatranscriptome | 918 | Y |
| F000771 | Metagenome / Metatranscriptome | 897 | Y |
| F000949 | Metagenome / Metatranscriptome | 823 | Y |
| F001520 | Metagenome / Metatranscriptome | 679 | Y |
| F002926 | Metagenome / Metatranscriptome | 520 | Y |
| F003445 | Metagenome / Metatranscriptome | 486 | Y |
| F005420 | Metagenome / Metatranscriptome | 401 | Y |
| F014777 | Metagenome / Metatranscriptome | 260 | Y |
| F050313 | Metagenome / Metatranscriptome | 145 | Y |
| F059309 | Metagenome | 134 | Y |
| F087516 | Metagenome / Metatranscriptome | 110 | Y |
| Protein ID | Family | RBS | Sequence |
|---|---|---|---|
| Ga0070734_10000011201 | F000771 | GAGG | MKKVLSVSALALSFVAGAALSAQVRDWHDLDGVHKHVNEAIREMERARAANHYDMDGHGVKAEEHLRAAERELDMAVQAARR* |
| Ga0070734_10000011228 | F003445 | GGAGG | MAFPENEPTIQRGDPTIQLIDLLVARLEECLGEVLPLETDDLLKDYAQNARNSTASAIEQLRLARARKEQQLGGRTK* |
| Ga0070734_10000011230 | F000655 | N/A | MIWWFLILGVSTLVVVCVAIALFVHLRRHLHKAHDGTTDMERQTNQLKS* |
| Ga0070734_10000011299 | F005420 | N/A | MRCGIGLALGIVLLVPALRAQAPEPTIYVESFRKGATHIAEDKFEAHLSPADANYRERIKDAAGNDRYELTISPQGPAGDNKITSWRVQLRDLRHNLYSNLLVADQQPSEDAKDNLWWLNPNPFGPVPLHARRIVKVDGFYVIFHVKDIHFTPLDSPYVDSLVVNFEVTNTAPKISH* |
| Ga0070734_10000011357 | F059309 | N/A | VATFRSKADMVTKRYVVIGTAWLLHATSWFLPAIKGFLGSRLDHSLPGWEVFLSQTCALRPCGTESADPWYGTAISAAGVVTTVLFVLASPGIVWRGSRRLRRVAAFAATAAFVVNCLWYVFYVPDRSDLGVGYFLWCSSFGLLAIGLFLLAGSNNELESTRQQSVLT* |
| Ga0070734_10000011375 | F014777 | AGG | MSAGNKTLIQILAASFALCVTASAQCPLNTLIVKGHVVENANAHSKVRVQLVYPKEKPGEAGEATVEDGSFQIPIEFVTMQSSIFTNLPKRCGRKPKTVVITLLENDQQSDQVFLDFLKNFKMTDPSAYALRSEVVLNGPH* |
| Ga0070734_10000011382 | F087516 | AGGAG | LTFCLANATGQAKVLTNDPLTGLPLIPATVLFKNVGNEPDKLSDVQVCKSKGQGNFYSLSNIMNPASGLKMDAAAAWYASHLSGFKKVQGYESGRSQIAFYNSDKTIVILLTGQLGAEGENANAYGVSYERFQPGISEKAIGPNTGQVHLQLIGAERC* |
| Ga0070734_10000011390 | F050313 | AGG | LDAIPNGFKVAPVNRTLTHAALLLLVSPVMVCQTQGHAPDCSTLKYSRHKVSCLCGTVQVCSGDICGPPSVYELDDDIAVELRAKNGTILDTQKLVVETREMQGATQDGTKTSYKQTERRFCFEGKQDGDYVLAFVLHKKGIPQPAVIFPTNYAHKRRKSCDSIYMVEPLCPR* |
| Ga0070734_10000011465 | F002926 | AGGAG | MSRAKVTSGMWRIDPKKWTRVVDRGGVPVASAEGKADGNANASLIAAAPDMFELLESLEDNKLIPEDVVDEIRMVLRRARGELGLAVFSRNTHRILRGRTREWLKFMEGLWGSDSDQE* |
| Ga0070734_10000011503 | F000725 | GGA | MSRASKTLRAVQWALLASILLYVVVGEMIAPRATRIDQTLSYILSSLAVGIVGTIFVVRRTLVLPATASLSSSSEEELTLSQWKTGHIATYALCDALAIFGLLLRLRGSSLQQSLLFYVGGFVLLMFFRPNVPAEPTAT* |
| Ga0070734_10000011508 | F000949 | AGCAGG | MSGQDATRSDIWPYCLAAACGIGTGIADVAIDDLLFTALLVLASCMLLGLLRARWPWRWVVAVGAFVPLTELVAYLVLTVKPTRAQIYGSFLAFLPGIAGAYGGAVMRRVVDDLRAGK* |
| Ga0070734_1000001189 | F001520 | AGGAG | VNAIKFLVAAYIATWAIHFFYMGTLVTRFRRLQKQRKELGKE* |
| ⦗Top⦘ |