| Basic Information | |
|---|---|
| Taxon OID | 3300027857 Open in IMG/M |
| Scaffold ID | Ga0209166_10000190 Open in IMG/M |
| Source Dataset Name | Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1 (SPAdes) |
| Source Dataset Category | Metagenome |
| Source Dataset Use Policy | Open |
| Sequencing Center | DOE Joint Genome Institute (JGI) |
| Sequencing Status | Permanent Draft |
| Scaffold Components | |
|---|---|
| Scaffold Length (bps) | 79922 |
| Total Scaffold Genes | 101 (view) |
| Total Scaffold Genes with Ribosome Binding Sites (RBS) | 82 (81.19%) |
| Novel Protein Genes | 10 (view) |
| Novel Protein Genes with Ribosome Binding Sites (RBS) | 8 (80.00%) |
| Associated Families | 10 |
| Taxonomy | |
|---|---|
| All Organisms → cellular organisms → Bacteria | (Source: IMG/M) |
| Source Dataset Ecosystem |
|---|
| Environmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil → Surface Soil Microbial Communities From Centralia Pennsylvania, Which Are Recovering From An Underground Coalmine Fire. |
| Source Dataset Sampling Location | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location Name | USA: Pennsylvania, Centralia | |||||||
| Coordinates | Lat. (o) | 40.7999 | Long. (o) | -76.3402 | Alt. (m) | Depth (m) | Location on Map | |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F000208 | Metagenome / Metatranscriptome | 1593 | Y |
| F001742 | Metagenome / Metatranscriptome | 643 | Y |
| F002560 | Metagenome / Metatranscriptome | 548 | Y |
| F003202 | Metagenome / Metatranscriptome | 501 | Y |
| F003851 | Metagenome / Metatranscriptome | 465 | Y |
| F004412 | Metagenome / Metatranscriptome | 439 | Y |
| F006318 | Metagenome / Metatranscriptome | 376 | Y |
| F015496 | Metagenome / Metatranscriptome | 254 | Y |
| F041893 | Metagenome / Metatranscriptome | 159 | Y |
| F064394 | Metagenome / Metatranscriptome | 128 | Y |
| Protein ID | Family | RBS | Sequence |
|---|---|---|---|
| Ga0209166_1000019017 | F003202 | GGAGG | MTLRNLAMAILITLVLAAGVGLHNYESARNAAVVRQLSDQLEQTKSELVDATTQLSEANKKLGFLESSKARVQVTAYALTEDFGPDPVFSNNAPAKSAYAVPKHTLPAEKVLNVALSPMAERKLHASLNDTIVLMSGDRARNHLARFVDRTAQTETRSVVDILFADAHEARIWGRRSFFAVNISRPDSPFQQ |
| Ga0209166_1000019043 | F015496 | GGAGG | VYNTDLVRKLCCEIAQEKDPEKAQDLMSLLQSVMKDDQEDIRIRMAFLAKKYAFVSDSKAAD |
| Ga0209166_100001905 | F003851 | GAGG | MNRRKLAATSSRTSDKNAKRVLRALEDATHGHAIWNDPIVERLKQCPEAFKVGEILGKAFDSLKYAQDGGEVGDRYLPPAGTVIPPLV |
| Ga0209166_1000019068 | F041893 | N/A | MSSAPNRSQARRGTRIPCEIPITLTSLDPIHPFSEPCLAILVNPQGCAVRFRRPLEIGAAVRLEGLPARTNVTARVVNSISIGEYEQFWLLGLALDEPGNVWDIKTPPEDWAR |
| Ga0209166_1000019075 | F000208 | GAGG | MRIRVTTRADLFEERKQQHIERGYRIEDQRPIPVNGFCSFIAVSEIPDSDRVGDLVAQALNGMAR |
| Ga0209166_1000019076 | F002560 | AGGAGG | MVEHNPPPSADSARPPVIPLPVALYVSDELIFELYDGGQPCRSFSIRWERLVPAMENEPEAAPQSSQTGLRTHQD |
| Ga0209166_1000019080 | F004412 | AGGA | MENEPSNAEQALPPEQAAAKAAILAECDRLTEAGITFVAVHFDGSGDEGVNEDIKCYATEDYAHEKSEVQVANFSNLQEHFETLVPYGYEDGCGGFGDVILNLKTRKITVERNDRFEDYTTSSYEV |
| Ga0209166_1000019083 | F001742 | GAGG | MPDTETGDRRRDRQKIPRIGPEEAARVFAQCEHLLDRSPVFVENLRQVGFARFLAPLHEDILKLAVQPDSWGERTRARLISELFAGLKGIPAEDAPVEEIVRVSNVVVPCFLLELGRRRQHIEINFPANPCDSAARFALRAGRSYPAHSVNSEQLVRLVAEAGEELVGLCYFGDQRSREHIEAQLTFESPTTDS |
| Ga0209166_1000019094 | F064394 | N/A | MKHQEQIATVARLRICHAKDQEERLVDALQRWIEDPIKPKTDKGTLRINPILVLLAAVAVLAGGTFLFFSLVQL |
| Ga0209166_1000019099 | F006318 | AGGAGG | MWRQPPIWEPQGTKFRPYSIPTLIFFYIGKFFGRLFGHLYGARH |
| ⦗Top⦘ |