| Basic Information | |
|---|---|
| Taxon OID | 3300027869 Open in IMG/M |
| Scaffold ID | Ga0209579_10000004 Open in IMG/M |
| Source Dataset Name | Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen03_05102014_R1 (SPAdes) |
| Source Dataset Category | Metagenome |
| Source Dataset Use Policy | Open |
| Sequencing Center | DOE Joint Genome Institute (JGI) |
| Sequencing Status | Permanent Draft |
| Scaffold Components | |
|---|---|
| Scaffold Length (bps) | 890648 |
| Total Scaffold Genes | 980 (view) |
| Total Scaffold Genes with Ribosome Binding Sites (RBS) | 657 (67.04%) |
| Novel Protein Genes | 10 (view) |
| Novel Protein Genes with Ribosome Binding Sites (RBS) | 8 (80.00%) |
| Associated Families | 10 |
| Taxonomy | |
|---|---|
| All Organisms → cellular organisms → Bacteria | (Source: IMG/M) |
| Source Dataset Ecosystem |
|---|
| Environmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil → Surface Soil Microbial Communities From Centralia Pennsylvania, Which Are Recovering From An Underground Coalmine Fire. |
| Source Dataset Sampling Location | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location Name | USA: Pennsylvania, Centralia | |||||||
| Coordinates | Lat. (o) | 40.7999 | Long. (o) | -76.3402 | Alt. (m) | Depth (m) | Location on Map | |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F003606 | Metagenome / Metatranscriptome | 477 | Y |
| F005991 | Metagenome / Metatranscriptome | 384 | Y |
| F006152 | Metagenome / Metatranscriptome | 380 | Y |
| F009499 | Metagenome / Metatranscriptome | 317 | Y |
| F010182 | Metagenome / Metatranscriptome | 307 | Y |
| F015635 | Metagenome / Metatranscriptome | 253 | Y |
| F041385 | Metagenome / Metatranscriptome | 160 | Y |
| F052789 | Metagenome / Metatranscriptome | 142 | Y |
| F064123 | Metagenome / Metatranscriptome | 129 | Y |
| F064962 | Metagenome / Metatranscriptome | 128 | Y |
| Protein ID | Family | RBS | Sequence |
|---|---|---|---|
| Ga0209579_10000004219 | F009499 | GGA | MRDDDRPPICPRCGVTMVPAALSAGGRHEGDWVCLECEERNEDED |
| Ga0209579_10000004241 | F041385 | AGGAGG | VRISTLVVVVGLIAAATAAIAMAAAGLPNLGPAKMTSEPVYKGYYDHHLDTYVLTDVSSKAQATAMHINFSAAIGKVKGLPLQYFVKGRAAAGQVAVFGSEPGESDYNPLWEEIWVTWKPGVKPVLLYRDDQINSLQSKGKLTETDAHIVLNAPILTVGK |
| Ga0209579_10000004300 | F006152 | N/A | MAERRPTNSELRLKRLTAQEPDREQRGILELFDDGGEELSDEDAPLARETDEHDG |
| Ga0209579_10000004383 | F064123 | AGGAGG | MARKTVFVSDLSGKEISNERDAVTITVKFGDARRGQYSVDAHPDDAEVKRLISAGTQQARRGRRPKAASS |
| Ga0209579_10000004408 | F010182 | AGGAG | MDVIGTDFLAGSILSLVLPVGLLVAVGIWWLTILRRRDRDV |
| Ga0209579_10000004464 | F015635 | GGA | VSQSTPLAGGATLLFSPRMQGTRGSSFQPVEAGAVLAGTVGMCGGAGALAGWAAGNAGYGALAGIVVGIPAGVYAVYHRFKGYFS |
| Ga0209579_10000004607 | F003606 | N/A | VRPRSANEWRDFWRDGGERDLHAQLGEFAPYSVRLATLLGSNAPLRALVAELGRIREHELRTPPDPDADAELAARIYAWFESASRAR |
| Ga0209579_10000004721 | F064962 | AGGAGG | MRRRTVLLAVAALGAALAAGGALSATTSVTITTPKAGQKVSLHQNPYLAVAGKVTFADTSAGTTRFFLRRDGCGTSADNPHLSLTAGNPDAGDGCGLIVDQVGLVGDAAPEAAFTDYPAVDGMPLAFDGTKPITGQVSLSGAQVGVAEVTVDVQALVGGSAVDLGSTTATAVLDPTGASTPVPFSIPAEPSLDGSDVQALDLRVTIHGPNVYSGFTALSGASYMDVPSYAASVNKTVLVSVDDSSFANAVPARLSGSTWSVAVPTPAVGKHTLYAESTQGYTTSAPTSLTFTVTK |
| Ga0209579_10000004766 | F052789 | AGG | VSTALGLLAFAAYALVVIGAAAAITWMVVKLTPPGRRPGS |
| Ga0209579_10000004836 | F005991 | GAGG | MRSYGMYDMTRGLTLALFAGLVGVALWGAAQVGTQTSSRFWIAMAIVAAAGLLLTLANHVGTWTKGLRMRTSPGTFVLAFLPVLVCVGWILMASQPGHGWQEGRIDSWSSSIGILGLVHSVGLWRGVLAFGFGVMLGLSLDGVPEPMVAPDTPAYAPIPTRTEGTATADEPVTAERRWAVRRRESGTPAGTTRVPERTRND |
| ⦗Top⦘ |