Basic Information | |
---|---|
Taxon OID | 3300027706 Open in IMG/M |
Scaffold ID | Ga0209581_1000002 Open in IMG/M |
Source Dataset Name | Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen15_06102014_R2 (SPAdes) |
Source Dataset Category | Metagenome |
Source Dataset Use Policy | Open |
Sequencing Center | DOE Joint Genome Institute (JGI) |
Sequencing Status | Permanent Draft |
Scaffold Components | |
---|---|
Scaffold Length (bps) | 1458151 |
Total Scaffold Genes | 1560 (view) |
Total Scaffold Genes with Ribosome Binding Sites (RBS) | 947 (60.71%) |
Novel Protein Genes | 11 (view) |
Novel Protein Genes with Ribosome Binding Sites (RBS) | 9 (81.82%) |
Associated Families | 11 |
Taxonomy | |
---|---|
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria | (Source: IMG/M) |
Source Dataset Ecosystem |
---|
Environmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil → Surface Soil Microbial Communities From Centralia Pennsylvania, Which Are Recovering From An Underground Coalmine Fire. |
Source Dataset Sampling Location | ||||||||
---|---|---|---|---|---|---|---|---|
Location Name | USA: Pennsylvania, Centralia | |||||||
Coordinates | Lat. (o) | 40.7999 | Long. (o) | -76.3402 | Alt. (m) | Depth (m) | Location on Map | |
Zoom: | Powered by OpenStreetMap © |
Family | Category | Number of Sequences | 3D Structure? |
---|---|---|---|
F002967 | Metagenome / Metatranscriptome | 517 | Y |
F002992 | Metagenome / Metatranscriptome | 515 | Y |
F003246 | Metagenome / Metatranscriptome | 498 | Y |
F004844 | Metagenome / Metatranscriptome | 421 | Y |
F005031 | Metagenome / Metatranscriptome | 414 | Y |
F005922 | Metagenome / Metatranscriptome | 386 | Y |
F007034 | Metagenome / Metatranscriptome | 359 | Y |
F009094 | Metagenome / Metatranscriptome | 323 | Y |
F010130 | Metagenome / Metatranscriptome | 308 | Y |
F014125 | Metagenome / Metatranscriptome | 265 | Y |
F033149 | Metagenome / Metatranscriptome | 178 | Y |
Protein ID | Family | RBS | Sequence |
---|---|---|---|
Ga0209581_10000021046 | F003246 | GAG | VASTPRDSFRPADAVAGLMAAGALFLGVFELFYRPFRLAPAAVILILIATIMSKEQQRLIALALAVVGICFVVGTALQVITNHPLY |
Ga0209581_10000021075 | F004844 | GGA | MHTMTRPGERVSVFLFALALLALFVVLTFAAGYALGRILL |
Ga0209581_10000021108 | F002992 | AGG | MSLRTAPVVEHAALPGGGTVTVWVGVPDDPYIEDKSQLTTVDLQLHEGRSVVASVSTVLDPDQDGEGRQLARDVKEALEAGRIGLHAHELEPFADRLR |
Ga0209581_1000002130 | F009094 | N/A | MAMCHVCRRSLLAGERYRTWRWARRDRTVCAVCEPQARDAGWIRVVDGFEQVRVTGLTQTVRRVA |
Ga0209581_10000021338 | F002967 | GGA | MSAEPTVVWLSPSATWFACLCEHCLEAARLEGALFAEALRSASVRGPIAAEASVSSVRCPAGHEVVLRRVERPPALAHPDERQLQLA |
Ga0209581_1000002144 | F005031 | AGGGGG | MAGVRRGARSVLKWFAPLYVVAIVVQVFLAGEGIFGMKNVSDSDHCNKHGVQCIANSKDLDAHRALGFILTMPGALLFLIVALLAWHPNTRVRAVSIVVPILTFVQMILAGAGRWAGGLHPVNAFLVLALYGWLTYRLRQEEPATSQAAAPVTVPAA |
Ga0209581_10000021534 | F033149 | GAGG | MRSRAPLGLGGFLALPVFFGALMAASLAVEKPRVVEWSRPHGHLARIFHDPTASNEARIWLLALVPPLLLVLAGWFASYLPFGIYLTCAAAIIDAVALTLRLHRWEVHHTARFPYGEDLLADQTNSSSLLKGEWEHDAAQTVRSFEHYTIGLAVAAALISLFLTYRRRRAPVLGVPSALQQTGGAPTSTGV |
Ga0209581_1000002696 | F010130 | N/A | VQNPELLLAISTGIAVEVAHLRWIVLTQWRCRACGVPHLHCACKPAWLRRLL |
Ga0209581_1000002772 | F005922 | GGAGG | MSRDGGVRADDRLVTLLSQWLVGTLGNDELRREVEAADGDELAPGQRRAVEELLAELRAALPGERAGLQVAVREALEALAYGE |
Ga0209581_1000002950 | F014125 | GAGG | LQFQIGKRGGRPPGVRPEFTTVGDDRIIGSTLWLDEDGKRIERFQVLTTRDGKIVDMQGFETRRQAERFARRRTF |
Ga0209581_1000002951 | F007034 | AGG | MVDAEMIEQLRFAAEALGRGDPGPFASLFAEDAEWRGVSRGHLWWKQTPS |
⦗Top⦘ |