| Basic Information | |
|---|---|
| Taxon OID | 3300027965 Open in IMG/M |
| Scaffold ID | Ga0209062_1001036 Open in IMG/M |
| Source Dataset Name | Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen12_06102014_R2 (SPAdes) |
| Source Dataset Category | Metagenome |
| Source Dataset Use Policy | Open |
| Sequencing Center | DOE Joint Genome Institute (JGI) |
| Sequencing Status | Permanent Draft |
| Scaffold Components | |
|---|---|
| Scaffold Length (bps) | 58207 |
| Total Scaffold Genes | 62 (view) |
| Total Scaffold Genes with Ribosome Binding Sites (RBS) | 52 (83.87%) |
| Novel Protein Genes | 8 (view) |
| Novel Protein Genes with Ribosome Binding Sites (RBS) | 7 (87.50%) |
| Associated Families | 8 |
| Taxonomy | |
|---|---|
| All Organisms → cellular organisms → Bacteria | (Source: UniRef50) |
| Source Dataset Ecosystem |
|---|
| Environmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil → Surface Soil Microbial Communities From Centralia Pennsylvania, Which Are Recovering From An Underground Coalmine Fire. |
| Source Dataset Sampling Location | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location Name | USA: Pennsylvania, Centralia | |||||||
| Coordinates | Lat. (o) | 40.7999 | Long. (o) | -76.3402 | Alt. (m) | Depth (m) | Location on Map | |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F001527 | Metagenome / Metatranscriptome | 678 | Y |
| F013999 | Metagenome / Metatranscriptome | 266 | Y |
| F017111 | Metagenome | 242 | Y |
| F018762 | Metagenome | 233 | Y |
| F050488 | Metagenome / Metatranscriptome | 145 | Y |
| F055628 | Metagenome / Metatranscriptome | 138 | Y |
| F085449 | Metagenome | 111 | N |
| F092401 | Metagenome / Metatranscriptome | 107 | Y |
| Protein ID | Family | RBS | Sequence |
|---|---|---|---|
| Ga0209062_100103614 | F050488 | N/A | VCGVAERSEGEGNRVVEDSWNTHIDYLEDEVKKIDERAGRLEVTIADLENEQKKHALRELVQHLRDAAKEHRKYLALVKRK |
| Ga0209062_100103615 | F092401 | AGGA | MPFSGNCHSFDSSTLDSSTLKALNDVGMVYGLFKEDLPFRPDHYTCLFVGQTNNLRARLLEHYSNRSIAGVTHFFAETSATEQQQKLREKELIAEFNPSGNKN |
| Ga0209062_10010364 | F001527 | AGGAG | MIVTSVDILGRHLKPLCSRDNRVMKYESGGSKANTGDRASYHCGVEGCSVRYSSTDGYYMLIGMPDHANPVAEPGVNTARCPIHGRWLYRRLNVDAGPGVGWSCGVEGCDYGHNANTKGDWVRS |
| Ga0209062_100103644 | F055628 | GGAG | MLNKRLWLLLPILSLAIVSAAGCLSRSNVSGVWKGSIESTDKRGHKWQGPAELTLNQNGGAITGTLVFTPPQAGRVQVPITSGVVSKDSLTFSGQNNLPMASVEITFHGSVSGTTLSGTADMTSRSVILGPATETASLSLQKQ |
| Ga0209062_100103645 | F085449 | GGA | MVSGAAPSSFRLNSVDRHGRQIDPSVLAAAETIFPKALDYGQSLLGDLAVIANTLEEVAANVSQRMARRDSSGEPEAIRNLPGYVFRAFVREVNRLKNKELAVLDAAVEGQTLAQRLADPARQLEMKVLVKECLARFDFTERDMCWRRLEGFTWDEIGPVHELSAHAAEVRFRNAVRAVKAKLVRSRKPLPPTAQTAQNEQLMPAMEADDDERKT |
| Ga0209062_100103653 | F013999 | AGGAGG | MFRIVSRKRQIELRRFGRPIGSAVATLLLSSPAFAQFGGDRVTSFLSNALGYAQGLGIFGAGFLVIWAIANIARERPSGKQWAGAGGALLLSSVLQLLRTFAG |
| Ga0209062_100103656 | F018762 | AGGA | MRQHSRKFVFLALTTALALIVPITAHAFIGTMPVIDWTAVVRIGRQIGISQETLNTLGLYVQQYNRVNAGVQEGIRLSRGRQLQGVLNQVVGSQFPQFQQLQRDFNGVLVDPSILRGDLELTYGATPSTDFPISRKKRIDAADATATLGLLEASRAEMVSQQEELDADDIESRAALSSPGGAAKLSAAANGAMLRSQAYDQRLLARLMRLQALNIARDNSLEKEQEQVRQAQLTTVSNMVGGMRLSYGIGDKVGQ |
| Ga0209062_100103658 | F017111 | AGGAG | MTTIPLTLEETQKAKSELGDYTEFYSHLAMQTRRASRVALVACAVALISIVSAILAQLRPPILLRVQDGKVSSLDGSDVQVAQTAVQQQPDNAEKLSFVNTFLSRFVNIDPLTVKRDTTLALNQMTYTLRQQILAQLNQENFVDTVRQNNVTSTLAVKSAELVSGDPYTAIVFGRKRLTTLINGQENEKNLLVKYTIRLAPVARSAANGWAGLEIADYKEEVLQP |
| ⦗Top⦘ |