| Basic Information | |
|---|---|
| Taxon OID | 3300023112 Open in IMG/M |
| Scaffold ID | Ga0233411_10000083 Open in IMG/M |
| Source Dataset Name | Seawater microbial communities from Saanich Inlet, British Columbia, Canada - Na_anoxic_2_MG |
| Source Dataset Category | Metagenome |
| Source Dataset Use Policy | Restricted |
| Sequencing Center | DOE Joint Genome Institute (JGI) |
| Sequencing Status | Permanent Draft |
Note: The use of this dataset is restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of the sequences below requires obtaining a license from the dataset's corresponding author(s).
| Scaffold Components | |
|---|---|
| Scaffold Length (bps) | 32577 |
| Total Scaffold Genes | 55 (view) |
| Total Scaffold Genes with Ribosome Binding Sites (RBS) | 8 (14.55%) |
| Novel Protein Genes | 13 (view) |
| Novel Protein Genes with Ribosome Binding Sites (RBS) | 4 (30.77%) |
| Associated Families | 13 |
| Taxonomy | |
|---|---|
| Not Available | (Source: ) |
| Source Dataset Ecosystem |
|---|
| Environmental → Aquatic → Marine → Inlet → Unclassified → Seawater → Methane Metabolizing Microbial Communities From Different Methane-Rich Environments From Various Locations |
| Source Dataset Sampling Location | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location Name | Canada: British Columbia | |||||||
| Coordinates | Lat. (o) | 48.6569 | Long. (o) | -123.4875 | Alt. (m) | Depth (m) | 100 | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F018478 | Metagenome | 235 | Y |
| F025412 | Metagenome | 202 | Y |
| F027524 | Metagenome | 194 | N |
| F028650 | Metagenome | 191 | N |
| F036312 | Metagenome | 170 | Y |
| F049288 | Metagenome | 147 | Y |
| F059442 | Metagenome | 134 | Y |
| F076108 | Metagenome | 118 | N |
| F079336 | Metagenome | 116 | Y |
| F099320 | Metagenome | 103 | N |
| F100003 | Metagenome | 103 | N |
| F103288 | Metagenome | 101 | Y |
| F105502 | Metagenome | 100 | N |
| Protein ID | Family | RBS | Sequence |
|---|---|---|---|
| Ga0233411_1000008311 | F100003 | N/A | MKIKRKHYKALQYASLIQRWKYLPSNFIFQVVQNSKVDETMLNRNRIEQNDKRI |
| Ga0233411_1000008312 | F018478 | GAG | MIKEFEEMDWSKEYTYKEKKIYISYETKKYILCSFYESGKGTFKLDKTEFHG |
| Ga0233411_1000008319 | F028650 | N/A | MRIPKSLKEVLVKDYIQINKIRSAEYDNPFTRTIDLLCIFNDRKDVLKCKPAELAVDLSHLLEEPSRVLKQYFTINGKRYGIVNHVNDLEAGQYMSFTTYLKGFADNPNVHIDQMPDILASVIFPVDKNNKVMAIEPSYFRNLAEDIRNTMSIEDAYPIAVFFCNLSRSLTKCTQDYLNTKLEKMTEESRNAILEVAKDLESDGVGLPHSITSAMEILQKDPTTRK |
| Ga0233411_1000008321 | F049288 | GGA | MSEGLVTPHSTITEVLKAFGLEMQKDLRAELVKDHAYVSGDLAEQIEFTTVVEGTAFVSSLRLKDYYDYVNKGVNGTRSVKNNTPYSYMQSSKIPFYFAKQWMNAKGLFVDKGTTLTSLAGNKYKAGSKDSQAFAMARSWKEKGTEGNHFYDKVVTQARLDKLSKDLASAAAGDLKIALTDTFKRLK |
| Ga0233411_1000008333 | F105502 | N/A | MLSSYQRLKFKKDLAVKVAEELKEDIEFVILYPHTARAKNIRREVKEKNNG |
| Ga0233411_1000008344 | F076108 | N/A | MIRIELSDNEIEYSTYFPIDDPHDIMYSFEEMVRMYTKADLEVDSYILQRAKEIKLKNSN |
| Ga0233411_1000008346 | F036312 | N/A | MKQKLTHSLYEAKKVQKDLYEVCTTNYWNNGTYTVKDISHHSTEREAQEQKEINKHKNQI |
| Ga0233411_1000008348 | F079336 | N/A | METILRSELETADKRIIVLESTIDTYKRILDAQDERIELMEKSHKFELENYYTKNTEL |
| Ga0233411_100000835 | F027524 | N/A | VNEDFIKEKRQVIETACKNICKHSDIWRDLSQEVNIYFLTNALPENLNKIDGFIFVVAYKMFHLSGSEFNRLHFDNVLKESTELDYLKADDIPYISNNVYKEYLEQVKQLDEMERIWVEEIVKRNLSIKLFSDHTGIHRTTAKERMNSIYEKLRKNNK |
| Ga0233411_1000008350 | F059442 | N/A | MRKEDRYKPYIYKTRNKQGKLEEYSRYYKTKREAVYWYKTQGKWLEKHLNRKLILIDTDINLFTYVPRALLNR |
| Ga0233411_100000836 | F103288 | N/A | MSIIIISIISILGWTDIFKQTFTTTEGFNYKYKAISKILYAIDYKPLNCAYCLSFWVGLIFSIGLLDISYMVIFLYFARTRE |
| Ga0233411_100000837 | F025412 | GAG | MDYRKLKWGALKSYGTKLGIDTKGMTKDVLLEWLDAMPDVAHGIEELKPFTGIKQHHPLFDEIKDYIPYLKAFKKLKAVSPVPEVNKAIATLFLKYIEEQKNIRINLGCGRCKHSYYERMIAGYNRLVDEYGGERI |
| Ga0233411_100000838 | F099320 | AGTAG | MEENVYSYCLEVHEDGNLYMVTEYMNGYITIWAANATIETEGEVYFINLYE |
| ⦗Top⦘ |