| Basic Information | |
|---|---|
| Taxon OID | 3300009419 Open in IMG/M |
| Scaffold ID | Ga0114982_1000061 Open in IMG/M |
| Source Dataset Name | Subsurface microbial communities from deep shales in Ohio, USA - Utica-3 well 1 S input2 FT |
| Source Dataset Category | Metagenome |
| Source Dataset Use Policy | Open |
| Sequencing Center | DOE Joint Genome Institute (JGI) |
| Sequencing Status | Permanent Draft |
| Scaffold Components | |
|---|---|
| Scaffold Length (bps) | 54120 |
| Total Scaffold Genes | 123 (view) |
| Total Scaffold Genes with Ribosome Binding Sites (RBS) | 89 (72.36%) |
| Novel Protein Genes | 15 (view) |
| Novel Protein Genes with Ribosome Binding Sites (RBS) | 10 (66.67%) |
| Associated Families | 15 |
| Taxonomy | |
|---|---|
| Not Available | (Source: ) |
| Source Dataset Ecosystem |
|---|
| Environmental → Terrestrial → Deep Subsurface → Unclassified → Unclassified → Deep Subsurface → Subsurface Microbial Communities From Deep Shales In Ohio And West Virginia, Usa |
| Source Dataset Sampling Location | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location Name | Ohio, USA | |||||||
| Coordinates | Lat. (o) | 39.849 | Long. (o) | -81.036 | Alt. (m) | Depth (m) | 2500 | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F000369 | Metagenome / Metatranscriptome | 1222 | Y |
| F000441 | Metagenome / Metatranscriptome | 1136 | Y |
| F000868 | Metagenome / Metatranscriptome | 853 | Y |
| F000980 | Metagenome / Metatranscriptome | 814 | Y |
| F001125 | Metagenome / Metatranscriptome | 769 | Y |
| F007263 | Metagenome / Metatranscriptome | 354 | Y |
| F007921 | Metagenome / Metatranscriptome | 342 | Y |
| F008814 | Metagenome / Metatranscriptome | 327 | Y |
| F014140 | Metagenome / Metatranscriptome | 265 | Y |
| F021988 | Metagenome / Metatranscriptome | 216 | Y |
| F030095 | Metagenome / Metatranscriptome | 186 | N |
| F031497 | Metagenome / Metatranscriptome | 182 | Y |
| F045755 | Metagenome / Metatranscriptome | 152 | N |
| F051930 | Metagenome / Metatranscriptome | 143 | Y |
| F066776 | Metagenome / Metatranscriptome | 126 | Y |
| Protein ID | Family | RBS | Sequence |
|---|---|---|---|
| Ga0114982_1000061115 | F051930 | N/A | MSEEGLVEKLLCPVDQSLLFCNQDLEDNIFLYCLECKYLKNIGAATYENIVKRVNLNDNNKQ* |
| Ga0114982_1000061119 | F030095 | AGG | VAAQKNFEVDQNTTFTFEVQYLDEDQTPIQLNFHEAKLQVRDTQGGKKLAFTLTEQDGIQISPTEGKLKISISADRTNKMFYPKSAYDLVIVDPSVNKTRLLEGYMTLNRSVTV* |
| Ga0114982_100006122 | F014140 | N/A | MNKYRIKLDVEVEVEAFNSEDAGEYIHDIFNIDDEIKKVNIVKIQQK* |
| Ga0114982_100006125 | F066776 | GGA | MLNLTEQVVEIFIKKFQSSVQKSFWNNYDLVIWKKDHGGYTDTKGMYYNNTWGKAEKISVDKKGIWKLPKKYVRYFK* |
| Ga0114982_100006134 | F008814 | N/A | MDNKLTVLEEIIKEIGEELYQKWYNALAIEDRTEEASKAMSANAGETAVWVIQTFMNKFNNAADELKGD* |
| Ga0114982_100006147 | F021988 | GAG | MSNSFKKEDGTGMVPPANAGAPAGAVTSTNTPKKYPRQGVKIDTNKHGIRRETSLIPKPTKKPRYKKP* |
| Ga0114982_10000616 | F000369 | N/A | MNEIEPAVHFDRMNKVVEELLKGNSATQIATLTGFSRKEVLEYVDEWKSVVHNDSNIRDRAREAISGADQHYAMLIKEAWKTVEDADTQGALAVKSGSLKLIADIETKRIAMLQSVGVLENTQIASQIAETERKQEILVGILKEVTASCPKCKMDVAKRLSQITGIVEAVIIEDADVV* |
| Ga0114982_100006163 | F007921 | AGGA | MIDWLVNRIFRWDSLRKAVFDEVRMYQSIDRSMWEYEKEGPTNLTWSEGDRWYGWTYNSNAKRYYFDDIGNESLIGLWEDQWLREADAH* |
| Ga0114982_100006171 | F031497 | AGGCGG | MSKTQEIKVAESLVNLMDDHWFNPTIFGRYLAEQPIYTIDRIMEMVVSVISEQAKMYDVYSNQGTYTEGLALAKELNECIKAYQQDNNLVNLKLPSRSYKVKREEPVRERIFGWREEEDPFSQ* |
| Ga0114982_100006173 | F000441 | AGAAG | MSDNYLNDQLNTAQKLLWGGSETENIEAHNIIAKLIKDRIEQADLS* |
| Ga0114982_100006179 | F000868 | GAGG | MSSFLENENEMLIDAIFSEIGEQLVEDWMNSNLDEGQLYADWCVADMSNSNYLKGRFNQFHDLSPTDNYYLQWDENAE* |
| Ga0114982_100006181 | F001125 | AGGCGG | MGDRANFGFRDSKENIVFLYGHWAGHRMLENLADAVQIAHPRWNDEAYATRITISQMIGDEWASETGWGISVNELADNEHKVPIIDWKNKTFTLMEEDLQTVVFSTSLEAFVAKYCTQLSVV* |
| Ga0114982_100006185 | F000980 | N/A | MTNELVSSKYTFVCDPDECDCLIELTSSDGFGFPSGVTELTCPCGRKTTLVSVVNATIAPITQTKEEKMEPTTTQIPESYNSNLLVTYKVIRGFSDAEYATDKITSLEWDLHNGRQSQKTVGVLNSKIDSAKEIICEAYADSQDQDTLREIAEALGIELIKEVEWSASIEVSGTYSYNILENDYDLDLESEITDAIFADSHNGNIEINDQEVCNVREN* |
| Ga0114982_100006197 | F045755 | AGGGGG | MSQIAGMWICDNCDTLAVVSVLTDTIQITQCKCVTNERETNV* |
| Ga0114982_100006199 | F007263 | AGG | MLDTKYIDETEFYFIKDEMKFHCDESQFVYVCKEHGEQMGCYYCEFDYDKKCECE* |
| ⦗Top⦘ |