| Basic Information | |
|---|---|
| IMG/M Taxon OID | 3300029199 Open in IMG/M |
| GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0127423 | Gp0192912 | Ga0168647 |
| Sample Name | Deep subsurface microbial communities from shale gas basins in Pennsylvania, USA - 1_Day0 |
| Sequencing Status | Permanent Draft |
| Sequencing Center | Marine Biological Laboratory |
| Published? | N |
| Use Policy | Open |
| Dataset Contents | |
|---|---|
| Total Genome Size | 90030480 |
| Sequencing Scaffolds | 10 |
| Novel Protein Genes | 18 |
| Associated Families | 18 |
| Dataset Phylogeny | |
|---|---|
| Taxonomy Groups | Number of Scaffolds |
| All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes | 4 |
| Not Available | 1 |
| All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales → Myoviridae | 1 |
| All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales | 1 |
| All Organisms → Viruses → Predicted Viral | 3 |
| Ecosystem Assignment (GOLD) | |
|---|---|
| Name | Deep Subsurface Microbial Communities From Shale Gas Basins In Pennsylvania, Usa |
| Type | Environmental |
| Taxonomy | Environmental → Terrestrial → Deep Subsurface → Unclassified → Unclassified → Deep Subsurface → Deep Subsurface Microbial Communities From Shale Gas Basins In Pennsylvania, Usa |
| Alternative Ecosystem Assignments | |
|---|---|
| Environment Ontology (ENVO) | terrestrial biome → planetary subsurface zone → shale |
| Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Subsurface (non-saline) |
| Location Information | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location | USA: Pennsylvania | |||||||
| Coordinates | Lat. (o) | 39.898 | Long. (o) | -79.976 | Alt. (m) | N/A | Depth (m) | 2517 | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F001968 | Metagenome / Metatranscriptome | 610 | Y |
| F011477 | Metagenome / Metatranscriptome | 290 | Y |
| F013514 | Metagenome / Metatranscriptome | 270 | Y |
| F014004 | Metagenome / Metatranscriptome | 266 | Y |
| F018881 | Metagenome / Metatranscriptome | 232 | Y |
| F029435 | Metagenome / Metatranscriptome | 188 | Y |
| F032252 | Metagenome / Metatranscriptome | 180 | Y |
| F035276 | Metagenome / Metatranscriptome | 172 | Y |
| F048175 | Metagenome / Metatranscriptome | 148 | Y |
| F048267 | Metagenome / Metatranscriptome | 148 | Y |
| F050990 | Metagenome / Metatranscriptome | 144 | Y |
| F057313 | Metagenome / Metatranscriptome | 136 | Y |
| F067687 | Metagenome / Metatranscriptome | 125 | N |
| F071779 | Metagenome / Metatranscriptome | 122 | Y |
| F077031 | Metagenome / Metatranscriptome | 117 | Y |
| F080982 | Metagenome / Metatranscriptome | 114 | Y |
| F081278 | Metagenome / Metatranscriptome | 114 | Y |
| F088534 | Metagenome / Metatranscriptome | 109 | Y |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| Ga0168647_100926 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes | 11119 | Open in IMG/M |
| Ga0168647_101050 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes | 9980 | Open in IMG/M |
| Ga0168647_101054 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes | 9952 | Open in IMG/M |
| Ga0168647_101171 | Not Available | 9054 | Open in IMG/M |
| Ga0168647_101439 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales → Myoviridae | 7672 | Open in IMG/M |
| Ga0168647_101494 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales | 7451 | Open in IMG/M |
| Ga0168647_102182 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes | 5336 | Open in IMG/M |
| Ga0168647_102993 | All Organisms → Viruses → Predicted Viral | 4121 | Open in IMG/M |
| Ga0168647_104277 | All Organisms → Viruses → Predicted Viral | 3045 | Open in IMG/M |
| Ga0168647_105805 | All Organisms → Viruses → Predicted Viral | 2320 | Open in IMG/M |
| Scaffold ID | Protein ID | Family | Sequence |
|---|---|---|---|
| Ga0168647_100926 | Ga0168647_10092610 | F001968 | MFDELWTEIQETPGEIFDILELQEEQEKFDFDGYLAADYDY |
| Ga0168647_100926 | Ga0168647_10092611 | F050990 | MTPDTLNFTGDAVTYLGLIGVISTLIIVVTSFRRYFNSPLKK |
| Ga0168647_100926 | Ga0168647_10092612 | F014004 | MTTLLLHVTEVLFDFDDEDFTLEEQQSVVDSVLGNTYQVEVLNPNDDDEIADVIVEEVTDATGWCVVSLNYRQVTNTHS |
| Ga0168647_100926 | Ga0168647_10092615 | F088534 | MFSTTITNPQSFGNTYQWAILSVLPMDSNKKVGGLNSPEIKEALGLSNAARTSLCLMMKEMAGKGLVQRYDYKVGNRRFVTYKRLLPLRKRERIAQFLGL |
| Ga0168647_100926 | Ga0168647_10092624 | F081278 | MSYTPIRRHATAADFTRWQEMAQSMTVAQLRYAADDCRKVESCWRGYDPMVEGFYSDQAATYGTELFRRRQIR |
| Ga0168647_100926 | Ga0168647_1009265 | F071779 | MTTKTTYRVIGFGTNEHGFFNEFAHCGTIGYACGFYNGHLQDPEMDGVVLIRVNHEDWEVIQEFGTIPYSVSCGPGGTFKVEKAVDMVLV |
| Ga0168647_101050 | Ga0168647_10105015 | F048267 | MTDEEWETSLKSMNEEAKKRVDEEIDKIKYNVVKEIINRELKND |
| Ga0168647_101050 | Ga0168647_10105020 | F035276 | MTKSYISCYNNETKELEHFEVPYSIFVYIRQLEDEIKYASGGVKRLYPFRFGEEE |
| Ga0168647_101054 | Ga0168647_10105411 | F018881 | MTPKLLNSDGNYDEWCEEELLKCYKDAAEYDDVLFGDHDYSYVWLDKGP |
| Ga0168647_101171 | Ga0168647_10117117 | F057313 | MTEKPVRRYELCTERLETLEDVKKVLKALQIRIDTDNPLYEELEYYFSTEVVPLGYLKVYEKVGDKIYEMSMEEIAQMGSELLNQNEETN |
| Ga0168647_101439 | Ga0168647_1014395 | F011477 | MSVAVIWAWVLANEAAVATILLIVSELLGAVPQVKSNGLASFVILQVRKVLENKGAKDPT |
| Ga0168647_101494 | Ga0168647_1014946 | F048175 | MASGSFQFRYNHSNENAAWHTNPDAKFILPKEDVDIICDDPYLNENQFLEMVRRFFIACGYTEKQWKNALKEHLEEVEKTE |
| Ga0168647_102182 | Ga0168647_10218216 | F029435 | MTKKDLQEILTPEFLSTLHTVVECCNWDVDMIETMEFCDWCYNIAGQPKPQYDMGIMMEDEE |
| Ga0168647_102182 | Ga0168647_1021828 | F067687 | MIDKKEVAQAIVDAADEVFSNGGRIRDGFASALRVLADTVVPEDYKCFYGDKEYDTGMEVRNEEIREAILSIATELEVL |
| Ga0168647_102993 | Ga0168647_10299310 | F013514 | MTIEEIQKFLDDNNTTLEEYLRDNMITDEDRKYVDKIWVDAIYRNLGEDTP |
| Ga0168647_102993 | Ga0168647_1029933 | F032252 | MKLYRYKKDGNLYKLYEQKMPYYELQAVPYFPNQGILAKSKRSISMNDFEVVAEL |
| Ga0168647_104277 | Ga0168647_1042776 | F080982 | MTSQSPRIYIYKITFEEVLYYYYGVHKEDVFDEEYWGSPYTNKWVWELYTPKKQILEVFPYTDDGWIEGNKVEKRIIKEFYNDDKWCLNENCA |
| Ga0168647_105805 | Ga0168647_1058051 | F077031 | SIVAVAVASFLAFLTVKCDISEKDVLKYYNELRKLIKWDLPDTMIDEFDNQLNQRINEDPELLQQKIENEVDYAIIDYEIEEQRSRMINMRNQNILEEINKPKYDNLQKLIVKNAIYYEFADGTMGIRGAWVAPDPREIPFE |
| ⦗Top⦘ |