Basic Information | |
---|---|
IMG/M Taxon OID | 3300018405 Open in IMG/M |
GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0114818 | Gp0116262 | Ga0194138 |
Sample Name | Freshwater microbial communities from Pennsylvania, USA, analyzing microbe dynamics in response to fracking - TARM_MetaG_T2_14 |
Sequencing Status | Permanent Draft |
Sequencing Center | DOE Joint Genome Institute (JGI) |
Published? | N |
Use Policy | Open |
Dataset Contents | |
---|---|
Total Genome Size | 718678970 |
Sequencing Scaffolds | 33 |
Novel Protein Genes | 37 |
Associated Families | 27 |
Dataset Phylogeny | |
---|---|
Taxonomy Groups | Number of Scaffolds |
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage | 4 |
All Organisms → Viruses → Predicted Viral | 3 |
Not Available | 18 |
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Halanaerobiales → Halanaerobiaceae → Halanaerobium | 1 |
All Organisms → cellular organisms → Bacteria | 1 |
All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Comamonadaceae | 1 |
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Anaerolineae → Anaerolineales → Anaerolineaceae → unclassified Anaerolineaceae → Anaerolineaceae bacterium | 1 |
All Organisms → cellular organisms → Bacteria → Proteobacteria | 1 |
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Clostridiaceae → unclassified Clostridiaceae → Clostridiaceae bacterium | 1 |
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae → Chryseobacterium group → Chryseobacterium → unclassified Chryseobacterium → Chryseobacterium sp. | 1 |
All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria | 1 |
Ecosystem Assignment (GOLD) | |
---|---|
Name | Freshwater And Sediment Microbial Communities From Various Areas In North America, Analyzing Microbe Dynamics In Response To Fracking |
Type | Environmental |
Taxonomy | Environmental → Aquatic → Freshwater → Unclassified → Unclassified → Watersheds → Freshwater And Sediment Microbial Communities From Various Areas In North America, Analyzing Microbe Dynamics In Response To Fracking |
Alternative Ecosystem Assignments | |
---|---|
Environment Ontology (ENVO) | aquatic biome → hydraulic fracturing → fresh water |
Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Water (non-saline) |
Location Information | ||||||||
---|---|---|---|---|---|---|---|---|
Location | USA: Pennsylvania | |||||||
Coordinates | Lat. (o) | 41.2451 | Long. (o) | -76.9856 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
Zoom: | Powered by OpenStreetMap © |
Family | Category | Number of Sequences | 3D Structure? |
---|---|---|---|
F000166 | Metagenome / Metatranscriptome | 1810 | Y |
F000388 | Metagenome / Metatranscriptome | 1201 | Y |
F000695 | Metagenome | 931 | Y |
F005175 | Metagenome / Metatranscriptome | 409 | Y |
F009516 | Metagenome | 316 | Y |
F011383 | Metagenome / Metatranscriptome | 291 | Y |
F012648 | Metagenome / Metatranscriptome | 278 | Y |
F020909 | Metagenome | 221 | Y |
F023829 | Metagenome / Metatranscriptome | 208 | Y |
F033769 | Metagenome / Metatranscriptome | 176 | N |
F037502 | Metagenome | 168 | Y |
F058545 | Metagenome / Metatranscriptome | 135 | Y |
F065554 | Metagenome | 127 | N |
F065566 | Metagenome | 127 | N |
F069722 | Metagenome / Metatranscriptome | 123 | Y |
F078253 | Metagenome | 116 | Y |
F084270 | Metagenome / Metatranscriptome | 112 | N |
F087432 | Metagenome | 110 | Y |
F088556 | Metagenome | 109 | Y |
F089595 | Metagenome | 109 | Y |
F090615 | Metagenome / Metatranscriptome | 108 | N |
F091594 | Metagenome / Metatranscriptome | 107 | Y |
F095153 | Metagenome / Metatranscriptome | 105 | Y |
F095378 | Metagenome | 105 | Y |
F100683 | Metagenome / Metatranscriptome | 102 | Y |
F104682 | Metagenome / Metatranscriptome | 100 | N |
F105047 | Metagenome / Metatranscriptome | 100 | N |
Scaffold | Taxonomy | Length | IMG/M Link |
---|---|---|---|
Ga0194138_10000674 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage | 16185 | Open in IMG/M |
Ga0194138_10024152 | All Organisms → Viruses → Predicted Viral | 2729 | Open in IMG/M |
Ga0194138_10051646 | Not Available | 1797 | Open in IMG/M |
Ga0194138_10053310 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Halanaerobiales → Halanaerobiaceae → Halanaerobium | 1764 | Open in IMG/M |
Ga0194138_10056367 | Not Available | 1709 | Open in IMG/M |
Ga0194138_10068307 | All Organisms → cellular organisms → Bacteria | 1532 | Open in IMG/M |
Ga0194138_10071001 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Comamonadaceae | 1498 | Open in IMG/M |
Ga0194138_10078912 | All Organisms → Viruses → Predicted Viral | 1409 | Open in IMG/M |
Ga0194138_10102295 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Anaerolineae → Anaerolineales → Anaerolineaceae → unclassified Anaerolineaceae → Anaerolineaceae bacterium | 1208 | Open in IMG/M |
Ga0194138_10118134 | All Organisms → Viruses → Predicted Viral | 1107 | Open in IMG/M |
Ga0194138_10132678 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 1031 | Open in IMG/M |
Ga0194138_10134595 | Not Available | 1022 | Open in IMG/M |
Ga0194138_10138304 | Not Available | 1005 | Open in IMG/M |
Ga0194138_10148069 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage | 964 | Open in IMG/M |
Ga0194138_10149203 | Not Available | 959 | Open in IMG/M |
Ga0194138_10149754 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Clostridiaceae → unclassified Clostridiaceae → Clostridiaceae bacterium | 957 | Open in IMG/M |
Ga0194138_10149925 | Not Available | 956 | Open in IMG/M |
Ga0194138_10150027 | Not Available | 956 | Open in IMG/M |
Ga0194138_10165088 | Not Available | 901 | Open in IMG/M |
Ga0194138_10211296 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae → Chryseobacterium group → Chryseobacterium → unclassified Chryseobacterium → Chryseobacterium sp. | 771 | Open in IMG/M |
Ga0194138_10224023 | Not Available | 744 | Open in IMG/M |
Ga0194138_10263463 | Not Available | 672 | Open in IMG/M |
Ga0194138_10271468 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage | 660 | Open in IMG/M |
Ga0194138_10272471 | Not Available | 658 | Open in IMG/M |
Ga0194138_10282702 | Not Available | 643 | Open in IMG/M |
Ga0194138_10288319 | Not Available | 635 | Open in IMG/M |
Ga0194138_10320838 | Not Available | 594 | Open in IMG/M |
Ga0194138_10327350 | Not Available | 587 | Open in IMG/M |
Ga0194138_10348824 | Not Available | 564 | Open in IMG/M |
Ga0194138_10352448 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage | 561 | Open in IMG/M |
Ga0194138_10352824 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria | 560 | Open in IMG/M |
Ga0194138_10362208 | Not Available | 551 | Open in IMG/M |
Ga0194138_10384529 | Not Available | 532 | Open in IMG/M |
Scaffold ID | Protein ID | Family | Sequence |
---|---|---|---|
Ga0194138_10000674 | Ga0194138_1000067410 | F095153 | VTDLPYGIATISAAVISGVAAIKASKAEKNSRPVSNGFADGIRTDVREIRNLLIDHIKDHNKN |
Ga0194138_10024152 | Ga0194138_100241524 | F091594 | MSENGITHYCLGNGELKCDGCGQEKNWQTLNQMPDALRKTLQAQAQRIDDTDCILSGRPWYVGA |
Ga0194138_10051646 | Ga0194138_100516461 | F087432 | VEETHYANFRTHWEDMVAKHDHLTNLNFEQITFSAEQDARLQEISELSIPQGFQAQVREYVENGNFPEGYEHPLSDLKLKKERLQHQNDIDEAYQMILESEGLI |
Ga0194138_10053310 | Ga0194138_100533104 | F087432 | IIYENGEYTPCTYRVTLQNRGVEETHYANFRTHWEDMVAKHDHLTNLNFEEIIFSAEQDARLQEISELSIPQGFQAQVREYVENGNFPDGYEHPLSDLKLKKERLQHQNDIDEAYQMILESEGLI |
Ga0194138_10056367 | Ga0194138_100563673 | F000388 | MKVIKVTKEYFQTEDEKVYFFEPLEKEISVEDMQKIVDANEKLVKELKDEHSKF |
Ga0194138_10068307 | Ga0194138_100683071 | F100683 | MTDHAAATVEALEKILALLRAGHAPEDLGEAVILLGRLMARRT |
Ga0194138_10071001 | Ga0194138_100710011 | F069722 | CFLRDPKAKRGRTDLKKGAAHEKALGHESGAALQE |
Ga0194138_10078912 | Ga0194138_100789123 | F084270 | MPRQTGETNTQFPDGFRLEISTDGTTGSVWEEVGVLAGGATATLNWTDFYLDAGNYEGLVDKARDPEFALAPSAVWNWDATVIANLFPGMFSTSSATLPTTGTDVEYAGTSNQVTLTRSKIRLTHYTVDASGGAETNADIDWQFTLHNAKIDAGGSFNFKGVNEDGLDEITVSFTGKPDPASSYALFTFFKA |
Ga0194138_10102295 | Ga0194138_101022952 | F058545 | MKMKYVFVLNASLILFLLVACSAINPTPAQPASTPTTVVTPGTGVQYQYVTNTLLLPATRDQTQAYALNIDGDPKQNNENKFGELLSLLVSAAPGLELQSTLDQAINLGQLVTLHMVKTDDFLNDPSVSWFIYLGQPSDAPSFDNSDNFTLDSKTPLNSPIIGSITNGHFSGGPGSVRVRMFILGQLVEVDLIGVRLEADVNQNGCVNGKLGGGVTVDEFRTKLLPAIAIGLNQIIKDNKNTALVLLK |
Ga0194138_10118134 | Ga0194138_101181342 | F104682 | MREIGIDRPRFDIKVSTQTFTLYFIPNIARKKYIDFWARVEKMLEAMKIQNKKKRKEMVDAISSDDEDELLREIIQIIMEANGYTYDSEWWEGHTDAEMQLEFVRAVIENADDSKKKVMAMLKA |
Ga0194138_10132678 | Ga0194138_101326782 | F090615 | MRALQLAAPEPTDPSLIAALRVRKGWLRPDSPIVDKAAAEGLKKLGALLCPPPDADKRSATVKQINEVATEYLKAVPQARLLIEESAVEHQAFWDPDQPYVERTDVQLHGNLVTAIKTANADRLRAAEAKAAAVS |
Ga0194138_10134595 | Ga0194138_101345952 | F090615 | MRALQLAAPEPTDPTLIAALRVRKGWLRPDSSIVDKAAAEGLKKLGVLLCPPPDADKRSATVKQINEVATEYLKAVPQARLLIEEAASELQAFWDPDKPYVERTDVQLHGNLVTAIKMANADRLRAAEAKAAAGS |
Ga0194138_10138304 | Ga0194138_101383043 | F065554 | MIGTVTIGDTPVDVYGTECSSEPDVGIMGNYVEIEDLEVGGISIYEMVANNPIFDQIQEAINDMVNS |
Ga0194138_10148069 | Ga0194138_101480692 | F012648 | MQQNITIKYIDGSETTYQVRPPDYAKWELTTKKVIAQFGGMWDILYVAHSAMKRDAGGKPVKPLDVWMESVADVEVGDESPKVIQEEA |
Ga0194138_10148069 | Ga0194138_101480693 | F011383 | EGTFALSMLADWGKANSVCEALWTAAESAPDTDISVTLTAATGAQFVFPIMPEFPTAGGAGTDAQTVDFTFKVSKGAVVETFS |
Ga0194138_10149203 | Ga0194138_101492031 | F037502 | GPDSQSAHNVDVWGQVLAVDFFVSGVFGREQAEEIVNLMRGLGFTGIGVYSDTRNNRGVEQVMFHGDVRPTAAMGNPATWGRVNGHYTSLIAAIQSLPLGGRS |
Ga0194138_10149754 | Ga0194138_101497542 | F033769 | MSVNAGVKTATSTASTVGVATNREYLRLTNESLTNRCRAGDSSISDTAGVIIEPGQTVEFRTMPSDNKEIYVLSEGASVRLAYYEVISA |
Ga0194138_10149925 | Ga0194138_101499252 | F090615 | PAPAQKPTAAGTLTRSEKLARIRALQVAAPEPTDPALITALRVRKNWLRPDSPIVDKAAVEGLKKLGVLLCPPPDPEGKSIPERRFIDIATEYLKAVPQARLLIEEAAVEHQAFWDADQPYIDQSDEDIHEEMVTAIKIANGERLRAAKAKAAAGS |
Ga0194138_10150027 | Ga0194138_101500271 | F089595 | MFHSDVRPTEDMGDPATWGRVDGKYTSLMAAIQSLKN |
Ga0194138_10165088 | Ga0194138_101650881 | F078253 | VFRHVKIKVMEIRKIYRIENPIDMDGMWYTKNGVERRKIHLLCPDGISKDLPMPFNIELHRKDGLIWNSAGKSIENMREWFTDKDAANLYNNGFRLFEFHTTIYNELEHEILFCRDGIVMQRDIPLETIWKL |
Ga0194138_10211296 | Ga0194138_102112961 | F023829 | MKIRLIRSYRSKNGNPTFVYEVSGNANDLAAFKVAQGEFYREDEKTGSPLWFTTRCVGDNGKLIITTNGKVVPDMSAYDQAASIAAQYGGNFGQELAKQAAMSILGSKASAATPVENTATVNEDAALDDL |
Ga0194138_10224023 | Ga0194138_102240231 | F090615 | VRAAPVPAPAQKPTPAASGTLTRSEKLAQMRALQLAAPEPTDPSLIAALRVRKGWLRPDSSIVDKAAVEGLKKLGVLLCPPPDPEGKSIPAKLFNEIAAEYLKTVPQTRLLIEESAVEHQAFWDADQPYIDQSDEDIHEEMVTAIKMANVDRLRAARAKASAGS |
Ga0194138_10261917 | Ga0194138_102619172 | F088556 | MLKEIRKDIFTQAEYAKKVGKSRAWVNQQIKSGNLKTLTVNGAVLVKI |
Ga0194138_10263463 | Ga0194138_102634633 | F087432 | VEETHYANFRTYWEDMVAKHEYLTNLSFEEITFSAEQDARLQEISELNIPQGFQAQVREYVENGNFPEGYEHPLSDLKLNKERLQHQNDIDEAYQMILESEGLI |
Ga0194138_10271468 | Ga0194138_102714681 | F020909 | KKVRAIAIAVGILLIWQVATNLWWVGIDAPNAEFLGWCWGSMSECVVL |
Ga0194138_10272471 | Ga0194138_102724711 | F005175 | MIDQPLHVEDVIDMYNEKILILQKEIDRLNEEIQV |
Ga0194138_10272471 | Ga0194138_102724712 | F105047 | MRSINFDKVIVEWMDINSCDDAWNTEDHLKDLMPASCTTIGYLYEDTPHFVKTFATFSFNTDDTIDFGDCVVIPKGCVVSIKKLEN |
Ga0194138_10282702 | Ga0194138_102827023 | F087432 | VEETHYANFRTYWEDMVAKHEYLTNLSFEEITFSAEQDARLQEISELNIPQGFQAEVKEYVENGNFPEGLNNPLAGLKYKKEMNDAYKMILESEGLI |
Ga0194138_10288319 | Ga0194138_102883191 | F095378 | MKAILIDELVKDQSTTEQVSVTIEGKKLEGWQIAKPLNYEKKYTRFADRFKSAIKVLAGKAIAVQFFTDLSEKDKIAYVKTKIN |
Ga0194138_10320838 | Ga0194138_103208382 | F065566 | MHDNRKQIRVTLPADKVEQFKKAKAKAEDAAMIKLTDTQFASRLLAKAIEQ |
Ga0194138_10327350 | Ga0194138_103273501 | F087432 | VINLIIYENGEYTPCTYRVTLQNRGVEETHYANFRTHWEDMVAKHDHLTNLNFEEVSFSAEQDARLQEISELSIPQGFQSQVREYVENGNFPEGYEHPLSDLKLKKERVQHQNDIDGAYQMILESEGLI |
Ga0194138_10348824 | Ga0194138_103488241 | F090615 | VRAAPVPAPAQKPTPAASGTLTRSEKLAQMRALQLAAPEPTNPALITALRVRKGWLRPDSSIVDKAAAEGLKKLGVLLCPPPDADKRSATVKQINEVATEYLKAVPQARLLIEESVVEHQASWDPDKVYVERTDADIHEEMVTAIKTANADRLRAAEAKAAAGS |
Ga0194138_10352448 | Ga0194138_103524482 | F009516 | HKDINASLGVRIMARPMIYPMGTLEVGEVATMPATKRGDAKRTSRNASQYGIRNGKAFKCRTVEGVTFITRLR |
Ga0194138_10352824 | Ga0194138_103528241 | F090615 | LRVRKGWLRPDSPIVDKAAAEGLKKLGVLLCPPPDPDGKSIPERRFIDIATEYLKAVPQARLLIEESAVEQQASWDPDKVYVERTDADIHGNLVTAIKIANAERLRAAEAKASAGS |
Ga0194138_10362208 | Ga0194138_103622081 | F087432 | MVKYENGQYTPCTYRVTLQNRGVEEIHYADFRTYWEDMVAKHDHLTNLSFGEITFTAEQDARLQEIAELDIPEGFTSQVRNYIENGEFPEGYQHPLSDLKQEKEKEQYQSDLDDAYQMILENEGLI |
Ga0194138_10384529 | Ga0194138_103845292 | F000695 | MIYAQHYKAEEFRDWSDDMSPRLVTMMDVLRFRIGSLIVISPHPDSLGRE |
Ga0194138_10389437 | Ga0194138_103894372 | F000166 | AHFFALCVKKWQEILNLNDWRIEKGLKPAKQAMASVEFNESARLATYRLGDFGAEKITHESLDCTALHELLHVMLHDLLATAQDPKSSQDDIDKQEHRVINLLERLLSKDSYGIK |
⦗Top⦘ |