| Basic Information | |
|---|---|
| IMG/M Taxon OID | 3300025275 Open in IMG/M |
| GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0053074 | Gp0054788 | Ga0208057 |
| Sample Name | Marine microbial communities from the Deep Pacific Ocean - MP1649 (SPAdes) |
| Sequencing Status | Permanent Draft |
| Sequencing Center | DOE Joint Genome Institute (JGI) |
| Published? | N |
| Use Policy | Open |
| Dataset Contents | |
|---|---|
| Total Genome Size | 144170637 |
| Sequencing Scaffolds | 17 |
| Novel Protein Genes | 24 |
| Associated Families | 24 |
| Dataset Phylogeny | |
|---|---|
| Taxonomy Groups | Number of Scaffolds |
| All Organisms → Viruses → Predicted Viral | 1 |
| Not Available | 11 |
| All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Marine Group I → Marine Group I thaumarchaeote | 4 |
| All Organisms → cellular organisms → Bacteria → FCB group → Candidatus Marinimicrobia → Candidatus Marinimicrobia bacterium | 1 |
| Ecosystem Assignment (GOLD) | |
|---|---|
| Name | Deep Ocean Microbial Communities From The Global Malaspina Expedition |
| Type | Environmental |
| Taxonomy | Environmental → Aquatic → Marine → Oceanic → Unclassified → Deep Ocean → Deep Ocean Microbial Communities From The Global Malaspina Expedition |
| Alternative Ecosystem Assignments | |
|---|---|
| Environment Ontology (ENVO) | marine biome → marine water body → sea water |
| Earth Microbiome Project Ontology (EMPO) | Free-living → Saline → Water (saline) |
| Location Information | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location | North of Tokelau, South Pacific Ocean | |||||||
| Coordinates | Lat. (o) | -5.74 | Long. (o) | -170.77 | Alt. (m) | N/A | Depth (m) | 4017.73 | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F000245 | Metagenome / Metatranscriptome | 1468 | Y |
| F001625 | Metagenome / Metatranscriptome | 661 | Y |
| F002745 | Metagenome | 533 | Y |
| F002874 | Metagenome / Metatranscriptome | 524 | Y |
| F004881 | Metagenome / Metatranscriptome | 420 | Y |
| F005608 | Metagenome / Metatranscriptome | 395 | Y |
| F008720 | Metagenome | 329 | Y |
| F013649 | Metagenome | 269 | Y |
| F014899 | Metagenome / Metatranscriptome | 259 | Y |
| F015491 | Metagenome | 254 | Y |
| F016015 | Metagenome / Metatranscriptome | 250 | Y |
| F024888 | Metagenome | 204 | Y |
| F035075 | Metagenome / Metatranscriptome | 173 | Y |
| F038419 | Metagenome / Metatranscriptome | 166 | Y |
| F038848 | Metagenome | 165 | Y |
| F043979 | Metagenome | 155 | N |
| F066858 | Metagenome / Metatranscriptome | 126 | N |
| F081449 | Metagenome / Metatranscriptome | 114 | Y |
| F089050 | Metagenome / Metatranscriptome | 109 | Y |
| F097512 | Metagenome / Metatranscriptome | 104 | Y |
| F099450 | Metagenome | 103 | Y |
| F099860 | Metagenome / Metatranscriptome | 103 | Y |
| F101326 | Metagenome | 102 | Y |
| F105490 | Metagenome | 100 | Y |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| Ga0208057_1013711 | All Organisms → Viruses → Predicted Viral | 1620 | Open in IMG/M |
| Ga0208057_1022268 | Not Available | 1108 | Open in IMG/M |
| Ga0208057_1026120 | Not Available | 977 | Open in IMG/M |
| Ga0208057_1026400 | All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Marine Group I → Marine Group I thaumarchaeote | 970 | Open in IMG/M |
| Ga0208057_1029612 | Not Available | 888 | Open in IMG/M |
| Ga0208057_1031251 | Not Available | 854 | Open in IMG/M |
| Ga0208057_1031289 | Not Available | 853 | Open in IMG/M |
| Ga0208057_1031846 | All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Marine Group I → Marine Group I thaumarchaeote | 842 | Open in IMG/M |
| Ga0208057_1038210 | All Organisms → cellular organisms → Bacteria → FCB group → Candidatus Marinimicrobia → Candidatus Marinimicrobia bacterium | 734 | Open in IMG/M |
| Ga0208057_1045238 | Not Available | 647 | Open in IMG/M |
| Ga0208057_1045265 | All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Marine Group I → Marine Group I thaumarchaeote | 647 | Open in IMG/M |
| Ga0208057_1047116 | All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Marine Group I → Marine Group I thaumarchaeote | 628 | Open in IMG/M |
| Ga0208057_1049896 | Not Available | 602 | Open in IMG/M |
| Ga0208057_1051825 | Not Available | 586 | Open in IMG/M |
| Ga0208057_1053396 | Not Available | 573 | Open in IMG/M |
| Ga0208057_1058399 | Not Available | 536 | Open in IMG/M |
| Ga0208057_1062133 | Not Available | 512 | Open in IMG/M |
| Scaffold ID | Protein ID | Family | Sequence |
|---|---|---|---|
| Ga0208057_1003539 | Ga0208057_10035391 | F035075 | LICGYDNAISVKYVNKKNQRSIWVIQKNIEDEEKWV |
| Ga0208057_1013711 | Ga0208057_10137112 | F004881 | MSYTFFEPSGKPYQECTNPNDPEDKSLCVLVQDGSKFDGAIVRYTTFKLLEQELTGGEIACRYEYEIEVPPHAIKHKITDKEGKEFENKLGKWVIEILQKQMDKHAAKNRSTDTEKSNT |
| Ga0208057_1022268 | Ga0208057_10222681 | F002745 | MKSFKGYLTEFAIQSTSDYVFNAGSDSSALKIPISGPMFKRIWPDTIRATVFHTTDLAGLERLKKLEGGKKSISAFFSMMSRYMETGVATGGDVHVVVEMDADVLVSARDDIMSEVDKQGRRWVMMSWFEYQTRERAKFSKIEKDLNTLIANLVKKHIPKDKEIQQTKHFGKDSGAVFEIWGNMKKHLKGDGRKIAAVIKDYFDGVEKIIKKNKEVMGDIFYGYTKSKRMTDNSWDEQIVNNIDIYKIHVIK |
| Ga0208057_1022268 | Ga0208057_10222682 | F000245 | MKTFTRFKEDSFDASDKKMDAFVVKEIKKRKLAKFPVNATDDIKMRMKPNKPAFTFPSPISDMMIQVFLKKMLPSKGQPKGMMAFNYQLEDK |
| Ga0208057_1026120 | Ga0208057_10261201 | F101326 | IKLYFVTDNQFTNLLSKVRKIYPTKQNLPNTTQSITDKSYTLADTARDIFYDFFVGTKKPLRQPKTRKPFLVEPTRYKSEIVIPNWNPFKAISFLASKAVAANQEAKGANFVFYQTLQGFRFVSIETLMLGGFRLFKEKYENGAAVAKDYPHLATNAILETATTDNTSHIPIYKDDPEIEEDMKPFVASYKYMPANMGESKQSSYESVTSFRLVDSFDTMKNVALGMYANRVITHDLIQMKIDRRDFHYVTPPSTISMIEADGSVSSKQNTEEGDKEKTQIDVSVSTEIGRLCSDNADFLGKPEAHISLVPTTFGQAGALNTGPK |
| Ga0208057_1026400 | Ga0208057_10264001 | F081449 | TLSVDRHGQGKPTGKKLTVKIELHKVDPYKIWWIGEKTFTHRGQEETFVRFTIDQEGKQVGGFTYEKKKFVTPFGNMEVVNPNNDEPTGASSFESGTEESRYHPGDRR |
| Ga0208057_1029612 | Ga0208057_10296123 | F016015 | MINVNKFENRIVKFKRIDSEGNISDKEAEVRRMDYDQAEDIPRSVTARLVDPLNFVITLGYDKSKKKFSGPLGTDIWESNFDIDDFITSSKMGTADRYMRS |
| Ga0208057_1031251 | Ga0208057_10312512 | F097512 | DFSPILKDARAEGYEGLSQVARNILKGKYYTPTDEEVRGF |
| Ga0208057_1031289 | Ga0208057_10312891 | F013649 | MSSILEKELSILERLPSSRRMREHDEAGKRSKKFKKLRKEHFREDWDVDLWD |
| Ga0208057_1031846 | Ga0208057_10318462 | F008720 | MKVVQTEDRSFVRDLHSKALINTDRVALENHRKKRQIEIQQAQQWQQMETKVEELNNMRNEILEIKGLLQEVLN |
| Ga0208057_1038210 | Ga0208057_10382101 | F038848 | LIIKDYMDGMEKVIKKNIDTFSSAMLSYAKKRSTDLSWDEQVVNNIKVKTAHFFKLPIPIAQTDDQIEMSEKQQELIEFTESKGWSTKMWDAAIELEAYTREVAKKELGK |
| Ga0208057_1038952 | Ga0208057_10389521 | F066858 | GERSEDMNAPSTVPGTAINPSFQPRESSMRFCLAYTAVDATELLNTANKLLLTASVGENPTNVNTGTIIIPPPRPIIEPNRPATNPSGMSQILSINVLG |
| Ga0208057_1045238 | Ga0208057_10452381 | F024888 | MGTSTIDLKNVHKIVADEVNARIEIDRKGKTPDEWAQFKGFKDWLGYAAWSRHTGGFYEMAVMMIKAHWKEEDPVEFARQKKIQKDQSLREHSYINLPPEAWPTSVPNKNDFASRFIGNKKKKWQPVHKSNFTAEELEVI |
| Ga0208057_1045265 | Ga0208057_10452651 | F099860 | VNAMEAVGAEWDTIQLNSTDGAPASNAGEKVILEDASESDDGDMTFLQMEKQTTGTQMNYYFTIAGSDGAGELSVARQAITLNSSTFKEAGDTETADSGIMQQVSASIANAAVTMA |
| Ga0208057_1047116 | Ga0208057_10471161 | F015491 | VPARPQVTPRKWSKKTARIFDKQFDRFLYDELYVPTKKAWGENDIVKYERNMYGDSK |
| Ga0208057_1047116 | Ga0208057_10471162 | F005608 | MKKATIEVLEEGELIFGSPTVGKYFVRRYEDGEEMGGGFFKTKKEAVTHVREYKKSE |
| Ga0208057_1049896 | Ga0208057_10498962 | F105490 | MKGGQGKTLRRQGAIERIEPMIEKYEKILPAEKELLKLMRKEKDLPPDNIPTMEKKVKQFEQKLERAKITLENTKKN |
| Ga0208057_1050103 | Ga0208057_10501032 | F099450 | PVIRTIDKEFASEVACWNYYEGDVGESKFGTQHLDHQGNPPTKDFHFGKMDHREYPIRTYRGKDGQGSVWLTCDIKGRYKDL |
| Ga0208057_1051825 | Ga0208057_10518251 | F089050 | MVELNHDVVSFNSDTLIKGGHGVLVNVYVTKVGSGSNKVEFRNGTTGSAPIVFTIFT |
| Ga0208057_1051825 | Ga0208057_10518252 | F001625 | TISSSDWTNANVRKTLSWQAALVSKLRVYAIKVTFGSGDNYATNGVSADLKEGRISTLVAVIPTYSNISREVVYDKANEKIKILEVGGSTGSAMTELANNSSATNSKIFEFLVIGY |
| Ga0208057_1052473 | Ga0208057_10524731 | F014899 | MAEEEQLIDFGFSAVTADEYDRDQTDGEDTGGGGSASPEALSAMDAKIEQIMAHLAAQSKEEDDGWSSSEVAQSLDEKLAQISSKVDEVLVLERDEERAQTTADILFQLNNATEATSATSKKSGEVVGKQDEIMKFLQEMAPKVDKILKLESLEALLEQSSGKLDALTE |
| Ga0208057_1053396 | Ga0208057_10533962 | F038419 | MMLVQAIIVTWIMMTTPEPCPQEYLNLGGDVIEYCELTPAGLGRQWSLKE |
| Ga0208057_1058399 | Ga0208057_10583991 | F002874 | MKEKIDLSLNRSLEELESSYWEFVEATEGEDSSGWSKSVQEDLSALEGYLM |
| Ga0208057_1062133 | Ga0208057_10621331 | F043979 | MALTISSSDWTNANVRKTLSWQAALVSKLRVYAIKVTFGSGDNYATGGVSADLKEGRISTLVAVIPTYSNISREVVYDKANEKIKILEVGGSSGSA |
| ⦗Top⦘ |