| Basic Information | |
|---|---|
| IMG/M Taxon OID | 3300020237 Open in IMG/M |
| GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0117946 | Gp0115933 | Ga0211478 |
| Sample Name | Marine microbial communities from Tara Oceans - TARA_A100001011 (ERX291767-ERR318621) |
| Sequencing Status | Permanent Draft |
| Sequencing Center | CEA Genoscope |
| Published? | Y |
| Use Policy | Open |
| Dataset Contents | |
|---|---|
| Total Genome Size | 16383351 |
| Sequencing Scaffolds | 24 |
| Novel Protein Genes | 26 |
| Associated Families | 25 |
| Dataset Phylogeny | |
|---|---|
| Taxonomy Groups | Number of Scaffolds |
| All Organisms → cellular organisms → Bacteria → Proteobacteria | 1 |
| All Organisms → Viruses → Predicted Viral | 4 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Pelagibacterales → Pelagibacteraceae → Candidatus Pelagibacter → unclassified Candidatus Pelagibacter → Candidatus Pelagibacter sp. IMCC9063 | 2 |
| Not Available | 14 |
| All Organisms → cellular organisms → Archaea → Euryarchaeota → unclassified Euryarchaeota → Euryarchaeota archaeon | 1 |
| All Organisms → cellular organisms → Bacteria | 1 |
| All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales → Myoviridae | 1 |
| Ecosystem Assignment (GOLD) | |
|---|---|
| Name | Marine Viral And Eukaryotic Protist Communities Collected From Different Water Depths During Tara Oceans Survey |
| Type | Environmental |
| Taxonomy | Environmental → Aquatic → Marine → Unclassified → Unclassified → Marine → Marine Viral And Eukaryotic Protist Communities Collected From Different Water Depths During Tara Oceans Survey |
| Alternative Ecosystem Assignments | |
|---|---|
| Environment Ontology (ENVO) | marine biome → marine water body → sea water |
| Earth Microbiome Project Ontology (EMPO) | Free-living → Saline → Water (saline) |
| Location Information | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location | TARA_030 | |||||||
| Coordinates | Lat. (o) | 33.93 | Long. (o) | 32.7322 | Alt. (m) | N/A | Depth (m) | 70 | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F000107 | Metagenome / Metatranscriptome | 2222 | Y |
| F001756 | Metagenome / Metatranscriptome | 641 | Y |
| F004819 | Metagenome / Metatranscriptome | 422 | Y |
| F013773 | Metagenome / Metatranscriptome | 268 | N |
| F020823 | Metagenome / Metatranscriptome | 222 | Y |
| F025308 | Metagenome | 202 | N |
| F026125 | Metagenome / Metatranscriptome | 199 | N |
| F026579 | Metagenome / Metatranscriptome | 197 | N |
| F027868 | Metagenome / Metatranscriptome | 193 | Y |
| F028201 | Metagenome / Metatranscriptome | 192 | Y |
| F029784 | Metagenome / Metatranscriptome | 187 | N |
| F032678 | Metagenome / Metatranscriptome | 179 | N |
| F033459 | Metagenome | 177 | Y |
| F034541 | Metagenome / Metatranscriptome | 174 | N |
| F036279 | Metagenome / Metatranscriptome | 170 | N |
| F038721 | Metagenome / Metatranscriptome | 165 | N |
| F048369 | Metagenome / Metatranscriptome | 148 | N |
| F051208 | Metagenome | 144 | N |
| F056670 | Metagenome / Metatranscriptome | 137 | Y |
| F057435 | Metagenome / Metatranscriptome | 136 | N |
| F078814 | Metagenome / Metatranscriptome | 116 | N |
| F079191 | Metagenome | 116 | N |
| F082792 | Metagenome / Metatranscriptome | 113 | Y |
| F085576 | Metagenome / Metatranscriptome | 111 | N |
| F089153 | Metagenome / Metatranscriptome | 109 | Y |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| Ga0211478_100490 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 2454 | Open in IMG/M |
| Ga0211478_100514 | All Organisms → Viruses → Predicted Viral | 2374 | Open in IMG/M |
| Ga0211478_100728 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Pelagibacterales → Pelagibacteraceae → Candidatus Pelagibacter → unclassified Candidatus Pelagibacter → Candidatus Pelagibacter sp. IMCC9063 | 1817 | Open in IMG/M |
| Ga0211478_100844 | Not Available | 1612 | Open in IMG/M |
| Ga0211478_101241 | All Organisms → Viruses → Predicted Viral | 1245 | Open in IMG/M |
| Ga0211478_101633 | All Organisms → Viruses → Predicted Viral | 1038 | Open in IMG/M |
| Ga0211478_101665 | All Organisms → Viruses → Predicted Viral | 1023 | Open in IMG/M |
| Ga0211478_101680 | Not Available | 1017 | Open in IMG/M |
| Ga0211478_101752 | Not Available | 988 | Open in IMG/M |
| Ga0211478_102231 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Pelagibacterales → Pelagibacteraceae → Candidatus Pelagibacter → unclassified Candidatus Pelagibacter → Candidatus Pelagibacter sp. IMCC9063 | 849 | Open in IMG/M |
| Ga0211478_102813 | Not Available | 742 | Open in IMG/M |
| Ga0211478_102877 | All Organisms → cellular organisms → Archaea → Euryarchaeota → unclassified Euryarchaeota → Euryarchaeota archaeon | 733 | Open in IMG/M |
| Ga0211478_103220 | Not Available | 680 | Open in IMG/M |
| Ga0211478_103471 | Not Available | 649 | Open in IMG/M |
| Ga0211478_103685 | Not Available | 627 | Open in IMG/M |
| Ga0211478_104008 | Not Available | 596 | Open in IMG/M |
| Ga0211478_104129 | Not Available | 586 | Open in IMG/M |
| Ga0211478_104247 | Not Available | 577 | Open in IMG/M |
| Ga0211478_104505 | Not Available | 557 | Open in IMG/M |
| Ga0211478_104562 | All Organisms → cellular organisms → Bacteria | 553 | Open in IMG/M |
| Ga0211478_104638 | Not Available | 548 | Open in IMG/M |
| Ga0211478_105030 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales → Myoviridae | 523 | Open in IMG/M |
| Ga0211478_105316 | Not Available | 508 | Open in IMG/M |
| Ga0211478_105436 | Not Available | 502 | Open in IMG/M |
| Scaffold ID | Protein ID | Family | Sequence |
|---|---|---|---|
| Ga0211478_100490 | Ga0211478_1004902 | F025308 | MISDEDFRFLLQESNGCKKALEIGTGTGKSSAALKLNCEVYSIDKDDIFEYNIDINRFNCESKEYWLNYMHYDFDFVFIDGSIEKIDCEEILKRTKNSFKIVFHDYMPKEDKDPGKNKGWYNMKVFKESALLNYDMKETQGGSHCGMLVLNKDK |
| Ga0211478_100514 | Ga0211478_1005141 | F034541 | MKAVEAMTWEELESALTEYQVNGNGGGMRVKDIQILHSIEDEISWRREQGYTDLLPREIEIELLEQGKIRERYL |
| Ga0211478_100514 | Ga0211478_1005146 | F057435 | MQKNIERQILLFTEATGLTKRAEVVHGTLFVKFNNPQDKWFFRRALRNFCRIFINRDTGINANDVGDEYAYDIVPEKDEKNWNRTEPLASQVDTQLTMMEDK |
| Ga0211478_100728 | Ga0211478_1007282 | F001756 | MLELNRPRKKLCPKLEKNVKINPNRITFKLKLLNIDKNYDF |
| Ga0211478_100844 | Ga0211478_1008443 | F048369 | MKEDRDSFIEDLADNTPNEGQFDKFMEAEVHDAEEDLLAGKTFKKLKDEVKKGDKNV |
| Ga0211478_101109 | Ga0211478_1011091 | F000107 | KGIGGEEDQIVADAEELESGNNESKVSLIECYYQIKGTGTLTISAVSETNNLTFTGRGKYGLRPDQLKFGDDKQILLTTDSNVDSYLLITEFRRNN |
| Ga0211478_101241 | Ga0211478_1012412 | F036279 | MKGTRHSLKTLFIIFSFVGLQACSVPFANGVTGQDIIKIANAGKNIKNITKEGINEELITETKNILRDIQYGGKAQR |
| Ga0211478_101633 | Ga0211478_1016332 | F013773 | MAKRIVLAGDSFGCEWPNGEGWPLMLAQQHGVNNIAQAGVGEYKILKQLWDLSARDAYWVNNYDCVIVCHTSPSRIHTTEHPVHKEGLHKDCDLIYTDIMDKFDWFNPRLRTAKNWFHHHYDDEYAIDIYNMIRAEIKKFINIPYLAVDHFEISNFYAKEDNVLNLSTTWPKYKGKVNHYSDEGNQIVYNQIIDKLDKIC |
| Ga0211478_101665 | Ga0211478_1016651 | F004819 | MQYYPQDKDPRLDERSARFHARVLKEDLATLPFVLDTCNRDINIARASTYVTWDHDKEMWAEVDHLMMNFYVQARTSETRDELEDKINRGVVELLKGPRYYEQAKVYCMIDMDYPEDESIYDIVKVPKKDRKKAGWGISGGEGEIVYHVTLHVQECNNVDLTIYDNEDRGDFFDLNGNNLESPMADLERIVNG |
| Ga0211478_101680 | Ga0211478_1016801 | F038721 | MAYIGNNTKQTAVDTVDERFDEFKETSIDASKVQTIFLGGDESGVADSPTDAFGVSLNVITTDCNHKTFRRIDMGTVVAQVGVVDFGYVANSN |
| Ga0211478_101752 | Ga0211478_1017522 | F027868 | MDYLALKIPADIEQKITSHTVTDQPEPEEGGGYPFKGDNGAYELCGCDMLQIVPAAYTDKKGRLHLEGDLYCDEEGLLKAAPVHNWRASQMRYWYMHPRSEQLVPDWREWCNVAGDACFVVPATDDNLKIMEDILDS |
| Ga0211478_102231 | Ga0211478_1022312 | F001756 | SLILELNSPIKKLCPKHEKNVKIKPKIITFKLKLLNI |
| Ga0211478_102813 | Ga0211478_1028131 | F085576 | MKQWYLRFKAGNWENGYYEIGENFDLVKEVCKAEFIKDCKALGFTPVTAELTLCEEHRI |
| Ga0211478_102877 | Ga0211478_1028771 | F056670 | MDRINTMNIDKFVEAFTFIGKGPCQEFNCPRQQECAEEKVECKAFRYWVNNDSYDTMRKGKKTSIAIDMERLL |
| Ga0211478_103220 | Ga0211478_1032202 | F078814 | AEKQAVKDVFTMLGNTKGEIISKIDGIIKQVAKKRNVKVSAIEDYFDNEILS |
| Ga0211478_103471 | Ga0211478_1034713 | F029784 | MCTQIYAHPVEGYKCFANANKSQGTYYTCCDLDTKEIRYVTYIYDGYFMGYYLVQSAVKATENYGNCQEKIFLSSASNKWPYYKGNDDEYTIDRQYPMQYDIPSQDEIEYSYLDSLLTTGTNKXILLKIKYSH |
| Ga0211478_103685 | Ga0211478_1036852 | F079191 | MTPNPNMKTVCDNCGATYVVRHDLPDEYIEQYCPFCGEEHENIDDMDEVNWDDED |
| Ga0211478_104008 | Ga0211478_1040081 | F032678 | WQRQNSLDVYNWLVKSFPNIEFKRHENFIAPDLEWGSKGPNIIDEYGKLKSGNQIELRSHAEYVAHTEKLDAWYCGVTQNPDKEFDERLADRDVFIDSLGDKTLDRLIKPHMGGYACHPFTYVKKDWIVAQYKKLGIMDLFDLTRSCEGDANIYPDVFGDLDYKTYVPGSPVPTCGKCFWCKEREWGVANSDEE |
| Ga0211478_104129 | Ga0211478_1041291 | F020823 | MITNLILFYLTIYVFFIWGQNIARTPLDTKVFLII |
| Ga0211478_104247 | Ga0211478_1042471 | F026579 | MAISVTQPSFSETISDVKIIFADKEGEGKPTDDEEPDCE |
| Ga0211478_104505 | Ga0211478_1045051 | F051208 | FYGATVSSGMVGASKANVMIGPNVESHGQFADTTGDEGATIDIYIIGSSNQVHMASWGNDNYQVHDVIGDSNILDVHPDAIGSHVRMVQYGDNNYMKTVTSGNNNTIRYYGSGGSNNAQIYLYTSGSIVELKQTGGSNTANLTVNGDSIYDYTLLVDQDGSDTCTYSFNRNDQTSDTTVQLTNSG |
| Ga0211478_104562 | Ga0211478_1045622 | F026125 | MFNYINSWKSRKSEKWNIEVRLGRITLLQLNYDAKKAKFRFMLLNFGLEIGGK |
| Ga0211478_104638 | Ga0211478_1046382 | F028201 | QIFFYPHFKMIEYNQKQSKEIYKALADSKVDLMEYFLGPDPRKTSYYKSAVRRNSILNDF |
| Ga0211478_105030 | Ga0211478_1050302 | F082792 | MTKKIAKPVQSKKYFHEAIEEEQKILDIGLKMSRQHKKERTESEKLQEELEPIDENI |
| Ga0211478_105316 | Ga0211478_1053161 | F089153 | MAISDYSSHDWRKHTDSAVVVDKNKAMLKVNECKV |
| Ga0211478_105436 | Ga0211478_1054361 | F033459 | MNISRQQVDKLIVGTNDVSYVAPDTSPTGTAVLNGPVYVGKTGASPGYEALL |
| ⦗Top⦘ |