| Basic Information | |
|---|---|
| IMG/M Taxon OID | 3300019046 Open in IMG/M |
| GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0117946 | Gp0217188 | Ga0193546 |
| Sample Name | Metatranscriptome of marine prokaryotic communities collected during Tara Oceans survey from station TARA_011 - TARA_X100000009 (ERX1408503-ERR1336911) |
| Sequencing Status | Permanent Draft |
| Sequencing Center | Canada's Michael Smith Genome Sciences Centre |
| Published? | N |
| Use Policy | Open |
| Dataset Contents | |
|---|---|
| Total Genome Size | 356916566 |
| Sequencing Scaffolds | 34 |
| Novel Protein Genes | 37 |
| Associated Families | 32 |
| Dataset Phylogeny | |
|---|---|
| Taxonomy Groups | Number of Scaffolds |
| All Organisms → Viruses → Predicted Viral | 1 |
| All Organisms → Viruses → unclassified viruses → Circular genetic element sp. | 1 |
| All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium TMED175 | 2 |
| Not Available | 20 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria | 1 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodobacterales → Roseobacteraceae → Leisingera → unclassified Leisingera → Leisingera sp. M658 | 1 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Pelagibacterales → Pelagibacteraceae → unclassified Pelagibacteraceae → Pelagibacteraceae bacterium TMED201 | 1 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rickettsiales → unclassified Rickettsiales → Rickettsiales bacterium | 2 |
| All Organisms → cellular organisms → Bacteria | 1 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodobacterales → unclassified Rhodobacterales → Rhodobacterales bacterium TMED271 | 1 |
| All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Dinophyceae | 1 |
| All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Ciliophora → Intramacronucleata → Spirotrichea → Oligotrichia → Strombidiidae → Strombidium | 1 |
| All Organisms → Viruses → unclassified viruses → Virus sp. | 1 |
| Ecosystem Assignment (GOLD) | |
|---|---|
| Name | Marine Viral And Eukaryotic Protist Communities Collected From Different Water Depths During Tara Oceans Survey |
| Type | Environmental |
| Taxonomy | Environmental → Aquatic → Marine → Unclassified → Unclassified → Marine → Marine Viral And Eukaryotic Protist Communities Collected From Different Water Depths During Tara Oceans Survey |
| Alternative Ecosystem Assignments | |
|---|---|
| Environment Ontology (ENVO) | marine biome → marine water body → sea water |
| Earth Microbiome Project Ontology (EMPO) | Free-living → Saline → Water (saline) |
| Location Information | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location | North Atlantic Ocean: TARA_011 | |||||||
| Coordinates | Lat. (o) | 41.6686 | Long. (o) | 2.7996 | Alt. (m) | N/A | Depth (m) | 5 | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F000200 | Metagenome / Metatranscriptome | 1633 | Y |
| F000379 | Metagenome / Metatranscriptome | 1211 | Y |
| F000953 | Metagenome / Metatranscriptome | 822 | Y |
| F002223 | Metagenome / Metatranscriptome | 581 | Y |
| F005911 | Metagenome / Metatranscriptome | 386 | Y |
| F006699 | Metagenome / Metatranscriptome | 366 | Y |
| F007531 | Metagenome / Metatranscriptome | 349 | Y |
| F007722 | Metagenome / Metatranscriptome | 346 | Y |
| F008818 | Metatranscriptome | 327 | Y |
| F009636 | Metagenome / Metatranscriptome | 315 | Y |
| F009725 | Metagenome / Metatranscriptome | 314 | Y |
| F010768 | Metagenome / Metatranscriptome | 299 | Y |
| F013311 | Metagenome / Metatranscriptome | 272 | Y |
| F019653 | Metagenome / Metatranscriptome | 228 | Y |
| F020468 | Metagenome / Metatranscriptome | 224 | Y |
| F021115 | Metagenome / Metatranscriptome | 220 | N |
| F023874 | Metagenome / Metatranscriptome | 208 | Y |
| F029759 | Metagenome / Metatranscriptome | 187 | Y |
| F037682 | Metagenome / Metatranscriptome | 167 | Y |
| F042354 | Metagenome / Metatranscriptome | 158 | Y |
| F053091 | Metagenome / Metatranscriptome | 141 | N |
| F059061 | Metagenome / Metatranscriptome | 134 | Y |
| F067842 | Metagenome / Metatranscriptome | 125 | Y |
| F068862 | Metatranscriptome | 124 | N |
| F074917 | Metagenome / Metatranscriptome | 119 | N |
| F084330 | Metagenome / Metatranscriptome | 112 | Y |
| F085817 | Metagenome / Metatranscriptome | 111 | Y |
| F089015 | Metagenome / Metatranscriptome | 109 | N |
| F090440 | Metagenome / Metatranscriptome | 108 | N |
| F097143 | Metagenome / Metatranscriptome | 104 | Y |
| F098216 | Metagenome / Metatranscriptome | 104 | N |
| F105327 | Metagenome / Metatranscriptome | 100 | Y |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| Ga0193546_10012650 | All Organisms → Viruses → Predicted Viral | 1103 | Open in IMG/M |
| Ga0193546_10013632 | All Organisms → Viruses → unclassified viruses → Circular genetic element sp. | 1068 | Open in IMG/M |
| Ga0193546_10016130 | All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium TMED175 | 990 | Open in IMG/M |
| Ga0193546_10016414 | Not Available | 982 | Open in IMG/M |
| Ga0193546_10018394 | Not Available | 929 | Open in IMG/M |
| Ga0193546_10018784 | Not Available | 920 | Open in IMG/M |
| Ga0193546_10018922 | All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium TMED175 | 917 | Open in IMG/M |
| Ga0193546_10020184 | Not Available | 889 | Open in IMG/M |
| Ga0193546_10021864 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 856 | Open in IMG/M |
| Ga0193546_10022818 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodobacterales → Roseobacteraceae → Leisingera → unclassified Leisingera → Leisingera sp. M658 | 838 | Open in IMG/M |
| Ga0193546_10022984 | Not Available | 835 | Open in IMG/M |
| Ga0193546_10023684 | Not Available | 823 | Open in IMG/M |
| Ga0193546_10026355 | Not Available | 780 | Open in IMG/M |
| Ga0193546_10036010 | Not Available | 668 | Open in IMG/M |
| Ga0193546_10036117 | Not Available | 667 | Open in IMG/M |
| Ga0193546_10038066 | Not Available | 650 | Open in IMG/M |
| Ga0193546_10038152 | Not Available | 650 | Open in IMG/M |
| Ga0193546_10039215 | Not Available | 641 | Open in IMG/M |
| Ga0193546_10040267 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Pelagibacterales → Pelagibacteraceae → unclassified Pelagibacteraceae → Pelagibacteraceae bacterium TMED201 | 633 | Open in IMG/M |
| Ga0193546_10043121 | Not Available | 611 | Open in IMG/M |
| Ga0193546_10044630 | Not Available | 601 | Open in IMG/M |
| Ga0193546_10046234 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rickettsiales → unclassified Rickettsiales → Rickettsiales bacterium | 591 | Open in IMG/M |
| Ga0193546_10047335 | All Organisms → cellular organisms → Bacteria | 584 | Open in IMG/M |
| Ga0193546_10048743 | Not Available | 576 | Open in IMG/M |
| Ga0193546_10051009 | Not Available | 563 | Open in IMG/M |
| Ga0193546_10051921 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rickettsiales → unclassified Rickettsiales → Rickettsiales bacterium | 558 | Open in IMG/M |
| Ga0193546_10054731 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodobacterales → unclassified Rhodobacterales → Rhodobacterales bacterium TMED271 | 544 | Open in IMG/M |
| Ga0193546_10056501 | Not Available | 536 | Open in IMG/M |
| Ga0193546_10057034 | All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Dinophyceae | 533 | Open in IMG/M |
| Ga0193546_10060102 | All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Ciliophora → Intramacronucleata → Spirotrichea → Oligotrichia → Strombidiidae → Strombidium | 520 | Open in IMG/M |
| Ga0193546_10060619 | All Organisms → Viruses → unclassified viruses → Virus sp. | 518 | Open in IMG/M |
| Ga0193546_10063022 | Not Available | 508 | Open in IMG/M |
| Ga0193546_10063503 | Not Available | 506 | Open in IMG/M |
| Ga0193546_10065199 | Not Available | 500 | Open in IMG/M |
| Scaffold ID | Protein ID | Family | Sequence |
|---|---|---|---|
| Ga0193546_10012650 | Ga0193546_100126502 | F097143 | MIIETILGVVISAVVGGDTNPDPATMIYKGLMTGNKIIQKEKNKEIDDSIHEETHQLAKDAILKMGGDN |
| Ga0193546_10013632 | Ga0193546_100136322 | F013311 | LAFLGDFGKFFNLGSSEQVLGDVGTAVGSLFGSQGAIVGRELGKATGRATSNLAGDDNISPLSTADQALEQSAVSVSKRGDVPQEINQNLGITGGNTMQAGIGGIAPAIQGIRALAPKIGQFFGGTGGALTTGFGTGVALNALTGTGMSREKPVLTQSRRNKARVRQLVNFMGIEGASNFLSQASGQNVTANDVVMLLLRTFRNDGAYITKAQVRNLRRTTNRFKSLEKQVKEATSMTRTTRRAPMRRASSTTLIKN |
| Ga0193546_10013632 | Ga0193546_100136323 | F007722 | GVATLTGDIGTIKKSIQTLNNRINGLNSPKLNDELLQYAIKQQSNVTDIRKNNIGG |
| Ga0193546_10016130 | Ga0193546_100161301 | F002223 | MKYKQMSQNSWIVEVQKDGKTKELFIEFPTGSLDQVGWDTGDKVIWEELPTGEFSLRKADDGDGEQGNK |
| Ga0193546_10016414 | Ga0193546_100164141 | F029759 | GMFTYIHAFWALLGLNLMVFGWALYAHARVRANEKVLESLDWETLANLTGEVGAMKRSLQKVNNRINGMTAADPVQMLQELPKLQNATQPNGRIGG |
| Ga0193546_10018394 | Ga0193546_100183942 | F006699 | MLKRFDFKCTVCDHIEEQWVDSEDTLATCLECGETTKRIISSVSSQFKGTGWPDADDRWARDHERAARK |
| Ga0193546_10018784 | Ga0193546_100187842 | F019653 | MTTLVANYVFSPLSGLWSSLDRFTQVIGYSRAAAELARMGMIEESKRCMMEIGRLQNDR |
| Ga0193546_10018922 | Ga0193546_100189222 | F037682 | MKQEELVREXXXXGERDFYRAKLDESNARVKTLEFDLAELQKRDVELSKRCSELANKGNYRPKPKRFH |
| Ga0193546_10018922 | Ga0193546_100189224 | F002223 | MKYKQMSQNSWIVEVQKDGKTKELFIEFPTGSLDQVGWDTGDKVIWEELPTGEFSLRKAD |
| Ga0193546_10020184 | Ga0193546_100201843 | F009725 | MLIWERFKNACVSAGYARAASQLATQGHYEEAKHLMLNGVKETVERKKAIRRLERVKKAKASYEPGNYYMKGKKVA |
| Ga0193546_10021864 | Ga0193546_100218642 | F089015 | MKMSKMTAEQIVEMLARMSDAEKQKTANLLVNKWFHLTKSFVSMVDAEMQDQFMNEQAEVFEMQKAAENGTKLF |
| Ga0193546_10022818 | Ga0193546_100228183 | F042354 | REDAGGTKYTGDGGVVNLDGKISSSINYSLSISDGKFDDVVYNNSEGSVTYMVLPNIGVDLMGSQIKLGTVQETDTSLGVSYNLYASSLDMKVFVGSDINNYGKFYTYGTKLNLSVTQGSRLTLSYKTEDRKQKATTMDARFVYDLTSNLGLNLGYKSTETKNAAGTAVVLKGNTTYGGIVYKF |
| Ga0193546_10022984 | Ga0193546_100229841 | F085817 | MSSNLKVNKRSQKSFSVTRFSGFDGSRVQITTARDHRKEIDVSDKFFNSVNLNKQEAQDLVETLMGFINGTDNECFDSVPLVEAKKQWMDDFNKVRPGTSRIVKRDNGDVVVERPTFSKLLQENQKKLSQNDLNYFNSRRMA |
| Ga0193546_10023684 | Ga0193546_100236841 | F068862 | RHTTSFLIFNIMLYKLFIALATLLVVNANAPSERSVHISTNTDVDGLNGLNLNWQVPFKVDDYVVGFRYKLNELKKYPETLFAKKIFDVADGEATVDADVNVDDKSLSANVKWVSDKLVDGMKTTLRLNGNSNDKITSVGAEVNKNVDGRDVELKGTYNLADSRLDANGKVVVDKTTAEVSYNSGDEDIRLQLSHDLDDHNTPKGSYSTKTGQVAYGWTRKWEGGELDGTYHPDNGGRAVLEWTDKGNQGDWKTRAEVPLANNEIGNSRVTIKR |
| Ga0193546_10026355 | Ga0193546_100263551 | F005911 | MLKNKILNFLNSKKGNALLLATAGAIAATFSVYFFVSLTTLSEDSKQRVAHLYNAYQMGQSIKAKIDGADINQARLGAGDEDQIEAPVNELFHNGNFIKLSEMVKKAVIIVSDDPTATARKGYDIGYDAENSGVLIKFAAVDGNIIQPDQGDADDTIVADVHLFVNLAGTADTDSNSPYVDGTPFYYILMDSTTAGLAASLETIDSTIYASGILATNGGGPQAEVSVVLPQDSEE |
| Ga0193546_10036010 | Ga0193546_100360102 | F098216 | VPKLIIGMLVAGVLLAINTTSETQDQVKFCDQYLLVAEHMNKTYSISNYMKKNGVGGVLETNHCKKKQGRI |
| Ga0193546_10036117 | Ga0193546_100361172 | F053091 | MNFNSYYLWPKKFVEDTVNTEEMAERPFVDEYTEEILNKEIYDLEEEDLKSLINQLSYTASFYARLRRTVMNHTGRIMLERMM |
| Ga0193546_10038066 | Ga0193546_100380661 | F090440 | GGNEISGLKLATGLRLKRIYHCRITDVGEGHDLRIIIHKSIGG |
| Ga0193546_10038152 | Ga0193546_100381521 | F068862 | GGLNLNWQVPFKVDDYVVGFRYKLNELKKYPETLFAKKTFEVADGEATVDADVSVDDKSLSANVKWVSDKLVDGMKTTLRMNGNSNDKVTSVGAEVNKNVDGRDVELKGTYNLADSRLDANGKVVVDKTTAEVSYNSGDEDIRLQLSHDLDDNNTPKGSYSTKTGQVAYGWTRKWEGGELDGTYHPDNGGRAVLEWTDKGQQGDWKTRAEVPLANN |
| Ga0193546_10039215 | Ga0193546_100392151 | F105327 | RMSNYIRTINMDGPAGNAMNLIATAKHMAKDHGEDGKKIVKEMMDTGEYDILVQTLLFYFGDYLRLVNSSGKDITDNYIGADNG |
| Ga0193546_10040267 | Ga0193546_100402672 | F009636 | MGYFYQDWKEKKMFVENSDGXXXXGNLTEGASDEKRMAIGYMEYLADCLKKGKLEVKWNI |
| Ga0193546_10043121 | Ga0193546_100431212 | F085817 | RGFDGSRVQITTARDHRKEISVSDKFFNSVNLNKQEAMDLVETLMGFINGTDNECFDSVALVEAEKQWMDDFNKVRPGTSRIVKRDNGDVVVERPTFSKLLQENQKKLSQNDLNYFNSRRMA |
| Ga0193546_10044630 | Ga0193546_100446301 | F074917 | MRFAKTKPQKPTSFLATETKDKLDTIRQNPIDQKKKQCGDRCHYKHSQCSSAGFTPRRPRNPVHFLTDLSNELCGGHFSHRYLPIQVLINHDMTVSTLRFPWVFDQANVQHP |
| Ga0193546_10046234 | Ga0193546_100462343 | F023874 | GVDGFGDVIESNLSWNKEDMRKNVGHLFLFMKGTWQYSDNGIDWWPAEEMLKEVA |
| Ga0193546_10047335 | Ga0193546_100473351 | F021115 | MRNHDISNELDDPLSSGKIGAYSGSVKARFYKDGKLYKDIPPVDIHSELRSVLCNSKIKIKSYNVSYDKGCLVINLPDNIKGSLANLIIVKTLTTV |
| Ga0193546_10048743 | Ga0193546_100487431 | F007531 | MQKKIEKQIVEFAKDTGLTERREIVHGTLYVKLHEEKECDTFHRAIRTFYKEQINNEGGVNMYAVGDEFAFDFVPADRETPVFTDEEEKGIWSEFAEEGLEKDMPDDVDTLLNLEXXXXAESKRGK |
| Ga0193546_10051009 | Ga0193546_100510091 | F006699 | GGDHIEEQWVDSEDTLATCLECGETTKRIISSVSSQFKGTGWPDADDRWARDHERAARK |
| Ga0193546_10051921 | Ga0193546_100519211 | F000379 | ATKHGIMVGYIDGSQNVVAVESGWKITPEEMAKFLTEKFPSNEKAREVVDSPVLIPTFKKDDYCYVDGFGDVIESNLSWNKEEMRKKVGHLFLFMKGAWQYSDNGIDWEPADEALPMPREVA |
| Ga0193546_10054731 | Ga0193546_100547312 | F084330 | MASRLATFLKNIFGKTFYSNIWIATSLGLVSQVIGFTKYDGYFDLYWIIMKDGYLYFNLYPSFDELSFVLFIRVFLAIFVFLTIRDKLKGKNDLDA |
| Ga0193546_10056501 | Ga0193546_100565012 | F067842 | MHEYQTCIXXXXGAVTLSVVECGYSEKNSTDNAFDTFKCLALNPNEILKIETKGI |
| Ga0193546_10057034 | Ga0193546_100570341 | F010768 | VRPVPPPNSEITLIIAKRYERKRIKIEKLFTLGYTTSGDPXKIGTK |
| Ga0193546_10058295 | Ga0193546_100582951 | F000200 | GIYDFRCTNGHVFEDMVESGVTTSRCGCGANATKMVSAPQCVLEGHSGDFPGRHMKWVREHEAAGRKKSPQ |
| Ga0193546_10060102 | Ga0193546_100601021 | F000953 | GDDIFMRSMIEQYALEERTKVHEADDGLKTGGEPSGKFWMNEAAANAAAREVLTTHKGLTGKLLDDYMTTYFAKAWGHFDVNQTGFVEVIKMPQMXXXXMRFLCSDQYMSLGESG |
| Ga0193546_10060619 | Ga0193546_100606191 | F059061 | GIAPFFGKSVQQQPAITTTRNIGGQETQGSGSIQAGAGSLLPTITGLGRQFLKSPGMQIGTGLLGLAAGELLGGQPTGMRITRKMKSQARTVLNMVGGDISAASQILGVDEQTLVMILLKRFRNDGPVVTKAALRKTKQTVRRLKSMCDMYDDLRPRAATRRRTPMKRASTT |
| Ga0193546_10063022 | Ga0193546_100630222 | F098216 | VPKLIISITLVAGFLLAINTTSETQDQVKFCDKYVLIADQMNKTYSISNYMKENGVEGVLETNHCKNKQG |
| Ga0193546_10063503 | Ga0193546_100635031 | F020468 | GDQILAGTTYEYVGPNTRLVVAAAADAAGTKLNFNVNNAEFARDAEVSEKVTGEPFGWKGGYVMNDMVTTAVERNRPIITFTNNSGASRTIDVAVFIATLTCRHKALRTTAVSRTKP |
| Ga0193546_10065199 | Ga0193546_100651991 | F008818 | GEKSDSETNIAALAKAIPAIEKGMSGFLQTSAASVLRSLSVSMDMSSVDREMLASFLSSKTGYAPASGEIVGILKQMDDEMKADLASATANEEAAIKAFDGLVAAKTKEINALTKSIETKTARVGEIAVKTAEMENDLDDTKEDLEESKKFLADLDVNCEKKKKER |
| ⦗Top⦘ |