| Basic Information | |
|---|---|
| IMG/M Taxon OID | 3300009346 Open in IMG/M |
| GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0117984 | Gp0126428 | Ga0103839 |
| Sample Name | Microbial communities of water from the North Atlantic ocean - ACM14 |
| Sequencing Status | Permanent Draft |
| Sequencing Center | University of Georgia |
| Published? | N |
| Use Policy | Open |
| Dataset Contents | |
|---|---|
| Total Genome Size | 48730475 |
| Sequencing Scaffolds | 15 |
| Novel Protein Genes | 18 |
| Associated Families | 18 |
| Dataset Phylogeny | |
|---|---|
| Taxonomy Groups | Number of Scaffolds |
| Not Available | 8 |
| All Organisms → cellular organisms → Eukaryota | 2 |
| All Organisms → cellular organisms → Bacteria | 1 |
| All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Kyanoviridae | 1 |
| All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes | 1 |
| All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Ciliophora → Intramacronucleata → Spirotrichea → Stichotrichia → Urostylida → Pseudourostylidae → Pseudourostyla → Pseudourostyla cristata | 1 |
| All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Aokuangvirus → Aokuangvirus SCBWM1 | 1 |
| Ecosystem Assignment (GOLD) | |
|---|---|
| Name | Aquatic Microbial Communities From Amazon River, Brazil And North Atlantic Ocean |
| Type | Environmental |
| Taxonomy | Environmental → Aquatic → Freshwater → Unclassified → Unclassified → River Water → Aquatic Microbial Communities From Amazon River, Brazil And North Atlantic Ocean |
| Alternative Ecosystem Assignments | |
|---|---|
| Environment Ontology (ENVO) | marine biome → marine water body → surface water |
| Earth Microbiome Project Ontology (EMPO) | Free-living → Saline → Water (saline) |
| Location Information | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location | North Pacific Ocean | |||||||
| Coordinates | Lat. (o) | N/A | Long. (o) | N/A | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F002457 | Metagenome / Metatranscriptome | 557 | Y |
| F004358 | Metagenome / Metatranscriptome | 442 | Y |
| F005505 | Metagenome / Metatranscriptome | 398 | Y |
| F005911 | Metagenome / Metatranscriptome | 386 | Y |
| F006548 | Metatranscriptome | 370 | Y |
| F018020 | Metagenome / Metatranscriptome | 237 | Y |
| F020014 | Metagenome / Metatranscriptome | 226 | Y |
| F020468 | Metagenome / Metatranscriptome | 224 | Y |
| F028825 | Metagenome / Metatranscriptome | 190 | N |
| F031118 | Metagenome / Metatranscriptome | 183 | N |
| F042640 | Metagenome / Metatranscriptome | 158 | N |
| F047358 | Metagenome / Metatranscriptome | 150 | N |
| F063725 | Metagenome / Metatranscriptome | 129 | N |
| F064686 | Metagenome / Metatranscriptome | 128 | Y |
| F065416 | Metagenome / Metatranscriptome | 127 | Y |
| F083374 | Metagenome / Metatranscriptome | 113 | Y |
| F086309 | Metagenome / Metatranscriptome | 111 | Y |
| F093788 | Metagenome / Metatranscriptome | 106 | N |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| Ga0103839_1002833 | Not Available | 967 | Open in IMG/M |
| Ga0103839_1004531 | All Organisms → cellular organisms → Eukaryota | 803 | Open in IMG/M |
| Ga0103839_1004673 | Not Available | 794 | Open in IMG/M |
| Ga0103839_1005148 | Not Available | 763 | Open in IMG/M |
| Ga0103839_1005626 | All Organisms → cellular organisms → Bacteria | 735 | Open in IMG/M |
| Ga0103839_1006141 | Not Available | 709 | Open in IMG/M |
| Ga0103839_1007708 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Kyanoviridae | 646 | Open in IMG/M |
| Ga0103839_1008240 | Not Available | 629 | Open in IMG/M |
| Ga0103839_1008264 | All Organisms → cellular organisms → Eukaryota | 629 | Open in IMG/M |
| Ga0103839_1008869 | Not Available | 610 | Open in IMG/M |
| Ga0103839_1010208 | Not Available | 576 | Open in IMG/M |
| Ga0103839_1012136 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes | 540 | Open in IMG/M |
| Ga0103839_1012709 | All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Ciliophora → Intramacronucleata → Spirotrichea → Stichotrichia → Urostylida → Pseudourostylidae → Pseudourostyla → Pseudourostyla cristata | 530 | Open in IMG/M |
| Ga0103839_1014014 | Not Available | 510 | Open in IMG/M |
| Ga0103839_1014099 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Aokuangvirus → Aokuangvirus SCBWM1 | 509 | Open in IMG/M |
| Scaffold ID | Protein ID | Family | Sequence |
|---|---|---|---|
| Ga0103839_1002047 | Ga0103839_10020472 | F028825 | MSNTPLQGALESFTKLTDTLQECIKSHDIEGAMTLAKERHDALVNLLEDANIDQTQRANCADTTLGHLLKEQLLAKSNSDQNRSDFIARKSAYRAYALKAA* |
| Ga0103839_1002833 | Ga0103839_10028331 | F047358 | MNKKYIIIGAVVAAIALFTMCGKCEAQEVVKEKNWELNTEVGYYEKRISGGLYGAQDSAYVKASTKLGNFQGLAFVGSLEYVNTEDYQLHGTIGTYLNTVVGGIDTRLVVHTGENADTTFELNGAYDLNWFEFVDTSVTVAFEDGSEAGTSTDSIITTPAFNVSKTFDAKYVDITVGGEYGQSFGYDEDFEYLHGYVRVKSTINNVIPVFVQFNALKNDLGIVNDISSEGTDGDFDTSVTVGLAYSF* |
| Ga0103839_1004531 | Ga0103839_10045312 | F018020 | MTEKVAMILSGYSSLILEMRRVPIPDPVPPPREWVT* |
| Ga0103839_1004673 | Ga0103839_10046732 | F064686 | MKNKITHEEYQKMWFALNDGIITEDEWRVFCDALFAQTLEENKDVMVRLKNR* |
| Ga0103839_1005139 | Ga0103839_10051392 | F083374 | ASTLDDGTAPEDFDVDAPASGEGAIGAQSKQMYEELSGWIGEMDRFSQYLNGTTDSIQTSLNAAEPDTIFDSISNAETKKIARVAMEISSLSEILKGYLAGANDPKYKFN* |
| Ga0103839_1005148 | Ga0103839_10051481 | F005911 | KKGETMCLKDRLLQMLKSKKGNALLLATAGAIAATFSVYFFVSLTTLSEDSKQRVAHLYNAYQMGQSLKGKIDGADINQARLGSKTEDDIEAPIDDVFHNGKFISLKTMVKKAIIIAADDPTATARGGVDTPYDLVNSGVLIKYADAGGQVITPTTTGSGTATIVADVQLFVNLAGTPDADSNSPYVDGEPFYYILMDANTAGLADSEHTVDLEQFPAGILATNDGGPQAEVSVVLPQDDED* |
| Ga0103839_1005626 | Ga0103839_10056262 | F031118 | VVTQSRCEALAETLAEAEKLKMYRSDGTTYCEKCDSLVVRIQSALADVHVEDLTADEIEIIKSIKEKVISEEVFAKSSSILRNPKEWLGTLNE* |
| Ga0103839_1006141 | Ga0103839_10061411 | F020468 | MPLVKKDITLAAGATSEQILAGTTYEYVSQNTRLIVAATAGDGAGGIAADDSTGVVMNFNVNNAEFSRDAAVSPAITGEAFGWKGNYVMNDMVTTAAERNRPVITFTNNSGAARNVSVAVFIGG* |
| Ga0103839_1007708 | Ga0103839_10077083 | F093788 | DSDPLNLSITTMTEFNPRSYILTQLAYAEEQLMIADDMYSKITWGNRCDALEAALADLEAA* |
| Ga0103839_1008240 | Ga0103839_10082401 | F006548 | AELRETLQGLAVDLLALHPVDGMSSVRDGMRTVSGNMDLKTALKAIDHQKLPTDVQAMVKTSSSSKGAFSEESMAKARIALNDLVEKAWVELDDKIIECKEYQEMNRATFDQVVTDISRLVEQITDLERVETESLEGISKMEMQIKDVETEMSKETKIYNYNYAKNSEELTRRQNDLDVFQFILTLTRCPDATSLLQSGLNETRICAVK |
| Ga0103839_1008264 | Ga0103839_10082642 | F020014 | SYKDDPRPEYAPKKNSVSKPRAASPPTVATIQRGILPISLILIVVIKK* |
| Ga0103839_1008610 | Ga0103839_10086101 | F002457 | AARKAMFDAVDSKQGAARGWIGAAQFTDWATTHIAGKIAEIDTASEVDFYHVANYAEADFLKAIEVAVTNKNSREYASLYEFLLTAFVETDATCRGEITYAEFNKLIERAAAVPRTFGLAPPEASKEVRKQFFDSMEDKQMGGVTFRLLLAWTIEHSKGKIAAQKAGKGYKK* |
| Ga0103839_1008869 | Ga0103839_10088692 | F065416 | MSPTRLGYKELRLELVKPLQDQPHKS*QPPIADEKKDEGARDPIKILLEEALERQRNAMMDNFA |
| Ga0103839_1010208 | Ga0103839_10102081 | F086309 | MNVKQAKAMAFGSDPSGTLDDYYQAWQWLYDHKVELSEADENYMDKLIC |
| Ga0103839_1012136 | Ga0103839_10121361 | F004358 | ITMLLTTLGATGMGSMLKIVAGTIQSINDSRQQKAQRELARDLAMSNANAAFQKAVFEGGSEQESMFTRGTRRIIALIGMLNFATISILCTIWPSTTLVTFTPPENKESISILFGLVKFPSGADVTTAITTGHISLVSIATLGAIIGFYFTPGGKN* |
| Ga0103839_1012709 | Ga0103839_10127091 | F005505 | DKSGTYVS*FYDAFLKEIQDAWY*VLYIYVYFFLHHFNGSTVNYFFFER*NIAELDEIRFYGVAPH*YFRPYMGILVISPTHYEGLM*MGLFLGSLACLPLIYNVYNTFNKYVSTIPMQNSILQTTTFTLFMMSLYCANSMLPCGRYYYEPEGGYVGNP*VKFSYQYMYLYLC*LL |
| Ga0103839_1014014 | Ga0103839_10140141 | F042640 | LNLSRCTTTTREKSMSSEARKDFDEWFVNHYEELVHTARRFHDFPRDLVHHTYLQCINALESNENILDNLPGYFHTSMWNQAQNLFRKLYEIHETPTSNVVSDYDISEAIKKEEAMIMTNHLAWFDRTVLSLYLDGWSMAQIARESGINVSVLYESISKSKNKLRHVIRQ |
| Ga0103839_1014099 | Ga0103839_10140991 | F063725 | PAFAYTYQLTGTPAVRPEYYIRERRVVRAEITVERAINMTGLGGTGLIGSAFYIDNVFSD |
| ⦗Top⦘ |