NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300021907

3300021907: Marine eukaryotic communities from Monterey Bay, California, United States - M1_20Mar14CPVII9sort6BwellE17



Overview

Basic Information
IMG/M Taxon OID3300021907 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0111469 | Gp0242137 | Ga0214480
Sample NameMarine eukaryotic communities from Monterey Bay, California, United States - M1_20Mar14CPVII9sort6BwellE17
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size51995206
Sequencing Scaffolds32
Novel Protein Genes39
Associated Families10

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Eukaryota → Sar → Stramenopiles → Ochrophyta → Bacillariophyta → Coscinodiscophyceae → Thalassiosirophycidae → Thalassiosirales → Thalassiosiraceae → Thalassiosira → Thalassiosira pseudonana6
All Organisms → cellular organisms → Eukaryota → Sar → Stramenopiles → Ochrophyta → Bacillariophyta6
All Organisms → cellular organisms → Eukaryota → Sar → Stramenopiles → Ochrophyta → Bacillariophyta → Coscinodiscophyceae → Thalassiosirophycidae → Thalassiosirales → Thalassiosiraceae → Thalassiosira2
All Organisms → cellular organisms → Eukaryota → Sar → Stramenopiles → Ochrophyta → Bacillariophyta → Coscinodiscophyceae → Thalassiosirophycidae → Thalassiosirales → Thalassiosiraceae → Thalassiosira → Thalassiosira oceanica2
Not Available10
All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium TMED883
All Organisms → Viruses → Predicted Viral2
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Cyanobacteria/Melainabacteria group → Cyanobacteria1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameMarine Eukaryotic Communities From Various Locations To Study Complex Ecological Interactions
TypeEnvironmental
TaxonomyEnvironmental → Aquatic → Marine → Unclassified → Unclassified → Seawater → Marine Eukaryotic Communities From Various Locations To Study Complex Ecological Interactions

Alternative Ecosystem Assignments
Environment Ontology (ENVO)marine biomebaysea water
Earth Microbiome Project Ontology (EMPO)Free-living → Saline → Water (saline)

Location Information
LocationUSA: California
CoordinatesLat. (o)36.746Long. (o)-122.0257Alt. (m)N/ADepth (m)0
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F000866Metagenome / Metatranscriptome855Y
F010163Metagenome / Metatranscriptome307Y
F019658Metagenome / Metatranscriptome228Y
F026012Metagenome / Metatranscriptome199Y
F041807Metagenome / Metatranscriptome159N
F051162Metagenome / Metatranscriptome144Y
F057890Metagenome / Metatranscriptome135N
F064770Metagenome / Metatranscriptome128Y
F094948Metagenome / Metatranscriptome105Y
F100378Metagenome / Metatranscriptome102Y

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0214480_100041All Organisms → cellular organisms → Eukaryota → Sar → Stramenopiles → Ochrophyta → Bacillariophyta → Coscinodiscophyceae → Thalassiosirophycidae → Thalassiosirales → Thalassiosiraceae → Thalassiosira → Thalassiosira pseudonana45124Open in IMG/M
Ga0214480_100112All Organisms → cellular organisms → Eukaryota → Sar → Stramenopiles → Ochrophyta → Bacillariophyta34523Open in IMG/M
Ga0214480_100155All Organisms → cellular organisms → Eukaryota → Sar → Stramenopiles → Ochrophyta → Bacillariophyta32007Open in IMG/M
Ga0214480_100532All Organisms → cellular organisms → Eukaryota → Sar → Stramenopiles → Ochrophyta → Bacillariophyta → Coscinodiscophyceae → Thalassiosirophycidae → Thalassiosirales → Thalassiosiraceae → Thalassiosira19064Open in IMG/M
Ga0214480_100708All Organisms → cellular organisms → Eukaryota → Sar → Stramenopiles → Ochrophyta → Bacillariophyta → Coscinodiscophyceae → Thalassiosirophycidae → Thalassiosirales → Thalassiosiraceae → Thalassiosira → Thalassiosira oceanica16329Open in IMG/M
Ga0214480_100971All Organisms → cellular organisms → Eukaryota → Sar → Stramenopiles → Ochrophyta → Bacillariophyta → Coscinodiscophyceae → Thalassiosirophycidae → Thalassiosirales → Thalassiosiraceae → Thalassiosira → Thalassiosira pseudonana13529Open in IMG/M
Ga0214480_101183All Organisms → cellular organisms → Eukaryota → Sar → Stramenopiles → Ochrophyta → Bacillariophyta11738Open in IMG/M
Ga0214480_101276All Organisms → cellular organisms → Eukaryota → Sar → Stramenopiles → Ochrophyta → Bacillariophyta → Coscinodiscophyceae → Thalassiosirophycidae → Thalassiosirales → Thalassiosiraceae → Thalassiosira → Thalassiosira pseudonana11078Open in IMG/M
Ga0214480_101292All Organisms → cellular organisms → Eukaryota → Sar → Stramenopiles → Ochrophyta → Bacillariophyta → Coscinodiscophyceae → Thalassiosirophycidae → Thalassiosirales → Thalassiosiraceae → Thalassiosira → Thalassiosira pseudonana10971Open in IMG/M
Ga0214480_101507All Organisms → cellular organisms → Eukaryota → Sar → Stramenopiles → Ochrophyta → Bacillariophyta9402Open in IMG/M
Ga0214480_102061All Organisms → cellular organisms → Eukaryota → Sar → Stramenopiles → Ochrophyta → Bacillariophyta → Coscinodiscophyceae → Thalassiosirophycidae → Thalassiosirales → Thalassiosiraceae → Thalassiosira → Thalassiosira pseudonana6621Open in IMG/M
Ga0214480_102452Not Available5357Open in IMG/M
Ga0214480_102760All Organisms → cellular organisms → Eukaryota → Sar → Stramenopiles → Ochrophyta → Bacillariophyta → Coscinodiscophyceae → Thalassiosirophycidae → Thalassiosirales → Thalassiosiraceae → Thalassiosira → Thalassiosira pseudonana4592Open in IMG/M
Ga0214480_103033All Organisms → cellular organisms → Eukaryota → Sar → Stramenopiles → Ochrophyta → Bacillariophyta → Coscinodiscophyceae → Thalassiosirophycidae → Thalassiosirales → Thalassiosiraceae → Thalassiosira4032Open in IMG/M
Ga0214480_103383Not Available3450Open in IMG/M
Ga0214480_103639All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium TMED883062Open in IMG/M
Ga0214480_103822All Organisms → Viruses → Predicted Viral2842Open in IMG/M
Ga0214480_103976All Organisms → cellular organisms → Eukaryota → Sar → Stramenopiles → Ochrophyta → Bacillariophyta2661Open in IMG/M
Ga0214480_104140All Organisms → cellular organisms → Eukaryota → Sar → Stramenopiles → Ochrophyta → Bacillariophyta → Coscinodiscophyceae → Thalassiosirophycidae → Thalassiosirales → Thalassiosiraceae → Thalassiosira → Thalassiosira oceanica2510Open in IMG/M
Ga0214480_104289All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium TMED882368Open in IMG/M
Ga0214480_106615All Organisms → Viruses → Predicted Viral1131Open in IMG/M
Ga0214480_106841Not Available1068Open in IMG/M
Ga0214480_107482All Organisms → cellular organisms → Eukaryota → Sar → Stramenopiles → Ochrophyta → Bacillariophyta917Open in IMG/M
Ga0214480_107724All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium TMED88873Open in IMG/M
Ga0214480_107874Not Available843Open in IMG/M
Ga0214480_108309Not Available767Open in IMG/M
Ga0214480_109014Not Available671Open in IMG/M
Ga0214480_109038Not Available668Open in IMG/M
Ga0214480_109450Not Available623Open in IMG/M
Ga0214480_109662Not Available601Open in IMG/M
Ga0214480_110093Not Available561Open in IMG/M
Ga0214480_110706All Organisms → cellular organisms → Bacteria → Terrabacteria group → Cyanobacteria/Melainabacteria group → Cyanobacteria513Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0214480_100041Ga0214480_10004130F000866MGYVISLLFHIPFIADWHKIGEYRQIQTDNNTKRENKSCIDFDYVVGGELLLRKEGILRKSESRYTGPWTITQVHTNGNIRIQCGNKSERLHIRRVIPYY
Ga0214480_100112Ga0214480_10011235F094948TAAWVLDCHYNIGHVLSSPQTTTRGVGVAKCWADLEILAQSEARCRPTHGCKSATDTSTKVLQVFINLLGTQAMPIHQYIC
Ga0214480_100155Ga0214480_10015516F019658MKINRIKSQNYKLFKYNLLKLQIYSNESIFDLSNFSNSTLEQIEAYLKQVLKIIFEYHVXQFKILFIGFPVVCKMKQMKLINFTNHNFISEKSXVSGVFRNRFSILTYLKLIQLQSFSKSLKLLLTIKTKPHLVVVFNQKVELNTINEFYKSGIPILSFNXNSFDTLKVAYHTLGNFNFIEKNIKLTFFFLLYSLLKKTPLKKRRNQNF
Ga0214480_100155Ga0214480_10015533F064770MDFNLKTYKRSRIKHYFKRINFFFFFHGTSLNNESXIKTEQVFVNYGLKYFRVLNKLMINTLKNSIFKNLVVLIHGPIILLSNPNTKLTFKESKSLNPLISLLSFRLNNKIYSKKQIKNLNRMSYLENISIFHNSMKTCIKLPYYKFKNKRTPVVSK
Ga0214480_100155Ga0214480_10015536F010163MLLLKIANKSSFSQNLINHNYIKYLFHNIKVLKFIKKDAFSYYTQXLFLFISTNLYNLKIKTSCLMSYISFFNSQKLISYVININLSSTNTLINVNSIKGNPKFFYSAGMFSLQKNQKTRQPKAIITILRALLAKSKIFKVKPVAVHFNNVFFNHQSYIFKKLKQKIFMKLVTSYNFRSHNGCRLKKKKRIKIRTRTRKL
Ga0214480_100532Ga0214480_10053214F100378MSDAYFSVDIPKSLPIHGMALTSTGRKIEHSKVVVDSTQTYKLNGSVSLKNLGFLVFEV
Ga0214480_100708Ga0214480_10070811F094948MTAAWVLDCHYNIVHVLSSPQTIATGVGVAKCWADFEILAQSEARYQPTHGCKSATSTSTKVLQVFTDLLGTQAMPIHQYIC
Ga0214480_100708Ga0214480_10070812F094948MTAAWVLDCHYNIGHVLSSPQTTTRGVGVAKCWADLEILAQSEVRCRPTHGCKSATDTSTKVLQVFINLFGTQAMPIHQYIC
Ga0214480_100971Ga0214480_1009713F057890MVLSSEGLHLLSNVAAWRHDFETINDKNVALWHRYYTDVINYYIMKETHREWIDNKNKEKKMKKEYSTSDGWEVFRFQISEEHRINRTFNTLNDFLDYWFHEYHNRQESSNRIEPCYMKKDTKKQSGQGPTGVQQEAVERYQKLCYTDKAIYRAIGKEYFRKKVVYDNN
Ga0214480_101183Ga0214480_1011832F057890MVLSSEGLHLLSNVAVWRHDFETINDKNVAMWHRYYTDVINFYIMKETHREWIENKNKEKMMKKEYSTSDGWEVFRFQVSEEHRINRTFDNLNDFLDYWFHEYHNRQESRNRIEPCYMKNHTKEQPGQGPTGVQQEALERYQNLSYTDKAIYRAIGKEYFRKKVDYSHMSNVNN
Ga0214480_101276Ga0214480_1012765F057890MVLSSEGLHLLSNVAVWRHDFETINDKNVAMWHRYYIDVINYYIMKENDCQLIISNNDKKEYSTPDGWEVFRFQVSEEHRINQTFNTLNDFLEYWFHEYNNRQESSRLEPVYMKKQTKNQSGECPTGVQQEASERYQKLSYTDKAIYRAIGKEYFRKKK
Ga0214480_101292Ga0214480_1012928F057890MVLSSEGLHLLSNVAVWRHNFETINDKNVAMWHHHYTDVINFYIMKENHREWIDNNNKKKKEKKEYSTSDGWDVIRFQVSEERRINRTFNTLNDFLDYWFHEYHNRQESSRLEPIYMKKHTKKQSGHPTGVQQEALERYQKLSYTDKAIYRAIGKEYFRKKSDLLTSDIVDTTQSNRLT
Ga0214480_101507Ga0214480_10150710F094948MTAVWVLDCHYNIVHVLSSPQTITTGVGVAKCWADLEILAQSEVRCWPTHGCKSATSTSTKVLQVFINLLGTQAMPIHQYICQPTSLGGARQPNIAQMLVRF
Ga0214480_102061Ga0214480_1020617F100378MSDAYAIVHIPKSLPTHGMALTSTGRKIEHNKVVVDTQMYESNGSVSLEKSGFSPLRGVAVLL
Ga0214480_102452Ga0214480_1024524F057890MVLSSEGLHLLSSVAVWHHDFETINDKNVALWHHYYTDVINFYIMKENHREWIDNNNKKKKEKKEYSTSDGWDVIRFQVSEEHRINRTFDTLNDFLDYWFHEYNNRQESSRLEPMYMKKQTKKQSEQGQTGVQHEASDRYQKLSYTEKAIYRAIGKEYFRKKK
Ga0214480_102760Ga0214480_1027605F100378NSTMSDAYAIVDIPKSLPIHGMALTSTGRKIEHNKVVVGTQTSESLSWISLEKSGFSRLRGVAVLL
Ga0214480_103033Ga0214480_1030334F094948MTAAWVLDCHYNIVHVLSSPQTITTGVGVAKCWDYLEILAQSEARCRPTHGCKSATSTSTKARQVFINLFGTQAMPIHQYIC
Ga0214480_103383Ga0214480_1033831F100378MSDAYAPTHIPKSLPTHGMALTSTGRKIEHNKVVVDTQMYESNGSVSLVKSGFSPLRGVAVLL
Ga0214480_103383Ga0214480_1033832F100378MSDAYAIVHIPTSLPIHGMALTSTGRKIEHNKVVVDTQMYESNGSVSLEKSGFSPLRGVAVLL
Ga0214480_103383Ga0214480_1033833F100378MSDAYAIVHIPKSLPTHGMALTSTGRKIEHNKVVVDTQMYESNGSVSLEKSGISPLRGVAVLL
Ga0214480_103639Ga0214480_1036392F057890MVLSSEGLHLLSNVAVWRHDFETINDKNVALWHRYYTDVINYYIMKETHREWIDNKNKEKKMKKEYSTSDGWEVFRFQVSEEHRINRTFNTLNDFLDYWFHEYNNRQESSNRIEPCYMKNHTKKQSGQGPTGVQQEALERYQKLSYTDKAIYRAIGKEYFRKKVVYSHMSNVNN
Ga0214480_103822Ga0214480_1038223F057890MVLSSEGLHLLSNVAVWRHDFETINDKNVALWHRYYTDAINFYIMKETHREWIDNKNKEKKMKKEYSTSDGWEVFRFQISEEHRINRTFNNLNDFLDYWFHEYHNRQESSNRIEPCYMKKDTKKQSGQGPTGV
Ga0214480_103976Ga0214480_1039761F051162MKKSIPLSALQSKLCNIWGTKPETTKIILQRVEDWTADSFNEDVYIDIRAYGKEERTRDFVLDGMKDVQKAFGEHDLVANVRLETYDGERYFHVPPPK
Ga0214480_104140Ga0214480_1041402F057890MVLLSEGLHLLSNVAVWRHDFETISDKNVALWHRYYTDVISFHIMKENHHREWIINNNNKKNKKKEYSDSDGWDVFRFQVSEEHRINRTFNTLNDFLKYWFHEYNNRQESSRLEPMYMKKQTKNQSGECPTGVQQEASERYQKLSYTDKAIYRAVAKEYFRKKK
Ga0214480_104289Ga0214480_1042891F057890MVLSSEGLHLLSNIAVWRHDFETINDKNVAMWHHYYTDVINFYIMKENHREWMDNNNKKKEKKKEYSTSDGWDVIRFQVSEEHRINRKFNTLNDFLDYWFHEYHNRQESSRLEPIYMKKHTKKQSGQGPTGVQQEALERYQKLSYTDKAIYRAIAKEYFRKRK
Ga0214480_106615Ga0214480_1066151F100378AIVHIPKSLPIHGMALTSTGRKIEHNKVVVDTQTYELNGSVSFKKWGVFSRLRGVAVLL
Ga0214480_106841Ga0214480_1068411F057890MVLSSEGLHLLSNVAIWRHDFETINDKNVAMWHRYYTDVINYYIMKENDCQLIISNNNNKKEYSTSDGWDVFRFQVSEEHRINQTFNTLNDFLEYWFHEYHNRQESSRLEPVYMKKQTKNQSGECPTGVQQEASERYQKLSYTDKAIYRAIGKEYFRKKKSG
Ga0214480_107482Ga0214480_1074821F051162TKSIPLSSLQSKLCDIWGTKPETTKIILQRVEDWTSDSFHEDVYIDIRAYGKEERTRDFVLDGMKDVQKAFGEFDLLANVRLETYEGERYFHVPPPKK
Ga0214480_107724Ga0214480_1077242F057890MVLSSEGLHLLSNVAVWRHDFETINDKNVALWHRYYIDVINFYIMKENHREWIDNNNKKKEKKKEYSTSDGWDVIRFQVSEEHRINRPFNTLNDFLDYWFHEYHNRQESSRLEPMYMKKNRQRNNPEFNKKPWKDIKNLATLITQYIEQLQRSTSGRRSSICTIIKVTIKVTSNQLGFRTYL
Ga0214480_107758Ga0214480_1077581F026012LLQALYSIVISLQNNLSINFFDMWIYEIYINKVSIHNKFMNQKFQNLEPSEYITIKLAYGNNVSQEKK
Ga0214480_107874Ga0214480_1078741F100378MSDAYATVHIPKSLTIHGMALTSTGRKIEHSKVVVGAQTYESNGSVSLKNWIFSRL
Ga0214480_108309Ga0214480_1083091F100378KSLPIHGMALTSTGRKIEHSKVVVVTQTFESINSISLEKSGLSHL
Ga0214480_109014Ga0214480_1090141F100378STMSDAYATVGIPKSLPIHGMALTSTGRKIEHNKVVVHTQTYESISSISLEIWDSVGFEV
Ga0214480_109038Ga0214480_1090381F100378MSDAYAIVHIPKSLPIHGMALTSTGRKIEHNKVVVDTQTYELNGSVSLEKSGFSRFRGVADLL
Ga0214480_109038Ga0214480_1090382F100378IPKSLPIHGMALTSTGRKIEHNKVVVDTQTYESNGSVSLEKSGFSRLRGVADLL
Ga0214480_109450Ga0214480_1094502F094948MTAAWVLDCHYNIVHVLSSPQTITTGVGVAKCWADLEILAQSEARCRPTHGCKSATSTSTKARQVFINLLGTQAMPIHQYIC
Ga0214480_109662Ga0214480_1096621F094948MTAAWVLDCHYNIAHVLSSPQTITTGVGVAICWADLEILAQSEARCRPTHGCKSATSTSTKVQQVFINLLGTQAMPIHQYIC
Ga0214480_110093Ga0214480_1100931F100378MSDAYAIVDIPKSLPIHGMALTSTGRKIKHSKVVVGTQTYESLSSISLENVGFLVVEV
Ga0214480_110706Ga0214480_1107061F041807KHNVKEKLNMARLELTLNFPKNFEIKTFNVKSEKKLSPLAKLILQSVRFKHFYYV

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.