NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300006694

3300006694: Metatranscriptome of deep ocean microbial communities from South Indian Ocean - MP1239 (Metagenome Metatranscriptome)



Overview

Basic Information
IMG/M Taxon OID3300006694 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0053074 | Gp0092423 | Ga0031689
Sample NameMetatranscriptome of deep ocean microbial communities from South Indian Ocean - MP1239 (Metagenome Metatranscriptome)
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size54615677
Sequencing Scaffolds23
Novel Protein Genes31
Associated Families29

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
Not Available12
All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Crustacea → Multicrustacea → Hexanauplia → Copepoda → Neocopepoda5
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Sphingomonadales → Sphingomonadaceae → Novosphingobium → Novosphingobium aromaticivorans1
All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Hexapoda → Insecta → Dicondylia → Pterygota → Neoptera → Endopterygota → Coleoptera → Polyphaga → Cucujiformia → Tenebrionoidea → Tenebrionidae → Tenebrionidae incertae sedis → Tribolium → Tribolium castaneum1
All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Crustacea → Multicrustacea → Hexanauplia → Copepoda → Neocopepoda → Gymnoplea → Calanoida → Temoridae → Eurytemora → Eurytemora affinis1
All Organisms → cellular organisms → Eukaryota → Sar1
All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Dinophyceae → Gonyaulacales1
All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Ciliophora → Intramacronucleata → Spirotrichea1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameDeep Ocean Microbial Communities From The Global Malaspina Expedition
TypeEnvironmental
TaxonomyEnvironmental → Aquatic → Marine → Oceanic → Unclassified → Deep Ocean → Deep Ocean Microbial Communities From The Global Malaspina Expedition

Alternative Ecosystem Assignments
Environment Ontology (ENVO)marine biomemarine water bodysea water
Earth Microbiome Project Ontology (EMPO)Free-living → Saline → Water (saline)

Location Information
LocationWest of Perth, Australia, South Indian Ocean
CoordinatesLat. (o)-31.13Long. (o)110.21Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F000021Metagenome / Metatranscriptome6082Y
F000048Metagenome / Metatranscriptome3365Y
F000049Metagenome / Metatranscriptome3277Y
F000068Metagenome / Metatranscriptome2731Y
F000073Metagenome / Metatranscriptome2639Y
F000685Metatranscriptome937Y
F000879Metagenome / Metatranscriptome850Y
F000981Metatranscriptome814Y
F001583Metagenome / Metatranscriptome668Y
F001833Metatranscriptome628Y
F005448Metatranscriptome400N
F007220Metatranscriptome355N
F012012Metatranscriptome284Y
F013304Metagenome / Metatranscriptome272Y
F014852Metatranscriptome259Y
F016401Metagenome / Metatranscriptome247Y
F018528Metatranscriptome234Y
F023830Metatranscriptome208N
F027881Metagenome / Metatranscriptome193Y
F040651Metatranscriptome161Y
F044435Metatranscriptome154N
F055666Metatranscriptome138Y
F055741Metagenome / Metatranscriptome138N
F059045Metagenome / Metatranscriptome134Y
F088751Metagenome / Metatranscriptome109N
F090278Metatranscriptome108N
F090441Metatranscriptome108N
F099150Metatranscriptome103N
F105006Metatranscriptome100Y

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0031689_1082686Not Available511Open in IMG/M
Ga0031689_1117570All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Crustacea → Multicrustacea → Hexanauplia → Copepoda → Neocopepoda557Open in IMG/M
Ga0031689_1127159Not Available518Open in IMG/M
Ga0031689_1151166All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Crustacea → Multicrustacea → Hexanauplia → Copepoda → Neocopepoda638Open in IMG/M
Ga0031689_1162513All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Sphingomonadales → Sphingomonadaceae → Novosphingobium → Novosphingobium aromaticivorans587Open in IMG/M
Ga0031689_1163013Not Available615Open in IMG/M
Ga0031689_1165385Not Available644Open in IMG/M
Ga0031689_1168588Not Available572Open in IMG/M
Ga0031689_1170817All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Crustacea → Multicrustacea → Hexanauplia → Copepoda → Neocopepoda532Open in IMG/M
Ga0031689_1173537Not Available563Open in IMG/M
Ga0031689_1175148Not Available1485Open in IMG/M
Ga0031689_1178316All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Hexapoda → Insecta → Dicondylia → Pterygota → Neoptera → Endopterygota → Coleoptera → Polyphaga → Cucujiformia → Tenebrionoidea → Tenebrionidae → Tenebrionidae incertae sedis → Tribolium → Tribolium castaneum818Open in IMG/M
Ga0031689_1180587All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Crustacea → Multicrustacea → Hexanauplia → Copepoda → Neocopepoda802Open in IMG/M
Ga0031689_1181738All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Crustacea → Multicrustacea → Hexanauplia → Copepoda → Neocopepoda → Gymnoplea → Calanoida → Temoridae → Eurytemora → Eurytemora affinis561Open in IMG/M
Ga0031689_1182180All Organisms → cellular organisms → Eukaryota → Sar751Open in IMG/M
Ga0031689_1182419All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Dinophyceae → Gonyaulacales774Open in IMG/M
Ga0031689_1183381Not Available712Open in IMG/M
Ga0031689_1183571All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Crustacea → Multicrustacea → Hexanauplia → Copepoda → Neocopepoda665Open in IMG/M
Ga0031689_1184697All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Ciliophora → Intramacronucleata → Spirotrichea701Open in IMG/M
Ga0031689_1184734Not Available570Open in IMG/M
Ga0031689_1188252Not Available694Open in IMG/M
Ga0031689_1189983Not Available1182Open in IMG/M
Ga0031689_1191672Not Available528Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0031689_1005399Ga0031689_10053991F007220GEDNTPGVYASVSKGVCWIDYAMTCQFGQQSGSYSSYWGHSAQQCQTWMDVELSRLNQEVADMQNAGSLTGRKKAAALAKGLKAQETLNKYSQCNVFWQPIDAAPLTTGGNGYVDGGDVDISNFERDNYPAAPLTDGTDGYSEPKTVDSAPLTDDSYSETKTVDTAPLTDDSYSETKTVDAAPLTDGSYSEPKTAPQTEGSYEDAPPVKLTQGAYTEDTADLTGEVKAPGPVY*
Ga0031689_1040382Ga0031689_10403821F055741ARGAKFYALMHRFDGFTGELNMGEQSWVKHPVGSQTLVTQTNVKSGEAYQHPDNSWWFAHDATAVGDHGMAWTNGLACDETFKFTAAYCPAVACKMSAKCVQIAPLMPKLLHTTAKAMTQDEIASSNEVRISLYGATTKMDISPKLVPFFGEMRTTVISKPRNAKNYQDHVSRCKIAQKSVMAKGINIDAQQLDDIARFSFFIDFEDQYGADKAMFDGNYVRTLCADPFYKNGSGALQTGTLSL
Ga0031689_1082686Ga0031689_10826861F005448EATGIALKDSFRCGLFFPDPADKSPDKSALPYLNLYIFNATFDAKAECDSGTPNVERYNTFCAEVWDKGGKGLVLTSPSLNKKRAADGISIGDDICNWVKTEGKTPFVGKNSKKFPNGLEFGMYSNSCGNKLWDWAGRIHHEIVCCAKGLHTPCDF*
Ga0031689_1117570Ga0031689_11175702F059045IPKEDVQKLLAELCDPEDEDGMFPYTPFVDRLTGKA*
Ga0031689_1127159Ga0031689_11271591F105006MSMNKFTLLNSNGVARPGFYIEILEFPSGNTPGQAVYVDGRGTDDMINFIYNPETDLLDAWHIRSPDNKFTMTPITVSNRVVGLDVRFTHIPNFVAHFALDLSTDEEEVVYDVLAPPVHLAELTSVNF*
Ga0031689_1151166Ga0031689_11511661F000685LLEVIMELLEAYKEGNQAKVAKAQVDKLIDLAAPPPKGTGVQNIRECIIESKMTFDVSADDWQAYLKAKIMNNIERYFYMIVFAMYIREVGPKGFPQTFKQYMDANSGLRTMIEEGRSKLEWERKIPDEKLAELKDILSSADFKDNMPKVIKRIYELSWDMFGDLPRGHHKNNSMHKLASKTMIEILPENLATYVEKKCGSLAGTPDFFDVT
Ga0031689_1162513Ga0031689_11625131F044435QGQTDELLTWLSKVQQKKATIRSTRARDFRKEALLEAARVSAQTELEKKQRERLQRLRDVRPTICRPPSPPIVERDENLCDCESCFLCQRHRSSMYGKCYCNVIFSPRPIFMDFSPPVLLASEAFAGDEQVSQEADNFLASLGSKRRIDSELGGAVASKRAKA*
Ga0031689_1163013Ga0031689_11630131F090278SLWRSKREASEGLVDVDEDDFYEFLQQVNDNRHFRHNAMGNLTCVLHKCGQLTEDMEVNMDFYTTALRAEEPETGFTWDVEGSAAKDPEWREKIATAYEDCYDLAESWPATSLNRRPMTRMFGRQMIFFKCADRTERRVCTEAQLLENLETFYGSESDEETAERIAAIGLPEDKYDAAAISIAVIQNAQSEEEKFFERFMWNLN
Ga0031689_1163363Ga0031689_11633631F000068TEVEEKCEDETSGYTTNTKCSKWPKEVCSVSKKNVKKYTPITGCTKEPRELCAPAGCGFKQGAEECYDKTQTVVQDAPKEQCSLEPQRTCKHVTKLVPKLSPTEECVDVPKEVCTRSRTNPRKVKKPVVKKWCYVPSEESGLA*
Ga0031689_1165385Ga0031689_11653851F000981IPRRYGLAPLDVGDAVSRAVNHKILFDSLDKTGRGALTLDQFVEWANAHVVSAIPKIPTGDVGLYHVEDYSEEQYIGFVEKAVNNPGSYEHASFYNFILNCFVEADEQCQGRITYDQFDKLLSRAATVPRHFGLAPPESSTEARKKMFDELELKRGGQGTGYVTARTFWEWTVVHVRGMIELQKAGKGWRENH*
Ga0031689_1167634Ga0031689_11676341F000073KKAFSLTGGKVSTPADQLKLNKSFVDYMSEHSKLRTIVDEGKGKLQWERDIPPEALANLESLAASDFKGNLGKIIHDIYQTAHGLFKDLPQGDHKKRAKYRFASKTLMRVLPANLKTEVEGLISSKTMTLDLYEILGQCTWGQAK*
Ga0031689_1168588Ga0031689_11685881F023830PTLRMLKFVALALCLASLSQAAPEPRIACQECMDEMHMLGYIVKQGAAEIEEYLKTNYCPTLDEQFRPQCEQNLADNYVSMLQMIVNHFFVDGAGHICLAWGVCQPKEVAALIGNKQPRPFTCPECLEGMELVGAYMTDPLWISEYTVYLEQNFCVGHNDHHCVDMVQRHFPPMHSMAMAQFFVPQEICD
Ga0031689_1170817Ga0031689_11708172F012012EPEYYGNMAQKFKNGFQYAKYLEVLSLPDDMRELLDEYLDANSRADQNKACEEFFQCPYSIKDSAKRNLSGNNL*
Ga0031689_1173510Ga0031689_11735101F000048AEGLSKPSDLNEVKALNEKVSTFDKTCLTYAKVLDAANGAAQKMTTHGEADAEVAALKERYNKVKAVSDEWVKNVDVLLKEWTLLDNTVTELNSWVAKDKSSEGENQFSLEKMESTLGELKNIFKEKEKLVEGL*
Ga0031689_1173537Ga0031689_11735371F018528GNLPSRKMAAMFVVVLLAIATTCQGAALDRMKREAIVGKMDKTFDFLGVHMGLKYKDAAQPLKGGKIQLKVDDLKKIFKGAHSNKVEVDAEFDGGVAVNDGLFKMVAHYSMVHSDGDGEEKGEVLIERKQTGGMWTTTIKTTASPFGGKPLFPAAINNLKIMVESDRQTKFHAMYVNPTKNRDMHIN
Ga0031689_1175148Ga0031689_11751482F001833MTLDTEMTLSSNSKVYEFLSEKYPYGAFNTRNNKVKLFVNKKNKNLLAPKFKIEVHLEKDGEKVVDLTADTTGSPYVFKLTAPNFFKRWGIKQSSIDITADHKIGSSLVIDANILGGLLLEGKRGDNAKNGRDVSLMVQKGGVQMFKITWSTEKINNANEFRFILHDTLDVNSESMLYKNIISQYKILTPFNSRIGEFEIYVNKKDKNLVLNKFYAKGNVMKDGNKRFDFLLSTNEKPYKFELFAPSLLGKLKPGMTEAKISADHNPGQSLEIKTNFEKFTGLKIYKSGSGNERKVEVNGKELAKGDYTLTDNSFSTKITLGNDYLEPKITWEGKLPNSRQEAEAFMLKNNVDVKVTGSKRNLDLNLNWKMTKPDFDFGTPENGKISMNAKGNNPRWGDYSLSRDINWKAENKVIELNLKGMAQFAKGGLATSTPIETAVNFKVLVDKKDLI*
Ga0031689_1178316Ga0031689_11783161F000049MRHNKMAEWMENVEKAIARIMADKVYTSAEFKRERDTFHALCKDLERTEVKKWLNQILEILMAERAKSQQETENEKLTILIQKHEQLIPTVQKTAVMVDLYWKCYAYGDELKPHIEFLDGIMLSSTREIAPSCVENVDELIERQEKSITQLDSKRGVVTDLITKGKGILEHPDKPKFLEGNVRRIQEGWDDTKTKAQERLKLLIDTKDACVGYAANNETIASEVDVAEAEIKKVKKK
Ga0031689_1180587Ga0031689_11805871F000049PFLQETKKAKNEALCKDLERSDIKKWLVNILEILMAERSKDEKTMQNQKLEALIKKHEDLIPTVSKTQVKVDLYWKCYAFGDELKPHIEFLDGIMLSSTRDIAPSCIENVEELIERQEKSLTQLETKRSIVQDLITKGKQLMENPDKPKFLDGHVGRIKEGWDETKNKASARLELLYNTKAAWEGYATGLETIVVEFEKGEEEIKKVKKRFNLDAAKDDLAKRKGIFNDTKNTIEGMFGDIQHNYDVMTMTLPEEKKDFVKKEVKA
Ga0031689_1181629Ga0031689_11816291F000049ENVEKSIARIMADKVYTSSEFKRERDTFNSLCKDLERAEVRKWLQQILEILMAERAKEQKNTEFGKLDALIKCHEELIPNVQKTAVMVDLYWKCYAYGDDLKPHVEFLDGIMLSSTRDIAPSCLENVDELIERQEKSLSQLDSKRNIVTDLIAKGKVILQNPDKPKFLEANVKRIQDGWEDTKNKATDRLQLLNETKAAFIGYAENSETIATDFETAE
Ga0031689_1181738Ga0031689_11817381F000879MKCLLVCAVLSAVSAVPQVYLGGLHGYPYALGSALTYTPHVIKPVVKEIEIPVKTITYGIKETGCKNSFGFSVPCLAEGEARRKRAADEEAAEAAPAAAVLPYAGLPLAYGYGLGAYGYGLPYTYAAPTVTVGEPKVTEVEVPQYVYKAVNEKVELAPLCHNGLGFAVPCA*
Ga0031689_1182180Ga0031689_11821801F016401PGRFYPCSGVSIVERQGFVQRTLTANGETYLENIYDDEQTSEIVYRKLVNGSETDVERVVVVRTNPLQIEFHMRNKADGFRVEWDMPKSAALSAVDAFVREARRMEGSTPTTIGYGITSDPIRDVTFDHLFAATELAIKEPWRVIEVDQSSCHVQDCAGFLTRKMRLSASGEMVTERITISEEKGEVTYNKCDASGRPSDVERVLAIRTPLRLEFYERSARSGMRVHWTAPYNMARDTFSNIVQLAKKIK
Ga0031689_1182419Ga0031689_11824191F040651GWTSPAVTTCTSDGLWAAMVYKARNPQKFMDVSNVTVADRQGFIARSMTINPTGKRVEEHIYANERTGEMVYRIVDASTKQETDDERVMAVRDGPLRIEFFHRHKSDGYRAYWQAPVDTVQKMIQELIDYASKNDGQGGDVGLGVRSAEIKGVSHDSVWRSMMESIREPARFFPCSGVSIQEKPGYIQRTLTANGETYTENIYDDEASCEIVYRKLSNGAETDLERVVALRTHPLQIEFHLRNKNDGFRVTWSMPKS
Ga0031689_1183381Ga0031689_11833811F055666NAAQFDALCENVAKLPRRFGLAPTWEIEYDGDIAKRTAARQKMFDGIDGMHGPARGWIGCAQFIHWATDHVASKVADAKATGVDFYHVTDYSKQDFLKAIDNAVHDPTSQDHQNFYEFLLTIFVEEDHQCKGVVSKEAFARLVDRAALVPRHFELAPPSADAARVAELYAAMEDSRLGGVTFRKFLQWTTEHTKAKIEAHKQ*
Ga0031689_1183571Ga0031689_11835711F013304EDKKDFVKKEVKAIQDKLEVVSRFKEKVDKIDEFVASLDNFDKTLKMVDSWMKEADNQLNDIKNNSDTMTPEDRVSCTMELQEDVTAKVDIVVAAIKTENDLLPQGDQVPKDAQDFKDELKRINDYVTDLQKRVMTECEHFSEDVKYWAEYKTGIRGFKPWLEASETRTGGGLSKPQTLDEANAMFANVNDFDQGCLKHLTILLNAETAANKMTTHKEADV
Ga0031689_1184228Ga0031689_11842281F000021KNECDKVIDKNGTPKTGGNGIKQLRENIAESKLSYEIMDDAAQAFLKVKIMDNIHKYYYMIAFTAYMREMAEAARGLVSDEQKAALTLPGGKSAIPGNQLKLGKTFVTFMDAQPELRGLIDQGKGNLQWERDIPAAALANLESLANSDFKANLGKIIHDIYQTAHQMFSDMPQGDHKKRAKYRFASKTLMRILPSSVKGEIEGLIEKKAISLDLYEILGKCTWKPV*
Ga0031689_1184697Ga0031689_11846971F001583AESDG*LLGGYAFF*FHYIIALGISLSATHLSDLTLTIIANIF*SVLNFTYKTYYIIFTNKHLNTDQLTRLMVLHYFTP*YYLYLVQLHVMFCHES*DSDSGENVYEDKSGSYVS*FYDAFLKEIQDA*Y*VLYVFCYFTIHHFAPGTVNYFFFER*NISELDEIRFYGVAPH*YFRPLMGLLVVSPSHYEGLM*MGL*FVLLAALPIIYSFYNTHHNYLPIIPMQHSYLQTC
Ga0031689_1184734Ga0031689_11847341F099150MKGAILACVLLGLAASAYAEEFEVADNEARFLYFNTSSTATSLTLLGALILLGVIGYLVYVGGLLGTSSYNRNDYYDPAAYDPAYAQAYQQQAQYRSEESSSPFTAENILKAISMVQEIYEASQS*
Ga0031689_1188252Ga0031689_11882521F090441LALGALITRRVIVGRQFPDAVLYRKNDSDTFRFHSGT*
Ga0031689_1188317Ga0031689_11883171F014852LALLPLALALPAREKKQAGYGYAPAPYHPAPAYPAHYGYGYEQPKHNCSVIDVVEPAEVCTPVIETECNDVELPIKIIVEVPFTYTVVRTVCTETIEVVPQEVCSYSYTQTEEETNAKTVEVTFEKQEKVQMVTVCQPGHHGYGGYGHGGYGHNYCKEVAQTTAYNVPVVTPIDVPVTVAYPTPVKTCVDKPIDLPVVTCADVTEERTIQVPAVEDSSVTTQKCVSGLGAPACQAVELTLPKQV
Ga0031689_1189983Ga0031689_11899833F027881MSLTVARSERVRGVLTHANCTQFFLKLSGARVSNTLLTFSKFI*
Ga0031689_1191672Ga0031689_11916721F088751FSC*KH*SVRLQLAARGENVLF*PQNLDIWGQKSIFCLVIAIFVDGTNDHYTRGSNFPIGTTPKKFSVSELGVIFWGSPLFLAVFGH*RVRRATTLNFGPISTKLGGIVRAIKKMTQKDNGPSPGRNYGETGVFTLGRKVVFGLKMGLTLKNHPK*HFPP*LFGQRQLFSLNNFF

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.