NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300007610

3300007610: Marine microbial communities from the Southern Atlantic ocean - KN S15 Surf_A metaT (Metagenome Metatranscriptome)



Overview

Basic Information
IMG/M Taxon OID3300007610 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0114292 | Gp0125886 | Ga0102778
Sample NameMarine microbial communities from the Southern Atlantic ocean - KN S15 Surf_A metaT (Metagenome Metatranscriptome)
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size129798274
Sequencing Scaffolds40
Novel Protein Genes44
Associated Families31

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
Not Available25
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales1
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes2
All Organisms → cellular organisms → Eukaryota → Sar2
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → unclassified Flavobacteriaceae → Flavobacteriaceae bacterium TMED421
All Organisms → cellular organisms → Bacteria → Proteobacteria3
All Organisms → cellular organisms → Eukaryota → Sar → Stramenopiles → Ochrophyta → Synurophyceae → Synurales → Neotessellaceae → Neotessella → Neotessella volvocina2
All Organisms → cellular organisms → Eukaryota → Sar → Stramenopiles → Oomycota1
All Organisms → cellular organisms → Eukaryota → Sar → Stramenopiles → Ochrophyta → Synurophyceae → Synurales1
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Kyanoviridae → Nilusvirus → Nilusvirus ssm21
All Organisms → cellular organisms → Eukaryota1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameMarine Microbial Communities From The Southern Atlantic Ocean Transect To Study Dissolved Organic Matter And Carbon Cycling
TypeEnvironmental
TaxonomyEnvironmental → Aquatic → Marine → Oceanic → Unclassified → Marine → Marine Microbial Communities From The Southern Atlantic Ocean Transect To Study Dissolved Organic Matter And Carbon Cycling

Alternative Ecosystem Assignments
Environment Ontology (ENVO)marine biomemarine water bodysea water
Earth Microbiome Project Ontology (EMPO)Free-living → Saline → Water (saline)

Location Information
LocationSouthern Atlantic Ocean
CoordinatesLat. (o)-28.2362Long. (o)-38.4949Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F000020Metagenome / Metatranscriptome6197Y
F000075Metagenome / Metatranscriptome2622Y
F000216Metatranscriptome1562Y
F001145Metagenome / Metatranscriptome765Y
F001504Metagenome / Metatranscriptome681Y
F003068Metagenome / Metatranscriptome509Y
F005911Metagenome / Metatranscriptome386Y
F007756Metagenome / Metatranscriptome345Y
F010164Metagenome / Metatranscriptome307Y
F010476Metagenome / Metatranscriptome303Y
F011083Metagenome / Metatranscriptome295N
F013897Metagenome / Metatranscriptome267Y
F014386Metagenome / Metatranscriptome263Y
F018725Metagenome / Metatranscriptome233N
F019484Metagenome / Metatranscriptome229Y
F025306Metagenome / Metatranscriptome202N
F029472Metagenome / Metatranscriptome188N
F030122Metagenome / Metatranscriptome186Y
F030783Metagenome / Metatranscriptome184N
F042354Metagenome / Metatranscriptome158Y
F053816Metagenome / Metatranscriptome140Y
F068862Metatranscriptome124N
F070901Metagenome / Metatranscriptome122Y
F071278Metagenome / Metatranscriptome122N
F082717Metagenome / Metatranscriptome113N
F084269Metagenome / Metatranscriptome112N
F090441Metatranscriptome108N
F090442Metagenome / Metatranscriptome108Y
F092219Metagenome / Metatranscriptome107N
F098676Metatranscriptome103N
F100444Metagenome / Metatranscriptome102N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0102778_1004180Not Available678Open in IMG/M
Ga0102778_1006313Not Available616Open in IMG/M
Ga0102778_1009133All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales988Open in IMG/M
Ga0102778_1009487All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes618Open in IMG/M
Ga0102778_1009726Not Available962Open in IMG/M
Ga0102778_1010583Not Available717Open in IMG/M
Ga0102778_1051240Not Available947Open in IMG/M
Ga0102778_1061581Not Available887Open in IMG/M
Ga0102778_1175262Not Available694Open in IMG/M
Ga0102778_1200640All Organisms → cellular organisms → Eukaryota → Sar534Open in IMG/M
Ga0102778_1201590Not Available668Open in IMG/M
Ga0102778_1204049All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → unclassified Flavobacteriaceae → Flavobacteriaceae bacterium TMED42604Open in IMG/M
Ga0102778_1221248Not Available517Open in IMG/M
Ga0102778_1234750Not Available578Open in IMG/M
Ga0102778_1237623Not Available727Open in IMG/M
Ga0102778_1239548Not Available781Open in IMG/M
Ga0102778_1247118All Organisms → cellular organisms → Bacteria → Proteobacteria1106Open in IMG/M
Ga0102778_1252371Not Available528Open in IMG/M
Ga0102778_1276911Not Available655Open in IMG/M
Ga0102778_1278203Not Available502Open in IMG/M
Ga0102778_1278926All Organisms → cellular organisms → Eukaryota → Sar → Stramenopiles → Ochrophyta → Synurophyceae → Synurales → Neotessellaceae → Neotessella → Neotessella volvocina822Open in IMG/M
Ga0102778_1292715All Organisms → cellular organisms → Eukaryota → Sar → Stramenopiles → Oomycota567Open in IMG/M
Ga0102778_1293673All Organisms → cellular organisms → Eukaryota → Sar807Open in IMG/M
Ga0102778_1295945Not Available574Open in IMG/M
Ga0102778_1301230All Organisms → cellular organisms → Eukaryota → Sar → Stramenopiles → Ochrophyta → Synurophyceae → Synurales1254Open in IMG/M
Ga0102778_1313570All Organisms → cellular organisms → Eukaryota → Sar → Stramenopiles → Ochrophyta → Synurophyceae → Synurales → Neotessellaceae → Neotessella → Neotessella volvocina948Open in IMG/M
Ga0102778_1318530Not Available577Open in IMG/M
Ga0102778_1322430Not Available719Open in IMG/M
Ga0102778_1329800Not Available717Open in IMG/M
Ga0102778_1337318All Organisms → cellular organisms → Bacteria → Proteobacteria705Open in IMG/M
Ga0102778_1337566All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Kyanoviridae → Nilusvirus → Nilusvirus ssm2508Open in IMG/M
Ga0102778_1343856Not Available1672Open in IMG/M
Ga0102778_1346515Not Available843Open in IMG/M
Ga0102778_1364053All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes792Open in IMG/M
Ga0102778_1366972Not Available1532Open in IMG/M
Ga0102778_1375333All Organisms → cellular organisms → Bacteria → Proteobacteria774Open in IMG/M
Ga0102778_1378796Not Available903Open in IMG/M
Ga0102778_1391294Not Available940Open in IMG/M
Ga0102778_1391596All Organisms → cellular organisms → Eukaryota1843Open in IMG/M
Ga0102778_1391812Not Available786Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0102778_1004180Ga0102778_10041801F005911KGGRKTMSLKTRLLTMLNSKKGSALLLATAGAIAATFSVYFFVSLTTLSEDSKQRVAHLYNAYQMGQSIKGKIDGADINQARLGSGTEADIESPVSELFHNGNFITLKEMVKKAVIIVSDDPTATARSGVDTAYDIDNSGVLIKFADADGTVIADGTGGDTIVADVHLFVNLAGTADANTNAPYTAGEPFYYILMDDVTAGFNDATKRTIDLTQFPTGILATNDG
Ga0102778_1006313Ga0102778_10063131F010476MFDYKIVAYNKLGKIQETENLFCAPDEINDVMYTMSEQYGYAEALDTMDTHMGEYGERPLSLGERRYF*
Ga0102778_1009133Ga0102778_10091333F010476MFDYKITAYNKLGKVQETENLFCAPDEINDVMYTMSEQYGYAEAVDTMNTHMGEYGERPLSLGERRYF*
Ga0102778_1009487Ga0102778_10094872F013897MNLTKDQLLALMNTIDFATDNDASYEEYTIIKSGTSDLELIKDILYNEYIHQTQ*
Ga0102778_1009726Ga0102778_10097261F090441GEVWELATLSFLDSLALGALITRRAIIGCQFPDAVSYRKNDCNMFRSHSET*
Ga0102778_1010583Ga0102778_10105831F071278NAYQMALSVKGKINGDSMQANKLDGTNVEDDIEDSLDPLFHNGSFITLREMVIASIIIVQNDPTTTSERGKKIPYDVDNSGVLIKFANSTDAVIAPSDTNRAIEDRANLAKVHDLHLFVNLAGTTDIDSRPNGPYAVGDPFFYVVMASDSDTGRTDGLTDALKTVNLTQFPTGMLATNQGGPQAETSVILPQDFD*
Ga0102778_1012524Ga0102778_10125241F000075KFFALVAAVSATQYDNMSEDELLVNLESTLSSALSSEARGDGDAAVAKTAAIKNIQKALTARILKRLDDGQPLVEVARKMKAIEGMQPQINDMERRLGIMQSVEPVLENAIKTLQKVVDVRGMGKK*
Ga0102778_1051240Ga0102778_10512401F090441ELATLSFLDSLALGAPITRRVIIGCQFPDAASYRKNDSDMFRPHSGT*
Ga0102778_1061581Ga0102778_10615811F090441ELATLSFLDSLALGAPITRRVIIGCQFPDAASYRKNDSDMFRSHSGT*
Ga0102778_1130116Ga0102778_11301161F000216CKGVVMFCPMLSILFVGTRMRALMLTDWKGAPQGWCQDGMYMATWSLLIQFVMVLVTPCATGVPAQVDEDGNIKWEPDNKILFYCVVTIRALGFILLYGGIITVITGLYTMTPETANGRGAVPLVGDGKVPGVGAQVPGYEGIKEPYGVNDVPGTPIEQKF*
Ga0102778_1175262Ga0102778_11752621F053816ISDAPPQAQQIAMPITFEFYHGTSMANARNIMDNGFQASSGGYLGKGVYGAYEDKARDFASLGGRHKGDEGALIKCRVTVDENKVKTRTHATNSTQLNGQTADCVWYPGGGSVKRPEVCITDPSKIEILGMERVANR*
Ga0102778_1200640Ga0102778_12006401F014386HMRAFWGATCSFFIAFVGWFALAPVALDVIHSIGLCENQLYEPEETPTRPAFIAYKNIKTGKSFCVHGKTDDGKDCAEIPAESEIEACKDDAGSEECKFEKATKYDFDTLNAVKCVCGKGTECKNTIANAGIASVGSTVFVRIILGTGLEVLGPVNVQCLLLTFGGTMVALSSQINGP
Ga0102778_1201590Ga0102778_12015901F098676FCQMLCDEFPDCMAFEVEDGGVTLYESGMNSFEGKMAACYFKSAFTQDDEGGIADASAGGGKVVSGNGLRGRDQVFRGEDPRYDCYSNVCRQNAAFPNAYQLDAKIGDIDPMANLDGTQYQAGAMTRRMSEGYPGGSSAASRAEFRRMMETTGDTEGLKHLSAREKYIEMMKGAPISMHQYNIDHGLN*
Ga0102778_1204049Ga0102778_12040491F018725AGVSFSDTRTFRGVKQADDTLGVNVGLGTSLAEKVSLGVSLDSFTALEAGQTNELRTGVSLGYDVSETVGLSVGYTNYDYQGATSNDEIGFGVSVDTLLNPSVLYATDSDNDSDVTELSVCHELSLSEKFGLTLCGSLGNVDAAADYTYHSLGATVTSSLGGADTFAGVALVDNDNAGSDSETVFSVGFSLKF*
Ga0102778_1221248Ga0102778_12212481F082717KIEKIILSASVSFKPQQVRGLAATSTSISIGKASVSKSKTPTYAGKKKNSGKSRTSGIIPNFGHGLEDCTLEVRKALILLSGQALDSKTFRVGRPHLGIRKGRLAAYQVTLRNQTMYLFLERLLTEVIPKIITTDPAALITSAADRRLQKTSMKVANLYTLDTKKVPLKNMW
Ga0102778_1234750Ga0102778_12347502F030122MEFYPKDKNPMLDERTARFHGRVLKEDLATLPFVLDTCNRDINIARATNYVTWDHDKEKWCEVDHLMMNFYIKAKTSETRQELEDKINRGVVELLK
Ga0102778_1237623Ga0102778_12376231F005911MLKNKILNFLNSKKGNALLLATAGAIAATFSVYFFVSLTTLSEDSKQRVAHLYNAYQMGQSIKAKIDGADINQARLGSGDEDQIEAPINELFHNGNFIKLSEMVKKAVIIVSDDPTATARKGYDIGYDTENSGVLIKFAAADGNVIQPDQGDADDTIVADVHLFVNLAGTADTDANAPYVDGTPFYYILMDATTAGLADSLETINSTIYPTGILATNGGGPQAEVSV
Ga0102778_1239548Ga0102778_12395481F005911KKIKFNKEGGKLTMSLKIKLLTLLNSKKGSALLLATAGAIAATFSVYFFVSLTTLSEDSKQRVAHLYNAYQMGQSIKGKIDGADINQARLGGSTEAELEAPIDDLFHNAQFVTLKDMVKKAVIIVSDDPTATTRVGTDTGYDLDNSGVLIKFADVDGTVIDDGTGGDTLVADVHLFVNLAGTADATTNAPYTAGEPFYYILMDDVTAGFNDASKRTIDLTQFPTGILATNDGGPQAEVSVVLPQDSEE*
Ga0102778_1247118Ga0102778_12471181F030783KNKTPFLLGLLFLAFVTAHEVEHISEAFEVQDEGFELSCDYCEETQSQDLVNSKTNITFIDFDIEDSKLVSLTDQSLSKNYHQRAPPKI*
Ga0102778_1252371Ga0102778_12523712F084269NKKQRENSLIVMRAISRIKKYEMNENSSYGLHNPLLREQAN*
Ga0102778_1276911Ga0102778_12769111F003068MRKILTLISVLSLVSFNAYAVDVSQLSVTAGVAHNSAVYGASAKETNRNESNVVKTVDKESGVFTESHQSFFMELNAGEFISLGFEHTPDSITTPENQRITNTNATTKVKVDFNDLNTAYVKLNVPGGLYVKAGVVETDLDIKESMASGSTYNNVSTEGTLLGVGYSRDLGDSPFSIRVEGSYMELDDVTTSNGVSATGGTIANGGRNQIDASNLEG
Ga0102778_1278086Ga0102778_12780862F000020LKEFYEKAAEATALLQGKQKPEVFDEPYTGMQSESGGVVGMLEVIQSDFARLETDTKAAEAEAQKAFDEFTSESAVNRAANAKDVEHKTTKKTNQEAALTAKKADLEGTQKELDAALAYYDKLKPSCVDAGVSYEDRVARRKEEIESLQEALRILNGEDIAFLQK*
Ga0102778_1278203Ga0102778_12782031F100444LGHKMFKYSFIKSLAFGTLGFLSLALFSISSLVLAKEEMILVADNSFNTSHLFSKFEKLSTSLNSVKVVRNYKSQVDFGSDPITNEIWYPHKSATINYYVDCKISKLSIKSWKLFEGSNASGEIVWADQIFGNLSFYSPGTDEELNAVLNTCNNSFNFAAKEF*
Ga0102778_1278926Ga0102778_12789261F010164QSLFWLQSWKNLYEKDKVEIVETKYKLVKTGLTNTFSTTENLIKNFENKAFVALQRYMVLITASRILRKFLFLFEVEQLKLIDATISKLGGFKK*
Ga0102778_1292715Ga0102778_12927151F011083DALRALPNEVLEGVTVKARTSDTVSLCTRFHDGVQHLGGYSETTDGKFKNSKMTTNFCETTYTMATGSNKMDFTIEFADKPGQTGVQYLFEVDINKRGAGSFPVSGGITGTGAYSVAEMNFNVNLGNLSELAECSDRGLDDGDGQCECFDGFRGLACEEQEALV*
Ga0102778_1293673Ga0102778_12936732F001145QLNPNFFETNVINIGLLIGILIYANKTSFSVGLENRQKEIIQTIENAQKDVVNASNYYYLAEKGFTQSLFWLQSWKMLYEKEKADLVTNKYKIIKSGLEETFDTTENLIKNFENKAFISIQRYMIFITASRILRKFLLLSEAEQSKIVEITIAKLGGNK*
Ga0102778_1295945Ga0102778_12959451F042354LIMKKSNPLFILIALLFSAGAVFSQASLSTGLSYGTFNYDISGTKYTGDGGVLSLDGQLSPSLSYSLSMSDGKFDDVVFSDSQGSITYNVYPNIGIHFMGSQIRLSTVQETDTSLGVSYNINTSSLGLKVFAGSDINNYGKFYTYGTKINLGVAQGSSLILGYKTEDRKQKVTSMDISFVYDLTSSLGLNL
Ga0102778_1301230Ga0102778_13012303F010164QKDVLNASNYYYLAEKGFTQSLFWLQSWKVLYEKDKIDLVTNKYNLVKNGLLETFLTTENLIANFEKKAFITLQRYIIFVTASRILRKFLFLSDEEQSKLIEVTISKLGGV*
Ga0102778_1313570Ga0102778_13135702F010164EIIQTIENAQKDVVNASNYYYLAEKGFTQSLFWLQSWKNLYEKDKVEVVETKYKLVKNGLTNTFATTEELIKNFENKAFVSLQRYIILVTASRILRKFLFLSQNEQSSLIDSTISKLGGFKR*
Ga0102778_1318530Ga0102778_13185301F003068KETNRDESNVIRTVNKESGVFTEDHQSVFGEVNLGEFVSLGFEHTPDSITTPENRRTVQADGNSTAGTTTVSVDFNDLNVAYLKFNLPGGMYLKYGYVDVNLDIKETMASGSTYANVGTEGTIAGLGYSRPLGDAGLALRVEGSYMSLDDVTTSNGVSASGATVANGGRNQVDASNLEGLNGKIALTYTFGG
Ga0102778_1322430Ga0102778_13224303F025306MPKSILFTEAELETIERAMDDYMCYHDPNTPASDLIGGLPVAERVNDIMVKITEAYANL*
Ga0102778_1329800Ga0102778_13298001F090442MNEPNVAIEFHAEYESGKSGILRGIPANPKKCIGKNVTLTPINVVQK*
Ga0102778_1337318Ga0102778_13373181F092219ANTPPTFNVVREEILPAMNKAVASMPVTFSDVYGDIVYHEEVVPMSAPATFSDVYGDIVYHEEVVPVADADSFSEHFVGGLGPQLPYDPMRGVDYGFVQALLQQNADPFAEQIVGGLGPQSDLDPMMTVMTDWVQPASNVDSVSEQIVGGLGPQPFNAITYSEVFVGGQLHIIPATDVETFSVAGN*
Ga0102778_1337566Ga0102778_13375661F029472MLSRKTIENLSDKLMVEVANYVTEDPRFTELLNELVPEAIDLELGQVDDYSVVQIITSIQQHLRCSPNHSQIHYPRCPL*
Ga0102778_1343856Ga0102778_13438561F019484YFLTILGGSLKKIAKKITIKVPISTKTYTNASIFFFT*
Ga0102778_1346515Ga0102778_13465151F070901PDQVFIDPDTGLRVLTAVFTICDDRSLNGVALDSLGFDHQNDIDIGLTYVETGPGCWLTLYDGPNFNGKTAVISPQTSMHLKHVGARNWNDLTRSITTRSSTGDGVALYLSHRVALGSVNDHCVAMYNANPEIYPKADGLYLCGDKNVAKTWHFDLNDVIEQGFNIDGSHFGIKYIMTGRKVTLKVFDGPTRNTVNHGPLEMSGHETLDLETVKYGDDHWDSKAKSFTLVEV*
Ga0102778_1364053Ga0102778_13640531F001504MTQDREHFYAVQSFLEDDELHKIWNIIEIAMEREGYDVQNAELSMRLYDPELT
Ga0102778_1364053Ga0102778_13640532F007756MSKLNTLKRYYVNVKFEKYGTYTIEATSKEHALEIYNDGDYGWSDYSEDFGEFNEVVEDVEEEVFADTQLSLAGVLQ*
Ga0102778_1366972Ga0102778_13669725F090442YTKKSVATVVMIEPNVAIVFQDAYESGKSGIRRGIPANPKKCIGKKQTFTPTKVAQK*
Ga0102778_1375333Ga0102778_13753331F092219MKKFITALAATFATVVIGSAASAATPTFNVYHEEVLPALVETNAPATFSVYHEEVLPALVAANTPPTFNVVREEILPAMNRAVASMPVTFAEVYGDIVYHEEVVPVADADPFSEHVVGGLGPQLPYDPMRGVDYGFVQALLQQNADPFAEQIVGGLGPQGDIDPFAEEIVGGLGPQ
Ga0102778_1378796Ga0102778_13787961F090441EVWELATLSFLDSLALGALITRRAIVGCQFPDAVSYRKNDCNMFRSHSGT*
Ga0102778_1391294Ga0102778_13912941F090441ATLSFPDSLALGALITRRVIVGCQFPDAALNRKNDCDTFRFHSGT*
Ga0102778_1391596Ga0102778_13915961F090442KTKKAVEAIVIKLPNEATVFQAVYESGKSGTRRGIPAKPKKCIGKKHKLTPINVVQKWILPNTSG*
Ga0102778_1391812Ga0102778_13918121F068862VGFRYKLNELKKYPETLFAKKVFDVADGEATVDADVNVDDKSLSANLKWVSDKLVDGMKTTFRLNGNSNDKLTSVGAEVSKNVDGHDVELKGHYDLADSRLDANGKVVIDKTTAEVSYNSGDEDIRVQLSHDLDENNSPKGSYSTKTGHVAYGWTRKWEGGELDGTYHPDNGGRAVLEWTDKGNQGDWKTRAEVPLANNEIGNSKVTIKREWKY*

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.