NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300023203

3300023203: Combined Assembly of Gp0238866, Gp0238878



Overview

Basic Information
IMG/M Taxon OID3300023203 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0130338 | Gp0238866 | Ga0255812
Sample NameCombined Assembly of Gp0238866, Gp0238878
Sequencing StatusPermanent Draft
Sequencing CenterUniversity of Toronto
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size1202075960
Sequencing Scaffolds35
Novel Protein Genes39
Associated Families33

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria → PVC group → Candidatus Omnitrophica2
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → candidate division CPR1 → candidate division CPR1 bacterium ADurb.Bin1601
All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1
All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1
All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Planctomycetia → Pirellulales → Pirellulaceae → Candidatus Anammoximicrobium → unclassified Candidatus Anammoximicrobium → Candidatus Anammoximicrobium sp.1
All Organisms → cellular organisms → Bacteria → PVC group → Candidatus Omnitrophica → Candidatus Velamenicoccus → Candidatus Velamenicoccus archaeovorus2
Not Available17
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales1
All Organisms → cellular organisms → Bacteria5
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Ignavibacteriae → Ignavibacteria → unclassified Ignavibacteria → Ignavibacteria bacterium1
All Organisms → cellular organisms → Archaea → Euryarchaeota → unclassified Euryarchaeota → Euryarchaeota archaeon ADurb.Bin0091
All Organisms → cellular organisms → Archaea → Euryarchaeota → Stenosarchaea group → Candidatus Methanofastidiosa → unclassified Candidatus Methanofastidiosa → Candidatus Methanofastidiosa archaeon1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → unclassified Bacteroidales → Bacteroidales bacterium1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameMetagenomes From Anaerobic Digester Of Solid Waste
TypeEngineered
TaxonomyEngineered → Bioreactor → Anaerobic → Unclassified → Unclassified → Anaerobic Digester Digestate → Metagenomes From Anaerobic Digester Of Solid Waste

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Free-living → Non-saline → Water (non-saline)

Location Information
LocationUniversity of Toronto, Toronto, Ontario, Canada
CoordinatesLat. (o)43.5479Long. (o)-79.6609Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F000769Metagenome / Metatranscriptome898Y
F005744Metagenome / Metatranscriptome391Y
F008824Metagenome / Metatranscriptome327Y
F018746Metagenome / Metatranscriptome233Y
F022996Metagenome212Y
F023249Metagenome211Y
F026449Metagenome / Metatranscriptome198Y
F029922Metagenome / Metatranscriptome187Y
F030486Metagenome / Metatranscriptome185Y
F041155Metagenome160Y
F045728Metagenome / Metatranscriptome152Y
F047698Metagenome / Metatranscriptome149N
F051554Metagenome144Y
F053724Metagenome140Y
F053881Metagenome / Metatranscriptome140N
F054061Metagenome / Metatranscriptome140Y
F066219Metagenome / Metatranscriptome127Y
F067453Metagenome125Y
F069433Metagenome / Metatranscriptome124N
F069913Metagenome / Metatranscriptome123Y
F070092Metagenome123N
F070633Metagenome / Metatranscriptome123N
F071210Metagenome / Metatranscriptome122N
F071272Metagenome / Metatranscriptome122N
F072819Metagenome121N
F076867Metagenome117N
F080801Metagenome114Y
F080820Metagenome / Metatranscriptome114Y
F086459Metagenome110Y
F098918Metagenome103Y
F099327Metagenome103Y
F103328Metagenome / Metatranscriptome101N
F103499Metagenome / Metatranscriptome101Y

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0255812_10011594All Organisms → cellular organisms → Bacteria → PVC group → Candidatus Omnitrophica3956Open in IMG/M
Ga0255812_10023071All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → candidate division CPR1 → candidate division CPR1 bacterium ADurb.Bin160625Open in IMG/M
Ga0255812_10038387All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium10257Open in IMG/M
Ga0255812_10077340All Organisms → cellular organisms → Bacteria → PVC group → Candidatus Omnitrophica2248Open in IMG/M
Ga0255812_10097466All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon783Open in IMG/M
Ga0255812_10100136All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Planctomycetia → Pirellulales → Pirellulaceae → Candidatus Anammoximicrobium → unclassified Candidatus Anammoximicrobium → Candidatus Anammoximicrobium sp.4218Open in IMG/M
Ga0255812_10124951All Organisms → cellular organisms → Bacteria → PVC group → Candidatus Omnitrophica → Candidatus Velamenicoccus → Candidatus Velamenicoccus archaeovorus2472Open in IMG/M
Ga0255812_10134916Not Available1239Open in IMG/M
Ga0255812_10153404All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales884Open in IMG/M
Ga0255812_10166398All Organisms → cellular organisms → Bacteria3987Open in IMG/M
Ga0255812_10175781All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Ignavibacteriae → Ignavibacteria → unclassified Ignavibacteria → Ignavibacteria bacterium1196Open in IMG/M
Ga0255812_10210070All Organisms → cellular organisms → Bacteria592Open in IMG/M
Ga0255812_10230150All Organisms → cellular organisms → Archaea → Euryarchaeota → unclassified Euryarchaeota → Euryarchaeota archaeon ADurb.Bin0091795Open in IMG/M
Ga0255812_10246848Not Available1296Open in IMG/M
Ga0255812_10317601All Organisms → cellular organisms → Archaea → Euryarchaeota → Stenosarchaea group → Candidatus Methanofastidiosa → unclassified Candidatus Methanofastidiosa → Candidatus Methanofastidiosa archaeon2125Open in IMG/M
Ga0255812_10318128Not Available881Open in IMG/M
Ga0255812_10416537Not Available1013Open in IMG/M
Ga0255812_10460319Not Available670Open in IMG/M
Ga0255812_10461840All Organisms → cellular organisms → Bacteria988Open in IMG/M
Ga0255812_10480173Not Available2282Open in IMG/M
Ga0255812_10562970Not Available883Open in IMG/M
Ga0255812_10609061Not Available1844Open in IMG/M
Ga0255812_10613846All Organisms → cellular organisms → Bacteria → PVC group → Candidatus Omnitrophica → Candidatus Velamenicoccus → Candidatus Velamenicoccus archaeovorus679Open in IMG/M
Ga0255812_10643215Not Available906Open in IMG/M
Ga0255812_10710864Not Available662Open in IMG/M
Ga0255812_10763383All Organisms → cellular organisms → Bacteria3184Open in IMG/M
Ga0255812_10797444All Organisms → cellular organisms → Bacteria2091Open in IMG/M
Ga0255812_10819067Not Available1249Open in IMG/M
Ga0255812_10836311Not Available837Open in IMG/M
Ga0255812_10838425Not Available2014Open in IMG/M
Ga0255812_10856889Not Available3056Open in IMG/M
Ga0255812_10914163Not Available615Open in IMG/M
Ga0255812_10937986All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → unclassified Bacteroidales → Bacteroidales bacterium1063Open in IMG/M
Ga0255812_10969483Not Available2055Open in IMG/M
Ga0255812_10997421Not Available2738Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0255812_10011594Ga0255812_100115944F076867MDEKNRNLVEIAKKKRYIALVEKLGRGSLSSKELKELEEFEKSEQRPAGVIDGTVDLPTLCVYLEKSPRMIRRYVQQGMPVFRDAVGEIARFKVGDVFKWFYKKQGSEEDNGKDYWDKEYRKNRAKLSEIELKQKEGEVIPFEDHVSIVKNQIRGIKAGFLRLPKQIAP
Ga0255812_10023071Ga0255812_100230711F071210MGLGGLNMKEKRWLGTQKYEIHCFYCGGFHITGNCPQIVKAMKGWKYDRSCPETGHIKIVPNGDEYPTVLAFNGYHYRVVGLWGIPGRLLWLELERFYGDTIVAATFCPDELMEMDLGMSDDEQLSAWLGGLPFLSVSPSEFNGSEAEVEVKTTTGESL
Ga0255812_10038387Ga0255812_100383879F098918PNYTNREFMNALIIFQTALMDKMWDNQDFDKMEMQDRENMAVQCGLDLRKLIHTYTGLDTHKIEEFL
Ga0255812_10077340Ga0255812_100773401F069433MAHKLPPKQCSSNTPAWTDPVLTDLSTKVRKVHIDELRSFLNTEFVRRGLTQASFTDPTITALVTEIRKVHVDQLRTELAACKSGRGESGYCPQDSSGCMDFTDPTITALSTEVRGVHFREMTQKVQALMTGCICETEQCQYCADCGYHYTTCSHAGVACDDHKYSECHHSINHYWMCARINLPTATEHPYNSSNPPVAWDGYVPWDWCVYTPPGLNWGTCEYSGGHNHSAWNCKC
Ga0255812_10097466Ga0255812_100974661F051554MTFQLKYYVPFKAAEGINTDLALKEGILPIEGTAIDTSVNANKWQVPPEDLDFFTASLKGAQLRIDHAESVLSIVGKVPEAIRSGNQVFFQAEVGEPSIIPKILRGYVNHVSVQVDSDDTECSVCGKQTRIEGILTHLCSGAWEIVHKPRVRELSIVASPAYKNTAFHPVGFAAAMALSQLSRGNKDVGS
Ga0255812_10100136Ga0255812_101001364F067453MITRSLGIAPGVPWGGPLPGGGYEGKTKRLDRAVQRFAEWLHRRIAGPVQVRFNSHRLSGGAFVYPDAAPDARITFDAPEIGLGARLVTPASVQRRRDRAWNKGEGSSLPALEWEDCWTERTPLSYSVLVYPQYLRDPGNLGEHKPGDGFGYWSVATPQQAYRLVRQLVSFPVVR
Ga0255812_10124951Ga0255812_101249512F072819MEMFQDQAKAQEVSFRIARLEGENAISELVNWCRNNLDELTVQCFTHKRFMSLQTLVDVLCEVYRDLGVEGDKGNVSVFVLFLAGKHRDKIYASHVVDLNDTHRQILRDKLGLDIEEIEPGLSKLDWRTDAGI
Ga0255812_10134916Ga0255812_101349162F047698MQQATAIVGVLIGFLILTQIGIFVCDAMIEASSVNESSQLYEAQTEAINTFVQCLSIVRILLIVAIVAVVFQYLQGAGLIPGFGGPRQGGY
Ga0255812_10153404Ga0255812_101534042F069913MTLEEYTQLERDFSTSETDIKTFLGIKGVSIHQYYYWKRKSRDLQEASSSTEGQFLPLNVISGGSINPGKRGKNLKHPLITQGEIEIELRTPSGSELRIRGVMDSIMVSTIIASSSVRRN
Ga0255812_10166398Ga0255812_101663984F008824MYQISVYSIAPKLWRWELRCGGALLRCGTAPTKVAAEREASLILN
Ga0255812_10175781Ga0255812_101757812F022996METIKFNISKIKSDIKILVELQKFYKNQRKTETLVGDRQMHAGEAAWKHRVNREKLRLMYAAYGQARGKSYSQIESQYPEDQHPLKAFQSGIENILCEYLQEVEVA
Ga0255812_10210070Ga0255812_102100701F023249PLSQRLEAKMGGNKFKANTVVAIKLARAVYFMLKNKTVFDPSRLVAALAKN
Ga0255812_10230150Ga0255812_102301502F018746MPPRSEALNDFEKAVLQAIEQSPEHTIVASDIVREFAHRSPRTSCIARIETMAREGKVRISRFAGRILVHSPSEA
Ga0255812_10246848Ga0255812_102468483F067453MITRSLGISPGVPWGGPLPGGGYEGKTKRLDRAVQRFAEWLPARLAGPVQVRFNSHRLSGGAFVYPGLVPEAPVTFDTPEIGLGARLVTPESVERRRDGAWKKGDGYSLPALEWEDCWTERTPLAYNVLIYPEFLRDPANLGEERPGSRFGYWAADTPQEAYRLLKRLVNFSMRLTTA
Ga0255812_10254606Ga0255812_102546061F103499HDRIAMEGCVCETCDTKGCHRPATWEIECRGAGVSGRLIYSCDEHCPDPAILSPQDEMRRLVEE
Ga0255812_10317601Ga0255812_103176012F070633VLDVSHSNSYDGFTYSVRQHIPLGPKRCASSYNLVVVPLSDGIMTSIERENFNIAMDTEGRLILTRKDRKVQLVFDYGSGFMTGVLGGVLYETFEIPKKEETDYHG
Ga0255812_10317601Ga0255812_103176013F053881MKAISKISKDSLGEKEGKLYSKIYRHIIQNFEIDEIAADQIAMAIVNQKCILLPRLLSGVDVDISLASESIRKWLSEYRLTPKSKEKDKEVTINLSAIIEEIHREREDKS
Ga0255812_10318128Ga0255812_103181282F041155RAESVQVGVHRGLRADGDMSTVGFGLSALLSLGRLNLVESII
Ga0255812_10416537Ga0255812_104165372F005744MDDETFEIIKAGAPDAPPEQALYRIQQTYPDGSGGRLNVDWDGLRRLHGLIHDRITMEGCVCETCDTKGCHRPATWEIECRGAGVSGRLIYSCDEHCPDPAILSPQDEMRRLVEE
Ga0255812_10460319Ga0255812_104603192F053724MREIRVLVRLQVEECEPDAKFDREMMEDAAVEAVENVMRHATGVGFPHTYEEELSVCFVDAVLYEEPGDDDLDEE
Ga0255812_10461840Ga0255812_104618401F030486MRHNVRRYVALAQSRGGARFCTRGAMQDYAFITLF
Ga0255812_10480173Ga0255812_104801732F099327MATEGFFQGTSGEEISLLVDGVKIMALQNLSWKASQSKSPIRGAGYRKPHAMGRAFKEYEVDFEVKELNKAVIEEGVNSRRSREVQIKTFKIGDQEFSDLLDLRNCTILIVYPPKNNATRIIRFLGFEFTDVEGGFSIDDESVGRKLSGIALDAEGMV
Ga0255812_10562970Ga0255812_105629702F086459MIYAKDRILSELDSAGAQLASLAADLKASRAVLQARLKRPPPFMPLEEYDCHFANAFTHIEKAKMAVRAMHPVCHQADFLKKKTEKKGHRQ
Ga0255812_10609061Ga0255812_106090613F103328MFKKDDGAIDMVSLILTVVIAAVTMMVGLVVVANMESSMPDISGSALSTSLTNVMENTGTAFNFLALGLFVLAAVFIIGIVAGVLGGGQG
Ga0255812_10613846Ga0255812_106138462F072819GTAMCRGIGACIRRPDPTGDRVSIQADMIIRRGIANVTLIHGEDMLQDQEKADKASFKAALLENENTVKELIYFCRNNLNDLTVQCFTHKRFMSVQAFIDALRLVYQELGVEGDKGNVSDFVLFVAGEHRDNVYASHVCAITDEHRQVFAVRLGMDIEEIEANLSKLTWRIDGGI
Ga0255812_10643215Ga0255812_106432152F029922LEETIGFMEKCQTGILPLLSREELYSFGIIELSEMIMSIEHAISYTEGYRFLFLCFGSEETSDKAKAVMKGLEDYLFLVKDVYRFKVTEKKKRENFLKGVNA
Ga0255812_10710864Ga0255812_107108641F053724MREIRVLVTLQVDECEPDAKIDRETMQDAAVEAVENAVRFAYDNGFSHAYTDELCIGFVDAVLYEEDDEDDLGEE
Ga0255812_10763383Ga0255812_107633831F041155DIDDRAESVQVGVHRGLRADGDMSTVGFGLSALLSLGRLNLVESII
Ga0255812_10785887Ga0255812_107858877F066219LEANMKKVFLVLLILCIGCLIHANIVEFFESIPPVLRYAVGTVITVYALSWAYSWFFDPIIVVADTKYGAFNFGPFIVVEPCIWYSNDVEWRNTVLNHEYTHYVQHAVYGPILSVTYPILALYSNIKSGNQWDDNYWEIQAMQAPDTAPSWKPLAVWVW
Ga0255812_10797444Ga0255812_107974443F080801MADEVQIPELVMKTTKSQFRLRCHAIAWTKEYGGRLKMQRRHGQELPGMLLLSITGPDTAVKSIRATLYQPDVEAEFVLEGGDESQQMVKARIGCDGGPFVYGAAMAKLAPGVIHMVALAKIPGLMPNMSDDHLWAELTSPRYTTPLLRSWIP
Ga0255812_10819067Ga0255812_108190671F070092MAINDRLKPVMELLETNRQKIDLMISQGMASDLAIEKRKKELYDEVMAAKEKAFKDELADLEGEIRKIEDSYREPKKDPTTKLLEFEQIKAKIRSTPSKELKELTHKFQNTGAIPGIPWERPDHVDVLVAELRNRGLDEEADLTWDYAYNKLKVDRPWENNPLYKQLKSQHNKVSVLAGFKDMLKYLDGTNNAVYISETLKYKEE
Ga0255812_10836311Ga0255812_108363112F026449PASASASNSISKSMMISRRQHPLPPPAGDSGDEAYYPYVEEADDDANKDSA
Ga0255812_10838425Ga0255812_108384251F000769MRLTYRPLELERLQGFFDARAELSSGLDPPPGVVEAGCLQRAWDRFLTRRDVLKNAFGQLLPSELHTGQSVWSELRRGVGSTEQMLRFLAQSCKRSLLGDYACLHPAP
Ga0255812_10856889Ga0255812_108568894F054061MDDETFEIIKAGAPDAPPEQALYRIQQTYPDGSGGRLNIDWEGLLWLHELIHDRIALEGYVCETCDTKGCHRPATWEIECRGVGVSGRPI
Ga0255812_10914163Ga0255812_109141631F080820GPFPFLRGRFSGRLGAFGATRGLCTTRRFAMDGKQKDARYERWLARAAAAYERMFCDKNQKELVTLTEREEMAVALSKELAAFLLEEHVAADPAKAPAEASLGCCPKCGQPGTPAPPKGGKRGAGMPERMVRTRAGDIDIRRERWKCGRCRIVFFSARRSPEVGHGRV
Ga0255812_10937986Ga0255812_109379861F045728MSLASVPRESSGQKERPLMAAQAAPAPAGDSLLVSRFLP
Ga0255812_10937986Ga0255812_109379863F045728LRTLTGRYMSLASVPRESSGQKERPLMAAQAAPAPAGDSLLVSRFLPTRE
Ga0255812_10969483Ga0255812_109694833F053724MREVRVLVTLQVDECEPDAKIDRETMQDAAVEAVENAMRFAYDNGFSHTYADELSIGFVDAVLFVAEELDGE
Ga0255812_10997421Ga0255812_109974211F071272KAAVDQLADSMAAGAWQDGPAPKDGSWILGLFHGLPYVVWYDSWEVGGEMLPDGSGSPPDGHESGWCLAGDNIQVMDQDAPEKWARIIHPDRHMPTSWDGPGDLG

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.