NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300024973

3300024973: Wastewater bioreactor microbial communities from Cape Town, South Africa - Thiocy_expt_300_biof (SPAdes)



Overview

Basic Information
IMG/M Taxon OID3300024973 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0118793 | Gp0095077 | Ga0209959
Sample NameWastewater bioreactor microbial communities from Cape Town, South Africa - Thiocy_expt_300_biof (SPAdes)
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?Y
Use PolicyOpen

Dataset Contents
Total Genome Size137607266
Sequencing Scaffolds16
Novel Protein Genes20
Associated Families11

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Nitrosomonadales → Thiobacillaceae → Thiobacillus1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Nitrosomonadales → Thiobacillaceae → Thiobacillus → Thiobacillus thioparus2
All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Xanthomonadales → Rhodanobacteraceae1
All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Chelicerata → Arachnida → Acari → Acariformes → Trombidiformes → Prostigmata → Eupodina → Bdelloidea7
All Organisms → Viruses → Predicted Viral1
Not Available2
All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Spiralia → Gnathifera → Rotifera → Eurotatoria → Bdelloidea → Adinetida → Adinetidae → Adineta1
All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Spiralia → Gnathifera → Rotifera → Eurotatoria → Bdelloidea → Adinetida → Adinetidae → Adineta → Adineta ricciae1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameWastewater Bioreactor Microbial Communities From Cape Town, South Africa
TypeEngineered
TaxonomyEngineered → Bioremediation → Hydrocarbon → Unclassified → Unclassified → Wastewater Bioreactor → Wastewater Bioreactor Microbial Communities From Cape Town, South Africa

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Free-living → Non-saline → Water (non-saline)

Location Information
LocationSouth Africa: Cape Town
CoordinatesLat. (o)-33.936637Long. (o)18.478905Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F025775Metagenome200Y
F033506Metagenome / Metatranscriptome177Y
F033870Metagenome / Metatranscriptome176Y
F055860Metagenome / Metatranscriptome138Y
F075082Metagenome / Metatranscriptome119Y
F081545Metagenome114Y
F087441Metagenome110N
F092364Metagenome107Y
F094111Metagenome106Y
F095751Metagenome105Y
F103563Metagenome / Metatranscriptome101Y

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0209959_100011All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Nitrosomonadales → Thiobacillaceae → Thiobacillus659158Open in IMG/M
Ga0209959_100036All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Nitrosomonadales → Thiobacillaceae → Thiobacillus → Thiobacillus thioparus329946Open in IMG/M
Ga0209959_100058All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Nitrosomonadales → Thiobacillaceae → Thiobacillus → Thiobacillus thioparus259641Open in IMG/M
Ga0209959_100451All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Xanthomonadales → Rhodanobacteraceae30574Open in IMG/M
Ga0209959_102037All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Chelicerata → Arachnida → Acari → Acariformes → Trombidiformes → Prostigmata → Eupodina → Bdelloidea6094Open in IMG/M
Ga0209959_103499All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Chelicerata → Arachnida → Acari → Acariformes → Trombidiformes → Prostigmata → Eupodina → Bdelloidea3643Open in IMG/M
Ga0209959_106890All Organisms → Viruses → Predicted Viral2036Open in IMG/M
Ga0209959_107964All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Chelicerata → Arachnida → Acari → Acariformes → Trombidiformes → Prostigmata → Eupodina → Bdelloidea1806Open in IMG/M
Ga0209959_118729All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Chelicerata → Arachnida → Acari → Acariformes → Trombidiformes → Prostigmata → Eupodina → Bdelloidea919Open in IMG/M
Ga0209959_119714All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Chelicerata → Arachnida → Acari → Acariformes → Trombidiformes → Prostigmata → Eupodina → Bdelloidea872Open in IMG/M
Ga0209959_125471Not Available652Open in IMG/M
Ga0209959_132738All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Chelicerata → Arachnida → Acari → Acariformes → Trombidiformes → Prostigmata → Eupodina → Bdelloidea561Open in IMG/M
Ga0209959_134282All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Spiralia → Gnathifera → Rotifera → Eurotatoria → Bdelloidea → Adinetida → Adinetidae → Adineta549Open in IMG/M
Ga0209959_137226All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Spiralia → Gnathifera → Rotifera → Eurotatoria → Bdelloidea → Adinetida → Adinetidae → Adineta → Adineta ricciae528Open in IMG/M
Ga0209959_138482All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Chelicerata → Arachnida → Acari → Acariformes → Trombidiformes → Prostigmata → Eupodina → Bdelloidea520Open in IMG/M
Ga0209959_139227Not Available515Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0209959_100011Ga0209959_10001192F025775MKQLLTFQSFPMVVVTGLIIWAWISILSVLIASFF
Ga0209959_100036Ga0209959_100036115F025775MKQLLTFQGFPMVVVTALIVWAWISIFSVLIASFF
Ga0209959_100058Ga0209959_100058106F025775MKQLLTFQGFPMVVVTGLIVWAWISIFSVLIGSFF
Ga0209959_100451Ga0209959_10045112F033870MKVSMEAPQTPQHDLVAMTAALRGVLLQRVEHGIQSSGLTNYFFYLVAKRLTFAGPLTIPGLAEQLRLNPRDIEDSLHVMLERGMVTREDDAWRITDTGRRALTAANNSGEQVLDDIRNAIGAAACDHLVTGMRDALAALRKAA
Ga0209959_102037Ga0209959_1020372F094111LTLSYNQPRFCLNTIWNSIATSFADQSFLGTDPHGIFVNSNNSIYIPNRQTGQIYIWQNENHLNPTKTIIYVDNGQYNGRVDKWIVENETWISVMNATSPCFGLFIDIYENLYCSMFYNHRVDKKWSNSIISIESTKWNNCCRIWISKCNNQT
Ga0209959_103499Ga0209959_1034992F087441MEPLAVYGKSLIETNTNLSRSLSISILDEQGNEIPFETTSSNPIEFFIPRDPNLRIPPMILTNKTFHSLNLTTDLPISVHFEIKANFSYRFVYKFDRQTNFLNSIEVNNQNDFNFMLDNQQTVGHRLLIFGLEEENSNSSEYEYRVYSSGCYYLNKQNQWTSDGLRVGPKTNLSQTHCYSQFI
Ga0209959_106890Ga0209959_1068901F087441MEPLAVYGRSLIESNTNLSRSLSISILDESGNEIPFETTSSSNPTEFFIPRDPNLQIPPMILTNKTFHSLNLTTDPPISIHFEIKANFSYRFVYKFDKQSILTNSIQVNQSYFTFMIDNQQTVGHRLLVFGLQQENQNNSEEYEYRVYSSGCYYLNKENEWKSDGLRVSPKTNLSQTHCYSTHLT
Ga0209959_107142Ga0209959_1071421F092364MVMFVIGFINGICSFITFRNKELRKVGCVLYLLASSITSLLTITMFTVKFWFVVFIQLHSIISSSILQTDCAFIGPVLKLCLYLDGWLNACVAIERTVNVSKGITFDKKMSKRAARWIIPILFLIIMGTIIHEPISHELFEYPTQKYKPINNNLTMNKSMRNEIYSEYEIEHNVLCVIRYSRSVQTYNTITLFVHLVVPFLLNLCSALFIIFGTARQRSTTHNNQTLREHILKQINEHKQLLISPIILLILALPRLIISLVSGCVDPSNNPWLYLCGYFTSFIPPMLIFAVFVLPSELYWKTFKNSITKWRRHGCW
Ga0209959_107964Ga0209959_1079644F087441SLIESNTNLSRSLSISILGQNGNEIPFETNSDSIEFFIPRDPNLRIPPMILTNKTFHSLNLTTDLPISIHFEIKANFSYRFVYKFDRQSSFTNSIEVNQSYFTFMLDNQQTVGHRLLIFGLEQENQNISEEYEYRVYSSGCYYLNKENQWKSDGLRVGPKTNLSQTHCYSIHLT
Ga0209959_110871Ga0209959_1108711F075082MASSNSNKRRPRIQRIIQNFHLVWLDGNINENDDDCRHSIQKLKQVVNTVNTFVDVDECIDFINKIPEETAFMITSGAFGQTVVPAIHNKKQVNAIYIFCGNKDRHEKWAKEWPKVKGVFTDIKPICEALKQAAQDCDQNAVSISFAPTGDSTANKNTLECSFMYTQILKDILLTIHFDHDH
Ga0209959_113967Ga0209959_1139671F095751MYQAYPLKQYFDLFNRHARRANITTVYINSEDEQVFNEFDQINRKTQNYYKLISIQTTKNVVYRTLTNMSREQRGKIVLEFLTDLYIEANADLHAGTLTSNWCRLVDEIRLVLGKKIPYYTPENKYLMDW
Ga0209959_118729Ga0209959_1187291F087441MEPLAVYGRSLIETNTNLSRSLSISILDEKGNEIPFETNSDSIEFLIPRDPNLRIPPMILTNETFHSLNLTTDLPISIHFEIKANFSYRFVYKFDKQSIFTNSIEVNQSYFSFMIDNQQTVGHRTLIFGFEGENQEYEYRVYSSGCYYLNKENEWKSDGLRVGRKTNLSQTHCYLT
Ga0209959_119714Ga0209959_1197142F103563MVAIDRVGHVYPSVHLQEMLNLAQNSAPLATPTEYTSTHVNDPVASVMNGEIGQKISPSQFGLANNPSLQFSTPGGGFVQVFGPLKTPGNDKFFHDPNPVVIQKKSKPITYVQKINLAYYKPPAPPAPGPVIIKEVGR
Ga0209959_119921Ga0209959_1199211F075082MASSNVSRHKQSNQRIIQNFHLVWLDKNINEKRDDFRNTITKLRQVVNTVNTFVDADKCIDFINNIHEETAFMIISGAFGQTTVPIIHSKKQISDIYIFCGNKDWHTKWAKEWSKIKGVFTHIQPICEALKQAAHACDHNAVSISFATTRDTATSVSNKNA
Ga0209959_125471Ga0209959_1254711F033506LITVSVAAILAIFSGFARSRTDATNNLARLAGIVAVVLLFATGVLVAVALYTFVGAQYGTGYSFTAMAIAQFLSFLAALLVAHWMVKFNDK
Ga0209959_132738Ga0209959_1327381F055860KRFDFCSGILVDMCWMASLSVFVILTIHSVYLYDPIKGSGPAKVDAVQPLDRSHVLASISIIDHDVYINYHKGVHLDQYRVSTTSQWTLEKRFSKSDCCEAKDIGIRDVRCDAQSICLSIMQQGDLKWRLDIMSRDMKRIRRGTPMDAGENQHKFFSMLISLHDQRWLFVNWYTNKLWLVDQEGKPG
Ga0209959_134282Ga0209959_1342822F087441MEPLAVYGRSLIETNTNLSRSLSISILDEKGNEIPFETFSNPIEFFIPRDPNLRIPPMILTNETFHSLNLTTDLPISIHFEIKANFPYRFVYKFDKQSSFTNSIEVNQSYFTFMLDNQQTVGHRTLIFGFEGENQEYEYRVYSSGCYYLNKENEW
Ga0209959_137226Ga0209959_1372261F094111YQQQYLTSATFSFNQPRFRINATWNLIATTFANQSFVGTFPHGIFVNSNNSIYIANRQTGEIHIWLNENHLNPTKTISGNLSNPFSLFVTTNGDIYVDNGNNRRVDKWIRENERWISVMNASSYCDGLFIDIYENLYCSMYHNHRVDKKWSNDTTTIVAGTGVQGSQSDLLNYPC
Ga0209959_138482Ga0209959_1384821F081545MDHQQLRSIIIKLETRLSDDDRERLHFYLGNDVPRKIRDDASLRGTLRLMDSLFDQDKINEQDFTFLIRAFEQIQCIDAVNLLKG
Ga0209959_139227Ga0209959_1392271F094111QRNLIVSVRKIVQKSFHFDFVVLTLSYNQPRFCFNTIWNSNATTFANLSFVGPLPYGIFVNSNNSIYILNQQTGQILIWLNENDLNPTKTVSGNLSGPLSLFVTTNDDIYVDNGKNGRVDKWIRENETWISVMNVTSKCYGLFIDIYENLYCSMQDNHRVDKKWSNGTTTI

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.