NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300027555

3300027555: Marine sediment microbial community from Union City, CA, USA - Pond 2C Sediment 2 (SPAdes)



Overview

Basic Information
IMG/M Taxon OID3300027555 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0053056 | Gp0053725 | Ga0207752
Sample NameMarine sediment microbial community from Union City, CA, USA - Pond 2C Sediment 2 (SPAdes)
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size300195301
Sequencing Scaffolds19
Novel Protein Genes19
Associated Families18

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria3
All Organisms → cellular organisms → Archaea7
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Dehalococcoidia → Dehalococcoidales → Dehalococcoidaceae → Dehalococcoides → Dehalococcoides mccartyi1
All Organisms → cellular organisms → Archaea → Euryarchaeota1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi1
All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Planctomycetia1
Not Available2
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1
All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → unclassified Planctomycetota → Planctomycetota bacterium1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameEnvironmental Microbial Communities From Fremont, Ca And La Paraguera, Puerto Rico
TypeEnvironmental
TaxonomyEnvironmental → Aquatic → Marine → Unclassified → Unclassified → Enviromental → Environmental Microbial Communities From Fremont, Ca And La Paraguera, Puerto Rico

Alternative Ecosystem Assignments
Environment Ontology (ENVO)marine biomepondsediment
Earth Microbiome Project Ontology (EMPO)Free-living → Saline → Water (saline)

Location Information
LocationEden Landing Ponds, San Francisco, CA, USA
CoordinatesLat. (o)37.569017Long. (o)-122.102433Alt. (m)N/ADepth (m).11
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F005815Metagenome / Metatranscriptome389Y
F009854Metagenome / Metatranscriptome312Y
F024682Metagenome / Metatranscriptome205Y
F025653Metagenome / Metatranscriptome200Y
F028081Metagenome / Metatranscriptome192Y
F029294Metagenome189Y
F038920Metagenome / Metatranscriptome165Y
F048391Metagenome / Metatranscriptome148Y
F058721Metagenome / Metatranscriptome134Y
F060598Metagenome / Metatranscriptome132Y
F068265Metagenome125Y
F076229Metagenome118N
F078220Metagenome116Y
F091298Metagenome / Metatranscriptome107Y
F094064Metagenome / Metatranscriptome106N
F096272Metagenome105Y
F096533Metagenome104Y
F098703Metagenome103Y

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0207752_1000309All Organisms → cellular organisms → Bacteria23304Open in IMG/M
Ga0207752_1000492All Organisms → cellular organisms → Archaea18508Open in IMG/M
Ga0207752_1001094All Organisms → cellular organisms → Archaea12598Open in IMG/M
Ga0207752_1001097All Organisms → cellular organisms → Archaea12581Open in IMG/M
Ga0207752_1001196All Organisms → cellular organisms → Bacteria12066Open in IMG/M
Ga0207752_1001456All Organisms → cellular organisms → Archaea10775Open in IMG/M
Ga0207752_1001553All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Dehalococcoidia → Dehalococcoidales → Dehalococcoidaceae → Dehalococcoides → Dehalococcoides mccartyi10333Open in IMG/M
Ga0207752_1002145All Organisms → cellular organisms → Archaea8351Open in IMG/M
Ga0207752_1002397All Organisms → cellular organisms → Archaea7699Open in IMG/M
Ga0207752_1002908All Organisms → cellular organisms → Archaea → Euryarchaeota6740Open in IMG/M
Ga0207752_1008654All Organisms → cellular organisms → Archaea2947Open in IMG/M
Ga0207752_1008915All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi2880Open in IMG/M
Ga0207752_1009650All Organisms → cellular organisms → Bacteria2702Open in IMG/M
Ga0207752_1013255All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Planctomycetia2142Open in IMG/M
Ga0207752_1026369Not Available1349Open in IMG/M
Ga0207752_1031317Not Available1213Open in IMG/M
Ga0207752_1035218All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1129Open in IMG/M
Ga0207752_1054188All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → unclassified Planctomycetota → Planctomycetota bacterium872Open in IMG/M
Ga0207752_1097148All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria624Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0207752_1000309Ga0207752_100030925F068265MTKKNQSAAVPVEGAMFFPEDFTRLHRRHVIYIKQGVIIDLDKL
Ga0207752_1000492Ga0207752_100049216F058721MLKTTLPCADPQCADKMRLVFQNERFLGYRCLLKPKSHNFRYDIERKRWEKIIIKTKPIIGYKKSPYDVALDEDVAIETI
Ga0207752_1001094Ga0207752_10010944F038920LLTEILTDQQLIDLYTTPGYLVAVDYPKKEIKLHMVDCMLADPISSVGVKPSKARENKTGEFWFSESREEANSKAEEIAKNKEYTYKICPICNR
Ga0207752_1001097Ga0207752_100109711F028081MDKEKALQKLQKTRNENEQAYLKAKAFLEGFRARGQLSKKDNEFLFLLEFVIKGFKNHGNDIITAFENQVRFTEAFNNLQAKVNDLEHDIRQLRKTLDKMYQDR
Ga0207752_1001196Ga0207752_10011968F091298MSSAVTLRVSSSAQVLKHLSIGEPIQDTRVTDLLDLDPLVVSRWLCGEDLRGVYQPIVVRRCALDGIDLEGRTFYEIVELTGCCIAAAHLRNAYFYSSLLVEDCVFDGDFDGRGMQSDGRVVFHNTIFTGWADFSAISLRGRADLVDVSFPGGTNLLRNLGDGSRTLLGRQVNLSGCRFRAADIPDGLDAARWGVLPLVDHDLGSLERQWRKLVGSKKSHLVGVIGRLFFPSHQPGTVDNDDACSALVLSSGCNVK
Ga0207752_1001456Ga0207752_100145614F078220LDFASGDDYIYSMFEAVVDVVHGKKREKLQFAKCPQCGCLKSFRPKSKGIQSTAMRCSNCGSIICFGVRGMGSNFRDLAEIFCENCEDDCRKCPIYKLANEQNRVKRSKLGKRSLNR
Ga0207752_1001553Ga0207752_10015534F076229MRREESAYVAFGQYWLQARHQETSRLWLTNVMALVFALLLAMIAWQGLVYWYMAAFGLSLSLFGFFVNHAMRVLGVRYGRVAAALMDAELGMGDYRRFIEGGDKTGFQGAWENLWSLHMAFVLFYCFGIAGWSALLTMTQNTPLWFTVFVFALMIVVSLVFYFAILWPRERQAETESLASVKRQERQKKKSSGPSA
Ga0207752_1002145Ga0207752_100214514F025653MSDPIKFFETKLKDMSLAELQDYKKRLEENIQQMIAKTAPNEKIAPLIIYRGILEHEIKARTTQR
Ga0207752_1002397Ga0207752_10023976F024682LSAEDELISKLKEEIGKTVPPMFAEIAKDMMEKNKEVIINLLKDNKNLVKEVIES
Ga0207752_1002908Ga0207752_10029085F098703MDYMICKLVTCMVKKAIIVINLLPEASKASNSQIEEKIRKEAKIPLCNNIEQVSVEDNEASYMKLKKHGVSDNVARNLVDLYTE
Ga0207752_1008654Ga0207752_10086544F096272MEDLKKLHCKPKVSIAEINDFVLQHPKWAASIIRNAIGFEMYREFCLCCENFDKCYEKLGAIKRENSLGCICNEFLNQEHSEANRRRLRAYFKEVAVILNL
Ga0207752_1008915Ga0207752_10089151F091298MSSVVTLRVSSSAQVLQHLSIGEPIQDTRVTDLLDLDPLVVSRWLCGEDMRGVYQPIVVRRCALDGIDLEGRTFYEIVELVGCHIAAAHFRNAYFYSSLLVEDCVFDGDFDGRGMQSDGRVVFHNTIFTGWADFSALSMRGRADLVDVSFPGGTNLLRTLVSGSRDQLGHEVNLSGCRFRAADIPDELEAARGGILPLVDRDSGGMERQWRKLVRGKKGDPVGIVSRLLFSSHQPGTVDNDDVCTALVLSPGCNVE
Ga0207752_1009650Ga0207752_10096502F048391MEFKGDPMDDRISVEPINTLDYLNELLGKGYVIKGPRKDSSRDLISFKAFLKKGKEFAPEGWLLHMGYEFIEPNTFTKGHKIAYKIIDEIPDERFNSNYTLVKENREIPLYLKVAVLKAE
Ga0207752_1013255Ga0207752_10132552F009854MSASDTRAELFDALRELSEIVPEMRAGQLIAAVGEVCADLHGRGLWEASDAEVLEAVWRFRRNFETATAKASDHDA
Ga0207752_1026369Ga0207752_10263691F060598FAVIFGVRRAFHLDDVRDIGFALVTGRPRPLTHGTFQHLLHRIPGEKARSFYAATARLAVQSLGEGIRRISLDGHNLPRYTRIVEVVKGKIGNTGRILKAEELVLAYDLDAHLWLAVRIYQGTKKLSKGIVEIVQEILEHRGDLSGLLRIFSDKGAYGGHIFQALAAEERVRFYIPAVRYPSNVEQWEQLQDTDFDPEPFIFDKHADLPADQRPTYRLADTEMEINVWEGQKIVDTVTLRAVVLHDPQGEKPAERWPVAFLTDDQHIDARALLNEYGDHWGQEFAHRIGKHDLCLDILPPGYVLKTRRDEQGQLVREVEDDDTAFFLSAWLRCLVFNLMTCFAQAMGGEYTKMWAGTLLRKFIRRPATLYLVDKELHVVFDPFPDQDQLQPLLDELNAKRTALPWLNDLVVQFRIAEEEPLHPLTELEKRNRLFGDG
Ga0207752_1031317Ga0207752_10313171F029294MTQETGETRDAIIEDDQKKLEAILAANKHRRDPYWVVIFAKPSKNSVDGKPTLVKHVKAYGVKPTPQVGMITGEVNNSTGEIKWEVNMPQRPFDFDALQQFGAKPCNEAVIETT
Ga0207752_1035218Ga0207752_10352182F094064MAKVMKDYSGPVCPMIHLRDLGKETVIKLAVQFARAYSQIDGHWYDAVAKRYGEQVARDLDFEVWMRNIPPTAKRTLEALGHKEMDMAAILKVTQFHPAAGGGEYLWDYEVELKSPNVGIVKVLKCRPYSYYMEKGDLDFVRVMCREWDIPMFGITGTAIIPNVKCRPLVIPPYDAPYETKPEPGVVCAWEYSIEPE
Ga0207752_1054188Ga0207752_10541882F096533MSETLNIMLEDLNSETQQDVLNFYGYETAAEGNLDVMPLFVLETDDRKEQ
Ga0207752_1097148Ga0207752_10971482F005815MTMKVTPLRTYWSPDEAATAIDLLDILREALWQTYGDQITKMHREIHEEHVSDDNQCKLAFDDDLPF

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.