NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300026856

3300026856: Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_40_50 (SPAdes)



Overview

Basic Information
IMG/M Taxon OID3300026856 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0114663 | Gp0127627 | Ga0209852
Sample NameGroundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_40_50 (SPAdes)
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size42821247
Sequencing Scaffolds22
Novel Protein Genes24
Associated Families24

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria5
All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales1
All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → unclassified Planctomycetota → Planctomycetota bacterium1
Not Available4
All Organisms → cellular organisms → Archaea1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Chelatococcaceae → Chelatococcus → Chelatococcus asaccharovorans1
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes1
All Organisms → cellular organisms → Bacteria → Nitrospirae1
All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosotenuis → unclassified Candidatus Nitrosotenuis → Candidatus Nitrosotenuis sp.1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Moraxellales → Moraxellaceae → Acinetobacter → Acinetobacter baylyi2
All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium1
All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Candidatus Acidoferrales → Candidatus Acidoferrum → Candidatus Acidoferrum panamensis1
All Organisms → cellular organisms → Bacteria → Acidobacteria1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameGroundwater Microbial Communities From The Columbia River, Washington, Usa
TypeEnvironmental
TaxonomyEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand → Groundwater Microbial Communities From The Columbia River, Washington, Usa

Alternative Ecosystem Assignments
Environment Ontology (ENVO)freshwater river biomeriver bedsediment
Earth Microbiome Project Ontology (EMPO)Free-living → Non-saline → Subsurface (non-saline)

Location Information
LocationUSA: Columbia River, Washington
CoordinatesLat. (o)46.372Long. (o)-119.272Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F000402Metagenome / Metatranscriptome1180Y
F000545Metagenome / Metatranscriptome1038Y
F002241Metagenome / Metatranscriptome579Y
F003507Metagenome / Metatranscriptome482Y
F004115Metagenome / Metatranscriptome452Y
F005974Metagenome / Metatranscriptome384Y
F007188Metagenome / Metatranscriptome356Y
F007370Metagenome / Metatranscriptome352Y
F009436Metagenome318Y
F012785Metagenome / Metatranscriptome277Y
F017449Metagenome / Metatranscriptome240Y
F018483Metagenome / Metatranscriptome235N
F021644Metagenome / Metatranscriptome218Y
F022276Metagenome / Metatranscriptome215Y
F025377Metagenome202Y
F030538Metagenome / Metatranscriptome185Y
F046963Metagenome150N
F050637Metagenome145Y
F056187Metagenome / Metatranscriptome138N
F077084Metagenome117Y
F082894Metagenome / Metatranscriptome113Y
F084286Metagenome / Metatranscriptome112Y
F085295Metagenome / Metatranscriptome111Y
F103989Metagenome101N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0209852_1000003All Organisms → cellular organisms → Bacteria11278Open in IMG/M
Ga0209852_1000047All Organisms → cellular organisms → Bacteria2799Open in IMG/M
Ga0209852_1000105All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales2289Open in IMG/M
Ga0209852_1004485All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → unclassified Planctomycetota → Planctomycetota bacterium767Open in IMG/M
Ga0209852_1005860Not Available701Open in IMG/M
Ga0209852_1006147All Organisms → cellular organisms → Bacteria690Open in IMG/M
Ga0209852_1006930All Organisms → cellular organisms → Archaea661Open in IMG/M
Ga0209852_1007552All Organisms → cellular organisms → Bacteria641Open in IMG/M
Ga0209852_1007702All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Chelatococcaceae → Chelatococcus → Chelatococcus asaccharovorans637Open in IMG/M
Ga0209852_1007745All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes636Open in IMG/M
Ga0209852_1008846All Organisms → cellular organisms → Bacteria608Open in IMG/M
Ga0209852_1009220All Organisms → cellular organisms → Bacteria → Nitrospirae599Open in IMG/M
Ga0209852_1010216All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosotenuis → unclassified Candidatus Nitrosotenuis → Candidatus Nitrosotenuis sp.578Open in IMG/M
Ga0209852_1010240All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Moraxellales → Moraxellaceae → Acinetobacter → Acinetobacter baylyi577Open in IMG/M
Ga0209852_1010445Not Available573Open in IMG/M
Ga0209852_1010821All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium566Open in IMG/M
Ga0209852_1011992All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Candidatus Acidoferrales → Candidatus Acidoferrum → Candidatus Acidoferrum panamensis545Open in IMG/M
Ga0209852_1012170Not Available542Open in IMG/M
Ga0209852_1012271All Organisms → cellular organisms → Bacteria → Acidobacteria541Open in IMG/M
Ga0209852_1014094All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria514Open in IMG/M
Ga0209852_1014432Not Available510Open in IMG/M
Ga0209852_1014619All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Moraxellales → Moraxellaceae → Acinetobacter → Acinetobacter baylyi507Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0209852_1000003Ga0209852_10000037F084286MAFNCGVSFGACMVRITKVDENGNVIAGNNSYVTDKPISISVNPNIETGNAFSVRNGCGCSISRRKFPDTFNWWELSLQTATLEPEMIAFMLGADTIDDGADVVGVAFPSALACEDANPAVAFEFWSEHVVGSGLDATYPYFHWVFPSSVWQIGDNTFEEGPAQPTLNAFTQTNGNWGDGPYGDGPPDSQDISEGGFWATADALPTAECAAQAVTATS
Ga0209852_1000047Ga0209852_10000471F007370MPHPEPEDSRVVSVRLPTTLVQRLDRVLDWHMTHRRRPTTRNAALREALGDWLDQHEQLAGLLDPESLRQQFRAAYDSLRPSPDGVPIPRLRRLLPWPRERFNTVLEALRAAQAIDLEPLSAQVGDTQATHDSYQVYGQCYGRLRWRA
Ga0209852_1000105Ga0209852_10001052F022276MSGVLALAVAIGLVFGAHKLCNLIRAPRWFTKFLFIGAFLSITYFLAEDILPESKKPWGIWVALLGAATGIIPVGFIVFKHLDDGNDRTSHDSTQSSLHK
Ga0209852_1004485Ga0209852_10044851F003507MYACGLREVQADRAQAHFVLPETLAQFRSRLDQELMDTLVAIQAAAAIEAGFVSPAHLLVDTFPAEQGSQRVTDATTLYKAQKKTCSSSSGSPDRPLSSRPH
Ga0209852_1005860Ga0209852_10058602F005974VMTRFALDDFLRGNALERAQVYEILNRIGAMSVEQIQREEDLIPNES
Ga0209852_1006147Ga0209852_10061472F025377MRAYWKWVIKSVPSLGLGLLMLLSTVTPRQARSNVEAWLIYFGIENIPPWLADKSTDTWVFWIAFVGFCVWAIHLYLRPNLRNGKLSVMIRAGEPWVQVDPGIDQSQAAHTSGKLYTYRIALINSADSTLRNVEVKLMSLEKKPQNFHAMGSHLKLRHDQVGTTNFNVHPTK
Ga0209852_1006930Ga0209852_10069302F046963MALTIELKETLHQLIDNLMINPQQMRNMAYDLKPLTKDDVEVAFGIFVGYVTGGFAELFFESQKRSMTASELIQVRKILFERALEMKKAIAYVRAQESKS
Ga0209852_1007552Ga0209852_10075522F007188HVLLPVLLAALRTLGDAPTRSLTALAQRLGVSEATADAMVTPLADEPAPVAAVPAVAPASPLLPMTGRNDGSSAPKTLLNRKRVIAGRKAIIR
Ga0209852_1007702Ga0209852_10077021F085295MNGLSASGLADLAGVTEAEVQRLVDLGILVARDSAGPFLETDVQKVRLATACERAGLPMAGIAAAIRGGRLSFAFLEAAPYRRWAVRSARTYRQVSEESGVPLDTLAGVLGAMGFARMEPDERIREDELEIVPLLRLARSTGILDPVWLARIGRAYAEGLRLAANVENEAYRA
Ga0209852_1007745Ga0209852_10077452F018483MMCQAVEAYIYSKKGVVVKINRIAIISDSRQMEMLAYAYAYANGDR
Ga0209852_1008846Ga0209852_10088462F077084RLGDDGSGVRAAVAALGRRGKTTRIPDAVRAQALAYSRRQRAAGHSWVRIAHAVGVSVGALQNWLRTPPPARTLVPVAVASEMPAGALVVVSPGGYRVEGLDLPTASALLRTLG
Ga0209852_1009220Ga0209852_10092201F009436MELAHISSAAELLTIEHPHFGRIHPPGLLKSTIGTFTGTSQKAAAIADYSSGNRQTVAFQIDPRNFMVTIASSYVLVTLKEGASRHPVIHKTWRLVGQRKGEEVTEFFGTVHTYIDSDGLKWEHRFERSPHEDELIGFGSSGEEILIITLESFYPTTAPDLRGVII
Ga0209852_1010216Ga0209852_10102161F103989MNMGKSAIKESNNHQEFDGEINATDIFMRDLRRSYPEFAKALEEFMKKELQRLDDELK
Ga0209852_1010240Ga0209852_10102401F056187MSNYLDDYVSVQDRLKEFINAYPDYRIKTHILAESLVANCDVYIIKTELYRTEADLHPWTTGLSS
Ga0209852_1010240Ga0209852_10102402F004115SQEWWKVQSAKIRISISLDMPLSWVDYDFRVQKIGTTLSLTRNHNSNQYCDYCKYRWGQNKNGWDLRATTPAVWKVQSETPLRKAQVRFYCQPCADDAQNWPDGTFYSLKEQLEDAINDFAGREKLDVQLPR
Ga0209852_1010445Ga0209852_10104451F017449VGGRRTRRPGEISIPPGSGFGKGRSCTCWPRPNDARPCTPQTCDWRCRACRVHGRPFYSHGDLQYLEHGTGPLVAELDRLWWVRGTGKRRSARAGVWPWR
Ga0209852_1010821Ga0209852_10108211F082894MQALSNAEILERALAKARANNPEWKPILPNRPEDLIEMGHENVVLFERGFAQAFWGMAPYTVVPASNGGSRSQDNDIPAWRYHLQGLAVAPDLFAYLAKFV
Ga0209852_1011992Ga0209852_10119921F000402GHNDCTSGVCPGASENGETVSLYVDYNHPLLQLKRALPWEALFEVLTRHWRRAGKNTDGRPGLAWDVTLYVPLVVLMLVKNLNARDMESYLAENVVARVFIGRQDAPRPQIRDHSNIARAYAALGKEGVDEINALMLHVAKDFGFADVSILSADTTAQELPIGYPNEPGILRGLAQRCGR
Ga0209852_1012170Ga0209852_10121701F050637MQLNSNGQIQRSAAQWGEIIARYRQSGMGSRQFCEEEGLTLRTFEKWYGRIRRSETSKGKFVEVQAPLGTGGPWAVEVEFPTGVRLRVRG
Ga0209852_1012271Ga0209852_10122711F030538RSWHYSNVVMKDRIAELTASRQRLCRQLPDPSPIFPGSLLSRMIHCNKPGCRYCEKGKGKGHGPIWILSVSLGNRKVRQIPVPVEFKQEVEKGLRSFAHMRERLKQIAHINVELLKERKRRR
Ga0209852_1014094Ga0209852_10140942F021644MSTTRLRLVVAGETLFPPRALFSWRTWGTSRFPTPLHAHSA
Ga0209852_1014432Ga0209852_10144321F012785MVIHYPPEQLRYNSDQPTAITDVLVATSGSSARTGDIPLGTPTCSVRTRHGRWALYPHGDEVVVVWPGGALVGPAVEVAGALDRLGGDRHDHLDRAAAAAIRRFLTDP
Ga0209852_1014619Ga0209852_10146191F000545WALGLSEMKQDAHPFKCSTCLAVTPHIELYRYETSDIPEAPEEVWLIECQRCFLQRIIYPSDRVASKEDDITRCDKCGNWKMKSGKCRICRLAAGFEQISVKYWTGNATMERPYNDEQTPLY
Ga0209852_1014619Ga0209852_10146192F002241MSRPHSIRYIRQLMEWGFDKEFIARDCGINVSSLDVRLNRAKKREQDGNQGTE

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.