NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300006003

3300006003: Groundwater microbial communities from the Columbia River, Washington, USA, for microbe roles in carbon and contaminant biogeochemistry - GW-RW metaG T4_2-Sept-14



Overview

Basic Information
IMG/M Taxon OID3300006003 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0114663 | Gp0115673 | Ga0073912
Sample NameGroundwater microbial communities from the Columbia River, Washington, USA, for microbe roles in carbon and contaminant biogeochemistry - GW-RW metaG T4_2-Sept-14
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size98989268
Sequencing Scaffolds23
Novel Protein Genes29
Associated Families29

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → Viruses → Predicted Viral2
Not Available13
All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Dinophyceae1
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage3
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes1
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Kyanoviridae → unclassified Kyanoviridae → Synechococcus phage S-SRM011
All Organisms → cellular organisms → Archaea → Euryarchaeota → unclassified Euryarchaeota → Euryarchaeota archaeon1
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Caudoviricetes sp.1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameGroundwater Microbial Communities From The Columbia River, Washington, Usa
TypeEnvironmental
TaxonomyEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sand → Groundwater Microbial Communities From The Columbia River, Washington, Usa

Alternative Ecosystem Assignments
Environment Ontology (ENVO)freshwater river biomemicrocosmsand
Earth Microbiome Project Ontology (EMPO)Free-living → Non-saline → Subsurface (non-saline)

Location Information
LocationUSA: Columbia River, Washington
CoordinatesLat. (o)46.372Long. (o)-119.272Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F000919Metagenome / Metatranscriptome834Y
F000973Metagenome / Metatranscriptome817Y
F001968Metagenome / Metatranscriptome610Y
F003495Metagenome / Metatranscriptome483Y
F011078Metagenome / Metatranscriptome295N
F014573Metagenome / Metatranscriptome262N
F018713Metagenome233N
F019965Metagenome226N
F021301Metagenome / Metatranscriptome219N
F022403Metagenome / Metatranscriptome214Y
F022817Metagenome / Metatranscriptome212Y
F024100Metagenome / Metatranscriptome207Y
F025502Metagenome / Metatranscriptome201N
F026270Metagenome198N
F036108Metagenome170Y
F039047Metagenome / Metatranscriptome164N
F041196Metagenome160Y
F043897Metagenome / Metatranscriptome155N
F050335Metagenome / Metatranscriptome145N
F053242Metagenome141N
F058796Metagenome / Metatranscriptome134N
F062696Metagenome130N
F063698Metagenome / Metatranscriptome129N
F066500Metagenome / Metatranscriptome126N
F068747Metagenome124Y
F072345Metagenome / Metatranscriptome121N
F086620Metagenome110N
F100723Metagenome / Metatranscriptome102N
F104469Metagenome / Metatranscriptome100Y

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0073912_1004337All Organisms → Viruses → Predicted Viral1326Open in IMG/M
Ga0073912_1005032All Organisms → Viruses → Predicted Viral1215Open in IMG/M
Ga0073912_1005718Not Available1125Open in IMG/M
Ga0073912_1006144Not Available1074Open in IMG/M
Ga0073912_1006779All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Dinophyceae1015Open in IMG/M
Ga0073912_1006879Not Available1008Open in IMG/M
Ga0073912_1007197Not Available981Open in IMG/M
Ga0073912_1009020All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage861Open in IMG/M
Ga0073912_1011073All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes766Open in IMG/M
Ga0073912_1011401Not Available754Open in IMG/M
Ga0073912_1011986Not Available735Open in IMG/M
Ga0073912_1013330All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage693Open in IMG/M
Ga0073912_1014467All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Kyanoviridae → unclassified Kyanoviridae → Synechococcus phage S-SRM01663Open in IMG/M
Ga0073912_1014746Not Available657Open in IMG/M
Ga0073912_1015596Not Available638Open in IMG/M
Ga0073912_1016109Not Available628Open in IMG/M
Ga0073912_1018071Not Available593Open in IMG/M
Ga0073912_1019327All Organisms → cellular organisms → Archaea → Euryarchaeota → unclassified Euryarchaeota → Euryarchaeota archaeon572Open in IMG/M
Ga0073912_1020547Not Available555Open in IMG/M
Ga0073912_1021046Not Available549Open in IMG/M
Ga0073912_1021903All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Caudoviricetes sp.538Open in IMG/M
Ga0073912_1024437All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage510Open in IMG/M
Ga0073912_1025478Not Available500Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0073912_1004337Ga0073912_10043375F022817KASTHYLKNGKVHTGPVHKMNGQVHTGATHTASSKVLTHTKPKAKKK*
Ga0073912_1005032Ga0073912_10050321F086620MLTKKQEQGLRILESIVQSQYPFVVSLSKSDKYRLDEYATTMGIILEIDPTTLSKFVGLPFAKKFKETPSIWDYYMRDRDLSFILHLFEDEHQHIMGWQFNKGMEEFITKVYSQLPTNMRVNIYSDSPDYDIPDWARRTLHSPRTLTIDRFFLAPDSKPPKFED*
Ga0073912_1005718Ga0073912_10057182F011078VNKAEIIEELSRADWLTKATRNIAKNNELARELYQFYFLTILQKPDEQIEKIYNDGYIQFWTIRLLYLCINGNRHPFGESRIYDQYDVYDLHLSEEPDLLLEREEDERIEQKRINKINQVTESAYFYERELFKLWCSGMSARAIHRQTDISVREILRVVKLMKERCTTK*
Ga0073912_1006144Ga0073912_10061441F100723NLRGAIFVPTSRRQRKNTFKKKSTLTVPCGTISKMTNKKWTPRSLRWTDSERQAKKIQSQNVRQIRWLFWRESQALREEIQKEISEKLIAKREAIE*
Ga0073912_1006779Ga0073912_10067792F003495MKIKQDVTSKEVKVIPKDIAALDKRSVLAKVNSQGSGR*
Ga0073912_1006879Ga0073912_10068792F041196MNLQENIQRIKSMMGLLVEEQQDKSIVLLDGTSSAGKSHTLNHLNAVPYYEANNPNQWVIIATDDFSGTGELGADGEERRLKLDHPNIRQWAKENADAGIVSGNYRKDGKEVPENPYEDEYIQNTDPRLWYVAQEIKTGPFKKIAIDDIGKEILQYLPGVKLKYILLHAPLYILLDNVKWRNDRAKKDPNFKYDGRDVKMVLGQYSKKYEATQSKPDINEGDPTTVLTKGGITDLLQKNGMSDEHIDEFLNSINLTEDGD
Ga0073912_1007197Ga0073912_10071972F001968MFDELWSEIADAPGEIFDLPELRDLEEGKFDVNEYLNSN
Ga0073912_1009020Ga0073912_10090201F068747EDIKQSLYEWFVSHPRKLTEWEGFSKKSAQNLLYRSLRNQALDYCQYWKAKSLGYETSDLFFYDADIVEALLPAVLRGDITEAPVLNLGMPGKPSAPAEGGNMMAMMAEIKAAYLKLSTEDRHILYHKYAGSLSYGDIAIELALPSDDAARMRHNRAIKKLITRLGGFRSYLDKDETSEVGNDEPNQNEDAEQGQETD*
Ga0073912_1011073Ga0073912_10110732F019965MNDLEVMMQTVQIYIYQKKGVKVRIYLRDIRDINMLKQAYDYIQKNEHNKNTNN*
Ga0073912_1011401Ga0073912_10114011F050335VQLTIIKCKDELHLKNECIWNATTYPLFFIVALSITQQPDSYHPAFNDTNFVITESSGGIYTSSNFKFIANVKVAATSVAKLKAPIYFGSVNKGVFNIGRIMESYVSNNWSFTDTSPSGCVDSFSDYEVEFGYEYSPSATGTITEYLDLTSATGTVWNAALNPFDLVTYAQAQYLATSASAKFLTNVRTRYIHRTQKDWLYALKGDATSVVITY
Ga0073912_1011986Ga0073912_10119861F026270LLFEGGDEGYDGAVDQSPIVWLEIVDKIVKGDRTKWDFILQMPLIEFLNAMAFYKAKTKERQKRLEDAAGKGFNPYIVACLNEML*
Ga0073912_1011986Ga0073912_10119862F021301RQMNILAIALNLSMDEVESMTLDKLTSEFEKLSFLNDLPKAPIQFMFKLRGRYFKLAKTPNEMCGHHFIELQQVFNGDVIESLNKIVALLSVEVDFFGRNKKVVDAQAHYEDKCGLMMGLPVPLPYTYALFFLEVYPELLKNILCSLKEEMKDMTEQLTKVQ*
Ga0073912_1013330Ga0073912_10133303F014573MHIQDEQLRKELKKILAFKKRNSIVKEIQSNGSKFHFFQLTNFLQGKDVSLSTLKK
Ga0073912_1013822Ga0073912_10138221F072345MISYQSIVDKITTFYDNHLQVKKVGSDFKEQMVNFATKDEKYPLVYVVPTGVTPYENVTVFNIELYCFDIIQMDRANITTILSDTQQILQDLYLEFTFSDDYDFDIDGQP
Ga0073912_1014467Ga0073912_10144673F000919VEFRWSKLNNKYELVKWQECEGKEYCYVIAFFDKDKECYNMRTIGDRFFEDKDAWVVGKYALEFLNAIFQIEQDEEELK*
Ga0073912_1014746Ga0073912_10147463F063698ILAGGDGFKYHGTGTVTSVGYAALVVQEDTVFTSFSVDGTNVLSARGLSAITLQQGAYLPSGGASKITGFIISSGSVIGY*
Ga0073912_1015596Ga0073912_10155961F043897MTNEFAKISAESNMSDMITLSMEEITEANAWFDMVSAQFDEVEAAWDRLVEYVDNI*
Ga0073912_1016109Ga0073912_10161092F062696MNYGTFRIYCLCLVGVCFSACSPQSTLSRLLKNHPYLYENFRHDSIRIENVLVQDSVFFFTKEKDTITFNNATIYREFDTLRLVQSCPPCTTYVSKTILQPTQKLLKETRYKRSLREKLEDSIFPLIIGLLLGLIITRRG*
Ga0073912_1017110Ga0073912_10171101F025502EAILAIVQPVIDEQINALIAMIADLRNHMEEVMSEGEEVVEVEATKLSHHDKFSMVSKFLNNN*
Ga0073912_1018071Ga0073912_10180712F018713MTPDPNLWAEIPQEVKDAAILLGNYFKKQGLDSWTLYDVSSRQNFNGAYNQGLDTAISLVSEGSDMETIICGLENSKKQFSNEGGDGIDYTKSF*
Ga0073912_1018732Ga0073912_10187321F104469GSVAAVDSFSENKFVDTTKAPAGDSDYACFTTSQKCITSLMYLLDDMECPDYAFQSIMDWARNCFEAGFDFNPKSKTRLGNLKWMYDSLHNAKQMLPNVVSIQLPDPLPDTKSMDVICYDFVPQLLSILQNKEMMLANNLVLDPNNPLAMYKPQNSRLGEALSGSVYQDMYQRLVSNPTKQLLCPLICYTDGT
Ga0073912_1019327Ga0073912_10193272F000973AWELKGIRNILGSMWHSRYQDGETDVLNPEAFADEYISTEECGRRLSVSDQTLRNWMAMGRKNPEKGWVEGIHYVNASPDPNRKAIIRVPWNHLIRSFAKNRDLDAQDYRKKSSPMYVSTGFDRLE*
Ga0073912_1020218Ga0073912_10202182F058796DATHEVEGGLLVTTVGGMVTEIVEPEIEVEVEAEEFATVSAFNEVVAKMETAIAELTAKVATLTASNNTHKEAMSKAIDLIEKVADLPSEEPTKTPVSNKKNDQFEALKRLKNSLNK*
Ga0073912_1020547Ga0073912_10205471F022403MPLRHGQKFYCQLLLDRHRYLLVDEMAKQQGKRTTALLREMVYSALEKALPLSEYRAAEAADNAAWADSVKRRVQGRQRSRQDGSDTAQDS*
Ga0073912_1021046Ga0073912_10210462F066500QERFTKTWADNFNKLSEKETWLKDQSTPEYKRTVELLQRIPILTTLPNGLAHAVELMKLQDTAGRFQSVEAENKSLKEQLNKLQQKTAIGKSVPAGQLKAEEKDFSKLSQKEQRDALMRATREFDRESNQ*
Ga0073912_1021128Ga0073912_10211282F039047MKMKIEVDINDLVALKVALGNSARRIDELMNDKPDWRNLYEFDKNSVEK
Ga0073912_1021903Ga0073912_10219032F024100MADNSADIAKRIILGCVAEGMTIEAACASAGKSIKTYE
Ga0073912_1024437Ga0073912_10244371F053242GEWKKPNENMATGQQLLLKAFAQVPKFTVLVIIGNTDNEQTEVGDVFQVVLGKCVKIGEGLDFLKDFYILWYEFANSKG*
Ga0073912_1025478Ga0073912_10254783F036108LAITAINHYVDFLSSEIDFYEKEELLEDTDYQEHKSQLPEVYALLNWIKLEYFKHEN*

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.