NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300006006

3300006006: Groundwater microbial communities from the Columbia River, Washington, USA, for microbe roles in carbon and contaminant biogeochemistry - GW-RW metaG T4_12-Aug-14



Overview

Basic Information
IMG/M Taxon OID3300006006 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0114663 | Gp0115671 | Ga0073916
Sample NameGroundwater microbial communities from the Columbia River, Washington, USA, for microbe roles in carbon and contaminant biogeochemistry - GW-RW metaG T4_12-Aug-14
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size131149379
Sequencing Scaffolds31
Novel Protein Genes36
Associated Families35

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → Viruses → Predicted Viral3
Not Available16
All Organisms → Viruses2
All Organisms → cellular organisms → Archaea → unclassified Archaea → archaeon3
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage2
All Organisms → cellular organisms → Bacteria1
All Organisms → cellular organisms → Bacteria → Nitrospirae1
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales → Myoviridae → unclassified Myoviridae → Synechococcus phage S-CRM011
All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Moraxellales → Moraxellaceae → Acinetobacter → Acinetobacter baylyi1
All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Epsilonproteobacteria → Campylobacterales → Campylobacteraceae → Campylobacter → Campylobacter ureolyticus1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameGroundwater Microbial Communities From The Columbia River, Washington, Usa
TypeEnvironmental
TaxonomyEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sand → Groundwater Microbial Communities From The Columbia River, Washington, Usa

Alternative Ecosystem Assignments
Environment Ontology (ENVO)freshwater river biomemicrocosmsand
Earth Microbiome Project Ontology (EMPO)Free-living → Non-saline → Subsurface (non-saline)

Location Information
LocationUSA: Columbia River, Washington
CoordinatesLat. (o)46.372Long. (o)-119.272Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F000166Metagenome / Metatranscriptome1810Y
F001165Metagenome / Metatranscriptome760Y
F001190Metagenome / Metatranscriptome753Y
F002909Metagenome521Y
F003111Metagenome / Metatranscriptome506Y
F003299Metagenome / Metatranscriptome495N
F003537Metagenome / Metatranscriptome480Y
F003690Metagenome / Metatranscriptome473Y
F003784Metagenome468Y
F004723Metagenome / Metatranscriptome426Y
F004953Metagenome / Metatranscriptome417Y
F006111Metagenome / Metatranscriptome381N
F007203Metagenome / Metatranscriptome356Y
F007919Metagenome / Metatranscriptome342Y
F008161Metagenome / Metatranscriptome338Y
F008490Metagenome / Metatranscriptome332Y
F009204Metagenome321Y
F011078Metagenome / Metatranscriptome295N
F011485Metagenome / Metatranscriptome290N
F015867Metagenome / Metatranscriptome251N
F025096Metagenome / Metatranscriptome203Y
F026270Metagenome198N
F033033Metagenome178N
F038081Metagenome166N
F040531Metagenome / Metatranscriptome161Y
F051707Metagenome143N
F052464Metagenome142N
F052965Metagenome / Metatranscriptome142Y
F057379Metagenome136N
F064517Metagenome / Metatranscriptome128N
F075465Metagenome / Metatranscriptome119Y
F082169Metagenome113N
F083917Metagenome112Y
F089902Metagenome / Metatranscriptome108Y
F099283Metagenome103N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0073916_1002611All Organisms → Viruses → Predicted Viral1593Open in IMG/M
Ga0073916_1003145All Organisms → Viruses → Predicted Viral1458Open in IMG/M
Ga0073916_1003854Not Available1332Open in IMG/M
Ga0073916_1004787All Organisms → Viruses → Predicted Viral1201Open in IMG/M
Ga0073916_1007105Not Available997Open in IMG/M
Ga0073916_1008515All Organisms → Viruses918Open in IMG/M
Ga0073916_1009229All Organisms → cellular organisms → Archaea → unclassified Archaea → archaeon885Open in IMG/M
Ga0073916_1009872All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage857Open in IMG/M
Ga0073916_1011035Not Available813Open in IMG/M
Ga0073916_1011385Not Available802Open in IMG/M
Ga0073916_1011668Not Available793Open in IMG/M
Ga0073916_1011790All Organisms → cellular organisms → Bacteria789Open in IMG/M
Ga0073916_1012214All Organisms → Viruses777Open in IMG/M
Ga0073916_1013022All Organisms → cellular organisms → Archaea → unclassified Archaea → archaeon754Open in IMG/M
Ga0073916_1013255All Organisms → cellular organisms → Bacteria → Nitrospirae748Open in IMG/M
Ga0073916_1014157All Organisms → cellular organisms → Archaea → unclassified Archaea → archaeon727Open in IMG/M
Ga0073916_1014656Not Available715Open in IMG/M
Ga0073916_1015439Not Available699Open in IMG/M
Ga0073916_1016501Not Available678Open in IMG/M
Ga0073916_1016852Not Available673Open in IMG/M
Ga0073916_1017477All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales → Myoviridae → unclassified Myoviridae → Synechococcus phage S-CRM01662Open in IMG/M
Ga0073916_1018906Not Available638Open in IMG/M
Ga0073916_1018942All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Moraxellales → Moraxellaceae → Acinetobacter → Acinetobacter baylyi638Open in IMG/M
Ga0073916_1019062Not Available636Open in IMG/M
Ga0073916_1019425All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage632Open in IMG/M
Ga0073916_1021663Not Available602Open in IMG/M
Ga0073916_1022354Not Available594Open in IMG/M
Ga0073916_1022869Not Available588Open in IMG/M
Ga0073916_1024675Not Available568Open in IMG/M
Ga0073916_1026150All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Epsilonproteobacteria → Campylobacterales → Campylobacteraceae → Campylobacter → Campylobacter ureolyticus554Open in IMG/M
Ga0073916_1027661Not Available541Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0073916_1002611Ga0073916_10026114F002909GSLLQQIIVEWWNKKVIPPIWANLDANGTNASSKLRQSFIPGTITKSPTSINTILLAEDYWEFVEYGRKPTRGGHIEGTPYLWQSLKTWISQKGIKPAEDQTYDSLAKAIAKKIHRSGTKAQPFLEKAFTESIQMELVNELNARFGDLIFSEDIKI*
Ga0073916_1003145Ga0073916_10031452F007203MYVQMQGVNLAPKVKELEKRIEMLENVVNELKLDKPRMGRPPKDKHGTERLEVNTTGRD*
Ga0073916_1003854Ga0073916_10038545F011078EELSKAEWLTKATKNIAKGNELARELYQFYFLTILQKPDEQIEKIYNDGYIQFWTIRLLYLCINGNRHPFGESRIYDQYDVYDLHLSEEPDLLLEREEDEQMEQKRFNKINQVTESAYFYERELFKLWCSGMSARAIHRQTDISVREILRVVKLMKERCTTK*
Ga0073916_1004787Ga0073916_10047871F026270DDRAVDQSPIVWLEIVDKIVKGDRTKWDFILDMPLIEFLNAMAFYKAKTKERQKRLEDAAGKGFNPYIVACLNEML*
Ga0073916_1004787Ga0073916_10047872F099283VALSITQQPDSYHPAFNDTNFVITESSGGIYTSSNFKFIANVKVAATSVAKLKAPIYFGSVNKGVFNIGRIMESYVSNNWSFTDTSPSGCVDSFSDYEVEFGYEYSPSATGTITEYLDLTSATGTVWNAALNPFDLVTYAQAQYLATSSSAKFLTNVRTRYIHRTQKDWLYALKGDATSVVITYSDASTQTFTLPSSKVVRIPVGSQLTIPGGATYFDVVLKLGGTAKSE
Ga0073916_1007105Ga0073916_10071052F004953MNKSMMLVQATWQESQTFRMIPTSTDCPYVECIFDPTSNVFVVISNIKKTTLHMLPKLDEYGQPLTGSKGMKQERHKLEVFQEFYVEDKKATEELIKAFAVNAESFDYKKFMKEAKKA*
Ga0073916_1008515Ga0073916_10085154F089902MMTERAQKLMDAIYQERNTWADTEQKLVAAIIRKTMEHVKSMTAQKLNNLTVLDRGDMMTLSKEVENL*
Ga0073916_1009229Ga0073916_10092292F011485KLFVASVLVSVAVFAADTEPTVNASINAGYNNHYIVNGLAKTSGSAFAGFDIGKTYFGVDGYVGGVILPDSNSIDESHWKLGVGKALKISEKFSLRGDLQVLRHQSSILGGRNSTEIAPKIALVNPYLTPYIRGSHDFNLGQSGYIVGAERPTDVFGWFTLTPTVEYGKFTDYDVVAVKIGVSRTFFNHLQPYAEVGYYDNNFQSSKYKFASQEFSGDVVAVAGVRWNF*
Ga0073916_1009872Ga0073916_10098721F052464MDTQVQPKEIDDWLKQSVALVAGSIAGLGTNQRKVRYFAFEGQNDEVTK*
Ga0073916_1011035Ga0073916_10110352F057379MGKYIKGDVNLSMLYLKEIPEILNGIDIDGSLILSGNGLTTLNNFPKLVTENIDLMRNPLISLKGIKQNFITTLKVGGKFPDFEGCPPNVTRLTCVGNIRLTSLKHMPTNISEIKIMAGSLKSLEHLTQNQRCTYNLMDCRIKSLVGLPNRCRALNVAFNDFEDFTGAPEIVDGTFNCGHNPIVSLKGFPKQADIVVIHNFM
Ga0073916_1011385Ga0073916_10113853F008490MMNTSENSMMNDMGMEEDVMPYPAPDKQYPNASKYSSYDSIQTGAMGKAAK*
Ga0073916_1011668Ga0073916_10116682F009204MKTTIKYYTQNIYGVRREKFIDKNQERVFFQLTGRRTLDSVSRELIRDLSGSSIEFEQSLPPE*C*
Ga0073916_1011790Ga0073916_10117901F038081MILNKQEIKNILSMLLSEDKENAIIAFSCLNNYANKKHLGELLVLYQFGRTSAEVWEKECKKVFKLIKNVLKLDSTDYRLPSSTVFSALINNKCSEQSIELYLELFTNQLSNNLYNMGYPTDKLEINVKLKDND*
Ga0073916_1012214Ga0073916_10122143F001165NDAPKPRIPDSVDAQRLEAMQLVARMKESADKFGVGFVGGFIAPNGEKFMMSNMDEADTQALLPEDLK*
Ga0073916_1013022Ga0073916_10130223F033033PEDQKEFEIANQSADMYAVICHLEERLRSYRKHGNDFENVNEALDAIHTILYDELNARRINIHD*
Ga0073916_1013146Ga0073916_10131462F000166MPNIPTPEHAELFAQSVRKWQQVLSLGDWRIEKGIKPAKAAMASVEFTPNARLAVYRLGDFGAEKITPESIDMTCLHELLHVFLHDLMTVAQDPKSSQDEIEMQEHRVINLLEKLLSKDSNG*
Ga0073916_1013255Ga0073916_10132552F008161MESSQKDALIGTITWWGFHALLMWMGAWHFLSRETADGSILASDLFAGGLIGATVLSLLRLSVGCHLLRFIALIGILLVVFHVWQGKAAWGSLTDVAIWAYTFYSFGPSSWRYSLYSPLGVFGKVKA*
Ga0073916_1014157Ga0073916_10141571F033033MKAILEFNLPEDQKEFEIANQSADMHAVICHLEERLRSYRKHGNDFENVNEALDAIHTVLYDELNARRINIHD*
Ga0073916_1014656Ga0073916_10146561F003690MTISGFELVVSNEYGIEFDSFLGAIYLPWHTIILTALAVVAYKVYKRKRAK*
Ga0073916_1015439Ga0073916_10154392F004723MKKGDKFIHTDIVGRKYEVTYTGTRREVKGCEFEFFVDDKGDSCFFTDTEVKKMKKI*
Ga0073916_1015439Ga0073916_10154393F015867MKYKVNYYDYDSDSYLEKDLYWMASYRGSFDFVPVDDGPKENWKGFHIDGSGVNVFYNDKGMNITVDGFQLMKSGGYGKTTTFIQSAI*
Ga0073916_1016501Ga0073916_10165012F001190MTSTPTGQSTSDDYFDINKFEELLARLESSKGRQQRQKSLEGRRDIFAGGLASMMGNF*
Ga0073916_1016852Ga0073916_10168521F003537LLKIEIQFLLNKLSLLWPIDSKLGVWVAYIKRQLEIATQMSVIKVNVTVAKNRNSVSA*
Ga0073916_1017477Ga0073916_10174773F007919MNKVKFTHITRTICPKTRIHYLDGIDEYGNHWMAQMEHTTEKWLCFSEIWYLDAQQPVQL
Ga0073916_1018906Ga0073916_10189061F083917MIKVRIKKLPQAKTGYQVQGALVNDVSAMGGADYNAYIGKPKLIASKYITAVPRDEANLEAEGGETVYGDIN
Ga0073916_1018942Ga0073916_10189422F003784MYDDDRERVGHNVTDFAAYGLSHLAAGQADTVDTHTPTTNPLVDHVAVSSYRGQGVTTEDLTSFIESFASLRAHRVKGVGHEQYSHAKGQKFESFTTSDTIRELIEELADASNYIDFLAIKLLNIQ
Ga0073916_1019062Ga0073916_10190621F003111KNRQTLSLPEFWREAQVEALNYAKARGLGEVPLSYVIVKRRNASIDQAWVIQDLAQWLKEKQ*
Ga0073916_1019062Ga0073916_10190624F040531MPAQDWSRVRKAGRYKGAIDANTIPIGPIVQHFGG
Ga0073916_1019425Ga0073916_10194252F003299MCQLKTMTMKIYKVVFKTFDYWNGPVKLVTRIIEAYDADHVKQLIQKNDDLIMLIEEI*
Ga0073916_1021663Ga0073916_10216632F051707MNLQENIQRIRSMMNLSESDKNKIWLLRRLETQDIKEYLNEQIDSLTDQISPCGYSTSEDYKDVIFEVASSNFINHFSDELYVKHEMSEIEEIIWYLLTKKYGNYLINKFDGRICDDEY*
Ga0073916_1022354Ga0073916_10223541F064517MARTTTENIYYGRSQQITADGCFAISFFRPSTSNPVNVAGIPLEAGQTLSIKQNVGDEDWSTYEIVFGTGTATN
Ga0073916_1022869Ga0073916_10228691F082169MKFNLKTILAWGGAFVAYRLYKLYELGENVIYKPVGVSFTRGKSINDFVVRVKMELLNPTKTTLQMRGIDGKLKIKDQVIGTFASAPFTIKAGISYFFLDFKVLPNTTGVQIITAIIAKKVPVFIVEMNKRLPYFSITEVFAINP
Ga0073916_1024675Ga0073916_10246752F025096QWVVQAWVKTEAATLYAKSINLEGIIEHVTWDKDGAPSPGVQHFSGDVRTIPFAWMNAKERLLITEPVSIHFYTLLKEESSAPTPMDHYLGVVTTESIGKGSVITWRVYFDTTGWSPMASIMSSQLKRLLEKGIQSWIDEYGGALIPVEVKE*
Ga0073916_1026150Ga0073916_10261501F006111MISYQSIVDKITTFYDNHLQVKKVGSDFKEQMVNFATADEKYPLVYVVPTGVTPYENVSIFNLELYCFDIIQMDRANITTILSDTQQILQDLYLEFTFSDDYDFDIDGQPTFIPLNNDLLDYAAGWQMNLSVVIPSWTNCQIPEQNA*
Ga0073916_1027661Ga0073916_10276612F052965MDKPGLIEVAFHDWWLGSYKRPPTAQATMTHVAFAKHMITMLELTQDVQTP*
Ga0073916_1027902Ga0073916_10279021F075465ESEKTQYSNDWRSYRERTTQLTKHRGQAFSLILGQCTQLLQDKMKQDTDWNVVSTSYDPLSLYRLIERTILAQTDDQYPFATIYDQELAFYSFRQESLSNPQWYERFNTKVDVGEAIGVTRQHRVLLDYVAMELHTQAFATLGAAEQEAVRTDAEERYLSYAFLRQSGIQHGNLKTDLQ

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.