NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300006007

3300006007: Groundwater microbial communities from the Columbia River, Washington, USA, for microbe roles in carbon and contaminant biogeochemistry - GW-RW metaG T3_23-Sept-14



Overview

Basic Information
IMG/M Taxon OID3300006007 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0114663 | Gp0115666 | Ga0073917
Sample NameGroundwater microbial communities from the Columbia River, Washington, USA, for microbe roles in carbon and contaminant biogeochemistry - GW-RW metaG T3_23-Sept-14
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size130273757
Sequencing Scaffolds35
Novel Protein Genes40
Associated Families39

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → Viruses → Predicted Viral2
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage4
All Organisms → cellular organisms → Archaea → unclassified Archaea → archaeon2
Not Available18
All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Nitrosomonadales → Methylophilaceae → unclassified Methylophilaceae → Methylophilaceae bacterium2
All Organisms → cellular organisms → Bacteria1
All Organisms → cellular organisms → Bacteria → Nitrospirae2
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes1
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales → Myoviridae → unclassified Myoviridae → Synechococcus phage S-CBM21
All Organisms → cellular organisms → Eukaryota → Sar → Stramenopiles → Ochrophyta → Bacillariophyta → Coscinodiscophyceae → Thalassiosirophycidae → Thalassiosirales → Thalassiosiraceae → Thalassiosira → Thalassiosira pseudonana1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Prevotella → Prevotella disiens1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameGroundwater Microbial Communities From The Columbia River, Washington, Usa
TypeEnvironmental
TaxonomyEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sand → Groundwater Microbial Communities From The Columbia River, Washington, Usa

Alternative Ecosystem Assignments
Environment Ontology (ENVO)freshwater river biomemicrocosmsand
Earth Microbiome Project Ontology (EMPO)Free-living → Non-saline → Subsurface (non-saline)

Location Information
LocationUSA: Columbia River, Washington
CoordinatesLat. (o)46.372Long. (o)-119.272Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F000166Metagenome / Metatranscriptome1810Y
F000331Metagenome / Metatranscriptome1285Y
F001097Metagenome / Metatranscriptome780Y
F001808Metagenome / Metatranscriptome631Y
F002487Metagenome / Metatranscriptome554Y
F003299Metagenome / Metatranscriptome495N
F003806Metagenome / Metatranscriptome467Y
F007169Metagenome / Metatranscriptome356N
F008361Metagenome / Metatranscriptome334Y
F008688Metagenome / Metatranscriptome329N
F009134Metagenome322Y
F009204Metagenome321Y
F009682Metagenome / Metatranscriptome314Y
F010688Metagenome / Metatranscriptome300Y
F010915Metagenome / Metatranscriptome297Y
F011136Metagenome / Metatranscriptome294Y
F016803Metagenome / Metatranscriptome244Y
F020140Metagenome / Metatranscriptome225Y
F021301Metagenome / Metatranscriptome219N
F021761Metagenome / Metatranscriptome217Y
F023108Metagenome211Y
F026423Metagenome / Metatranscriptome198Y
F030691Metagenome / Metatranscriptome184Y
F031025Metagenome / Metatranscriptome183N
F032259Metagenome / Metatranscriptome180Y
F033034Metagenome / Metatranscriptome178Y
F033776Metagenome176Y
F041765Metagenome / Metatranscriptome159Y
F043233Metagenome / Metatranscriptome156N
F044447Metagenome / Metatranscriptome154N
F050158Metagenome / Metatranscriptome145Y
F054024Metagenome140N
F055558Metagenome138N
F055721Metagenome / Metatranscriptome138Y
F056186Metagenome138N
F057371Metagenome / Metatranscriptome136Y
F072094Metagenome / Metatranscriptome121N
F080037Metagenome / Metatranscriptome115Y
F097186Metagenome / Metatranscriptome104N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0073917_1001287All Organisms → Viruses → Predicted Viral2901Open in IMG/M
Ga0073917_1003659All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage1605Open in IMG/M
Ga0073917_1007549All Organisms → cellular organisms → Archaea → unclassified Archaea → archaeon1100Open in IMG/M
Ga0073917_1007753Not Available1084Open in IMG/M
Ga0073917_1008344All Organisms → Viruses → Predicted Viral1042Open in IMG/M
Ga0073917_1008914All Organisms → cellular organisms → Archaea → unclassified Archaea → archaeon1006Open in IMG/M
Ga0073917_1009577All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Nitrosomonadales → Methylophilaceae → unclassified Methylophilaceae → Methylophilaceae bacterium968Open in IMG/M
Ga0073917_1010616Not Available916Open in IMG/M
Ga0073917_1011431Not Available883Open in IMG/M
Ga0073917_1012601Not Available839Open in IMG/M
Ga0073917_1012904All Organisms → cellular organisms → Bacteria829Open in IMG/M
Ga0073917_1013425All Organisms → cellular organisms → Bacteria → Nitrospirae812Open in IMG/M
Ga0073917_1013591Not Available807Open in IMG/M
Ga0073917_1015133Not Available762Open in IMG/M
Ga0073917_1016059All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes738Open in IMG/M
Ga0073917_1016638All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage725Open in IMG/M
Ga0073917_1017387Not Available708Open in IMG/M
Ga0073917_1017604All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales → Myoviridae → unclassified Myoviridae → Synechococcus phage S-CBM2704Open in IMG/M
Ga0073917_1019299All Organisms → cellular organisms → Eukaryota → Sar → Stramenopiles → Ochrophyta → Bacillariophyta → Coscinodiscophyceae → Thalassiosirophycidae → Thalassiosirales → Thalassiosiraceae → Thalassiosira → Thalassiosira pseudonana670Open in IMG/M
Ga0073917_1020109Not Available657Open in IMG/M
Ga0073917_1021286Not Available638Open in IMG/M
Ga0073917_1021705All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Nitrosomonadales → Methylophilaceae → unclassified Methylophilaceae → Methylophilaceae bacterium632Open in IMG/M
Ga0073917_1022321Not Available623Open in IMG/M
Ga0073917_1022549All Organisms → cellular organisms → Bacteria → Nitrospirae620Open in IMG/M
Ga0073917_1025072Not Available586Open in IMG/M
Ga0073917_1025368All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage583Open in IMG/M
Ga0073917_1025998Not Available576Open in IMG/M
Ga0073917_1026987Not Available565Open in IMG/M
Ga0073917_1027229Not Available563Open in IMG/M
Ga0073917_1028413Not Available551Open in IMG/M
Ga0073917_1029961Not Available536Open in IMG/M
Ga0073917_1030960Not Available528Open in IMG/M
Ga0073917_1031661All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Prevotella → Prevotella disiens522Open in IMG/M
Ga0073917_1031729All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage521Open in IMG/M
Ga0073917_1033666Not Available506Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0073917_1001287Ga0073917_10012872F021301MNWNNITIHQLQEIHSCRDMSDLERQMNILAIALNLSMDEVESMTLDKLTSEFEKLSFLNDLPKAPIQFMFKLRGRYFKLAKTPNEMCGHHFIELQQVFNGDVIESLNKIVALLSVEVDFFGRNKKVVDAQAHYEDKCGLMMGLPVPLPYTYALFFLEVYPELLKNILCSLKEEMKDMTEQLTNPQ*
Ga0073917_1001287Ga0073917_10012874F031025VALSITQQPDSYHPAFNDTNFVITESSGGIYTSSNFKFIANVKVAATSVAKLKAPIYFGSVNKGVFNIGRIMESYVSNNWSFTDTSPSGCVDSFSDYEVEFGYEYSPSATGTITEYLDLTSATGTVWNAALNPFDLVTYAQAQYLATSSSAKFLTNVRTRYIHRTQKDWLYALKGDATSVVITYSDASTQTFTLPSSKVVRIPVGSQLTIPGAATYFDVVLKLGGTAKSETYRINIKDECSKYETTDIFFMNRLGGFDSFRFNMVRRDTFEVARKQFQSNPYTLGATYGYATSVRTRSNYHTTASQKVKLTSNWIDDTESVWLKDLIESPVVYMYDGTLYAVNIDNANYEQKKGVQDKLFNLELDITLSFADKSQRL*
Ga0073917_1003659Ga0073917_10036591F072094MKLLILTDGINGVVYHRLFTPHLRMQIDGQADVSVCQSIEEWLTLDYTQFDVIIFSRWLGAKHYDVLKKIADSGTPYVVDIDDYWILPKYNPAYW
Ga0073917_1007549Ga0073917_10075492F009682LTKIERDNNIQTMIKDLYMKNSNEKILVQSDDLKYDGKNIIIPSYYASIVYDYLDNVNIEDMNLNDADMHDYLAFCSFFEDIIDHKADKGGN*
Ga0073917_1007753Ga0073917_10077532F097186MSDLIVGEQDPRGVDWVSITLGACLGATITFIVGGQNGEAVGKSIERATVQCIDELGVRNSDSTSACWDMQAEALNRMELQRQAIDRLAERCVLSVPEADRLRVTRVKDVKPFKVVIPAPSSSAGSADWDAEDFGPPGDPDGKP*
Ga0073917_1008344Ga0073917_10083442F007169MSYQLQFDFETLEQKEKRLKDWHDQQVKLNKMFEGKANDYYIYNKHVDQFIDFLPYRLGWGLKGNYNELRWWIKCQYQKFRYGVSDDEVYSLETNIAKYMVPRLQYFKKKGKMGIPMKFLPSNYDNLQDEDREKAEKIGEKEINRILDEMIFAFDYIIDPDKYVTFPKSCSWDIKDKNYFNREKNLEAKQAWDEYTKTCEQLDARKKQGLQFFVDHMDMLWI*
Ga0073917_1008914Ga0073917_10089141F020140FDSPYSLHFMTHKFAVIVDKKFYNQIQNNSDFYLRVLVLDVIRALVPNEESADKYSSDIFYKCAKSISKEVKMLEDEKTVIFYLELSSGYLDDFFEKPLDDFE*
Ga0073917_1009577Ga0073917_10095772F001808SCSAQYHLNKAIKKGYTCEETGDTIRITTLDSIPVIIHDSIVWEKFITTKDTIIKYNTVYVPKTRLEKRIEYKLKVKTIYKDRIVQKAQAKATRPKTRGNLNLLFVGVGIGLLLSYLFKFARDKYLF*
Ga0073917_1010616Ga0073917_10106162F021761MISNLWVNRITALVVLAAIYAAGYAGGRDATVQAHHNHPACHTNLKP*
Ga0073917_1011431Ga0073917_10114312F008361LVAVELSEATKELEKATSAIENAETSKARLDASVELKKATARLEGINYLS*
Ga0073917_1012601Ga0073917_10126013F010915MIVIVQVGLDLYTLSYVKYEECDIHQCIKYDLSSEDVSQFLGNN*
Ga0073917_1012601Ga0073917_10126014F033034MKVAKLTKKAQDIVNQIMSADAVDIDHGCIGERIAYGNMHLSGEGLECNVNVDGDIGEMIISNESLNAAVISEGTIEIDSEHIKDNPYGDMTITLYKLSKINAL*
Ga0073917_1012904Ga0073917_10129043F001097MNNFEWPTNDSSRIKPLQGLRSERVDTQVQPKEIDDWLKQSVALVAGSVRGLGTNQRKVRYFAFEGQNDEATK*
Ga0073917_1013425Ga0073917_10134252F026423LVTVGCTAPMKQPTTVGPYCNISWDKTNNSKVAWYQLTVIDQSKQAKIVRFIPADTTTVSCRDVGANHDGIWEVTVQSCYDKSTCGLPTEAARMQITTK*
Ga0073917_1013591Ga0073917_10135911F041765AEFFNKEVDKIIYKVTTTKSTKQRIKYIKQMIALKNRLSLEVKMLEDLDNF*
Ga0073917_1015133Ga0073917_10151331F030691TQTSGSATSGEIAFDVFVKNISSTSDAARLSLAQQLKDAGLWTGKISSKFNIKYYTALAKLEEKYQGQITVDQIVGATVSAKRFDVLADLVEGGDGEDGPKTTKQTYVTSASQTAKLLNAVAVDLLERDLTKAEQAKYLKMINAEQRKQPSVQTSGKGFTTTLGGVDEEQFIKEKLQSTSEAKNVRATDAYTVLMKEFGGLR*
Ga0073917_1016059Ga0073917_10160592F002487MITRQDAIKDLSHGDYCCYCTEPKTSGSCCGENHFVPFEDLYEEDKEAMIEEYLSEGNSNGT*
Ga0073917_1016638Ga0073917_10166382F032259MLAEPVRARVYAFDKDKKLVGPSKVVLPAGWYVLPKN*
Ga0073917_1017387Ga0073917_10173871F044447MKLQDLTIDQFQRIGAIEFSSVLGDYDKRAGVVAIVEGVDISIVREMPAKSVLKRYKVIISEWNALP
Ga0073917_1017604Ga0073917_10176042F055721MNREEYYKYIEENDTYPEHSHTWIVRTYTGDKLFYRNFGTFETKEEAKEFIENYKVKYTTKGFITRYSIQGLCEVL*
Ga0073917_1019299Ga0073917_10192992F000331ERVHQVIATMLRTAEIDMANSVAPSDIDTFLTNASWAIRSTYHTVLKASPGAAIFGRDMLFDIPYIADWSKIGDYRQRQTDLNTARENKSRADYDYKVGDKVLIRKDGILRKSESRYDSEPWTITSVHTNGTIRVERGTKSERINIRRVTPYFEN*
Ga0073917_1020109Ga0073917_10201091F009134VCIEKKTESLESLSILVRNRLAADAASTLERIDSYSLDGIKDESVRETILGSVAKRSALVFGWSEQGEQASVSINLLGSMPDRSIEVSVTNEAETK*
Ga0073917_1021286Ga0073917_10212862F080037HDCMTQAMAYAWAIRDNAQDDGVPIPVELVASFQDDYNNIIAALNEAHNLAS*
Ga0073917_1021705Ga0073917_10217051F023108MMRYLAIILLLSSCSAQYHLNKAIKKGYKCEQTGDTIRITTLDSIPVIINDTIVWEKIINTKDTIIKYNTVYVPKTRLDKRIEY
Ga0073917_1022321Ga0073917_10223211F008688FITVTVEPPKTKVYLIMSYRGNDLSIEKVYLKKENAQKYCDMYKDSHNYSVEERELTE*
Ga0073917_1022549Ga0073917_10225491F056186MTKEQAAALREKWEEGENPPCRHLHLELEHNNDDYLTDNYHCTACGELVAANTRDPFQVI
Ga0073917_1025072Ga0073917_10250722F043233AGIKDAVGIAKIRLKAESASDLEKKVAKYESELAQLRKATTPASGQPSAPARQKQFHELSSNEQEKELLRMAAEADRMGV*
Ga0073917_1025368Ga0073917_10253681F003299PVKLVTRIIEAYDADHVKQLIQKNDDLILLIEEV*
Ga0073917_1025998Ga0073917_10259981F009204NNYPRRPYNTKTRKSIQERIKMKTTIKYYTQNIYGVRREKFIDKKQESVFFQLTGRRTLDSVSRELIRDLSGSSIEFEQSLPPE*
Ga0073917_1026987Ga0073917_10269872F054024MSDTFSNHFYIEGPYEDLLNVTKDLDFTDGSIDYDGWEIEGGSAVLHFDGYYCPLDELEKASAKYPSLKIIFRFTQELLIAGLLIYEDGKIKLQSYYNWDTGTSSVTTAQE*
Ga0073917_1027229Ga0073917_10272291F080037MAYAHAIRDNAQDDGVPIPMELVVSFQDDYNNILTALNEAHNLAS*
Ga0073917_1027229Ga0073917_10272292F057371MQEIKVRFEPADLTDLDHQAAAAGTSRSAFIRNKALSLPVARLNTVEYHALVADAVSAMRGDLPRLQVEYLVAYVITRLDQHSRQAVAGHQPAT*
Ga0073917_1028413Ga0073917_10284132F016803MISLISRVRAAWAFGRHQCWVDALPWNRDDATTLNNFFKSETGKKFKDALLNTVLMQNASAITDKNHLQYSSGFAMGQASLVKVIEMMADRESITGQEDDPDSVTNT*
Ga0073917_1029721Ga0073917_10297211F050158MAQETVSIAWCDNGMVDGKFMQGVTDVMLKSGINFTTTLRSQ
Ga0073917_1029961Ga0073917_10299611F033776ARSKGLNFTVNIRLSREEIEAARRLGDGNISMGVRWCIRYANGREMKPIKLSTMLRSAAVLAAQLEAA*
Ga0073917_1030960Ga0073917_10309601F055558DGFNLQSAGKLMTYNFALIVMDRVFESESNTIEVLSDTAQIMSDIFALVETNTESDGDFELSINGNASPFYDSKTDILAGYAINFQVLTPYLSNSCVVPI*
Ga0073917_1031661Ga0073917_10316611F011136SRLAHDKQKGSDLLHEVLARLMDRPQQDIEDIVCRGKVEAYVNRALWLSWHSNRSDYAIKYRKYYELHVEKQVDDSKQDETWIGAFIDGEYLYNAIGRLNEFDAILLRLYSKPDFDYKELSAETGIPYSYLRTSIHRALKRIREYVKLQRSLSHTARETEYLQKM*
Ga0073917_1031729Ga0073917_10317291F003806MSNEIEIPLKLSGVQSLKAELRSLKAAIAEASDPEQMAALAAQAGKVADRIKDANDAVNVFASGSKFEQIKNSFGGIQDSLMSLDFEEASDKAKVFAKNL
Ga0073917_1033666Ga0073917_10336661F010688TLAPLPILLGTWCIVRRPLPRCNMRPKTATIMVIAVGPKGHRREIGGAPSHSACGCDEADNNAPMIAIPVEALSTDTEDGQQASPEVGDEVVLQEVRGVLKKLENGEAYVEIKSVNGMPAEYEKAGKESMEPMDEEGMRNMVSEYDSEMES*
Ga0073917_1033968Ga0073917_10339682F000166MPNIPTPEQSQLFAQSVRKWQQVLSLGDWRIEKGSKAAKAAMASVEFNASARLATYRLGDFGAERITPESLDQTALHELLHVFLHDLMTVAQDPKSSQDEIEMQEHRVINLLEKLLSKDSNGRT*

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.