NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300027091

3300027091: Groundwater microbial communities from the Columbia River, Washington, USA, for microbe roles in carbon and contaminant biogeochemistry - GW-RW metaG T2_30-Apr-14 (SPAdes)



Overview

Basic Information
IMG/M Taxon OID3300027091 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0114663 | Gp0115665 | Ga0209873
Sample NameGroundwater microbial communities from the Columbia River, Washington, USA, for microbe roles in carbon and contaminant biogeochemistry - GW-RW metaG T2_30-Apr-14 (SPAdes)
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size138784077
Sequencing Scaffolds23
Novel Protein Genes32
Associated Families31

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Eukaryota → Opisthokonta2
Not Available9
All Organisms → cellular organisms → Eukaryota → Sar → Stramenopiles → Ochrophyta → Bacillariophyta → Coscinodiscophyceae → Thalassiosirophycidae → Thalassiosirales → Thalassiosiraceae → Thalassiosira → Thalassiosira oceanica2
All Organisms → cellular organisms → Eukaryota1
All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Chelicerata → Arachnida → Acari → Acariformes → Sarcoptiformes → Oribatida → Brachypylina → Oppioidea → Oppiidae → Medioppia → Medioppia subpectinata1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria1
All Organisms → cellular organisms → Eukaryota → Sar → Stramenopiles → Ochrophyta → Bacillariophyta → Coscinodiscophyceae → Thalassiosirophycidae → Thalassiosirales → Thalassiosiraceae → Thalassiosira → Thalassiosira pseudonana2
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Chitinophagia → Chitinophagales → Chitinophagaceae1
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Synechococcus phage S-LBS11
All Organisms → cellular organisms → Eukaryota → Sar → Stramenopiles → Ochrophyta → Bacillariophyta1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1
All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Ciliophora → Postciliodesmatophora → Heterotrichea → Heterotrichida → Stentoridae → Stentor → Stentor coeruleus1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameGroundwater Microbial Communities From The Columbia River, Washington, Usa
TypeEnvironmental
TaxonomyEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sand → Groundwater Microbial Communities From The Columbia River, Washington, Usa

Alternative Ecosystem Assignments
Environment Ontology (ENVO)freshwater river biomemicrocosmsand
Earth Microbiome Project Ontology (EMPO)Free-living → Non-saline → Subsurface (non-saline)

Location Information
LocationUSA: Columbia River, Washington
CoordinatesLat. (o)46.372Long. (o)-119.272Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F000212Metagenome / Metatranscriptome1580Y
F000331Metagenome / Metatranscriptome1285Y
F001799Metagenome / Metatranscriptome632Y
F005176Metagenome / Metatranscriptome409Y
F005672Metagenome393Y
F007402Metagenome / Metatranscriptome352Y
F015696Metagenome / Metatranscriptome252Y
F020000Metagenome / Metatranscriptome226Y
F033776Metagenome176Y
F034156Metagenome / Metatranscriptome175Y
F034611Metagenome / Metatranscriptome174Y
F038918Metagenome165N
F044498Metagenome154N
F046195Metagenome151Y
F050934Metagenome144N
F051728Metagenome / Metatranscriptome143Y
F053809Metagenome / Metatranscriptome140N
F061552Metagenome / Metatranscriptome131N
F062748Metagenome / Metatranscriptome130N
F065810Metagenome / Metatranscriptome127Y
F067428Metagenome / Metatranscriptome125N
F067432Metagenome / Metatranscriptome125Y
F082695Metagenome / Metatranscriptome113N
F085203Metagenome / Metatranscriptome111Y
F088294Metagenome109N
F091422Metagenome / Metatranscriptome107Y
F092936Metagenome / Metatranscriptome107N
F096670Metagenome104Y
F100053Metagenome / Metatranscriptome103N
F100499Metagenome102Y
F104469Metagenome / Metatranscriptome100Y

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0209873_1000444All Organisms → cellular organisms → Eukaryota → Opisthokonta6376Open in IMG/M
Ga0209873_1000778Not Available4580Open in IMG/M
Ga0209873_1000825All Organisms → cellular organisms → Eukaryota → Sar → Stramenopiles → Ochrophyta → Bacillariophyta → Coscinodiscophyceae → Thalassiosirophycidae → Thalassiosirales → Thalassiosiraceae → Thalassiosira → Thalassiosira oceanica4436Open in IMG/M
Ga0209873_1001310All Organisms → cellular organisms → Eukaryota3327Open in IMG/M
Ga0209873_1001327All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Chelicerata → Arachnida → Acari → Acariformes → Sarcoptiformes → Oribatida → Brachypylina → Oppioidea → Oppiidae → Medioppia → Medioppia subpectinata3304Open in IMG/M
Ga0209873_1001432All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria3155Open in IMG/M
Ga0209873_1005927All Organisms → cellular organisms → Eukaryota → Sar → Stramenopiles → Ochrophyta → Bacillariophyta → Coscinodiscophyceae → Thalassiosirophycidae → Thalassiosirales → Thalassiosiraceae → Thalassiosira → Thalassiosira oceanica1411Open in IMG/M
Ga0209873_1009370All Organisms → cellular organisms → Eukaryota → Sar → Stramenopiles → Ochrophyta → Bacillariophyta → Coscinodiscophyceae → Thalassiosirophycidae → Thalassiosirales → Thalassiosiraceae → Thalassiosira → Thalassiosira pseudonana1137Open in IMG/M
Ga0209873_1016334Not Available871Open in IMG/M
Ga0209873_1016338All Organisms → cellular organisms → Eukaryota → Opisthokonta871Open in IMG/M
Ga0209873_1017298All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Chitinophagia → Chitinophagales → Chitinophagaceae849Open in IMG/M
Ga0209873_1017322Not Available848Open in IMG/M
Ga0209873_1019079Not Available809Open in IMG/M
Ga0209873_1024036Not Available725Open in IMG/M
Ga0209873_1025113All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Synechococcus phage S-LBS1709Open in IMG/M
Ga0209873_1027807All Organisms → cellular organisms → Eukaryota → Sar → Stramenopiles → Ochrophyta → Bacillariophyta676Open in IMG/M
Ga0209873_1028215Not Available671Open in IMG/M
Ga0209873_1030919All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria642Open in IMG/M
Ga0209873_1036092All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Ciliophora → Postciliodesmatophora → Heterotrichea → Heterotrichida → Stentoridae → Stentor → Stentor coeruleus596Open in IMG/M
Ga0209873_1036468Not Available593Open in IMG/M
Ga0209873_1037451Not Available585Open in IMG/M
Ga0209873_1039184All Organisms → cellular organisms → Eukaryota → Sar → Stramenopiles → Ochrophyta → Bacillariophyta → Coscinodiscophyceae → Thalassiosirophycidae → Thalassiosirales → Thalassiosiraceae → Thalassiosira → Thalassiosira pseudonana572Open in IMG/M
Ga0209873_1041385Not Available557Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0209873_1000444Ga0209873_10004442F082695LFFVLRSFESETGDETDVVKWAHKLMNKSVANRTITKQEAMCELGQLPMVICSEGIETVSITGSTRCSNETSTILSQYKKRPDTQEHLSLHEFFHTKKNKRLPATRSQREIVPHYVGGSGQPVYPLTKQSISYARSEILKHMPWSQKNPLPNDCDWVTLFQDFLNDARCPHSVTIGYERAKLRHELRKKGIEEACQPDTEHSNPTDDLDDDEIGDVIAISESMGYTTDEMDNLEKSGFFIGKDYEWWKRIHTVSCSFMIEMKSSPNEIWIFQNF
Ga0209873_1000778Ga0209873_10007781F053809MMVTMLRQEIYTHYLFHKSNQYNSIMRPYTYTPKSHIRPYIRNTLLLINDQLPLLHLHVNYLSQLQLINRRRLHDFDVLSGKRHTRGAGAIRVAEIGAAYAEDRIRAHLGALIVDNNDLLAEWFRLKTYTDGHTPLICSNRPALQMNTMPFDTDGVCTIQGDGIPHPADMRSTIQIIERILLPHAQSAARCCTITDYRNHNAAYMCVREFVRETKRSLDDVRQRLYFLQYDEIDLWRSVQNFEYNIEDSSTIIRELFANTQPTQPFPQVVTPTTNDVAHDARKSPNYLTR
Ga0209873_1000778Ga0209873_10007784F092936LVIIAAIIIQSHIPLAPGALFTVENPAIVQKWMANGTLPLIFTAQSIKSVLHESELYEIPPVPQSKSQHGFILSGVDITLLNMQVTNINCGGAMCDGLNMYQNSVTADRCPCYNVLDREGKVCLVLSLKVSDPKNNLQFCVHNHTSKSLTQLFMKRIPKGAVAATITGNQKHMGNLSAKVMDMLALGNEYNGFNISGWIKRGTIADSGVVQPPTGSKWDKPQQVDSGGLTYHLTKIAYATPPSDRILGEYQFNAGSLV
Ga0209873_1000825Ga0209873_10008251F061552KGFGNAGCSAEHDIRLITTRHLNLSSSSSSDTATCRYIATVKALTKMKDQEDMKRGVHITGTITMEETHYHSKCIHLETTLVNEKIMGFDCPILDISDSKMGRVGLAILVDMETTMVGNRIKKLVRKNMKCWTQPSIYAWPIVNSGLKIIDGVGLLILNDLAKSVYNFITTVQSEGNGTGSANRS
Ga0209873_1001310Ga0209873_10013102F082695LFVLLSYESQTGDESDVARWAHKLMNRSVANRTITKPEAMCELGQLPMVICSESIETVSITGQTRCSIDTTTSTILSQYKNRPNTQERLSLHEFYHAKRNNHSMATAANHREFVPHYVGGRGQPVYPVTDTKQSISYARSEILKHMPWSQKNPMPNECDWVAIFKEFLQDPSCPAGVKLGFERAKLRYELRRKGIQEVFQPDTEHSNATDDLDDDEIGDVIALTESLGYTEDELDKMEENGFCIGRDYDWGRRVYTVSNICII
Ga0209873_1001327Ga0209873_10013272F044498MPRQFNKRKSSTPNAINCTNIECHCVPPIGFQNMQTDLNKFSYVANNERVLLLSRYSTNCSLTELLELQSPMAEIQSETVSILSFLNGAIQRPLSYEQHIMNQKLTCAKKKFEKSCLLAYREFCDMELEHNLLVMQCCNDEMVLCSKAYDHHCYPCCKPSNEKHSTSDFNDKKTCETNIVFDGLQDEGIMAPDYYPDGLQYIFPTPSIESCLDSVNIECVRLLLSLRHCHKHI
Ga0209873_1001327Ga0209873_10013273F038918MDEMDDYYRNKISLCPKYIEWTRLSDNSEMFYCKTTYRKPDDETKLLAHIKRTIVTNDRAAQKRKKPKSESLTYDELLEKVRNNDVYKEFMKLEPGKQKFYDCKNYEKGNADDEIKLMKRISNRMNCNKRRTSGVDVEIGKRGTVVADSVGVDVEIQTKDAAVALLSLNNPRGGDATSPLNNVVIKEGGGAEAEEEDGGNGNNLGWDKEETTLTDGAIDPGGDVGAD
Ga0209873_1001432Ga0209873_10014323F046195MKVSRRWIEWAHGPGAEDGQPLATSLCDLASAPGAGSIEVADPALIAELVDVAGLYVNPCRGDALHDMGPWWMGQPRRIISEGRASLRAADSAAKSPAVDGQPE
Ga0209873_1005927Ga0209873_10059271F050934MPPEHTLIIEGDITSSRSNTTRQRIDRHLRHRIITTCGDANVMMGSKHIDPALCIYIGAYLICIDNKHLTDKVPRGNGTLCRVLGMKLNENAQSYKCKNYYGKKVWTVNAADVEWVECEHVNKTSFLTQLESQIKELKCQLDLTPNDHKIERKKIKSKLDDLNNKLAKEMTGRKFKLEPEQFSPEVTVKHYQTSSKKVAFRCKMKQIPANSNDATTGHKLQGMSKDAIIVSSWPTGGLAAMFKNWEYVVLSRVRTLSGLYLVKPIDMDKSFQPSPQLASYMDKIRKFEKDMLEKRKQAISKTFS
Ga0209873_1008295Ga0209873_10082951F088294RARHKDLYDEVKGGIMYCPECKQTVSTIDIVNKALQRWKDCLIPGDRAQYNRPDTIIPLSKERLDMAAYTFAYHMNGGCAIENDPFWGDKNVRETLLRYRFEEHSFSHSASCFKKDCECRFLFPFMSTDYTYIHEDKGDKDQNKTSWYFLDGSINTVYPFMVLPKRPMGCQFINAHNKTISDVFNFNTNVQIGDASQVFYSTLYTSKSTQDEDSEKQIRIGRAVIKRIKRVMNDNQEGEPSFGEGLSRVLSGLNAATTRNVISATMAHLIPCNDGSRFVYSHTFSDLLVGQMEATLEGHDISVRIRSNKFENKIITWPDSLADDYIHRPIEHELDQICFYEMTRCYKKGFNAFRNIQGGVKKYKFKETHPGHKFSHLIQLRFPSIPRIALPKGKLCLLDEL
Ga0209873_1009370Ga0209873_10093701F000331HFEHLCDSYGIKRKPTTVRNPQANAILERVHQVIGQMLRTAEIDMAKSVVPDDVDVFIDNAAWAIRSTYHTVLKASPGAAIFGRDMLFDIPFLADWNKIGDYRQSQTDRSAERENSKRIDYDYKVGDKVLIRKDGILRKAESIWKKEPWTITTVHTNGTIRIQCGTKSERINIRRVTPFSEELLI
Ga0209873_1009370Ga0209873_10093702F007402MVPPHHAGKFLPKTKGFFPHDFTLIASVVGESVIGHGIVGKDIPIMARLAEARRWSKR
Ga0209873_1016334Ga0209873_10163344F020000AVAITAVDRFTNALEYTGATMSAVDDWMQVDCHSSSFTFAANVTSSANFTLALEASFNGNGNWFTVDTSKTINSAGQYVYFYDGKPCTKIRMRIASISSGTVTLAPHIVVAYHG
Ga0209873_1016338Ga0209873_10163382F100499NGKQAGKVKGFRVTAETEEKMTNEQRIQLIKGAIDHVDINCSQFSIKNSEIVTKSMVKQWAFKFDKRMIRHASDNEINSLPYGF
Ga0209873_1016748Ga0209873_10167481F000212MKLITKIIVMTFFLMNSTMSLGLRTETQCHPQCSWKCDDPHCPAICDPVCEPPKCHTSCAEPKNAICDVKCEKPECEIKCPDKGCEMFDCPKCVTVCKQPHCVTHCQAPKPECEAVCEEPRCDWKCHKPNCPKPKCELVCENPNCVPKVECCPCAMGAPRVAQPFPFFKETASNPNCCGCNKPLV
Ga0209873_1017298Ga0209873_10172982F034611MLQSKTVLSLTIDLLANNAFNHLKDEEISALHHLILKLQEPLTTIQQNLLLTFWNHAYTGDLPSALLHRCNTVLQQLGRSPMEEVIMEIEMY
Ga0209873_1017322Ga0209873_10173221F015696MKAANAMTKGAKGASPSKSDMPSPARAAAGTTLKKFSPSSPQKKKDDRNKPQIFNLMHPSGTCYGWAFYNFYHAKEYLKGLSNRLGMTTVIGGIEFRPFCNLTTKWAKEAEFNNCIWVIRIDLELDGDENCFPLTAHVAYGNKIARGVTAQNIWEKGEVEVRTVTLSHAEALDLDGHFTVVHDQAADDAFDAAIKEADAELEDLI
Ga0209873_1017781Ga0209873_10177811F104469SESPVESDYSCFTTSQQCVTSLMYLLDDMECPDYGFKSIMDWARQCFEAGFDFNPKCKTRLGNLKWMYDAIHNAEHMLPHLEPIELPEPLPNLKTMNVICYDFVPQLLSILQNKEMMSANNLVLDPNNPLAMYKPHDSRLGEALSGSVYRDMYHHLVSNPSKQLLCPLICYTDGTQVDSLSRFGVEPFLFTPAVLSHAARCKADAW
Ga0209873_1019079Ga0209873_10190792F033776MANANNLTKDRAARSKGLNFTVNIRLSREEIEAARRLGDGNISMGVRWCIRYANGREMKPIKLSTMLRSAAVLAAQLEAA
Ga0209873_1020277Ga0209873_10202771F100053AKVAQKIWGSSVPRLKGSTVRESGHRKPQSLVKVPRELLKLQQKVSIAIDIFFVNGHIFFMTYSRKICFTTVTHLVNRKVNGVWAAMHQIYQMYMLRGFHIVEIAGDGEFVWIADQVASLPTNPTLDLAAANEHVGLIERNIRFLKEKVRSLRHSLPFERIPALMLIRMVLHAVPFMNSFPRKGGLKHYPPSAIMTGAQLHMNHLRLKFGSYCQVAEDVTPRNSLAARTRAAISMGPSGNLSGGHRFLALDTGKIIVRNRW
Ga0209873_1020943Ga0209873_10209431F085203KNALKKWRKYVQDVKDKKLFDGSRSLKLNIVLERIQRRTIKEVAERLKGFIFAAPAIKAVIKKMDSLVKRKPKQAFDKWRKYVQAVNNNELLDGVRSQKLLVVLTRVPVRRMKDATERILGDGSKVKGAIKKIYATMQRMPKTALEKWRKYMQGLRDKSFFDNLRSAKLLNCLSRIPLRKTRDAAQRIIGGGSKIKGALNTLVNGLKNIPKNALRKIRQVIQDIKDKKLYDNARSFKLQNVLEKVQRRTLKEAHERV
Ga0209873_1024036Ga0209873_10240362F067428NSHTKTKLEAYDIDWFDLSATTNECLKHTNIKNLPVTQEGQLTFLLNKFETSDLKTAFADRIGKVGTPARTAFYPLPVRKDKVIRSFLELIYDGRSIIISESKDAFR
Ga0209873_1025113Ga0209873_10251133F034156VDPATLAAVLGLGGAGVSALWKIAGGLGRFEAKTTTILGAMQVMLEDHEDRLRAIERKR
Ga0209873_1027807Ga0209873_10278071F065810LQYPKHSEITSKIFQIEYHEHLSLTAKFILNSFHNKYIYYAIDDILYSLRSNPSERDNLLAVLYSPILSLQNDFLVNFFDIWIRDISLNEISKSNKFLKQEAENLEKVNLITIKLFYKTRVPIKKQESLW
Ga0209873_1028215Ga0209873_10282151F062748TIVQGRYPTLKYFKVGALKSLPGAKSQYVGHGGRLHSDYPQSVEELEPRFRPVSIIVGLNSFNFMWLNDRTSRESEIRQMTVYPGEMIMFTNHCLHAGGENSTNEEQTRLFAYLASDESHFPSGEVTTWDWQREDNNPLISKPSRKANSQFRCKLASYLGGGT
Ga0209873_1030919Ga0209873_10309191F091422EQMMASVRLTHAASDQDFEDVHALATKELGPGLASLAEIKRVDALTGASIWVIRRNSEVTGFLAPLALTAAGVAALVDNTFDAAHIDQKWVARVGEPLAGFYCWCYAGKDQVSRGALVLGLRTLIDRHFPDLPFFGRDSTDAGARIMRHLGFFPFDSTPHLFWRCASVMEMAA
Ga0209873_1036092Ga0209873_10360921F051728QRMPKNALEKWRKYLQGLKDKSFFDNLRSAKLLNCLSRIPTRRTRDAAQRILGGGNKIKGCLQNLVNGLKNIPRNALKRLRQVVQDIKDKKLFDNARSAKLQISLERLQRRTLKEAHERVRGLMFASPAVKAVIKRMDGLLKRKPKQAFDKWRKYVQAVNNKEILDGVKSQKLKALLDKAVRRTLRDATERILGDGSK
Ga0209873_1036468Ga0209873_10364682F001799MDKHLLAQWAKNLLNDDFFKDVIDNLKKEQISVIINTSAEESDRREDAYRHIKTIELITGHLEGLASETVIREKKWKIL
Ga0209873_1037451Ga0209873_10374512F005176VTRGNEDGSGAPLQPPPEQEDEKISLFKRISPFDASMELKFELDTRLTVSCVVAVASAYGLLTVSIALDSIPLAFGNAVILLLILFVLFGWRELLGEFKKRAGN
Ga0209873_1039184Ga0209873_10391841F096670MLWGQRITVYTDHKNLTRDGLGLTSDRVARWRILLEEYAPEIIYIKGIHNTVADAISRLDYDPKLNTTNEYNHATHVMSTKVESNQKWMMFSKFWSCYNETQDQDEINPIKMNQVFANRSEDEEIYPLTVKEIVEAQKADPILTH
Ga0209873_1040475Ga0209873_10404751F067432MFDCPKCVTVCKQPHCVTHCQAPKPECEAVCEEPKCDWKCHKPNCPKPKCELVCENPNCTPKVECCPCATTGAGATPAFPFFKETETNNRCCGCGKTGSGAPTEQGRIVSTNINKGWTNRRKQYELICLY
Ga0209873_1041385Ga0209873_10413851F005672MTYPTHRLDHVTATAADWGLCQPKEHGEDERKRPLTAVATDLNGRLKQCCKSADSPSGAALEMRRHLALFARYGLPNPKATQLVRDLTMKAFTAHKR

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.