NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300003678

3300003678: Groundwater microbial communities from S. Glens Falls, New York, USA - water-only treatment rep 1 (Metagenome Metatranscriptome, Counting Only)



Overview

Basic Information
IMG/M Taxon OID3300003678 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0069861 | Gp0055781 | Ga0003067
Sample NameGroundwater microbial communities from S. Glens Falls, New York, USA - water-only treatment rep 1 (Metagenome Metatranscriptome, Counting Only)
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size15992773
Sequencing Scaffolds27
Novel Protein Genes29
Associated Families24

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
Not Available18
All Organisms → Viruses → Riboviria → Orthornavirae → Lenarviricota → Leviviricetes → Norzivirales → Fiersviridae4
All Organisms → Viruses → Riboviria → Orthornavirae → Lenarviricota → Leviviricetes → Norzivirales → Fiersviridae → unclassified Fiersviridae → Leviviridae sp.1
All Organisms → Viruses → Riboviria → Orthornavirae → Lenarviricota → Leviviricetes → Timlovirales → Steitzviridae → Tehnicivirus → Tehnicivirus pelovicinum1
All Organisms → cellular organisms → Bacteria1
All Organisms → Viruses → Riboviria → Orthornavirae → Lenarviricota → Leviviricetes → Norzivirales → Atkinsviridae → Neratovirus → Neratovirus caenihabitans1
All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → unclassified Planctomycetota → Planctomycetota bacterium1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameGroundwater Microbial Communities From Coal-Tar-Waste-Contaminated Well In S. Glens Falls, New York, Usa
TypeEnvironmental
TaxonomyEnvironmental → Aquatic → Freshwater → Groundwater → Contaminated → Groundwater → Groundwater Microbial Communities From Coal-Tar-Waste-Contaminated Well In S. Glens Falls, New York, Usa

Alternative Ecosystem Assignments
Environment Ontology (ENVO)freshwater biomeanthropogenic contamination featurecontaminated water
Earth Microbiome Project Ontology (EMPO)Free-living → Non-saline → Subsurface (non-saline)

Location Information
LocationS. Glens Falls, New York, USA
CoordinatesLat. (o)43.099444Long. (o)-73.604444Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F001233Metagenome / Metatranscriptome741Y
F005048Metagenome / Metatranscriptome413Y
F005443Metagenome / Metatranscriptome400Y
F007739Metagenome / Metatranscriptome345Y
F008500Metagenome / Metatranscriptome332Y
F010305Metagenome / Metatranscriptome305Y
F013430Metagenome / Metatranscriptome271Y
F014210Metagenome / Metatranscriptome265Y
F014988Metagenome / Metatranscriptome258Y
F016476Metagenome / Metatranscriptome247Y
F017604Metagenome / Metatranscriptome239Y
F022577Metagenome / Metatranscriptome213Y
F028190Metagenome / Metatranscriptome192Y
F029638Metagenome / Metatranscriptome187Y
F030638Metagenome / Metatranscriptome184Y
F060937Metagenome / Metatranscriptome132Y
F060956Metagenome / Metatranscriptome132Y
F061584Metagenome / Metatranscriptome131Y
F081396Metagenome / Metatranscriptome114Y
F082737Metagenome / Metatranscriptome113Y
F091449Metagenome / Metatranscriptome107N
F094695Metagenome / Metatranscriptome105Y
F100491Metatranscriptome102Y
F104572Metagenome / Metatranscriptome100Y

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0003067J53075_100045Not Available816Open in IMG/M
Ga0003067J53075_100122Not Available697Open in IMG/M
Ga0003067J53075_100147Not Available661Open in IMG/M
Ga0003067J53075_100497Not Available586Open in IMG/M
Ga0003067J53075_101086Not Available576Open in IMG/M
Ga0003067J53075_101093Not Available639Open in IMG/M
Ga0003067J53075_101136Not Available766Open in IMG/M
Ga0003067J53075_101241Not Available737Open in IMG/M
Ga0003067J53075_102351Not Available533Open in IMG/M
Ga0003067J53075_102736All Organisms → Viruses → Riboviria → Orthornavirae → Lenarviricota → Leviviricetes → Norzivirales → Fiersviridae3323Open in IMG/M
Ga0003067J53075_103256Not Available917Open in IMG/M
Ga0003067J53075_103341All Organisms → Viruses → Riboviria → Orthornavirae → Lenarviricota → Leviviricetes → Norzivirales → Fiersviridae → unclassified Fiersviridae → Leviviridae sp.4393Open in IMG/M
Ga0003067J53075_103822Not Available746Open in IMG/M
Ga0003067J53075_104138All Organisms → Viruses → Riboviria → Orthornavirae → Lenarviricota → Leviviricetes → Timlovirales → Steitzviridae → Tehnicivirus → Tehnicivirus pelovicinum4734Open in IMG/M
Ga0003067J53075_104509All Organisms → Viruses → Riboviria → Orthornavirae → Lenarviricota → Leviviricetes → Norzivirales → Fiersviridae4238Open in IMG/M
Ga0003067J53075_104816Not Available553Open in IMG/M
Ga0003067J53075_106132All Organisms → cellular organisms → Bacteria1514Open in IMG/M
Ga0003067J53075_106289All Organisms → Viruses → Riboviria → Orthornavirae → Lenarviricota → Leviviricetes → Norzivirales → Fiersviridae2756Open in IMG/M
Ga0003067J53075_109345All Organisms → Viruses → Riboviria → Orthornavirae → Lenarviricota → Leviviricetes → Norzivirales → Atkinsviridae → Neratovirus → Neratovirus caenihabitans1746Open in IMG/M
Ga0003067J53075_110098Not Available671Open in IMG/M
Ga0003067J53075_110197Not Available504Open in IMG/M
Ga0003067J53075_110432All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → unclassified Planctomycetota → Planctomycetota bacterium718Open in IMG/M
Ga0003067J53075_110850Not Available673Open in IMG/M
Ga0003067J53075_112783All Organisms → Viruses → Riboviria → Orthornavirae → Lenarviricota → Leviviricetes → Norzivirales → Fiersviridae1783Open in IMG/M
Ga0003067J53075_113501Not Available503Open in IMG/M
Ga0003067J53075_115215Not Available557Open in IMG/M
Ga0003067J53075_121442Not Available744Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0003067J53075_100045Ga0003067J53075_1000451F060937NRGAGNGREIEERTAGIIPGNWGKVGPGWLVQPLGRRIAGFAQRRQNSPAPLAGVATRHNAIEELGGEGR*
Ga0003067J53075_100122Ga0003067J53075_1001222F104572MRPIALLAAGNGVGEPAAHRKVERPMSVKGKESVANSHSRYYPA
Ga0003067J53075_100147Ga0003067J53075_1001471F094695TWLVMPSCSSLQRCELLDGGISVCSVRDPCFRETWDDVQWFSDESIESSWVQLFPVRRLPFPVASFRFSLSVVVRRFSRLRHSTLRLTCLLEVQPPINWPRMRTRSRLSCDFFPYSARQKQASVSPGASNLRHCPSSGFLTLSTSCSACNHGRFISPGLRSWGSTKSAYSLPSFDKGRVRRAWVSSPKLSPFP*
Ga0003067J53075_100497Ga0003067J53075_1004971F082737VPSDPRYQFAPQPACQV*ACHIQHAGRISRLAPRPRYFSDGDSMLGRTPASLPVRSQNLEAAFHSPTTTSLLPDDRGGVRVPALPLQLCDTISVSPVRSGLHPRPVSRTAWDFYNQNPLPSGPAPLLPAALNRRSPSGQFAPPDQSTRSRWPTGNLPSETPDFPSLPTGGRIRYQHQRIIVPAPLRPCRLTVP*T
Ga0003067J53075_101086Ga0003067J53075_1010861F005048LKFEHTTPIAEDCLQDFVVETFASAKADDGLIALVVEPRHVERQGLVRTLRELGRRAIGVATALDAVQLLGREGEHVDTVFIEAESSSLPSLELVEFLAHNHPHVRRVLVGEPSEIAASWVAQATGEVHALLETPCDQEAVHRVLHRLQFTPHDGALS*
Ga0003067J53075_101093Ga0003067J53075_1010931F091449LLATFRLQGLITLLAVYSLRSRAGFVSHRPRSWDLPFGAFSSWKVTGRFRLGRTHILFTRRYTLAPRCRGRLDKPQFLGFDPSESSWQSDVCLIRRLLVAPLGLTLPGYLTKALTRISPSLLSRASQQGLRLTAGASEFRSAFAPPHPPA*
Ga0003067J53075_101136Ga0003067J53075_1011362F010305MKQAEKPHSKSSCCEALDGPATRPKTPLAVENGVGKLAANLAPNVSMGKRDWRTPNP
Ga0003067J53075_101241Ga0003067J53075_1012412F017604MSKGCDETVGDTSGDNPDPETPLRRRGQVARKGGWERGRAHEETRAPTRRETGDGSAEC*
Ga0003067J53075_101610Ga0003067J53075_1016101F100491MVPSDLPVACSPTGMHGQNVASGTGET*ALCSPPLVLANRGSVRRCASALNSGRSPFLDTAFRSTAPRTGLATDPRNCVNVPGLHLRNDPQIRSCSFGTALPPPCGLLMALRGTINIRYPLPDPISGLIIRLRVSAPLRELSFPRDQSPQPGSGL*SLPLQVARSSFTPRSAATVLFITQ
Ga0003067J53075_102351Ga0003067J53075_1023511F001233LRWAAAINRLRLAVSRIIPGDWGKAESGWLAQPLLSRIARFGGGGRIHQFLWRRSRAVSKREKGTAR*
Ga0003067J53075_102736Ga0003067J53075_1027362F028190MPTMASIVVKKFDGTTDITYDSLSASGGDSSPAVWRQDTGAAAGLPIGFRSLFKLWTTWNGPKTARQTKFNFVAPYAVQDSTTTLYSAKDRVVVDGIMTVPQGIPSANINEAIYQACNLLAATLIKQAAASGYAPT*
Ga0003067J53075_103256Ga0003067J53075_1032562F061584AARLAVSPSKVRVIDSASRGLVTEPRWTLMKIGRSTECCERRSCRRIRLPAPTMTSLFPGSLCGLTPPPSPVPLRVRVHPLVNFASPSEHEPFRSCPARCRAWRLPWGFVPHRGISTRSPLPSELPMSRPTDPSSAFLTPSTVCSSAHLAGLFHPTTTSGIHLPGVISRCLARPPRRRPVPSCRSSARSCRRVAPTTPDRTDPPTGS*
Ga0003067J53075_103341Ga0003067J53075_1033412F005443MLANTLVTNEVKNSTGAEVEFSRASTEGRTTVFKQSAETPNAPHRLKISHQESGAGTKLVRRSVVRVDKTVTGADSNPVVVTAYTVLVHPVGNLASQAEPKHVLAELMSFCASLGASTTILYDGTGNGAVCLLEGDL*
Ga0003067J53075_103822Ga0003067J53075_1038221F016476RHGDRVLSWAFAPFSTSGIRGPLSREPSQPAKFRLQGLATLLTVCSLESRAGFVSHRRRSWDSPFGGFLSLQASAAFRPGRTHVPLAQRYFRRRSVRPARGASVSGFTPVGTALRPCGVLAQQPPAPPLGFAPLGLTCEDLAPDFSRTPLACLTNPGDYSPGQPTPQSVDRPSPCLARLVPKYRPAEATLMGFLHLPVPDHLSSPVPGLWNSPFVASCITADSPTIFGHRKNPAEAVQDRPWVPSIAT
Ga0003067J53075_104138Ga0003067J53075_1041382F005443MLSNTLNTNEIKDSAGAEVEFSRMSQSGRATEFAKINESPVAPHRIKISHLESGAGIKKRRRSMVRVDKTVPSTLDATVMVTDSAYIIKDTPVGALSTNAEPANTLAELLSFCATTGAGTTVLFDGSGNGAKALLEGSL*
Ga0003067J53075_104509Ga0003067J53075_1045093F030638MAAAAALTLKNNAAANVTYDVYSVNPDSVEWTEAGATSILGTSRFVLSRVIPADKTAGVYRTRGKLTRPVINGTSGLLDGTLTATFEILRPAKLTVAEVDEIVARFKEAVAQAIVKTAAESGAIPS*
Ga0003067J53075_104816Ga0003067J53075_1048161F005048LKFEHTSPKAEDCLQDFVVETFARAKADDGLIALVVEPRHVERQGLVRTLRELGHRAIGVATALDAVQLLGREGEHVDTVFIEAESSSLPSLELVEFLAHNHPHVRRVLVGEPSEIAASWVAQATGEVHALLETPCDQEAVHRVLHRLQFTPHDGAFS*
Ga0003067J53075_106132Ga0003067J53075_1061321F022577MRWLKGAVVLAALGVGIAQVASARVPAFVRQTGLVCNQCHVTWTPTPDLTFTAVKFRMNGYRTPWVAEKIEAGQEGALGGRRLLLGVTGYLSYHMRSNLFQQSKGASDPTLAEPTAGPVTSNPFSSLAWDYTGPIAENVGIWTEWYSTNFNPVTTGAGSVGNQFGAVRNDEFDVRMGWNPGEGGNIVSVFINNQGQTSPFFGAFGSGTPAGGQGQFIHTGVAAWVKDRVVVQLAVAPGVDNLDYKRMTYGVVLGFLPMNTDGMWLMPTFSMLAGNDMAPTAGGTPGVAALSKGGAGYTSQSMGDATRTLFDVRFGFLDHGHWSFNSATGYSWNKETYNDGAGSTLVGIGSTVRVWYDRTYGINAGLNKRLTYDFTDASGVVHPIPSDLGYNVLLVYRQAMNFAWEFGFSNNQSLRLDQNWRNGWSWNLQWHFLY*
Ga0003067J53075_106289Ga0003067J53075_1062892F028190MPAMASIVVKKFDGTTDITYDSLSASGGDSSPAVWRQDTGAAAGLPIGFRSLFKLWTTWNGPKTARQTKFNFVAPYAVQDSTTTLYSAKDRVVVDGIMTVPQGIPSANINEAIYQACNLLAAALVKQAASAGYAPT*
Ga0003067J53075_109345Ga0003067J53075_1093452F014988MAFAPPALITGAAVTGLTTPTYIIAVDTPPAINAKQYAVTALGGTQTGVDVNTVSKPFSISFFRPVQLRSLPAANPVTGVIKNIPMNQYKLITRKGAQPSANTPAMTARITTTIEVPAGSDTYEPEEIRALISSHFGVGFSTASGIADTVLTG
Ga0003067J53075_110098Ga0003067J53075_1100981F016476LSSNLTFPLECYPATPTRPPQRPSPLMGFRSLQHLRNPRSTHRGQSQPTTFRLQGLATLLTAYSLESRAGFVSHRRRSWDSPFGGFLSREVSPAFRLRRTHLPLAQRFFRRRSVRPARRASVSGFTPPGIALRSHGVLGRQPPAPPLGFAPLGSDRENLDPDFSRPPLACLASPRDYSRNKPTPQSLYRPSPCPAQPTPKRRPAEATLMGFLHLPVPDHSSPP
Ga0003067J53075_110197Ga0003067J53075_1101971F008500VSCSNCHAQGFIPVVDEVREIAIANAREIGLDRDEVEQLEGIYVSPQEFARTVQEDSQGFYQRALQLGDLPIQGGDPVSSVFLRFDQDLRIEDVAGDLGVTPDDLDDNLDLLDPVLAVLERGTVDRDDFTAVYVNSLCIMSTPLENQPDPAVCDAAEAALDL*
Ga0003067J53075_110432Ga0003067J53075_1104321F014210MRWLRGAVVLAALGVGVAQVASARVPAYVRQTGLVCNQCHVTWTPTPDLTFTAVKFRMNGYRTPWVAEKIEAGQEGALGGRRLLLGVTGYLSYHMRSNLFQQSKGASDPTLPEPTAGPVTSNPFSSLAWDYTGPIAENVGIWTEWYSTNFNPVTTGAGSVGNQFGAVRNDEFDVRMGWNPGEGGNIVSVFINNQGQTSPFFGAFGSGAPAGGQGQFI
Ga0003067J53075_110850Ga0003067J53075_1108501F029638VDITVTAPATTAPGSFAGSLIGQVIPATATVGTRIQCVPSNVNGFNGSTITIRKIDQFGQPLTAGFSIQQGPFWVEVARVSLGSTLASNPCATDGTQGSFNITGAGTSCANIGVITPAPFLSGLPAGQYRVVEVSGPNSYCTLVQVYNGNQAQNQSNTSAYNSGLLTQPVTLNLPDANIYDLQLTFVNSCVTPGGPSTATSQIAVVIGGSTPGLVNTSNIEIS
Ga0003067J53075_112783Ga0003067J53075_1127833F028190MANMTVKKFDGTTDIVFDALSPSAGDNVPAVWRQDTGNAAGLPVGLRSMVRMTTKDNGPKTARQVKLTFDFPYAVQDSTTTLYSAKDRVHGEVMFTVPVAIPATWINEAIAQ
Ga0003067J53075_113501Ga0003067J53075_1135011F007739RAAKSKIDVVRFRIIPGDWGKVRSGWLTGLLLNRAARPDSGGRIHQFL*
Ga0003067J53075_115215Ga0003067J53075_1152151F081396ISGESDQAPDRRTLKIIIESDCDELGERSWAAPAVTPLVGFGERHPEEVNPRRGSDPVDG
Ga0003067J53075_118046Ga0003067J53075_1180462F013430MRRCIQALVVVVVLLALGSLALAQQTPQPVVRMGNWIEVG
Ga0003067J53075_121442Ga0003067J53075_1214422F060956MDNDLTYNSVVFAHSFDVEGRHVRISKARAINTPDTLTIQATDYVDSKTKVPGKRYNVRIDREDLDPESRKIISSAYLVIAVPETVADAQLDVLIATFKAAVANADLIADVLDGQL*

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.