NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300025309

3300025309: Soil microbial communities from Rifle, Colorado, USA - Groundwater C2



Overview

Basic Information
IMG/M Taxon OID3300025309 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0053054 | Gp0054775 | Ga0209212
Sample NameSoil microbial communities from Rifle, Colorado, USA - Groundwater C2
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?Y
Use PolicyOpen

Dataset Contents
Total Genome Size1062417705
Sequencing Scaffolds21
Novel Protein Genes23
Associated Families20

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria7
Not Available2
All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria2
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1
All Organisms → cellular organisms → Bacteria → Proteobacteria1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes2
All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes1
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Tenericutes → unclassified Mycoplasmatota → Tenericutes bacterium HGW-Tenericutes-71
All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium2
All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Desulfuromonadales1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameSoil Microbial Communities From Rifle, Colorado, Usa
TypeEnvironmental
TaxonomyEnvironmental → Aquatic → Freshwater → Groundwater → Unclassified → Soil → Soil Microbial Communities From Rifle, Colorado, Usa

Alternative Ecosystem Assignments
Environment Ontology (ENVO)freshwater biomeplanetary subsurface zonegroundwater
Earth Microbiome Project Ontology (EMPO)Free-living → Non-saline → Subsurface (non-saline)

Location Information
LocationRifle, Colorado, United States
CoordinatesLat. (o)39.55Long. (o)-107.97Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F003800Metagenome / Metatranscriptome468Y
F007838Metagenome / Metatranscriptome344Y
F009579Metagenome / Metatranscriptome316Y
F009862Metagenome / Metatranscriptome312Y
F015640Metagenome / Metatranscriptome253Y
F017253Metagenome242Y
F018779Metagenome / Metatranscriptome233Y
F021450Metagenome219N
F026515Metagenome / Metatranscriptome197Y
F027235Metagenome / Metatranscriptome195Y
F029285Metagenome / Metatranscriptome189Y
F036104Metagenome / Metatranscriptome170Y
F056711Metagenome / Metatranscriptome137Y
F068265Metagenome125Y
F074912Metagenome / Metatranscriptome119N
F084410Metagenome112Y
F096266Metagenome105Y
F098345Metagenome103N
F099501Metagenome103Y
F103864Metagenome / Metatranscriptome101Y

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0209212_1003319All Organisms → cellular organisms → Bacteria21417Open in IMG/M
Ga0209212_1005297All Organisms → cellular organisms → Bacteria15454Open in IMG/M
Ga0209212_1009262Not Available10301Open in IMG/M
Ga0209212_1018127All Organisms → cellular organisms → Bacteria6332Open in IMG/M
Ga0209212_1023813All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria5208Open in IMG/M
Ga0209212_1035224All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium3904Open in IMG/M
Ga0209212_1043064All Organisms → cellular organisms → Bacteria → Proteobacteria3363Open in IMG/M
Ga0209212_1044889All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes3261Open in IMG/M
Ga0209212_1050864All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria2975Open in IMG/M
Ga0209212_1080030All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes2129Open in IMG/M
Ga0209212_1152371All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes1304Open in IMG/M
Ga0209212_1169314All Organisms → cellular organisms → Bacteria1201Open in IMG/M
Ga0209212_1182740All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1131Open in IMG/M
Ga0209212_1207853All Organisms → cellular organisms → Bacteria → Terrabacteria group → Tenericutes → unclassified Mycoplasmatota → Tenericutes bacterium HGW-Tenericutes-71022Open in IMG/M
Ga0209212_1210464All Organisms → cellular organisms → Bacteria1012Open in IMG/M
Ga0209212_1226142All Organisms → cellular organisms → Bacteria956Open in IMG/M
Ga0209212_1229069Not Available946Open in IMG/M
Ga0209212_1260620All Organisms → cellular organisms → Bacteria853Open in IMG/M
Ga0209212_1351998All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium672Open in IMG/M
Ga0209212_1384810All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium622Open in IMG/M
Ga0209212_1389417All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Desulfuromonadales615Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0209212_1003319Ga0209212_100331922F056711MRKAFTALAAAAFALSIVAGGCAKDKSPSQLQAGKKGIAGTEKKAAPKPRAKLFTGTIEDMDEAAGTLTLKGPKGEMSFQARKKVKEQLGELEIGDKVIVKHDGKTALSIVKPRTSKTALAWYRNEETETVREEFH
Ga0209212_1005297Ga0209212_100529714F003800MGDERRDDADQRRADVYSSARIGAAAALTLVLVVLLVLDVAVPGYDISPGILLPLLGAICALLGLEASAVWRSVR
Ga0209212_1005297Ga0209212_100529715F009579MSGIELVAAFLAGVGMGGALDRFVLPLLVDAWIDRLRRHGR
Ga0209212_1005595Ga0209212_10055955F074912MVPFNQMRDLHTLALAALRSGDVLAVTKEKYVVQFIREGLQHQLPLINMQLKKKGMRAISLKSLKK
Ga0209212_1009262Ga0209212_10092622F029285MKNQANYSEYECPYSHIEKECGHELRGPEGYENTYGVWCACGFRAPVFALDPIELGLKKK
Ga0209212_1018127Ga0209212_10181273F009862MPKPKAEKPLVVRCPNCKKTNEPNFNETTHRYACPICSAPVDVQVIIEKKKRSGNKGRNW
Ga0209212_1023813Ga0209212_10238136F027235MDSYVVRIYRRAGAKSRILIGTVEAAGTEKRLGFSNIEELWEILGRRKGRDLCAPPSPRRRLRKEVMSATAASGIEEPAEGVRQMKPDFRTLRGGIR
Ga0209212_1035224Ga0209212_10352245F003800MGDEHRGRAPERSRADVYSSARIGAAAALTLVLVVLLVLDVAVEGYDISAGILLPLLGAILALLGLEASAVWRGVR
Ga0209212_1043064Ga0209212_10430644F056711MRKAFTMLAAAAFALAILAGGCAKDKSPPRLQPAKKGIAGAEKKAAPKPRAKLFTGTIEDLDEAAGTLTLKGPKGEMSFQAREKVKEQLGELELGDKVIVKHDGKTALSIVKPRTSKTALALYRNEEPGKSHPNFSNQ
Ga0209212_1044889Ga0209212_10448891F084410LTQNLSLIYQIRDFGSRQILLPRQLYLSPGFAKIAPWWLARRDYFSAKVPGLVSAGCATNVYS
Ga0209212_1050864Ga0209212_10508643F099501MVDRFMLKFLGELFYLIIALCTIYGLWNLFREYRAINSTVIRTHIIAYELQKMVKSRARQ
Ga0209212_1080030Ga0209212_10800304F068265MANKKHSAAMPIKEGIFFPEDFTKLHKDYVIYIKEGVIIDLKKL
Ga0209212_1152371Ga0209212_11523711F021450MKKTISLLALLLLFSFEIQAQVTNVLNNFYTVWVKPAIPIIGGLVLIVGALANIGKVLGDSRDYRGFITSIVLYLAVYFCLVGIVAFIMA
Ga0209212_1169314Ga0209212_11693141F026515LLDGAYDEEIDAVAREFDPDRRAKLTRAMGQKLYDGYHGVMLGMKSITWALSKRVGGWQTLAYVPMETNYEYVS
Ga0209212_1182740Ga0209212_11827401F103864LAGARASGASEDEVREAISFAIRANAAKTHADILKVYTDGGR
Ga0209212_1207853Ga0209212_12078532F007838MRSLFTFFLLLLTFEVFSQTVVVRSPTWKIEDEAKDALEKADRLRQLEKLTEQVKTQKESLQVIRDATEKLRKINRKVANYHNLELAIVQVSEAYTRVLSSLKRISDNNCFKPSEYHMISESMMGLLSQTSYSITTLTVVLTDNFSEMTDGERLLNMNQAIKELRENLGVINSAIIEVEILDNQRMQLRTLNYINSIFK
Ga0209212_1210464Ga0209212_12104641F036104MQDGIPLSIGIIIVAALLGATFIAGVVVFSVLSAAF
Ga0209212_1226142Ga0209212_12261421F017253GDGRTQIGYNVGSYTQGRTAGDFLSSRKAARAERCTPKTTAAELAGS
Ga0209212_1229069Ga0209212_12290691F098345MQRPKNIKNLTHSASILNSLYLGKEIVYTGTGHSLRPYRGEIINIAYYSDLESVPYAEIKLKIWNNRTVTKIMKLSD
Ga0209212_1260620Ga0209212_12606202F027235MDSYVVRIYRRGGRKSRILIGTAEVAGAERKMAFSNIEELWGILRHRKGRDPWAPPSPRRRLRKEVMSATASSGLEESAEGVRQIKSDL
Ga0209212_1351998Ga0209212_13519981F018779MEVVTMTIKEPEKKLPPIGALAQHSLSGVKDLMKEILTKPHVDEDKMSYWQSIFSQSKANGCFYRGEDLPHMKWQWPRGTWFQEQLSHHPKALEFAAHHAKN
Ga0209212_1384810Ga0209212_13848101F015640RVIRYEIFYAISLAGLLHYMVTFNSSQRRMSLRVQLEIMKKPLIEHLKSEGIPIWEDLGRMPGEQRPKEKFLASDIVLATQAFITHNAHVTKAFETERFLDENQPYLDNIGDISDIMRMLKRISGEVHAKIAQAYEANLNQRYLMTTGDPFLLGFVAACGYVRNRSSMEILDKALDRLLGEFERAETDPLRFDDYQRALDMIHASR
Ga0209212_1389417Ga0209212_13894172F096266LFLTFVLLAILVAIAFSTLKKNRKQKLEKPKHRMLEDD

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.