NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300026740

3300026740: Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G05A1w-12 (SPAdes)



Overview

Basic Information
IMG/M Taxon OID3300026740 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0095510 | Gp0072080 | Ga0207439
Sample NameSoil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G05A1w-12 (SPAdes)
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size20743468
Sequencing Scaffolds26
Novel Protein Genes28
Associated Families27

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Archaea6
Not Available12
All Organisms → cellular organisms → Bacteria2
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Sphingomonadales → Sphingomonadaceae → Sphingomonas1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Hyphomicrobiaceae → Rhodoplanes1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium2
All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameSoil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts)
TypeEnvironmental
TaxonomyEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil → Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts)

Alternative Ecosystem Assignments
Environment Ontology (ENVO)terrestrial biomeagricultural fieldagricultural soil
Earth Microbiome Project Ontology (EMPO)Free-living → Non-saline → Soil (non-saline)

Location Information
LocationWisconsin, United States
CoordinatesLat. (o)43.3Long. (o)-89.38Alt. (m)N/ADepth (m)0
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F000569Metagenome / Metatranscriptome1018Y
F001213Metagenome / Metatranscriptome746Y
F002616Metagenome / Metatranscriptome543Y
F004317Metagenome / Metatranscriptome443Y
F016622Metagenome / Metatranscriptome246Y
F017892Metagenome / Metatranscriptome238Y
F019548Metagenome / Metatranscriptome229N
F022731Metagenome / Metatranscriptome213Y
F023901Metagenome / Metatranscriptome208N
F026602Metagenome / Metatranscriptome197N
F026949Metagenome196Y
F038480Metagenome166Y
F040716Metagenome161Y
F043582Metagenome / Metatranscriptome156Y
F047460Metagenome / Metatranscriptome149Y
F057496Metagenome / Metatranscriptome136N
F057773Metagenome / Metatranscriptome136Y
F058528Metagenome / Metatranscriptome135N
F068880Metagenome / Metatranscriptome124Y
F070698Metagenome / Metatranscriptome123N
F071393Metagenome122N
F078970Metagenome / Metatranscriptome116Y
F083156Metagenome113Y
F088429Metagenome109N
F089166Metagenome / Metatranscriptome109Y
F089980Metagenome108N
F094081Metagenome / Metatranscriptome106Y

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0207439_100092All Organisms → cellular organisms → Archaea1653Open in IMG/M
Ga0207439_100213Not Available1353Open in IMG/M
Ga0207439_100333All Organisms → cellular organisms → Archaea1199Open in IMG/M
Ga0207439_100542All Organisms → cellular organisms → Bacteria1038Open in IMG/M
Ga0207439_101042Not Available838Open in IMG/M
Ga0207439_101341All Organisms → cellular organisms → Bacteria775Open in IMG/M
Ga0207439_101903All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Sphingomonadales → Sphingomonadaceae → Sphingomonas686Open in IMG/M
Ga0207439_101944Not Available681Open in IMG/M
Ga0207439_102038All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria667Open in IMG/M
Ga0207439_102099All Organisms → cellular organisms → Archaea659Open in IMG/M
Ga0207439_102254Not Available641Open in IMG/M
Ga0207439_102262Not Available640Open in IMG/M
Ga0207439_102576Not Available612Open in IMG/M
Ga0207439_102765All Organisms → cellular organisms → Archaea594Open in IMG/M
Ga0207439_102832Not Available589Open in IMG/M
Ga0207439_103188All Organisms → cellular organisms → Archaea564Open in IMG/M
Ga0207439_103296Not Available558Open in IMG/M
Ga0207439_103344All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Hyphomicrobiaceae → Rhodoplanes555Open in IMG/M
Ga0207439_103418Not Available551Open in IMG/M
Ga0207439_103590Not Available542Open in IMG/M
Ga0207439_103934All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium525Open in IMG/M
Ga0207439_104158All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia514Open in IMG/M
Ga0207439_104254Not Available510Open in IMG/M
Ga0207439_104316All Organisms → cellular organisms → Archaea508Open in IMG/M
Ga0207439_104362All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium506Open in IMG/M
Ga0207439_104445Not Available502Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0207439_100092Ga0207439_1000922F023901MKDNCSQNEIKTSEQSQSCEGGVPLDPKKFGGYEFAINTKGWDAKREPTHNCHDQHKSGLIPTNLENDKAVRLRQTVKDESGKVHQIGEIDYMDGNGFHKVMDIFDSSPNPWMVDRNLYETKSYFWIRNNGSGYITVRDVSLEILS
Ga0207439_100213Ga0207439_1002132F019548MNPSKKEIYKILERFTSQKGGILIILHNSFSSDSKPPQEQTSVVRDDERKSAIFEINFGTVAGLSMLCECEKALADQVMSLDFLDSGIEGDVIWFGGMLDKSGSEFIGSTYDDGLKSAPVEQSELVHRVNQAIDKCLEYMLNSVELDKKVYVDASDRMSGYVKLTRIGEHIREKHPDLFRDNTKSE
Ga0207439_100333Ga0207439_1003331F026602MISLTLASMHVFAGRIYAIEQPSKMEGINEEECNKIFKCKIISEDVLKYPDIVNPFAKNEEIAKTLNANDAQIMTEHTCQKLMDVDIVKKKDQKIGEQTPKYLVCLP
Ga0207439_100542Ga0207439_1005421F083156MSDGGRERASLGVEVWNSSQKWSVQRSAVRSIAWLDVL
Ga0207439_101042Ga0207439_1010421F001213QQEEVNVIVAGSGVYRIEGEDIPVSVGSFLRFDPGTTRQPIAGPEGMTMIGVGARRGSYEPRGPF
Ga0207439_101341Ga0207439_1013411F070698QTNIFDRRHHTLMVILNISEIWNNGHNTTFRDSHLEDDNSEILSNGNAPYVIKGNGDCMIIVHADALGASKNFGDGH
Ga0207439_101511Ga0207439_1015112F040716MTHVRIAANAILVFSFMAVTGFPALAAPAKCNAELRKCNSHCNLVYESGRANRTCRNRCKDNLYVCKARPS
Ga0207439_101903Ga0207439_1019032F043582PQREKKMPLLFYFPLIIWMGVLEAMQDEMRVAATAKARR
Ga0207439_101944Ga0207439_1019442F047460MFLLHFVMFGCIYDHFVTALNSAQMGQSGAINAKVRATKSRL
Ga0207439_102038Ga0207439_1020381F078970ASRVWIDRVQSEISLWSDLATKLSSTKSVPEALDAYTKCVSQRMQMAADDGRRLAEEAQQLTQKFAQSLGNGRPGMTT
Ga0207439_102099Ga0207439_1020991F068880KYKGVLESADRLTFGHSNYGYRYFLNLSSPSKLLDSSSGLIPKNEFLSKIPEGYDVPFKGMLILTQKHNELYAIIFLNPKEKFDSMLNQIQPTLDSIQLSG
Ga0207439_102254Ga0207439_1022541F038480NMEGACRVVADRGSSQMKLGRKSGKIVRRDTMKAILVSIVLYFAATAAHSMPISVLNANGLSATIPISDQCGDRCGSSRSYVKDRRSGVGGYSGGYVLVRDPLIQRRPFCPFGSYVACVVSGTYCVDLCH
Ga0207439_102262Ga0207439_1022621F057773LILEPAVASHETDDGLGKVDIEIVADDIPSDVGGGAAQQAAEKARKILFGPRIADHAFDLAGGNIEGRNQGLSAVAAVLELTPLDLARRHRQSRRDALERLDAGHLVDRDGAMGVIGGGRGFVDRADVCALGIEGGIGLRGQPVADAMRLEVSLFFKKRPTERCEMFGMSPRRMASSAISRWLQWLMGRSLSDGFSHVIATRAQICSGVNVA
Ga0207439_102576Ga0207439_1025762F058528NDIPDGLSVWPQDDVARAICNALMMATGALVTIREDQHLKDQGTYAIGIGHKLN
Ga0207439_102765Ga0207439_1027652F088429TPEPQPETNASLNTPEPQPETNASLNTPEPQPETNASEIVTPRSIDLNITVGKDPIARSENQMVTVVALDPTTGKVLDRVFIKLEIKDPVGILVKNYTGTVGNLTRTFKIGENAIGTFIISATASQAGVQSTKSLPFQVQ
Ga0207439_102832Ga0207439_1028321F002616MQDTPMPNSFPNWTSRIVIAGRIAEYRRPESRHESTMPGDYHLPFGRVEAQAKAARWLWQSYII
Ga0207439_103188Ga0207439_1031881F089980MDLESATELIVMNWLGIFSLAATIFIAIVTINYRNKQHQIKGLLDAFKILNTREHRTSRRKVYELYIEYEKNKDVGIFDNVPEVVDVRADFDVIGTLVKSRNIDEKLFLIEYGPLAYRCWKYLKNHIEAERKKRNFDPFMMNFESLAGKADNFWLKRGYDLSKTLLYQPEQ
Ga0207439_103296Ga0207439_1032961F089166ACELGYWAMVDALEAERNAAKKNNAGDHPAFSCEPEAEPSPAPRVQRTADEPAP
Ga0207439_103344Ga0207439_1033441F038480VLVSIGLCLAATAVHSMPLSLLNANVAQPVIAVSDQCGDRCGSSRSYVRDRRTVMAGYSGGYVLVRDPLIQRRPYCPFGSYVACIMSGTYCIDLCH
Ga0207439_103418Ga0207439_1034181F004317MGKVFRALRPCLSPYRDERQMNLSYMVLLDMWNMFEQKFLSFIGG
Ga0207439_103590Ga0207439_1035901F000569MDLHIKKVWLPGAASCLLFFGFYWVLIWLPFDKNRFQFLAIPYLVLPFAGALAAYSSRRMKGSVLERILSALFPAFAFVVLFAVRIVYGLFFEGQPYTLPHFLAGFSVTLVFIVVGGLLLVLGAWPFCRPHLREQ
Ga0207439_103909Ga0207439_1039091F094081MADDHRKTLKEEFTDRLEKAKGRLQQSFPEIQQSIKTSGAVEAARKIIDPAQSIFKQFADDIQLKDLIAKAEALVANANLTLTKAASKDAAPTEARPIGAPSNEPSAKKAGVKKTPSR
Ga0207439_103934Ga0207439_1039341F016622VLREVAEYPNCFGPLGPGAERIETDRYTLCLGPGSTWNTVQRQRFALEELDEVLEEVRAHLRDRNRTQTQWEVGSSAPAGLVDALLERGIGFDKDPYAVALVLTSEPPPPREGLVARQIETYAEYLDANAVQWEAFGTPPEQI
Ga0207439_104158Ga0207439_1041581F026949MKKLFALILSVAALAPFTAQSQRPPGSLAGFLSGQGLVGVKLERRYGNHLFVL
Ga0207439_104254Ga0207439_1042542F022731MGTEATIDRVKKELLRAFDNTRAELDRIEILAAGLAAFNAPIPGYEPMFRHLPQLNRNAHELAADEPRA
Ga0207439_104316Ga0207439_1043162F057496MAGVYRNHTKVAEDHSSAYCKDEYVFGGKGEPLRLSHLIAFNVIEDAEHISGILTVEFDYYSDDDSIKYRDMHYSNPRLIKHIEGNPTVMKNID
Ga0207439_104362Ga0207439_1043622F017892YTKQSTQGPTTVVEGFFEGSLEDAYDEYKKELEAAGFKILFDEIEEHDSEVSWEGEGRSGQVALREECGSDDKIYVHITNRPASE
Ga0207439_104445Ga0207439_1044451F071393QANELQDAHDEIDRLSQTVSALQEAVTQYQAGAAAAEDEIVLLESEKAALQAQLDGAFEESKTLADRVLAAEAAAKRREENIASSLKQIDFLNAELMAASSERFKVVAAMQGEQRRQRSAFNQQKSMLEIRLQEKEALAATQAATIKQLEGVRDELDKRFRVIEAL

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.