NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300027445

3300027445: Soil microbial communities from Kellog Biological Station, Michigan, USA - Nitrogen cycling UWRJ-G08K3-12 (SPAdes)



Overview

Basic Information
IMG/M Taxon OID3300027445 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0095510 | Gp0072117 | Ga0207554
Sample NameSoil microbial communities from Kellog Biological Station, Michigan, USA - Nitrogen cycling UWRJ-G08K3-12 (SPAdes)
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size8938293
Sequencing Scaffolds25
Novel Protein Genes26
Associated Families25

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria3
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium3
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → unclassified Bradyrhizobium → Bradyrhizobium sp. ARR651
Not Available16
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → Bradyrhizobium erythrophlei1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameSoil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts)
TypeEnvironmental
TaxonomyEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil → Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts)

Alternative Ecosystem Assignments
Environment Ontology (ENVO)terrestrial biomeagricultural fieldagricultural soil
Earth Microbiome Project Ontology (EMPO)Free-living → Non-saline → Soil (non-saline)

Location Information
LocationUSA: Michigan
CoordinatesLat. (o)42.4Long. (o)-85.37Alt. (m)N/ADepth (m)0
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F002315Metagenome / Metatranscriptome572Y
F002508Metagenome / Metatranscriptome553Y
F003059Metagenome / Metatranscriptome510Y
F004397Metagenome / Metatranscriptome440Y
F017166Metagenome / Metatranscriptome242Y
F017216Metagenome / Metatranscriptome242Y
F017759Metagenome239N
F020078Metagenome / Metatranscriptome226Y
F021340Metagenome219Y
F025530Metagenome201Y
F028161Metagenome / Metatranscriptome192N
F028554Metagenome / Metatranscriptome191N
F034995Metagenome173N
F045439Metagenome / Metatranscriptome153Y
F048418Metagenome / Metatranscriptome148Y
F050525Metagenome / Metatranscriptome145N
F054151Metagenome / Metatranscriptome140N
F070441Metagenome / Metatranscriptome123Y
F077827Metagenome / Metatranscriptome117Y
F081321Metagenome / Metatranscriptome114N
F082926Metagenome / Metatranscriptome113N
F084203Metagenome / Metatranscriptome112N
F095448Metagenome / Metatranscriptome105N
F099776Metagenome103Y
F104619Metagenome100Y

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0207554_100075All Organisms → cellular organisms → Bacteria2045Open in IMG/M
Ga0207554_100139All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales1731Open in IMG/M
Ga0207554_100253All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium1480Open in IMG/M
Ga0207554_100342All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → unclassified Bradyrhizobium → Bradyrhizobium sp. ARR651367Open in IMG/M
Ga0207554_100474Not Available1223Open in IMG/M
Ga0207554_100525Not Available1174Open in IMG/M
Ga0207554_100588All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium1125Open in IMG/M
Ga0207554_100636Not Available1089Open in IMG/M
Ga0207554_100690Not Available1049Open in IMG/M
Ga0207554_101035Not Available902Open in IMG/M
Ga0207554_101036Not Available902Open in IMG/M
Ga0207554_101040Not Available900Open in IMG/M
Ga0207554_101157All Organisms → cellular organisms → Bacteria860Open in IMG/M
Ga0207554_101408Not Available785Open in IMG/M
Ga0207554_101560Not Available749Open in IMG/M
Ga0207554_101586All Organisms → cellular organisms → Bacteria742Open in IMG/M
Ga0207554_101639Not Available729Open in IMG/M
Ga0207554_101769Not Available702Open in IMG/M
Ga0207554_101776Not Available701Open in IMG/M
Ga0207554_101992All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium657Open in IMG/M
Ga0207554_102075Not Available643Open in IMG/M
Ga0207554_102120All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → Bradyrhizobium erythrophlei636Open in IMG/M
Ga0207554_102217Not Available619Open in IMG/M
Ga0207554_102295Not Available609Open in IMG/M
Ga0207554_102445Not Available590Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0207554_100075Ga0207554_1000751F004397AMSEFDLKVALIIFVTKFIDPFAAVPALVAGYFCRTWWQVVIAAAAVGIFVEMILVLFEPTPGVHQGRLLMAVLAAGVWSNLAFAFKTWRAKRA
Ga0207554_100139Ga0207554_1001393F048418MPIHLPPKARVKAVPARRQAAINSIDWLGLWALSVAFCAIVLAGIAYFMIGDNTGASCILVGAAAVIAIVVRFGTDRDEARDEN
Ga0207554_100253Ga0207554_1002531F003059MMRAGITAGMLAGAVIVTAAVPAAAQVRDAVYRGTLVCDKLPFSAGKGREAIEVTIAGGTVRYSHVVRLRDAAEPVSEQGKGSLNGQDIELQGSWKAGNRQYEAKYSGAFVRRHADLKGTQTWTDG
Ga0207554_100342Ga0207554_1003421F025530MSDMQISTGTMMRISEAARDNIAAGIWFAVLAGSLFLTAHGQSILMTAGLMLELTAAYSTFVLCGKGARSLFVHAVPYAFALAGAVLLCLAPDFPNAVQASLVFLGVTALMHGSVVYSALKNSPESEDPAYASAT
Ga0207554_100474Ga0207554_1004741F070441MRISTLDISGTTTTVRRHEALLRFARYLLVYNLAIISWVVISKLSQMPSFEGLLEWLTGPTWKFAALVLGSVGASYLILAHRLWWAYAVIMLAQVVYFFSAGSAELAVMRASVLFAYGVITLPPMRPLAPLATDIAVMVICFCYIMIVLCLYSAAWVASGGRVPQGAYGRRPTPLEPLRPSRLLDTFLPGHRSQDVTLWEGAVFALSSLLFVAASMAPFYGIRRVQNAFQSTFATQVQQVCPAQPPEAMIACWAQFYPWSRVAIDLGAPIVIAVICLVLANRLRHFGRQHFINRLA
Ga0207554_100525Ga0207554_1005251F017166RTRGSGVRFATLLALAVMVFTGWLYLGEKDAVRSTKADVVAAADRTAVALSQSADPERNTDADAEDVFKKHVQTPSALEDLAVKQSVESISAGRLRQSVKISARARTTLSEFFSMQGAEIDITATHDFDRKK
Ga0207554_100588Ga0207554_1005881F050525MRGFAGLLAYLAGVSAIFGIGIVGLMALQSPTERTPSSAPVAAASHTESLAKPVKRPVDDKKTAHRNQKHKKEHVTRKQPHEAPFTDAGRNAYGYAEEPRRIDPNRFLFFGR
Ga0207554_100636Ga0207554_1006362F002508MSNDENTQHYARWRNSEVSRNFHRLGLLVAAIILTAGLLLMAKDALGLRLWDLLPADIPILARGIAIGLTGIGLVSLAAYG
Ga0207554_100690Ga0207554_1006902F028554VLQDQVNFLKGQMRKAKQVRNRALSAGEGRRAIIVMAALTGLFFLAAFVTAGSLLSTDPRPMSSLSKVTPLPPTEGGEAASRVASIVMETDKKGRCEERQFDNRTGKMVSANYVNCDARLEPERDSTPSENINRERIRAILGAFKK
Ga0207554_101035Ga0207554_1010351F099776RHVRSHVFKTAQIVVAESAPTIECRARDLSGYGARLCLSTTYGLPQQFDVIIDGKRRSVRPVWMTYTEMGVMFAEASQKSADLVECERDIASLIELLKMAEEKWPSSESYEISETEMLCRDQALLDMWPEACRRIGFSKREFPIDVIKLWQKQMGWPN
Ga0207554_101036Ga0207554_1010361F002315MKRVAVLSMAMLTVAFAADKKTYRYTCKGGAFTVTAAVEASGRWSKAGPVVLQIGSEPLQTLTADPDAPDADSFSNKDYEFYALKAFITLTRKSHGVVVKTYNACRAE
Ga0207554_101040Ga0207554_1010403F082926PSHRREPPRRYFSRMQANVREASMPELSATEQPEQSNIVTWKVISIGTAIVVVVFLFLWL
Ga0207554_101157Ga0207554_1011572F021340IVAFRTDQGFFIKFSKAVCPLNKRHSKLQTFQAHRTIARANVARSQFWHVAIVTSFFVIVLGASLFLGTVMVIGTFHGVDPSNELTANGRAGRIARTLQDGTLCHYMIFDNKTAQTVEDRIGRCDENKPKPKQEKPATFTWGK
Ga0207554_101408Ga0207554_1014082F081321VRALCEAREFRYVAVVPVRSVTSGLVMRAVMCTVLRAAALISLALVLVSPTHAACRGNCEPNVEVARAAMQQIFKQTFLSPYTLVSFERLDGRSGERYGGAFYEMRIRAVLHYDGVRLRCRRPSCPELHHYLLENDAASKKATVAGWLFLQQDGEGDWQPVPLTPGPQ
Ga0207554_101560Ga0207554_1015601F095448SPVMMASTLTHQPDPAIPNIVCPKCGLRMQVAAIEPAGDNDRTVTFGCDCGHRYDLSERAIVALARDSSDRW
Ga0207554_101586Ga0207554_1015862F028161LSLAAALAVVAIQAPHAQDNKNVREDDYVRKVPLEDFKVPIVPIIPPGSSLDLRPGRTPDSSDRIYNTTPFSRDQTTPSIGLSIKSPFDDRK
Ga0207554_101639Ga0207554_1016391F045439MSRSGFSRRTFLQGSVGLTVANFVPGTTPFAHAATM
Ga0207554_101769Ga0207554_1017692F020078MHSATRFCWQSGVIALATIIFAFLVPGIALCKVVSLTSIASAIAITTLVAGFGLYLAGQLIEKRTPQSERVDHYLQASILVTAAGLLWGHVVLQTGPWRDRSIEPGVAFAIVAGCGVAGA
Ga0207554_101776Ga0207554_1017761F034995ELLAVYGSVLLGIAVQVGRAKWSGQSLFPFGLFKAGMWGVLVAASAYFVAGGVYGLISYGAFDKVIGGTLWGLLAIFWGVFGALVSAGLALLFYAWRGSAGYQPPGSAGLGTPRS
Ga0207554_101992Ga0207554_1019921F054151PDMRKVDQYTLADHFRALAEGLSLRAVSERAPAQRAELQRLAECYAELAKQQSPADHFARGVGPR
Ga0207554_102075Ga0207554_1020752F017759LLFGVVVGLLNSECPQQNRAYESKYGADSQHIELQGKVHGSASLVDARRLARNQRRPKVPAIGPVLFAGEIAYRICAMDNRLIVRLKKSEEPKILTVQNSRGTEFFMANLRQIPSILRGNDSTGFFAKEI
Ga0207554_102120Ga0207554_1021201F084203MRARVRRAMWMLGALALAVPASAQESTDVAPLTPEDSALLANALVFDPAALVTAPKKPLRLPGYRNNEYDITRTQKVDGSTTVVVKQPLQTEWSNSVGADLAPSRPATYPLPLPTEHNN
Ga0207554_102217Ga0207554_1022171F002508MPMSNDDDTHVYTRWRDSQVSRNFHRLGLLVAAIILIAGLLLMAKDALGVRFWDLIPADIPILAGGIAIGLTGIGLVSLAAY
Ga0207554_102292Ga0207554_1022922F104619MHKTPKDIARSIRAFRNAHRLGLLRSVRISPGKRACEAAMAQDRMEYLGNAVPRLP
Ga0207554_102295Ga0207554_1022951F077827VTQGGIERSALFDKPRDLFIVVYLQPLPRRCAMDSAACLVDGRKPTDIFLETDSSGNATWLNFAGPGMKASRLLHVAIATSENEDLGSFRTRCGEVPGFPMTLFDVMAPSPNLYYGPLASTLNAKNKGTGSMGDFCELIGDKPDAAIMKLGTSVAGVLGGH
Ga0207554_102445Ga0207554_1024452F017216CGAIYEVIETKGPSRESRPAKCVLCDREMFAWEGDNVGQLHLIWRPDEDRE

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.