NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300026739

3300026739: Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-SCHO21-E (SPAdes)



Overview

Basic Information
IMG/M Taxon OID3300026739 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0095510 | Gp0091538 | Ga0207536
Sample NameSoil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-SCHO21-E (SPAdes)
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size21484026
Sequencing Scaffolds23
Novel Protein Genes25
Associated Families25

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria5
All Organisms → cellular organisms → Bacteria → Proteobacteria1
Not Available2
All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria2
All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia3
All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium6
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium1
All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameSoil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts)
TypeEnvironmental
TaxonomyEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil → Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts)

Alternative Ecosystem Assignments
Environment Ontology (ENVO)terrestrial biomeagricultural fieldagricultural soil
Earth Microbiome Project Ontology (EMPO)Free-living → Non-saline → Soil (non-saline)

Location Information
LocationWisconsin, United States
CoordinatesLat. (o)43.3Long. (o)-89.38Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F000268Metagenome / Metatranscriptome1411Y
F000280Metagenome / Metatranscriptome1383Y
F000283Metagenome / Metatranscriptome1379Y
F001033Metagenome / Metatranscriptome799Y
F001079Metagenome / Metatranscriptome785Y
F001496Metagenome / Metatranscriptome683Y
F001823Metagenome / Metatranscriptome630Y
F003416Metagenome / Metatranscriptome488Y
F004992Metagenome / Metatranscriptome416Y
F007432Metagenome / Metatranscriptome351Y
F007622Metagenome / Metatranscriptome348N
F012082Metagenome / Metatranscriptome284Y
F014678Metagenome / Metatranscriptome261Y
F021191Metagenome / Metatranscriptome220N
F021336Metagenome / Metatranscriptome219Y
F022435Metagenome214Y
F026017Metagenome / Metatranscriptome199Y
F026840Metagenome / Metatranscriptome196Y
F027472Metagenome / Metatranscriptome194Y
F036400Metagenome / Metatranscriptome170Y
F051442Metagenome144Y
F072777Metagenome121Y
F078510Metagenome / Metatranscriptome116Y
F094959Metagenome / Metatranscriptome105Y
F101638Metagenome / Metatranscriptome102Y

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0207536_100014All Organisms → cellular organisms → Bacteria1996Open in IMG/M
Ga0207536_100060All Organisms → cellular organisms → Bacteria → Proteobacteria1417Open in IMG/M
Ga0207536_100135Not Available1183Open in IMG/M
Ga0207536_100153All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1150Open in IMG/M
Ga0207536_100200All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium1070Open in IMG/M
Ga0207536_100289All Organisms → cellular organisms → Bacteria973Open in IMG/M
Ga0207536_100376All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria902Open in IMG/M
Ga0207536_100631All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia790Open in IMG/M
Ga0207536_101253All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium659Open in IMG/M
Ga0207536_101306All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium652Open in IMG/M
Ga0207536_101663All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium610Open in IMG/M
Ga0207536_102020All Organisms → cellular organisms → Bacteria582Open in IMG/M
Ga0207536_102170All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium571Open in IMG/M
Ga0207536_102206All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium568Open in IMG/M
Ga0207536_102529All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium549Open in IMG/M
Ga0207536_102983All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria527Open in IMG/M
Ga0207536_103120All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium520Open in IMG/M
Ga0207536_103275Not Available512Open in IMG/M
Ga0207536_103364All Organisms → cellular organisms → Bacteria508Open in IMG/M
Ga0207536_103390All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium507Open in IMG/M
Ga0207536_103456All Organisms → cellular organisms → Bacteria505Open in IMG/M
Ga0207536_103498All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia503Open in IMG/M
Ga0207536_103500All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia503Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0207536_100014Ga0207536_1000141F101638LPGIVSGGKERKALDVIPVKMRERDYDLFLFVADGSQVSAQISQSRACINDRDAVRIGERDLQAGGVAAELLETGIADWDGSTRT
Ga0207536_100060Ga0207536_1000601F001823MRAIDRDLIRAGHYGQRPGVPHSKAEHMAAPTKAANVKKVLANSEPSTHGPK
Ga0207536_100135Ga0207536_1001351F094959MKALFTAIFCIFFGANVIPAEPDKDALEKLAAHIGKTFWSPPEKRLVRATTYEKRKDGSGSRKNVSNATISLLRYATAKEAGTIAVPHWLPRGSMVKIQTQQGVYAYIASDNGGDVDNRKAAQSSGKTIEQRGATVLDFCAPKQLWPDFIIVEIYYYAGKVPFDKLSLEDQKTLFAYAMEYVTKRE
Ga0207536_100153Ga0207536_1001531F027472MQNKYEAPELTLIGEAEEVVMGIGSFGDDLPLQTVPDFEFEQD
Ga0207536_100200Ga0207536_1002001F014678MLGSFRIERRGACSKIRFDHVGDDGARLGEIEGCNSRIHLVETLAAAQKL
Ga0207536_100289Ga0207536_1002892F000268MRVVAVMLLLSAGIAAEAVSYSFVSKASGRLGGPIRFEFYRDSTTHPKTDIETFTVSMRTADDRWKAMWSILSGRGLTQPIQYGVTPPGFTTMIQPQKLIPGRVYAGFATDGHGGSSGVTFGFDKDGTMIFPDSVDR
Ga0207536_100376Ga0207536_1003762F072777LTQIWIVWLLFGLATVAVFETYWRLPPAELWKVTNSGFVGGAGRAFVFVSFSAALVSLAVLPIVADRLEDRRADIAAIVAFVLCATVAIPGVQTPSHLDPKWSNTFAVLGVLVAVGLTAWATG
Ga0207536_100631Ga0207536_1006311F001496LFIGLSLFGIFGAWFVDRKATDVALKGFGLIEVGIGVVDAGVGRVDDLIARSRTEVRQASETITAAGAQAQANSPVLNALNERLETSLAPRIAQMQQVLAPVRDAMGTVGNAVSLLNSLPMMADRAPRLAALDEAFNRLEELSADTTQLRGTLRTLVVEQKSDIAPGTVAALKGLTQRIDTRLGEVHANVQAVRADVAALKDRLDKRQSRLLFVFNLLALLSTLMLAWILYTQVVVIRHHWARVRPPRPERRSATMS
Ga0207536_100793Ga0207536_1007931F021336NLSDAIFFSTNVPGGADPGVDTVGIGYDGSFLRAFLLANGDPNLQFSIGIDVNDTGTAQTLEAFALLNLTQHTVLAQYSLLQPGGTLIPSQNNGTGFPDYTLSGFDINLGTDIQAGDQLIFYARISGANDGPDSFFLVPQQVPGPIVGAGIPGLIAACGAMLAFARNRRRKALGVA
Ga0207536_101253Ga0207536_1012531F007432MPIAGRTARALSVVTALGLAATTSAQAHLVEDFGVRKGAPAHISLSTGFNGFVGAGIKKIQVDGVQMDAFCIDPFTMALRSSPGYKFVPLTKAPEAPFTLSASEATEISDLWAMFYNPGMKENKAAGLQLAIWEIVGGDDFSIIGKDYGANLMLAALRSYSGPGAGLIALTGPGQDYVVLTPPGQGDESTPTPPPH
Ga0207536_101306Ga0207536_1013061F004992TPAAIDPDMFAAVFSENWDNARYIKSERISFMNAYSVICAGVLALLQSVQASDLIRIALLFFMTLFSLVGLLTSLRLKSELEECLAKIEAMTVQAKVIQFVALGQLEGKPSRYPRFRWIFPIFYTMTTAGFIALIVYRLVTGEAMK
Ga0207536_101663Ga0207536_1016631F001033MCGHSLIARLTIAVFVFQMLGVTSVVHAERPDSTAGTSNAGTRKLFIGPSSTSVALRGKASLIVSPLTHRDGNYVGNYQLKVRPYFFKSEKGSLLLAASDDAVRKLQTGTAINFTGQAVTHKDGRTHIVLGRATPSSGDRGSVTFSIVTDDARIVFNTSYHFPAPRP
Ga0207536_102020Ga0207536_1020202F078510FDLHKFKLGPHLIQLHRKVLRLQGNLKDLPQIADGLALAEGKNRDFLLGIIRRREKWETLQVIPMKMSERNDQLVLAMSNRAHVPAEIAKAGSGVNNGDTLCIRGRDLKARGVATELLETSFTDWG
Ga0207536_102170Ga0207536_1021701F036400PGMSPDPRAEELRRKLAESRSILAERDEFEGGEVTVDLAEPAPEDPETRRRAVHETARDTVQRMRAR
Ga0207536_102206Ga0207536_1022061F003416MDSLTFASSQKQESDSAAERRWENEGGNPGQLQQLPCDYRKEDATTGPAQGALRHSATKIDGT
Ga0207536_102529Ga0207536_1025291F021191MPDDSLSKRLALINTLLRPIQKTIRELLPDFIERNLKNKTVRAAIVTAADAALVRQFPIARIFPPEMRQRLIRSQLDLVLDELVLKDSLPTDDLGTSFRSFIVTAKSAVATPADVIEAAMLKLLGSDWTATLLPIDAFTFELTTENPQLSVPRGNWRTDCNKNQGL
Ga0207536_102983Ga0207536_1029831F001079PHRLGAAVLMEASLKEASALPYVAWSSSSSALIPGEQWPLVFGSMQALKGHVQEYPGCQKLEAFVAPSGSDYRVHCYTTWDTPEQLEAFLERGYTFERMLEDVAGLAAEPTLVMEKVF
Ga0207536_102983Ga0207536_1029832F000280MEEPQQPQRPARPLVGYRDVGEDVRHSRSAMTRAWVILAVLMALYLGWTLVVYFLEP
Ga0207536_103120Ga0207536_1031201F007622WQGARDQPLAKWNGNSIVAIWIAARWGMKDLAIYEIEADEIKRIQPVWRRVWLLFDHDFRERFLSKYPDEKGSGVIFVSKGEGPDSKPELEFKGRKMLLNLFADNKPNLSTTPHWTASLHAVWNLDTVDFDKVDFQPGPIELRPEE
Ga0207536_103275Ga0207536_1032751F026017MKTSLAIVLATIALTFGAMLSFTSGVAAHDYGSLSGKSPGRCGVAACVIDLAPLPSPPGPGPYRGNR
Ga0207536_103364Ga0207536_1033641F022435ILGTHDQFFTVPAINSTYNRIASAGTSERFLKRIMMKPNGKHGVVDENSYPELLELIQNIDSWFKYCFKNGSRPPGTPTVSIEVQPTTMVFHVNAPAGGSAINQVQLYYASQIDTRPSTVHDFGSISLSRNGAEYVGSIPIGTLPPAGPPVTPNNIIYLASVNDGANFT
Ga0207536_103390Ga0207536_1033901F051442FRREVFDAVGGYNEATTSGEDQDLFARMTTRGRVVTLPDILYSYRYHANNATLFNGARAIGERHSQNGHALAAFYMLGAMRLWAGDPPMLLQPILEDKSLKWTPRTLMILVSAIWGHLSPPSLRAVLRSSIRARDLLASLRIKDGEPYEWRPE
Ga0207536_103456Ga0207536_1034561F000283MNSTLVTRNVFLCALAIFIASCGTATFTKTGSDATIESLRNFHLAFIDEFAVPGKKFNASAFNAKVNQGNAMFQQAIANDKFTARRPVFVDLKAQFDADAAHIKSKASHGKITPALASEMKKDVNKVYDHALGR
Ga0207536_103498Ga0207536_1034981F026840GVWSVNGTVTAKKPIKLQGLLSGEDFDLTMEPGVNPNTPMREIVIKNKAWICSDGETWHAGRPDDRLIYNWAHVPIMAGGGISQMPFEKVGTEQRDGQTWLHVRMKVPEKNVKPKELPQYWLVLDSQGQAQYIGHTEMPMFSQARNEVMYCSFDYAPAKEKIAPPPL
Ga0207536_103500Ga0207536_1035001F012082MSEHLAQPLLLVALILLPVRYVQAESPKDPIYVKISHGWNAAYAHGNEYAEFRVIGNGAKLQDPYHILLQKGVGMMVSFVDKKDLQNDTDLLSAHAQWEIDYWHQRASKIESNTREDLTGTRKDVKVTEIKVYNDKGARMSSYLI

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.