Basic Information | |
---|---|
IMG/M Taxon OID | 3300026806 Open in IMG/M |
GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0095510 | Gp0072042 | Ga0207546 |
Sample Name | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G06A4a-10 (SPAdes) |
Sequencing Status | Permanent Draft |
Sequencing Center | DOE Joint Genome Institute (JGI) |
Published? | N |
Use Policy | Open |
Dataset Contents | |
---|---|
Total Genome Size | 26717425 |
Sequencing Scaffolds | 19 |
Novel Protein Genes | 21 |
Associated Families | 20 |
Dataset Phylogeny | |
---|---|
Taxonomy Groups | Number of Scaffolds |
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales | 3 |
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Hyphomicrobiaceae → Rhodoplanes → unclassified Rhodoplanes → Rhodoplanes sp. Z2-YC6860 | 1 |
Not Available | 12 |
All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium | 1 |
All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium | 1 |
All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Planctomycetia → Gemmatales → Gemmataceae → Gemmata | 1 |
Ecosystem Assignment (GOLD) | |
---|---|
Name | Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts) |
Type | Environmental |
Taxonomy | Environmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil → Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts) |
Alternative Ecosystem Assignments | |
---|---|
Environment Ontology (ENVO) | terrestrial biome → agricultural field → agricultural soil |
Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Soil (non-saline) |
Location Information | ||||||||
---|---|---|---|---|---|---|---|---|
Location | Wisconsin, United States | |||||||
Coordinates | Lat. (o) | 43.3 | Long. (o) | -89.38 | Alt. (m) | N/A | Depth (m) | 0 | Location on Map |
Zoom: | Powered by OpenStreetMap © |
Family | Category | Number of Sequences | 3D Structure? |
---|---|---|---|
F001604 | Metagenome / Metatranscriptome | 664 | Y |
F013811 | Metagenome | 268 | Y |
F017145 | Metagenome / Metatranscriptome | 242 | Y |
F017759 | Metagenome | 239 | N |
F019338 | Metagenome / Metatranscriptome | 230 | Y |
F022495 | Metagenome / Metatranscriptome | 214 | Y |
F025757 | Metagenome | 200 | N |
F032172 | Metagenome / Metatranscriptome | 180 | Y |
F046476 | Metagenome / Metatranscriptome | 151 | Y |
F049092 | Metagenome | 147 | N |
F053280 | Metagenome / Metatranscriptome | 141 | Y |
F059295 | Metagenome | 134 | Y |
F063910 | Metagenome / Metatranscriptome | 129 | N |
F074000 | Metagenome | 120 | Y |
F082749 | Metagenome / Metatranscriptome | 113 | Y |
F090518 | Metagenome / Metatranscriptome | 108 | N |
F090598 | Metagenome | 108 | Y |
F094081 | Metagenome / Metatranscriptome | 106 | Y |
F094114 | Metagenome | 106 | N |
F095113 | Metagenome | 105 | N |
Scaffold | Taxonomy | Length | IMG/M Link |
---|---|---|---|
Ga0207546_100018 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales | 1987 | Open in IMG/M |
Ga0207546_100410 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Hyphomicrobiaceae → Rhodoplanes → unclassified Rhodoplanes → Rhodoplanes sp. Z2-YC6860 | 1068 | Open in IMG/M |
Ga0207546_100418 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales | 1064 | Open in IMG/M |
Ga0207546_100663 | Not Available | 928 | Open in IMG/M |
Ga0207546_100736 | All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium | 900 | Open in IMG/M |
Ga0207546_100958 | Not Available | 839 | Open in IMG/M |
Ga0207546_101037 | Not Available | 824 | Open in IMG/M |
Ga0207546_101192 | Not Available | 790 | Open in IMG/M |
Ga0207546_101198 | Not Available | 789 | Open in IMG/M |
Ga0207546_102001 | All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium | 674 | Open in IMG/M |
Ga0207546_102643 | Not Available | 621 | Open in IMG/M |
Ga0207546_102819 | Not Available | 609 | Open in IMG/M |
Ga0207546_103052 | Not Available | 596 | Open in IMG/M |
Ga0207546_103767 | Not Available | 557 | Open in IMG/M |
Ga0207546_103773 | Not Available | 557 | Open in IMG/M |
Ga0207546_103880 | Not Available | 552 | Open in IMG/M |
Ga0207546_103900 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales | 551 | Open in IMG/M |
Ga0207546_104948 | All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Planctomycetia → Gemmatales → Gemmataceae → Gemmata | 512 | Open in IMG/M |
Ga0207546_105007 | Not Available | 510 | Open in IMG/M |
Scaffold ID | Protein ID | Family | Sequence |
---|---|---|---|
Ga0207546_100018 | Ga0207546_1000181 | F019338 | YLGISDTMADGNRTGAFLIVGAIVCLSTAQFVFAQEVDPRCKDIYDKVACTCAVRNGGHVIPPPVGVKREGLKLRPKEAAGGTQTLDGGRVAFPKYYRREGLKFRRSDAIEGYLGCMRAAGRK |
Ga0207546_100410 | Ga0207546_1004103 | F046476 | KGAPRLDVPSLFRPQFMAEVQKEHPEYFADLPPLK |
Ga0207546_100418 | Ga0207546_1004182 | F090518 | ELEQIDDGILNFNVSDEALESAGDNAVAANYTLGACTGLSVCPG |
Ga0207546_100663 | Ga0207546_1006631 | F095113 | EGQDGDIKRPPGRPGDTSMFQILIRAIVIILAGVVAIARASAQETPPKSRSYELEKVHLAKRDNAYVLVHTITSTTSRDRFWALFEVNNVDGSRKCEWLKVVEPKQRYRFECPVEATAGQKYPSRVRVFSDARLRLTDREVFYEPIFDITADRVSAAAAVSDSTGTVVPDGTLDAIELPLPAIFKPTWYRRVDKGFGMRAYENSGDLTVNADELLFTDGKKTVRIPYNQILSVRWEPLPNDIANHWVVVRFKNEEGKDDGIAFRDGGRIGLRGDTGPIYQTLRRAAKQ |
Ga0207546_100736 | Ga0207546_1007362 | F090598 | NPADPNWRQYEMAGISHLPEPILSLGLPNQNTADARPIFRAALENLTKWARDNDRESPPPSRYFSGSVDAIDAFVPMTDGDGHFAGGVRLPHVESTVHGRVAGAPLGRHTPLNPLGLDPFNPFVFISGTFTRFSDDELLERYQSRDQYARRVRRAADHLAASGYITDADRMALIAAAEHEPLPEELRCPR |
Ga0207546_100958 | Ga0207546_1009582 | F025757 | MKAYGKSETRKATRAPMSYSTSISLFWLEMAVLIGCVVLSIRMRSRAAMWVALAIVAHCAVWLAIHDEEILIRLVASALVYLGLLKFSPNAARVWLCAGGALAGAFLLGTMALSLLMSFPGRWSVFGLSMGATLLFLTSGLLIGFWVVYRWTDPTQLGDTQPRQEA |
Ga0207546_101037 | Ga0207546_1010372 | F001604 | MTNILDSARASDEEPRLIVRKASHAPIWSVWAVLEGTPSEEIFEGSSEEDASSWINSGGRSWLEERRRKRNA |
Ga0207546_101192 | Ga0207546_1011921 | F013811 | MHVTPFSATTILAFAIASPLHAQGIEVFGGYSVNADYVQNRPAILVADQKVSPFFSHGSGPTGFEASFKHDMRNGLGIKVDVSGYSDTFPPGPAAYCQPDSSTAGIACGTGLTFQATGRALYVTAGPEWKIRRGKRFAPFAQALVGIVHTRSTFMMNGSDVQYANPFTGGVLLFTSAGFPPDRSIDYADAHADAGLALAIGGGFDIRLSKRLGLRAAMDWDPTFLVRPVLPDLTPDAQGQVALRPAS |
Ga0207546_101198 | Ga0207546_1011981 | F017759 | LLLFGVVVGLLNSECPQQYGAYESKYGAHGQHIELQGKVHGSASLVDALRLARNDPAPKAPVTRPAFPAGGIAYRTCAIDNRLIERLKKSEGPKILIVQNSCDTEFFMANLRQIPSILRGNDSAPSPRKKIYPSIKRMMTSVARRICRTAKNARTAATSHP |
Ga0207546_102001 | Ga0207546_1020012 | F022495 | MRKLTPTLTQSGLLAVLLQRQRYKISQLACRRAQRKHMRAALWSTLTWPWRAITAPMPVRHSRRALGRIP |
Ga0207546_102643 | Ga0207546_1026432 | F053280 | VNKRWATVEKQIMQQAPWAPWSNRVFPEFFSKKMGCIHIQRLYGVDLMRLCRK |
Ga0207546_102810 | Ga0207546_1028102 | F094081 | MRDDQGKILKKEFVERFDKAKGRLQQSVPEIQHLLKTSNVVEAARKIIDPAQSIFKQFADDIQLKDLIAKAEALVANANLTLTKAASKDAAPTE |
Ga0207546_102819 | Ga0207546_1028191 | F017145 | AFGAALLIYSNDWHPSGWSALRQEATATKPPVSVTEQYTGSIIIVPTRGEDCRQMMLDNRTGRMWDKGVVNCYEAVSRPEKGQRGGMSSLRMNAIGKAFNRRDE |
Ga0207546_103052 | Ga0207546_1030522 | F094114 | VRRLFLAAAACVVVAAVPGAGAQTPAMLRIESLSTPVISPNGNGVRDSVIVKTNSSPGTLLGLRVYVWGGRLSGWKRIRTGLSSTSGELTWNGTSATGRALDDGIA |
Ga0207546_103256 | Ga0207546_1032561 | F082749 | HCIAHSAFAANATGEGRLAPAFARRGGMDGESVDAAGKLGRKRLINHAMTLDAGLSLKGVRHDIDPVVSLPARPVPGMALMLVRFINHFEVLRRESLGQLFCDEIGGSHIARLGERSLPVNGYKQVLKASPVKAHNVRS |
Ga0207546_103767 | Ga0207546_1037671 | F063910 | MSERYNLELISRRRAFSFLGSAAVALSVAVPVTVLVATDAEARVGNPGSAVSVAGANRRDRRQDRRNKKSPNTPTTTGQGEKK |
Ga0207546_103773 | Ga0207546_1037731 | F049092 | MKFIEPAFDTGGDLWRPPLNVPDSDAHALNEKFARQSAELRLRLTEVLELHNVQQRQANELQDRYDDIDRLSQTVSALQEAVNQYKMGTAAAEDKIILLESEKAVLQAQLDVALEESKTLADRLHAAEAASDRREATVASSIRQIEFLNTELTAAA |
Ga0207546_103880 | Ga0207546_1038802 | F019338 | TMPAGRRTETVLTGAVACLAFVSIVQPVFAQDFDPRCKDIFDKIACTCAVRNGGHVIPPPVGVKREGLKLRPKEEAGGTQTLDGGRVAFPKYYRREGLKFHRSRALEGYLACMRAAGRK |
Ga0207546_103900 | Ga0207546_1039001 | F059295 | LAVERLYFVAAMGAVVYAAFFAALIAMPIYGGAAYDKNGYQPFNAPVPIFAKKWDANITAFSIQLLILVAGLLTVSGAFAG |
Ga0207546_104948 | Ga0207546_1049481 | F074000 | VAPVINYESTGVSTDVSMREGEAVIVGTLNIGPSGDALILVVSAKRTQK |
Ga0207546_105007 | Ga0207546_1050071 | F032172 | MTMMKLEAEAPVQIIRGRFGPSGGIIPELDKDRQVIPTGYFNNRLGFHALMRAVGVDERVVTLNELFANPKLNLEITRRIEAGQKSVSISGEDAAKTGEF |
⦗Top⦘ |