Basic Information | |
---|---|
IMG/M Taxon OID | 3300026818 Open in IMG/M |
GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0095510 | Gp0091560 | Ga0207634 |
Sample Name | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G05A2-11 (SPAdes) |
Sequencing Status | Permanent Draft |
Sequencing Center | DOE Joint Genome Institute (JGI) |
Published? | N |
Use Policy | Open |
Dataset Contents | |
---|---|
Total Genome Size | 29167064 |
Sequencing Scaffolds | 23 |
Novel Protein Genes | 25 |
Associated Families | 24 |
Dataset Phylogeny | |
---|---|
Taxonomy Groups | Number of Scaffolds |
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae | 1 |
Not Available | 9 |
All Organisms → cellular organisms → Bacteria → Proteobacteria | 1 |
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Rhizobiaceae → Sinorhizobium/Ensifer group → Sinorhizobium → Sinorhizobium meliloti | 1 |
All Organisms → cellular organisms → Bacteria | 3 |
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria | 1 |
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria | 2 |
All Organisms → cellular organisms → Archaea | 2 |
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria | 1 |
All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium | 2 |
Ecosystem Assignment (GOLD) | |
---|---|
Name | Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts) |
Type | Environmental |
Taxonomy | Environmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil → Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts) |
Alternative Ecosystem Assignments | |
---|---|
Environment Ontology (ENVO) | terrestrial biome → agricultural field → agricultural soil |
Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Soil (non-saline) |
Location Information | ||||||||
---|---|---|---|---|---|---|---|---|
Location | Wisconsin, United States | |||||||
Coordinates | Lat. (o) | 43.3 | Long. (o) | -89.38 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
Zoom: | Powered by OpenStreetMap © |
Family | Category | Number of Sequences | 3D Structure? |
---|---|---|---|
F004397 | Metagenome / Metatranscriptome | 440 | Y |
F010272 | Metagenome / Metatranscriptome | 306 | Y |
F019688 | Metagenome | 228 | Y |
F020518 | Metagenome / Metatranscriptome | 223 | Y |
F023626 | Metagenome / Metatranscriptome | 209 | Y |
F025530 | Metagenome | 201 | Y |
F025757 | Metagenome | 200 | N |
F028554 | Metagenome / Metatranscriptome | 191 | N |
F032172 | Metagenome / Metatranscriptome | 180 | Y |
F032670 | Metagenome / Metatranscriptome | 179 | N |
F032686 | Metagenome | 179 | Y |
F034172 | Metagenome / Metatranscriptome | 175 | Y |
F038225 | Metagenome / Metatranscriptome | 166 | Y |
F047102 | Metagenome / Metatranscriptome | 150 | Y |
F049092 | Metagenome | 147 | N |
F049763 | Metagenome / Metatranscriptome | 146 | Y |
F051990 | Metagenome / Metatranscriptome | 143 | Y |
F064047 | Metagenome / Metatranscriptome | 129 | Y |
F082662 | Metagenome / Metatranscriptome | 113 | Y |
F083399 | Metagenome | 113 | N |
F089000 | Metagenome | 109 | N |
F089374 | Metagenome / Metatranscriptome | 109 | Y |
F103641 | Metagenome / Metatranscriptome | 101 | Y |
F106123 | Metagenome | 100 | Y |
Scaffold | Taxonomy | Length | IMG/M Link |
---|---|---|---|
Ga0207634_100093 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae | 1947 | Open in IMG/M |
Ga0207634_100349 | Not Available | 1393 | Open in IMG/M |
Ga0207634_100367 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 1374 | Open in IMG/M |
Ga0207634_100530 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Rhizobiaceae → Sinorhizobium/Ensifer group → Sinorhizobium → Sinorhizobium meliloti | 1245 | Open in IMG/M |
Ga0207634_100815 | Not Available | 1070 | Open in IMG/M |
Ga0207634_100900 | All Organisms → cellular organisms → Bacteria | 1033 | Open in IMG/M |
Ga0207634_101453 | Not Available | 866 | Open in IMG/M |
Ga0207634_101834 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria | 789 | Open in IMG/M |
Ga0207634_102173 | Not Available | 737 | Open in IMG/M |
Ga0207634_102434 | All Organisms → cellular organisms → Bacteria | 707 | Open in IMG/M |
Ga0207634_102466 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria | 703 | Open in IMG/M |
Ga0207634_102727 | Not Available | 679 | Open in IMG/M |
Ga0207634_103227 | All Organisms → cellular organisms → Archaea | 635 | Open in IMG/M |
Ga0207634_103378 | Not Available | 625 | Open in IMG/M |
Ga0207634_103438 | All Organisms → cellular organisms → Archaea | 621 | Open in IMG/M |
Ga0207634_103480 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria | 617 | Open in IMG/M |
Ga0207634_103622 | All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium | 608 | Open in IMG/M |
Ga0207634_103779 | All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium | 597 | Open in IMG/M |
Ga0207634_104459 | Not Available | 560 | Open in IMG/M |
Ga0207634_104657 | Not Available | 551 | Open in IMG/M |
Ga0207634_104855 | All Organisms → cellular organisms → Bacteria | 542 | Open in IMG/M |
Ga0207634_105271 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria | 527 | Open in IMG/M |
Ga0207634_105726 | Not Available | 512 | Open in IMG/M |
Scaffold ID | Protein ID | Family | Sequence |
---|---|---|---|
Ga0207634_100093 | Ga0207634_1000931 | F028554 | VLQDQVNFLKGQMRKAKQVRNRALSAGEGRRAIIVMAALTGLFFLAAFVTAGSLLSTDPRPMSSLSKVTPLPPTEGGEAASRVASIVVETDKKGRCEERRFDNRTGKMVSANYVNCDARLEPERDTTPSENINRERIRAILGAFKK |
Ga0207634_100349 | Ga0207634_1003491 | F019688 | LEWLTGPTWKFAALVLGSVGASYLILAHRLWWAYAVIMLAQVVYFFSAGSAELAVMRASVLFAYGVITLPPMRPLAPLATDIAVMVICFCYIMIVLCLYSAAWVASGGRVPQGAYGRRPTPLEPLRPSRLLDTFLPGHRSQDVTLWEGAVFALSSLLFVAASMAPFYGIRRVQNAFQSTFATQVQQVCPAQPPEAMIACWAQFYPWSRVAIDLGAPIVIAVICLVLANRLRHFGRQHFINRLAELKVLPAASTLFLRAFRDDQVRIRRASRNLFSSVFDLGRVPATLDELMLERLDGRGDLIAIGNPQDRKGAARQSPWGAQRLYVDDAHWQETVTMLARDADRIVLCIDASDGVRWEIAHVLQSGHAGKTLFFLNPSTDVQTRKRQLQEDFGVSAADLASIDVNRVLALRTTSTDQLILMVCDKPERDAYLVAARLAFEGRAGNLTMSAKGR |
Ga0207634_100367 | Ga0207634_1003673 | F032686 | TLVCDKLPFSAGKGREAIEVTIAGGTVRYSHVVRLRDAAEPVSEQGKGSLNGQDIELQGSWKAGNRQYEAKYSGAFVRRHADLKGTQTWTDGGKSFTRACTGTIKRPFRVFLPGEKK |
Ga0207634_100530 | Ga0207634_1005301 | F089000 | MGEPTPATPASKYFAATVAMIAGAFFFAVGAGLLPIPGGPSNLHGPLWLLLCVGLAFFLAGLAILIPMLGHANDSGDLPAGAPFWLRAMQYLIGLCIFACFGAISSWIAFG |
Ga0207634_100815 | Ga0207634_1008152 | F004397 | MSEFDLKVALIIFVTKVIDPFAALPALVAGYFCRTWWQVVISAAVVGIFVEMILVLFEPTPGVHQGRLLMAVLAAGVWSNLAFAFKTWRAKRA |
Ga0207634_100900 | Ga0207634_1009002 | F038225 | VGAFFVLTPAQAADYRVVQYNDTKICQVVDMAGLFKPIRTNYTVLTKKSIPTFDAAMKARADVSKKAKCTFL |
Ga0207634_101197 | Ga0207634_1011973 | F064047 | MTLMKSSRAGLFALAFGILLSLFLSVPSYDVSYAQTQAPAAAPAAPAAPPPA |
Ga0207634_101453 | Ga0207634_1014531 | F025530 | MSDMQISTGTMMRISEAARDNIAAGIWFAVLAGSLFLYAQSILMTTGLMLELTAAYSTFVLCGKSARSPFVHAIPYAFALAGAVFLCLAPDFHNAIEASLVFLGVTALMHGSVVYSALKNPRETEDPVYASAT |
Ga0207634_101834 | Ga0207634_1018343 | F083399 | MVEESAVSGDARPWGFFATFVLGAIALLAGQLAGMAALVGWYDFDLRNVPVLSQHGGAIILFIFVSAPVQVAILALAAGYKGNIADYLGYKLPRRGEVVLCLAI |
Ga0207634_102173 | Ga0207634_1021732 | F025757 | TRAPMSYSTSISLFWLEMAVLIGCVVLSIRMRSRAAMWVALAIVAHCAVWLAIHDEEILIRLVASALVYLGLLKFSPNAARVWLCAGGALAGAFLLGTMALSLLMSFPGRWSVFGLSMGATLLFLTSGLLIGFWVVYRWTDPTQLGDTQPRQEA |
Ga0207634_102434 | Ga0207634_1024342 | F034172 | MRTIFVFLVAGLIVLLGGASYAADVRLDHKRAHARGAGTFDQRLRVVEQRPYCGNCEAPFGRTHSANVVRLRFINWPYWQESCAAGACGEY |
Ga0207634_102466 | Ga0207634_1024662 | F020518 | MARMQPLGMNEVDVEIRHLCEESERQIGTSASTRTYARNPAVVKALAAFRGALVREG |
Ga0207634_102727 | Ga0207634_1027271 | F049092 | MKFIEPAFDTGGDLWRPPLNVPDSDAHALNEKFARQSAELRLRLTEVLELHNVQQRQANELQDRYDDIDRLSQTVSALQEAVNQYKMGTAAAEDKIILLESEKAVLQAQLDVALEESKTLADRVHAAEAASDRREATVALSIKQIEFLNTELTAAAAERFRLVAAMQGEQRRQRSVFSQQKSILEDKLQE |
Ga0207634_102809 | Ga0207634_1028091 | F106123 | QQSGREFYAAVFYAFERRWLQERGGDYTAVRWRTIDQVTPFGLIRLPVRVVRERGAQKGGYLSLSKALLKPKATRLLSPWVEKGVLEAATCSNYRPAAAELWRWVRVKVSAWLIWKCVQFHGARLCEQLERQWWPDRALPRKADVVLTEIDSTYLKAQRRGRAARGHPTAHFAIHLGLHYSGRERRYQKRGSCSVRLRSKRWILSTEPLSIFGRRLAWQRMRH |
Ga0207634_103227 | Ga0207634_1032271 | F032670 | PNLFMKCGICKEEIIKEKRREHLRYHKLDDTLVEWIIETDDDLISSYEKH |
Ga0207634_103378 | Ga0207634_1033782 | F032172 | LDHQVAELAMFVSHMDGIGRAMTMMQLEGEEPVQIIRGRFGPSGGIVPELDKDGQVVPTGHFNNRLGFHALMQAVGVDERVMTLIELFANPKLNAEITRQLAAGQRSVSISGEDAAKTGE |
Ga0207634_103438 | Ga0207634_1034381 | F010272 | MYMNPERKLILIAIGAFVTLSLFVFPSTNMIAYSQSENKTQGKEQAQEKLFELAQKFNKILKDSNVNLTLPQNGDLSAKLQELKDSGAFKELSEKFSQAVQELGQGNKTEVLKEQAGADLGKLIQKLQDLRNNSTQ |
Ga0207634_103480 | Ga0207634_1034801 | F082662 | LYSTTSALATLLTDEREALFGRIRPLLAGPYRLPLKHDLTWTRLAA |
Ga0207634_103622 | Ga0207634_1036221 | F051990 | MRLRIRTLLCCTAIFSAVSFFTITFAETNPLSKAQVADQIRKVENGVDDFEKYLTSRGENAKDQAGSAKSSGEAKRGKGANSAKKEAGKEK |
Ga0207634_103779 | Ga0207634_1037791 | F103641 | HFFETYKRTICVDIRSQRTIFGHAGTVKPQQPAVWWKRLLTVLRIRKVKAIEI |
Ga0207634_104459 | Ga0207634_1044591 | F089374 | MTDILDNARVSDEEPKLIVRKASHAPIWSVWAVLEGTPSEEIFEGTSEVDASSWIDT |
Ga0207634_104657 | Ga0207634_1046571 | F047102 | MPFSIAIPESRRCRERALECRTVADRLRLQDARAQMLKVAADYERMARDAKQREIAQGLSYLSVLVVRQRH |
Ga0207634_104855 | Ga0207634_1048552 | F049763 | MNMVVDNRITETPELTGQFMVFPKSFSSEEASQKTLSRMSREQLATFLQA |
Ga0207634_105271 | Ga0207634_1052712 | F023626 | MTLALLVVAGVTFSLLFGLVHKVSERRTKRRLQQQWEERERALNR |
Ga0207634_105726 | Ga0207634_1057261 | F032172 | FVSHMDGVGRAMTMMKLEAEAPVQIIRGRFGPSGGIIPELDKDRQVIPTGYFNNRLGFHALMRAVGVDERVVTLNELFANPKLNLEITRRIEAGQKSVSISGEDAAKTGEF |
⦗Top⦘ |