| Basic Information | |
|---|---|
| IMG/M Taxon OID | 3300027116 Open in IMG/M |
| GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0095510 | Gp0055661 | Ga0207539 |
| Sample Name | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G01A4-11 (SPAdes) |
| Sequencing Status | Permanent Draft |
| Sequencing Center | DOE Joint Genome Institute (JGI) |
| Published? | N |
| Use Policy | Open |
| Dataset Contents | |
|---|---|
| Total Genome Size | 23624174 |
| Sequencing Scaffolds | 17 |
| Novel Protein Genes | 18 |
| Associated Families | 18 |
| Dataset Phylogeny | |
|---|---|
| Taxonomy Groups | Number of Scaffolds |
| Not Available | 8 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Xanthobacteraceae → Pseudolabrys → Pseudolabrys taiwanensis | 1 |
| All Organisms → cellular organisms → Bacteria | 3 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales | 1 |
| All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria | 1 |
| All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium | 2 |
| All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia | 1 |
| Ecosystem Assignment (GOLD) | |
|---|---|
| Name | Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts) |
| Type | Environmental |
| Taxonomy | Environmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil → Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts) |
| Alternative Ecosystem Assignments | |
|---|---|
| Environment Ontology (ENVO) | terrestrial biome → agricultural field → agricultural soil |
| Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Soil (non-saline) |
| Location Information | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location | Wisconsin, United States | |||||||
| Coordinates | Lat. (o) | 43.2958 | Long. (o) | -89.3799 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F000268 | Metagenome / Metatranscriptome | 1411 | Y |
| F001757 | Metagenome / Metatranscriptome | 641 | N |
| F016050 | Metagenome / Metatranscriptome | 250 | Y |
| F030160 | Metagenome / Metatranscriptome | 186 | Y |
| F032172 | Metagenome / Metatranscriptome | 180 | Y |
| F047321 | Metagenome / Metatranscriptome | 150 | Y |
| F048418 | Metagenome / Metatranscriptome | 148 | Y |
| F049092 | Metagenome | 147 | N |
| F054170 | Metagenome | 140 | Y |
| F058224 | Metagenome / Metatranscriptome | 135 | Y |
| F059114 | Metagenome | 134 | N |
| F066928 | Metagenome / Metatranscriptome | 126 | Y |
| F070493 | Metagenome / Metatranscriptome | 123 | Y |
| F077482 | Metagenome | 117 | N |
| F079256 | Metagenome / Metatranscriptome | 116 | N |
| F080403 | Metagenome | 115 | Y |
| F088977 | Metagenome / Metatranscriptome | 109 | N |
| F090709 | Metagenome | 108 | Y |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| Ga0207539_100519 | Not Available | 926 | Open in IMG/M |
| Ga0207539_100542 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Xanthobacteraceae → Pseudolabrys → Pseudolabrys taiwanensis | 915 | Open in IMG/M |
| Ga0207539_100555 | Not Available | 911 | Open in IMG/M |
| Ga0207539_100670 | All Organisms → cellular organisms → Bacteria | 868 | Open in IMG/M |
| Ga0207539_100684 | All Organisms → cellular organisms → Bacteria | 864 | Open in IMG/M |
| Ga0207539_101016 | Not Available | 782 | Open in IMG/M |
| Ga0207539_101191 | All Organisms → cellular organisms → Bacteria | 750 | Open in IMG/M |
| Ga0207539_101596 | Not Available | 693 | Open in IMG/M |
| Ga0207539_101903 | Not Available | 660 | Open in IMG/M |
| Ga0207539_101979 | Not Available | 652 | Open in IMG/M |
| Ga0207539_102432 | Not Available | 619 | Open in IMG/M |
| Ga0207539_103107 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales | 575 | Open in IMG/M |
| Ga0207539_103607 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria | 546 | Open in IMG/M |
| Ga0207539_104286 | Not Available | 518 | Open in IMG/M |
| Ga0207539_104470 | All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium | 511 | Open in IMG/M |
| Ga0207539_104554 | All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium | 508 | Open in IMG/M |
| Ga0207539_104606 | All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia | 506 | Open in IMG/M |
| Scaffold ID | Protein ID | Family | Sequence |
|---|---|---|---|
| Ga0207539_100519 | Ga0207539_1005192 | F054170 | AGNKRQLSAIDLEETEEILSWDVSDEALEIAGTAGQEIAGAYTLQFCTSMDCALAS |
| Ga0207539_100542 | Ga0207539_1005421 | F048418 | ARRQAAAHAIDWLGLWALSVAFCAIVLAGIAYFMIGDNTGASCILVGAAAIIAIVVRFGTDRDEARDEN |
| Ga0207539_100555 | Ga0207539_1005551 | F090709 | AIDPSVTLPIDKLNWMQNELVKAGNLKAPFDLTTMTAPDIRAEAAKRATK |
| Ga0207539_100670 | Ga0207539_1006702 | F080403 | ANQERELAAYVAKAQTEIQRRESEWWQKQLGSEEVSAA |
| Ga0207539_100684 | Ga0207539_1006841 | F016050 | MDIQPDANVEDQDRAKCGKNEAGGMESSGCWVRKHVGNRAADDRSDDAEHDCPKNRHGHVQYRFRDKARD |
| Ga0207539_101016 | Ga0207539_1010161 | F088977 | MRTTSLAGVLSLSLSAPLFSQTFNQTATYIALMRSSVGGLPPVATSTLQGDLQDGVALAIRYGYVPSSSRMDLPSMNNFGLTAVLPTGTASTVSITGGLSSLSRGGSDAWIIGAGGDLRLTDWAFSQGRSAPHLRVAVNGQLDYSKPRESALIAGSVGLPLSIIRPNRPKQEMQVVPFVTPSFAFGNFNPDDDTGLSHESGARFMIGGGLGLY |
| Ga0207539_101191 | Ga0207539_1011911 | F000268 | MRVVAVMLLLSAGIAAEAMSYSFVSKASGRLGGPIRFEFYRDSTTHPKTDIETFTVSMRTADDRWKAMWSILSGRGLTQPIEYGVTPPGFTTMIQPQKLIPGRVYAGFATDGHGGTSGVTFGFDKNGRMTFPDSFDR |
| Ga0207539_101596 | Ga0207539_1015961 | F079256 | PGGLNKVAHNVSKTMKKAGRDTKAEVHRDASKTHQTLTKAGNDTKAQLKRTTGVTTHSPDANHKPGGVNKLARDVSHTSKTVGAKAKHSVKSASGEVHRDLTKAGKHAKEVVKDSVKKP |
| Ga0207539_101903 | Ga0207539_1019031 | F049092 | MKFIEPAFDTGGDLWRPPLNVPDSDAHALNEKFARQSAELRLRLTEVLELHNVQQRQANELQDRYDDIDRLSQTVSALQEAVNQYKMGTAAAEDKIIILESEKAVLQAQLDVALEESKTLADRVLAAEAAAKRREENIASSLKQIDFLNAELMAASSERFKVVAAMQGEQRR |
| Ga0207539_101979 | Ga0207539_1019791 | F077482 | HQWYASTLVLPIVILAYPSLRCGEAYELLLDRELSALDQFYAGWGTVIRMDENSEGQGPKDTMQDNEYRPAGGIKKFAEPTIQLRGTAEKIGLDRQSLSSFIGTKFLNEFAFLQSDFVFEKTYQTWEIGIFECETWTVGVNYPIAFHVQCAGGSMDEPREWHYASLGYGPADKTSETVRGTLDAIIQEYATFVRKASGKD |
| Ga0207539_102432 | Ga0207539_1024321 | F032172 | VQIIRGRFGPSGGIVPELDKDGQVVPTGHFNNRLGFHALMQAVGVDERVMTLIELFANPKLNAEITRQLAAGQRSVSISGEDAAKTGEF |
| Ga0207539_103107 | Ga0207539_1031072 | F066928 | MSPADRYRALAAYLRTCAAREQRPEVRTQWTKLAQCYLRLAEQADQNSRADIVYEFDRSA |
| Ga0207539_103607 | Ga0207539_1036072 | F058224 | MSELLGEVNRSIRDLASTVDPHDSSTWEFVCECGEEGCTERFGLPLARYDELKHTGVALLAPGHPPRRAEG |
| Ga0207539_103979 | Ga0207539_1039791 | F059114 | GARMRPHLIYLVSRAVLLFLSVVAAIASMQATSFGFISVTMSPSREGFLWLSLAIVLVAFAALGIRVALDWRLSYWILFPAVFATLIFATFGTSILSLL |
| Ga0207539_104286 | Ga0207539_1042861 | F030160 | VEQRLPVSVDALRVESYSATDDSIVILLRPKYSTAERAYSIPVECLQDLIIDLRRLSFSAPNAPYEKADSQTEPLLPLELSVAAE |
| Ga0207539_104470 | Ga0207539_1044701 | F070493 | MESEQVQLGSVTFVLAETILRELCAKVTHHLVARHLRDYAGSSDA |
| Ga0207539_104554 | Ga0207539_1045541 | F001757 | RGNFPCPVDIPMKERAIMMRVLKPLQDKATTGPGKKLVLPEPRRVRFLIRGEGSVSAGAVTIECCPESTKAGKFWMELATIPVPDNGIAQYYTEEASGLLRARISQPVSNGAVTVTPLVSRGRPDRPDRTKVV |
| Ga0207539_104606 | Ga0207539_1046061 | F047321 | MKNKCSTRSTDPHYSLFTPRSPAIARRRRLGNSGFINLRALFALLLCFSGVALAILAGRDVSVRRASEPERYMPVPSAKGQSEAVGLERLEQYWHDRLTFPTGRFDPAWVRAAVAQHDRMATG |
| ⦗Top⦘ |