| Basic Information | |
|---|---|
| IMG/M Taxon OID | 3300026720 Open in IMG/M |
| GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0095510 | Gp0072084 | Ga0207440 |
| Sample Name | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G10A2w-12 (SPAdes) |
| Sequencing Status | Permanent Draft |
| Sequencing Center | DOE Joint Genome Institute (JGI) |
| Published? | N |
| Use Policy | Open |
| Dataset Contents | |
|---|---|
| Total Genome Size | 17816435 |
| Sequencing Scaffolds | 14 |
| Novel Protein Genes | 14 |
| Associated Families | 14 |
| Dataset Phylogeny | |
|---|---|
| Taxonomy Groups | Number of Scaffolds |
| Not Available | 4 |
| All Organisms → cellular organisms → Archaea | 4 |
| All Organisms → cellular organisms → Bacteria | 1 |
| All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium | 3 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium | 1 |
| All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Nitrosopumilales → unclassified Nitrosopumilales → Nitrosopumilales archaeon | 1 |
| Ecosystem Assignment (GOLD) | |
|---|---|
| Name | Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts) |
| Type | Environmental |
| Taxonomy | Environmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil → Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts) |
| Alternative Ecosystem Assignments | |
|---|---|
| Environment Ontology (ENVO) | terrestrial biome → agricultural field → agricultural soil |
| Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Soil (non-saline) |
| Location Information | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location | Wisconsin, United States | |||||||
| Coordinates | Lat. (o) | 43.3 | Long. (o) | -89.38 | Alt. (m) | N/A | Depth (m) | 0 | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F000466 | Metagenome / Metatranscriptome | 1105 | Y |
| F001757 | Metagenome / Metatranscriptome | 641 | N |
| F005305 | Metagenome / Metatranscriptome | 405 | N |
| F007651 | Metagenome / Metatranscriptome | 347 | N |
| F015500 | Metagenome / Metatranscriptome | 254 | Y |
| F021340 | Metagenome | 219 | Y |
| F026602 | Metagenome / Metatranscriptome | 197 | N |
| F049708 | Metagenome / Metatranscriptome | 146 | Y |
| F055629 | Metagenome / Metatranscriptome | 138 | Y |
| F063987 | Metagenome / Metatranscriptome | 129 | N |
| F068085 | Metagenome | 125 | Y |
| F080668 | Metagenome | 115 | Y |
| F081936 | Metagenome / Metatranscriptome | 114 | N |
| F090038 | Metagenome | 108 | N |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| Ga0207440_100518 | Not Available | 884 | Open in IMG/M |
| Ga0207440_100682 | All Organisms → cellular organisms → Archaea | 826 | Open in IMG/M |
| Ga0207440_100855 | All Organisms → cellular organisms → Archaea | 772 | Open in IMG/M |
| Ga0207440_100915 | All Organisms → cellular organisms → Archaea | 759 | Open in IMG/M |
| Ga0207440_100932 | All Organisms → cellular organisms → Bacteria | 755 | Open in IMG/M |
| Ga0207440_101100 | All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium | 717 | Open in IMG/M |
| Ga0207440_101763 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium | 614 | Open in IMG/M |
| Ga0207440_101813 | Not Available | 609 | Open in IMG/M |
| Ga0207440_101874 | Not Available | 602 | Open in IMG/M |
| Ga0207440_101980 | All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Nitrosopumilales → unclassified Nitrosopumilales → Nitrosopumilales archaeon | 591 | Open in IMG/M |
| Ga0207440_102831 | All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium | 529 | Open in IMG/M |
| Ga0207440_102993 | Not Available | 518 | Open in IMG/M |
| Ga0207440_103154 | All Organisms → cellular organisms → Archaea | 509 | Open in IMG/M |
| Ga0207440_103323 | All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium | 500 | Open in IMG/M |
| Scaffold ID | Protein ID | Family | Sequence |
|---|---|---|---|
| Ga0207440_100518 | Ga0207440_1005181 | F049708 | GTVMRVKWVLIGFGILYLVFALSFIPTDKILPRPIEESWELVRDILWAGMTIISIGLLEVMSPLATPFASASGLRRGDVAEILLAVILVAAAVYSALRLRRKDLTPRWRGTHTFFLLLALTLVAIVRFTLYSWSHFA |
| Ga0207440_100682 | Ga0207440_1006821 | F026602 | MEGINEEECNKIFNCKIISEDVLKYPDIVNPFTKNEDIAKTLVNNANDSKIMTEHTCQKLMDVDIVKKKDQKIGEQAPKYLVCLPXDTFALLT |
| Ga0207440_100855 | Ga0207440_1008551 | F005305 | MEDLEKELGPLIENFQNLVKDAKSKKLESLREDEDFKKEFNELSKDVIEPVMRKFESYLESKDVNSSVHIQSQIVAGKNPSIEFSLHLKLTHESRYPNIKFSSSGEKISVQEDRLMTKDEVSQDMIPEYYDKELITEEFVKERLIRLIKSCFDKDWQSFYS |
| Ga0207440_100915 | Ga0207440_1009151 | F090038 | MAFLSKKLSQELLNLDIDEGIRIESNKNIKCKMYINKRTSGYFVLEIEDTNSTNIKEFRFYRDIKPIHRLIDKIFGKQY |
| Ga0207440_100932 | Ga0207440_1009321 | F021340 | LQTFQAQRTIARSNEARSQFCHVAIVTLFFVIVLGASLFLGTVMVIGTFHGVDPSNELTANGRAGRIARTLQDGTLCHYMIFDNKTAQTVEDRIGRCDENKPKPKQERPATFTWGK |
| Ga0207440_101100 | Ga0207440_1011002 | F001757 | MMRILKPLQDKATTGAGKRLVLPEPRRVRFLIRGEGSVSAGAVTIECCPESTKAGKFWMELATIAVPADYGLAQYYTEEASGLFRARISRPVSNGTVTVTPLVS |
| Ga0207440_101763 | Ga0207440_1017631 | F000466 | KSWKTAPSQGVVKRDWVPTGRIDFATRLNGTLEESDQPSEFKLLVEERRIVESITGNENLEIQWRLATLNEAKAVVAQYHKYLAENALIRSVSDETVSLPPPKKVQKIQETTAA |
| Ga0207440_101813 | Ga0207440_1018131 | F055629 | VGKSLWEDPWFFAAVAPNDGQEQSELCPVTGYACEGDLSYLCEEYGCARKGGLSPRSEEN |
| Ga0207440_101874 | Ga0207440_1018742 | F068085 | MIVLKYILLFTCLGIGLALCVGVISILRSPPDNGPPAWFAAAFGAMFFWGAFALARW |
| Ga0207440_101980 | Ga0207440_1019801 | F007651 | MSNDLNELIARKDKLEGELHHELSSDYNELMKNLSESFRDMHENSVQYYKQKANEELDKMEKNIQSGNKLSAINQKLLADT |
| Ga0207440_102831 | Ga0207440_1028311 | F015500 | GLVEVEVCIVSVEEPPVPIDAGLKPPLVIPVGKPDSLPTLKFTVPVKPLRGVTVTVYVESPPGTTSCAAGPTVMEKSGLVGSTVIVRVGGLGSELPVASMTVSEVVYVPGPPNVMFPGFCAVDVAGFPPGNTQEYFDAVVLVPKLITLPAVIVISPDGAVMAPVGGTRE |
| Ga0207440_102993 | Ga0207440_1029932 | F080668 | RLGFHALMQAVGVDERVMTLIELFANPKLNAEITRQLAAGQRSVSISGEDAAKTGEF |
| Ga0207440_103154 | Ga0207440_1031541 | F081936 | KVIXLLDRKLNAKKPSPPNVEASVNPKVEQLAHVLEITLIVVPDSADLLNFNFRFLIIYRLKFTKIAIRTELIQVKINASIISLGT |
| Ga0207440_103323 | Ga0207440_1033231 | F063987 | MKFVSRTKSMAVPHQILGASTNEKRELLMCGHSLIARLTIAVFVFQMLGVTSVGHAERPDSTAGTSNAGTRKLFIDPSSTSVALRGKASLIVSPLTHRDGNYVGDYQLKVKPYSFKSEKGSLLLAASDDAV |
| ⦗Top⦘ |