Basic Information | |
---|---|
IMG/M Taxon OID | 3300026767 Open in IMG/M |
GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0095510 | Gp0072051 | Ga0207469 |
Sample Name | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G08A1-11 (SPAdes) |
Sequencing Status | Permanent Draft |
Sequencing Center | DOE Joint Genome Institute (JGI) |
Published? | N |
Use Policy | Open |
Dataset Contents | |
---|---|
Total Genome Size | 11614497 |
Sequencing Scaffolds | 12 |
Novel Protein Genes | 12 |
Associated Families | 12 |
Dataset Phylogeny | |
---|---|
Taxonomy Groups | Number of Scaffolds |
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales | 1 |
Not Available | 4 |
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Hyphomicrobiales incertae sedis → Pseudorhodoplanes → unclassified Pseudorhodoplanes → Pseudorhodoplanes sp. | 1 |
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria | 2 |
All Organisms → cellular organisms → Bacteria → Proteobacteria | 1 |
All Organisms → cellular organisms → Bacteria | 2 |
All Organisms → cellular organisms → Archaea | 1 |
Ecosystem Assignment (GOLD) | |
---|---|
Name | Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts) |
Type | Environmental |
Taxonomy | Environmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil → Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts) |
Alternative Ecosystem Assignments | |
---|---|
Environment Ontology (ENVO) | terrestrial biome → agricultural field → agricultural soil |
Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Soil (non-saline) |
Location Information | ||||||||
---|---|---|---|---|---|---|---|---|
Location | Wisconsin, United States | |||||||
Coordinates | Lat. (o) | 43.3 | Long. (o) | -89.38 | Alt. (m) | N/A | Depth (m) | 0 | Location on Map |
Zoom: | Powered by OpenStreetMap © |
Family | Category | Number of Sequences | 3D Structure? |
---|---|---|---|
F010126 | Metagenome / Metatranscriptome | 308 | Y |
F015505 | Metagenome | 254 | Y |
F019338 | Metagenome / Metatranscriptome | 230 | Y |
F030160 | Metagenome / Metatranscriptome | 186 | Y |
F038480 | Metagenome | 166 | Y |
F040398 | Metagenome / Metatranscriptome | 162 | Y |
F054151 | Metagenome / Metatranscriptome | 140 | N |
F057709 | Metagenome | 136 | Y |
F060880 | Metagenome / Metatranscriptome | 132 | N |
F067149 | Metagenome / Metatranscriptome | 126 | N |
F071715 | Metagenome | 122 | N |
F089166 | Metagenome / Metatranscriptome | 109 | Y |
Scaffold | Taxonomy | Length | IMG/M Link |
---|---|---|---|
Ga0207469_100188 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales | 988 | Open in IMG/M |
Ga0207469_100237 | Not Available | 928 | Open in IMG/M |
Ga0207469_100392 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Hyphomicrobiales incertae sedis → Pseudorhodoplanes → unclassified Pseudorhodoplanes → Pseudorhodoplanes sp. | 821 | Open in IMG/M |
Ga0207469_100463 | Not Available | 788 | Open in IMG/M |
Ga0207469_100471 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria | 786 | Open in IMG/M |
Ga0207469_100580 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 745 | Open in IMG/M |
Ga0207469_100723 | All Organisms → cellular organisms → Bacteria | 700 | Open in IMG/M |
Ga0207469_100730 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria | 697 | Open in IMG/M |
Ga0207469_100844 | Not Available | 667 | Open in IMG/M |
Ga0207469_101031 | All Organisms → cellular organisms → Archaea | 627 | Open in IMG/M |
Ga0207469_101048 | All Organisms → cellular organisms → Bacteria | 623 | Open in IMG/M |
Ga0207469_102145 | Not Available | 500 | Open in IMG/M |
Scaffold ID | Protein ID | Family | Sequence |
---|---|---|---|
Ga0207469_100188 | Ga0207469_1001881 | F054151 | MRKADQYTLADHFRALADGLSVRAVTERAPAQRAELQRLAECYAELAKQQSPADHFARGVGPR |
Ga0207469_100237 | Ga0207469_1002371 | F030160 | MASSGAKLAGTMSEKVEQRVSVSVVAVRAESYSAADDSIVISLRTKYSTAERAYSIPVECLQDLIVGLRQLSYLHPQWRLKKPTGRTESLLPLEPPVAVD |
Ga0207469_100392 | Ga0207469_1003923 | F015505 | MGTINDQTREVSSGTYFAHLFKLDADQYQLLSQKPRFHRDCEYLNRPYELSDLMRQASDAGVWPR |
Ga0207469_100463 | Ga0207469_1004631 | F071715 | MRKLAIGILAAAGVALSVPASAQGVWIGAGPVGVGVGVGPGYYG |
Ga0207469_100471 | Ga0207469_1004711 | F019338 | GSHYGFDADRQWRPIFRLGVLGSMPAMRRTEAVLTAGAVACLAFVSIVYPVFAQDVDPRCKDIFDKVACTCAVRNGGHVIPPPVGVKREGLKLRPKEEAGGTQTLDGGRVAFPKYYRREGLKFHRSRALEGYLACMRAAGRK |
Ga0207469_100580 | Ga0207469_1005802 | F038480 | MKTVLVSIGLCLAATAVHSMPLSLLNANVAQPVIAVSDQCGDRCGSSRSYVRDRRTVMAGYSGGYVLVRDPLIQRRPYCPF |
Ga0207469_100723 | Ga0207469_1007231 | F040398 | GKVLSERRDDLVKFVAAEMDAYKFALANRAETIKVSQEMTHAKPDDKRAEFITDEAIKDKQIDPTLSIPLDRLDWMQNLFLKAGVIKQTVPIESIVDKSVNADAAKIAGK |
Ga0207469_100730 | Ga0207469_1007301 | F067149 | MRKTIILAAATLLLACVTVVTGVARTIEGPSANLASLVVSPHTIGGLT |
Ga0207469_100844 | Ga0207469_1008442 | F057709 | MPEFDLKVALIIFVTKVIDPFAALPALVAGYFCRTWWQVVISAAVVGIFVEMVLVLFEPTPGIH |
Ga0207469_101031 | Ga0207469_1010311 | F010126 | GLISTALLFQIMISSNLVVAIESSESNTNKISSGSELESIVTNGSSTNQLAYRNSQHGIFMLFPSNWTFSDSGLPEYTQVAAFYGPLQNLSDPIPPRLTITVMNYQQNVSLKDFTNMTLSSLNQTNQVKILSSEPTTLAGQPGYQVVFSTLPNIGNPVSFEIMHSWISMGKKIYVLQYSAESSKFDTYLPTIKQILQSLRIDTTR |
Ga0207469_101048 | Ga0207469_1010481 | F060880 | MHTSADFRKTQRRRAELHSRSEAMVATIWLVFYVLGIAVAVSSPIISRAIELAAQ |
Ga0207469_102145 | Ga0207469_1021452 | F089166 | GYWAMVDALEAERNAAKKSIAVEDPSFTCEPGAEEAEPSSALRVQRTVDEPAP |
⦗Top⦘ |