Basic Information | |
---|---|
IMG/M Taxon OID | 3300027397 Open in IMG/M |
GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0095510 | Gp0055665 | Ga0207463 |
Sample Name | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G10A3-11 (SPAdes) |
Sequencing Status | Permanent Draft |
Sequencing Center | DOE Joint Genome Institute (JGI) |
Published? | N |
Use Policy | Open |
Dataset Contents | |
---|---|
Total Genome Size | 24334690 |
Sequencing Scaffolds | 13 |
Novel Protein Genes | 13 |
Associated Families | 13 |
Dataset Phylogeny | |
---|---|
Taxonomy Groups | Number of Scaffolds |
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Hyphomicrobiaceae → Rhodoplanes → unclassified Rhodoplanes → Rhodoplanes sp. Z2-YC6860 | 1 |
All Organisms → cellular organisms → Archaea | 1 |
Not Available | 5 |
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales | 1 |
All Organisms → cellular organisms → Bacteria → Acidobacteria | 1 |
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Xanthobacteraceae | 1 |
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria | 1 |
All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Xanthomonadales → unclassified Xanthomonadales → Xanthomonadales bacterium | 1 |
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Hyphomicrobiales incertae sedis → Pseudorhodoplanes → Pseudorhodoplanes sinuspersici | 1 |
Ecosystem Assignment (GOLD) | |
---|---|
Name | Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts) |
Type | Environmental |
Taxonomy | Environmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil → Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts) |
Alternative Ecosystem Assignments | |
---|---|
Environment Ontology (ENVO) | terrestrial biome → agricultural field → agricultural soil |
Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Soil (non-saline) |
Location Information | ||||||||
---|---|---|---|---|---|---|---|---|
Location | Wisconsin, United States | |||||||
Coordinates | Lat. (o) | 43.2958 | Long. (o) | -89.3799 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
Zoom: | Powered by OpenStreetMap © |
Family | Category | Number of Sequences | 3D Structure? |
---|---|---|---|
F000569 | Metagenome / Metatranscriptome | 1018 | Y |
F002896 | Metagenome / Metatranscriptome | 522 | N |
F034564 | Metagenome / Metatranscriptome | 174 | Y |
F034995 | Metagenome | 173 | N |
F042319 | Metagenome / Metatranscriptome | 158 | Y |
F052029 | Metagenome | 143 | Y |
F055839 | Metagenome / Metatranscriptome | 138 | Y |
F059558 | Metagenome / Metatranscriptome | 133 | Y |
F067154 | Metagenome / Metatranscriptome | 126 | Y |
F071766 | Metagenome | 122 | Y |
F089138 | Metagenome | 109 | N |
F090480 | Metagenome / Metatranscriptome | 108 | Y |
F097297 | Metagenome / Metatranscriptome | 104 | Y |
Scaffold | Taxonomy | Length | IMG/M Link |
---|---|---|---|
Ga0207463_100024 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Hyphomicrobiaceae → Rhodoplanes → unclassified Rhodoplanes → Rhodoplanes sp. Z2-YC6860 | 1951 | Open in IMG/M |
Ga0207463_100359 | All Organisms → cellular organisms → Archaea | 1116 | Open in IMG/M |
Ga0207463_100556 | Not Available | 994 | Open in IMG/M |
Ga0207463_100631 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales | 963 | Open in IMG/M |
Ga0207463_100716 | All Organisms → cellular organisms → Bacteria → Acidobacteria | 926 | Open in IMG/M |
Ga0207463_100903 | Not Available | 861 | Open in IMG/M |
Ga0207463_100944 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Xanthobacteraceae | 850 | Open in IMG/M |
Ga0207463_101056 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria | 818 | Open in IMG/M |
Ga0207463_101256 | Not Available | 769 | Open in IMG/M |
Ga0207463_101537 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Xanthomonadales → unclassified Xanthomonadales → Xanthomonadales bacterium | 721 | Open in IMG/M |
Ga0207463_101661 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Hyphomicrobiales incertae sedis → Pseudorhodoplanes → Pseudorhodoplanes sinuspersici | 702 | Open in IMG/M |
Ga0207463_101872 | Not Available | 672 | Open in IMG/M |
Ga0207463_102361 | Not Available | 624 | Open in IMG/M |
Scaffold ID | Protein ID | Family | Sequence |
---|---|---|---|
Ga0207463_100024 | Ga0207463_1000244 | F055839 | MRVVESMKHLVLASALILAGSTAAKPSDINFGETRTRFQIQNELTAWERLHPWDVDWRHTTLWQHGRALRTEFAPPGCTITRLVTTRSGT |
Ga0207463_100359 | Ga0207463_1003591 | F002896 | MDKMNEDTILHFYRILENSLLESDISKINEEDIDAWSQSFKKVVRESREKSGKGVFVPFLMWKLGEISPVEARKYLVNRKQDECRVSYDHNNVEYILWVMALMFMSWSVTNLKRKTQNGHCQNIDHPHGDKNPRLCKEGTIFHQDLYNECVKTFKDLLIHSDAQHN |
Ga0207463_100556 | Ga0207463_1005561 | F034995 | IAGIFLIGFPAWMMKEMSVPLPEYRKGGYADYVLLAYELLAVYGSVLLGIAVHVGRAKWSGQSLFPFGLFKAGMWGVLVAASAYFVAGGVYGLISYGAFDKVIGGTLWGLLAIFWGVFGALVSAGLALLFYAWRGSAGYRPPGSAGLGTPRS |
Ga0207463_100631 | Ga0207463_1006313 | F052029 | ALNFRSILPVDSVAEQEINNTGALSEQFVTIGAASLALLVVAAIAVLIGMA |
Ga0207463_100716 | Ga0207463_1007161 | F059558 | EAFYHPTDGLLLISSLKGEVKNKHNRNGWIFSCFAAHLFEQDLF |
Ga0207463_100903 | Ga0207463_1009031 | F097297 | MLVTERMRSLGDHAKSSRQPPLKERKLTKSDASAATQDVINEKEFKKGAAHESAAVVIVDNPAPQTVVHENRNAAVAEITPDEEAAVLATVAAVLAKIDPQPPTADDNVTRPEPVREDAELSAAKRAIIHEWESWSALHSDELEDPNTSEYFFRHLQLKKADLLIFGLQNDLDAVRALIASQLR |
Ga0207463_100944 | Ga0207463_1009442 | F042319 | YQTTTIGIADQSALLSRIAAWCTHFVEGVREGQEIAARYHALAQLSSPELARRGLNRQTIARAALTSY |
Ga0207463_101056 | Ga0207463_1010561 | F000569 | MDLHIKKVWLPGAASCLLFFGFYWVLIWLPFDKNRFQFLAIPYLVLPFVGALAAYMSRRMKGSVLERIESALFPVFAFVALFAVRIVYGLFFEGQPYTLPHFLGGFAVTLVFIVVGGLLLVLGAWPFCRPHLREQLP |
Ga0207463_101256 | Ga0207463_1012561 | F090480 | VDCKMNKIFACIVFTATSGFSFAVATAMPLVPMGTEQARLTIPVADGCGFNRYRDARGICRKKYVITRHQGRQPVYTGCGGLNSHRVCNLYGHCWMVCD |
Ga0207463_101537 | Ga0207463_1015371 | F089138 | MLKEGMAKAEIVTVLRRYAYDINRISALMPDGGTAKDAQSRLKQLKDAIHSDYKHRNAITRSAQLTPLEQSNLASAILDVFSALQSIGVNTTPGREWRNALFGADMYIQRYLNELRDSEKSVESDTTEWL |
Ga0207463_101661 | Ga0207463_1016611 | F034564 | LLSHSLTVRRGIIWCDFPRHNGRTQTGGMNRSGRLSGYWYGAVCIAGLGFGLLGERRPFGQGILAHPFIVYAFVVAAGLLVIRVVRQQPVPELIPERALGLGCAAGVALFLAGNFIAAHLIGR |
Ga0207463_101872 | Ga0207463_1018721 | F067154 | SQAGSDRPPRDASVVARSILLVVAAGSVIATSNYFAYMIGRKQVTYEELRRQVDLVAQFIDQKRVEEEPAPVALKQRQERLEQNASGLTGLTTSTINGKTTAIAPERAPAIPAALFDDTAVARPDHEVRSQPPNAITKHRDMKRRQKVSPASAAAKRAPAEPAIAAQPGQSTTATATPDVGLAGQ |
Ga0207463_102361 | Ga0207463_1023611 | F071766 | MDEDFITLEVEEEGRGRLQFELPLDITDEEIAYIKSLVPRAESGLLELLDPDTGEVVFSC |
⦗Top⦘ |