| Basic Information | |
|---|---|
| IMG/M Taxon OID | 3300026784 Open in IMG/M |
| GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0095510 | Gp0091549 | Ga0207453 |
| Sample Name | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G01.2K2-12 (SPAdes) |
| Sequencing Status | Permanent Draft |
| Sequencing Center | DOE Joint Genome Institute (JGI) |
| Published? | N |
| Use Policy | Open |
| Dataset Contents | |
|---|---|
| Total Genome Size | 23738785 |
| Sequencing Scaffolds | 10 |
| Novel Protein Genes | 10 |
| Associated Families | 10 |
| Dataset Phylogeny | |
|---|---|
| Taxonomy Groups | Number of Scaffolds |
| All Organisms → cellular organisms → Eukaryota → Viridiplantae → Streptophyta → Streptophytina → Embryophyta | 1 |
| All Organisms → Viruses → Predicted Viral | 1 |
| All Organisms → cellular organisms → Eukaryota → Viridiplantae → Streptophyta → Streptophytina → Embryophyta → Tracheophyta → Euphyllophyta → Spermatophyta → Magnoliopsida → Mesangiospermae → Liliopsida → Petrosaviidae → commelinids → Poales → Poaceae → PACMAD clade → Panicoideae → Andropogonodae → Andropogoneae → Tripsacinae → Zea → Zea mays | 2 |
| All Organisms → cellular organisms → Bacteria | 1 |
| Not Available | 3 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales | 1 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium | 1 |
| Ecosystem Assignment (GOLD) | |
|---|---|
| Name | Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts) |
| Type | Environmental |
| Taxonomy | Environmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil → Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts) |
| Alternative Ecosystem Assignments | |
|---|---|
| Environment Ontology (ENVO) | terrestrial biome → agricultural field → agricultural soil |
| Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Soil (non-saline) |
| Location Information | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location | Wisconsin, United States | |||||||
| Coordinates | Lat. (o) | 43.3 | Long. (o) | -89.38 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F002471 | Metagenome / Metatranscriptome | 556 | Y |
| F003406 | Metagenome / Metatranscriptome | 488 | Y |
| F007409 | Metagenome / Metatranscriptome | 351 | Y |
| F017166 | Metagenome / Metatranscriptome | 242 | Y |
| F034688 | Metagenome / Metatranscriptome | 174 | Y |
| F037171 | Metagenome / Metatranscriptome | 168 | N |
| F054170 | Metagenome | 140 | Y |
| F055629 | Metagenome / Metatranscriptome | 138 | Y |
| F060201 | Metagenome / Metatranscriptome | 133 | Y |
| F063479 | Metagenome / Metatranscriptome | 129 | Y |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| Ga0207453_100006 | All Organisms → cellular organisms → Eukaryota → Viridiplantae → Streptophyta → Streptophytina → Embryophyta | 8219 | Open in IMG/M |
| Ga0207453_100024 | All Organisms → Viruses → Predicted Viral | 2973 | Open in IMG/M |
| Ga0207453_100040 | All Organisms → cellular organisms → Eukaryota → Viridiplantae → Streptophyta → Streptophytina → Embryophyta → Tracheophyta → Euphyllophyta → Spermatophyta → Magnoliopsida → Mesangiospermae → Liliopsida → Petrosaviidae → commelinids → Poales → Poaceae → PACMAD clade → Panicoideae → Andropogonodae → Andropogoneae → Tripsacinae → Zea → Zea mays | 1798 | Open in IMG/M |
| Ga0207453_100586 | All Organisms → cellular organisms → Bacteria | 853 | Open in IMG/M |
| Ga0207453_100658 | Not Available | 827 | Open in IMG/M |
| Ga0207453_100982 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales | 755 | Open in IMG/M |
| Ga0207453_101292 | Not Available | 706 | Open in IMG/M |
| Ga0207453_101722 | Not Available | 652 | Open in IMG/M |
| Ga0207453_102107 | All Organisms → cellular organisms → Eukaryota → Viridiplantae → Streptophyta → Streptophytina → Embryophyta → Tracheophyta → Euphyllophyta → Spermatophyta → Magnoliopsida → Mesangiospermae → Liliopsida → Petrosaviidae → commelinids → Poales → Poaceae → PACMAD clade → Panicoideae → Andropogonodae → Andropogoneae → Tripsacinae → Zea → Zea mays | 620 | Open in IMG/M |
| Ga0207453_103469 | All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium | 542 | Open in IMG/M |
| Scaffold ID | Protein ID | Family | Sequence |
|---|---|---|---|
| Ga0207453_100006 | Ga0207453_1000065 | F037171 | MYVCARIENDPVEEPEEFAGEAPEQQSVGGGKCPLTYLCPIHSLIHLPHYTFMPKD |
| Ga0207453_100024 | Ga0207453_1000242 | F002471 | IDACSEHIASIAKLNDEMASLNAQLKASKSDFDKLKFARDAYTIGRHPSIKDGLGYKKEAKNLTSHKAPISAKEKGKAPMASSVQKNHAFMYHDRRQSRNAYRSCNAYNAFDSHAMFASSSSYVHDRNVGRKNVVHNMPRRNVVNAHRKAHGPSTIYHALNASFAICRKDRKIVARKLGAKCKGDKTCIWVPKDICTNLVGPNMSWVPKTQA |
| Ga0207453_100040 | Ga0207453_1000403 | F007409 | MSEAAKKAAAEMKLSLDEEKNLGFLIAMSKTNTEKITREILEGLSEDTGDSDSYDVDSGGEDSEDRPWRPSHSVYGKSTI |
| Ga0207453_100586 | Ga0207453_1005862 | F055629 | MEAVVGKSLWEDPWFFAAVAPNGGQEQREPCPVTGYACEGDLSYLCEEYGCARKGG |
| Ga0207453_100658 | Ga0207453_1006581 | F063479 | MSARNWAGTHWIARSTRGGGRPKSGEVDLGPPVKSGRVRGLGELHGLLAELAEALAGLEGGWSGLATVAVALAAMAGGIELAGAKERWLADEGEC |
| Ga0207453_100982 | Ga0207453_1009821 | F034688 | FFVIVLGASLFLGTVMVIGTFHGVDPSNELTANGRAGRIARTLRDGTLCHYMIFDNKTAQTVEDRIGRCDENKPKRKQERPATFTWGK |
| Ga0207453_101292 | Ga0207453_1012922 | F054170 | MNVTIGFQETEEILSWDVSDEALESAGTAGQEIAGAYTLQFCTSQDCALVS |
| Ga0207453_101722 | Ga0207453_1017222 | F017166 | LLALAVMVFTGWLYLGEKDAVRSTKADVAAAADRTAVALSQSADPERNTDADAEDVFKKHVQTPSALEDLAVKQSVESISAGRLRQSVKISARARTTLSEFFSMQGAEIEITATHDFDRK |
| Ga0207453_102107 | Ga0207453_1021071 | F003406 | LKTREDIKGIIMRPIWQSFGLRRPKVEMNEAAEECQRAFGVICSFIGTRDLVQEHIAFRVWPLAEKWEMPQETIKEADEGGLIRLKYTFKFGDKFVEPDDDWLKSIENLSDELLGAYSKAEDTAMSAAFGGRKKKRLNRVFDAIGFVYPDYCYPIRRQKRKNTTSAKEETAAAPSEPEPKRKKIKVLTYRPRYIEPASVPEFTGET |
| Ga0207453_103469 | Ga0207453_1034691 | F060201 | RGPHGDALAALLQLGPRTRLDASTLSAWIASVEASARQAGGGWSAPALSLADLDVEFPVEHDALAAIFANLLRNAQAAVAGQEDGRVIVRIDRARDVTGRQEVSLEMGDSATTPLSLETIEARESGRGLAIVRDLVREWRGHLVVRPEAVPFTKVVGACFPL |
| ⦗Top⦘ |