| Basic Information | |
|---|---|
| IMG/M Taxon OID | 3300026940 Open in IMG/M |
| GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0095510 | Gp0055680 | Ga0207521 |
| Sample Name | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G10A1-12 (SPAdes) |
| Sequencing Status | Permanent Draft |
| Sequencing Center | DOE Joint Genome Institute (JGI) |
| Published? | N |
| Use Policy | Open |
| Dataset Contents | |
|---|---|
| Total Genome Size | 21917726 |
| Sequencing Scaffolds | 22 |
| Novel Protein Genes | 23 |
| Associated Families | 23 |
| Dataset Phylogeny | |
|---|---|
| Taxonomy Groups | Number of Scaffolds |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales | 1 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria | 2 |
| Not Available | 12 |
| All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Candidatus Acidoferrales → Candidatus Acidoferrum → Candidatus Acidoferrum panamensis | 1 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → unclassified Bradyrhizobiaceae → Bradyrhizobiaceae bacterium | 1 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Hyphomicrobiaceae → Rhodoplanes | 1 |
| All Organisms → cellular organisms → Bacteria → Terrabacteria group | 1 |
| All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria | 1 |
| All Organisms → cellular organisms → Archaea | 1 |
| All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium | 1 |
| Ecosystem Assignment (GOLD) | |
|---|---|
| Name | Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts) |
| Type | Environmental |
| Taxonomy | Environmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil → Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts) |
| Alternative Ecosystem Assignments | |
|---|---|
| Environment Ontology (ENVO) | terrestrial biome → agricultural field → agricultural soil |
| Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Soil (non-saline) |
| Location Information | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location | Wisconsin, United States | |||||||
| Coordinates | Lat. (o) | 43.2958 | Long. (o) | -89.3799 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F005305 | Metagenome / Metatranscriptome | 405 | N |
| F010961 | Metagenome / Metatranscriptome | 297 | Y |
| F019338 | Metagenome / Metatranscriptome | 230 | Y |
| F021340 | Metagenome | 219 | Y |
| F031610 | Metagenome / Metatranscriptome | 182 | N |
| F033920 | Metagenome | 176 | N |
| F038225 | Metagenome / Metatranscriptome | 166 | Y |
| F038480 | Metagenome | 166 | Y |
| F042319 | Metagenome / Metatranscriptome | 158 | Y |
| F051273 | Metagenome / Metatranscriptome | 144 | N |
| F051439 | Metagenome / Metatranscriptome | 144 | N |
| F052029 | Metagenome | 143 | Y |
| F062518 | Metagenome | 130 | N |
| F067965 | Metagenome / Metatranscriptome | 125 | Y |
| F068796 | Metagenome / Metatranscriptome | 124 | Y |
| F071715 | Metagenome | 122 | N |
| F073765 | Metagenome / Metatranscriptome | 120 | N |
| F083399 | Metagenome | 113 | N |
| F085659 | Metagenome / Metatranscriptome | 111 | Y |
| F085779 | Metagenome / Metatranscriptome | 111 | N |
| F088579 | Metagenome / Metatranscriptome | 109 | Y |
| F089374 | Metagenome / Metatranscriptome | 109 | Y |
| F095448 | Metagenome / Metatranscriptome | 105 | N |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| Ga0207521_100118 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales | 1820 | Open in IMG/M |
| Ga0207521_100251 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria | 1516 | Open in IMG/M |
| Ga0207521_100785 | Not Available | 1040 | Open in IMG/M |
| Ga0207521_101103 | Not Available | 918 | Open in IMG/M |
| Ga0207521_101152 | All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Candidatus Acidoferrales → Candidatus Acidoferrum → Candidatus Acidoferrum panamensis | 901 | Open in IMG/M |
| Ga0207521_101325 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → unclassified Bradyrhizobiaceae → Bradyrhizobiaceae bacterium | 849 | Open in IMG/M |
| Ga0207521_101547 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Hyphomicrobiaceae → Rhodoplanes | 799 | Open in IMG/M |
| Ga0207521_101720 | Not Available | 764 | Open in IMG/M |
| Ga0207521_102055 | Not Available | 710 | Open in IMG/M |
| Ga0207521_102155 | Not Available | 695 | Open in IMG/M |
| Ga0207521_102217 | Not Available | 687 | Open in IMG/M |
| Ga0207521_102367 | Not Available | 671 | Open in IMG/M |
| Ga0207521_102394 | All Organisms → cellular organisms → Bacteria → Terrabacteria group | 668 | Open in IMG/M |
| Ga0207521_102574 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria | 649 | Open in IMG/M |
| Ga0207521_102605 | Not Available | 645 | Open in IMG/M |
| Ga0207521_103053 | Not Available | 602 | Open in IMG/M |
| Ga0207521_103206 | All Organisms → cellular organisms → Archaea | 591 | Open in IMG/M |
| Ga0207521_103551 | Not Available | 563 | Open in IMG/M |
| Ga0207521_103648 | Not Available | 557 | Open in IMG/M |
| Ga0207521_103883 | Not Available | 543 | Open in IMG/M |
| Ga0207521_104109 | All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium | 532 | Open in IMG/M |
| Ga0207521_104205 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria | 528 | Open in IMG/M |
| Scaffold ID | Protein ID | Family | Sequence |
|---|---|---|---|
| Ga0207521_100118 | Ga0207521_1001183 | F021340 | IARANVARSQFWHVAIVTLFFVIVLGASLFLGTVMVIGTFHGVDPSNELTANGRAGRIARTLQDGTLCHYMIFDNKTAQTVEDRIGRCDENKPKPKQEKPATFTWGK |
| Ga0207521_100251 | Ga0207521_1002511 | F083399 | MAEESAVSGDTRSWGFFATFVLGAIAFLAGQLAGMAALVGWYGFDLRNVPVLSQHGGAIILFIFVSAPVQVAILALAAGYKGNIADYLGYKLPRRGEVVLCLAILAAMIAVGDAMSWLAGRSVVDRFQTDIYHAANSVGQLPLLLAAVVVLIPI |
| Ga0207521_100785 | Ga0207521_1007851 | F068796 | VRKTKLLSTVAMALLLGGVAASAQGMGKEPPERAPAAQQSAPAEKVAPSIKAGEQKSPQTTSQAAPDSKPTGKGHETTGQSPKSQATDKPGAMDKDK |
| Ga0207521_101103 | Ga0207521_1011034 | F052029 | LALNFRSLLPANPLAQQEINNTGALSEQLVTIGAASL |
| Ga0207521_101152 | Ga0207521_1011521 | F051273 | MRRFIPLLILLGLIFGASYASALINRVMGPWSSTAIHQDGSLSHMQFGVDLPRPEWVPVYPGAWVVGGSKITSVEHPAGFHGLDLGTRASLEGVVNTLKT |
| Ga0207521_101325 | Ga0207521_1013251 | F010961 | NKLSFAVEAKMQRVTAMMAYVVSEGRLCALKPAEWSFLLVGVTLCGIATLLFLMLHA |
| Ga0207521_101547 | Ga0207521_1015472 | F038480 | TAVHSMPLSLLNANVAQPVIAVSDQCGDRCGSSRSYVRDRRTVMAGYSGGYVLVRDPLIQRRPYCPFGSYVACVVSGTYCVDLCH |
| Ga0207521_101720 | Ga0207521_1017201 | F062518 | NNVVNSPGASPTGAAVVDPTGPNLTPDTPWVQGGQLVVHTGSSSFTGISWFFLGSESGFQVTFHSPALADFTEGNQNNSAYSGSPPLTVGFLGSKLNVGSGPIQFSLTWATGSVDNSALQPNPGSGVPNLIFSYATFVEPDLLMLTKSVSDIIIFALNDGGADNDHDDFIGAAIVTERADCECALQQATTPIPGALPLFGSVLGGGLLLHRWRKRRSARASRSFTASYLRG |
| Ga0207521_102055 | Ga0207521_1020551 | F095448 | GAACEPEANIERGFVMVASPHTRQSDPAIPNIVCPKCGLRMQVAAIEPAGTDDRTVTFGCDCGHRYDLSERAIVALARDSSDRW |
| Ga0207521_102155 | Ga0207521_1021552 | F073765 | MRQLIGKIPIKGQRVVEEVYLSEQIRCRECQKTAPIGVEVVTVQKDGPSKKVLKRAFYCRSHAGDYE |
| Ga0207521_102217 | Ga0207521_1022171 | F089374 | MTNILDSARASDEEPRLIVRKASHAPIWSVWAVLEGTPSEEIFEGSSE |
| Ga0207521_102367 | Ga0207521_1023672 | F051439 | MPYHVRITTDTDGLINDAKPYPTLAAAMQVAGLSLKTGSATTAWIEDVHGNVCANTDDVKKHCGLT |
| Ga0207521_102394 | Ga0207521_1023941 | F085659 | VTYDETWDDAPARRLPLAAIAVVVALGALLTSSYVLYRQSRVVDSERSARRAEIGRLEKQVTLLQSRGAALAGRVGSAEKTLKRRDSGIAPLASRVLKSVFTVETNTGLGSGFIAWRDADASYLLTANHVVEGHLIGDVTVSRKGGSWSGEV |
| Ga0207521_102574 | Ga0207521_1025741 | F067965 | GSVKVKADRSLQGRKVYLQKFSRFHEWVKVRGVILGNGSAKRFRLGLPFGGHRYAVRIFMSLNQAGAGYLDGFSQTVVVRTPRR |
| Ga0207521_102605 | Ga0207521_1026051 | F085779 | MKNSHCLLGTSGVAAAVLLYSLGAAVFAQDEPRRPLNPGLVPNTGQINTGAAPQPWSKSAEVDNIPAPAEAWAALVQPISRQPSAGGGATTTGTGGQQPPASGTINASSEPPPSGPIGSFGQTIPAKFSKRNDILDHLPTMAIPLPLTQEQRKQIYDAVMAEKSQPVVGADALELASELSPNQALNGMRPLPESV |
| Ga0207521_103053 | Ga0207521_1030531 | F019338 | GSMPAMRRTEAVLTAGAVACLAFVSIVYPVFAQDVDPRCKDIFDKVACTCAVRNGGHVIPPPVGVKREGLKLRPKEEAGGTQTLDGGRVAFPKYYRREGLKFHRSRALEGYLACMRAAGR |
| Ga0207521_103206 | Ga0207521_1032061 | F005305 | MEDLEKELGPLIENFQNLVKDAKSKKLESLREDEDFKKEFNQLSKYVIEPVMRKFESYLESKDVNSSVHIQSQIVAGKNPSIEFSLHLKLTHESRYPNIKFSSSGEKISVQEERLMTKDEVSQDMIPEYYDNELITEEFVKERLIRLIKSCFDKDWQSF |
| Ga0207521_103551 | Ga0207521_1035511 | F071715 | MRKLAIGILAAAGVALSVPASAQGVWIGAGPVGVGVGVGPGYY |
| Ga0207521_103648 | Ga0207521_1036481 | F038225 | MKRLSLALLGAVGAFFVLTPVQAADYRVIQYNDTKICQVVDMAGPFKPISSKYTVLTKKSLPSFADAMKARADVGAKAKCIL |
| Ga0207521_103883 | Ga0207521_1038831 | F033920 | RSPIPYTLTGSEIAAVENGVRSVRRDLDNSAFRGFRATQHEDGQIDVCGWILPTGNLSEQPFIGTLFAGTFAPERIGGNEVDNAQIISDCQNRGARIA |
| Ga0207521_104109 | Ga0207521_1041092 | F088579 | ADHARTDTPVVKPETELNGFKVAKQLDPTDHHVRATIVVLEGDKKVVGFIPHPEEVPGIVFA |
| Ga0207521_104205 | Ga0207521_1042051 | F042319 | IGIADQSALLSRIAAWCTHFVEGVREGQEIAARYHALSRLSTPELARRGLNRHMIARVALTGY |
| Ga0207521_104251 | Ga0207521_1042513 | F031610 | MLAIGRAAGDVAGYPGKITLITIGMGEAAIAANNAIAQIRGEKVQPKYSTD |
| ⦗Top⦘ |