Basic Information | |
---|---|
IMG/M Taxon OID | 3300026940 Open in IMG/M |
GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0095510 | Gp0055680 | Ga0207521 |
Sample Name | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G10A1-12 (SPAdes) |
Sequencing Status | Permanent Draft |
Sequencing Center | DOE Joint Genome Institute (JGI) |
Published? | N |
Use Policy | Open |
Dataset Contents | |
---|---|
Total Genome Size | 21917726 |
Sequencing Scaffolds | 22 |
Novel Protein Genes | 23 |
Associated Families | 23 |
Dataset Phylogeny | |
---|---|
Taxonomy Groups | Number of Scaffolds |
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales | 1 |
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria | 2 |
Not Available | 12 |
All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Candidatus Acidoferrales → Candidatus Acidoferrum → Candidatus Acidoferrum panamensis | 1 |
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → unclassified Bradyrhizobiaceae → Bradyrhizobiaceae bacterium | 1 |
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Hyphomicrobiaceae → Rhodoplanes | 1 |
All Organisms → cellular organisms → Bacteria → Terrabacteria group | 1 |
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria | 1 |
All Organisms → cellular organisms → Archaea | 1 |
All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium | 1 |
Ecosystem Assignment (GOLD) | |
---|---|
Name | Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts) |
Type | Environmental |
Taxonomy | Environmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil → Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts) |
Alternative Ecosystem Assignments | |
---|---|
Environment Ontology (ENVO) | terrestrial biome → agricultural field → agricultural soil |
Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Soil (non-saline) |
Location Information | ||||||||
---|---|---|---|---|---|---|---|---|
Location | Wisconsin, United States | |||||||
Coordinates | Lat. (o) | 43.2958 | Long. (o) | -89.3799 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
Zoom: | Powered by OpenStreetMap © |
Family | Category | Number of Sequences | 3D Structure? |
---|---|---|---|
F005305 | Metagenome / Metatranscriptome | 405 | N |
F010961 | Metagenome / Metatranscriptome | 297 | Y |
F019338 | Metagenome / Metatranscriptome | 230 | Y |
F021340 | Metagenome | 219 | Y |
F031610 | Metagenome / Metatranscriptome | 182 | N |
F033920 | Metagenome | 176 | N |
F038225 | Metagenome / Metatranscriptome | 166 | Y |
F038480 | Metagenome | 166 | Y |
F042319 | Metagenome / Metatranscriptome | 158 | Y |
F051273 | Metagenome / Metatranscriptome | 144 | N |
F051439 | Metagenome / Metatranscriptome | 144 | N |
F052029 | Metagenome | 143 | Y |
F062518 | Metagenome | 130 | N |
F067965 | Metagenome / Metatranscriptome | 125 | Y |
F068796 | Metagenome / Metatranscriptome | 124 | Y |
F071715 | Metagenome | 122 | N |
F073765 | Metagenome / Metatranscriptome | 120 | N |
F083399 | Metagenome | 113 | N |
F085659 | Metagenome / Metatranscriptome | 111 | Y |
F085779 | Metagenome / Metatranscriptome | 111 | N |
F088579 | Metagenome / Metatranscriptome | 109 | Y |
F089374 | Metagenome / Metatranscriptome | 109 | Y |
F095448 | Metagenome / Metatranscriptome | 105 | N |
Scaffold | Taxonomy | Length | IMG/M Link |
---|---|---|---|
Ga0207521_100118 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales | 1820 | Open in IMG/M |
Ga0207521_100251 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria | 1516 | Open in IMG/M |
Ga0207521_100785 | Not Available | 1040 | Open in IMG/M |
Ga0207521_101103 | Not Available | 918 | Open in IMG/M |
Ga0207521_101152 | All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Candidatus Acidoferrales → Candidatus Acidoferrum → Candidatus Acidoferrum panamensis | 901 | Open in IMG/M |
Ga0207521_101325 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → unclassified Bradyrhizobiaceae → Bradyrhizobiaceae bacterium | 849 | Open in IMG/M |
Ga0207521_101547 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Hyphomicrobiaceae → Rhodoplanes | 799 | Open in IMG/M |
Ga0207521_101720 | Not Available | 764 | Open in IMG/M |
Ga0207521_102055 | Not Available | 710 | Open in IMG/M |
Ga0207521_102155 | Not Available | 695 | Open in IMG/M |
Ga0207521_102217 | Not Available | 687 | Open in IMG/M |
Ga0207521_102367 | Not Available | 671 | Open in IMG/M |
Ga0207521_102394 | All Organisms → cellular organisms → Bacteria → Terrabacteria group | 668 | Open in IMG/M |
Ga0207521_102574 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria | 649 | Open in IMG/M |
Ga0207521_102605 | Not Available | 645 | Open in IMG/M |
Ga0207521_103053 | Not Available | 602 | Open in IMG/M |
Ga0207521_103206 | All Organisms → cellular organisms → Archaea | 591 | Open in IMG/M |
Ga0207521_103551 | Not Available | 563 | Open in IMG/M |
Ga0207521_103648 | Not Available | 557 | Open in IMG/M |
Ga0207521_103883 | Not Available | 543 | Open in IMG/M |
Ga0207521_104109 | All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium | 532 | Open in IMG/M |
Ga0207521_104205 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria | 528 | Open in IMG/M |
Scaffold ID | Protein ID | Family | Sequence |
---|---|---|---|
Ga0207521_100118 | Ga0207521_1001183 | F021340 | IARANVARSQFWHVAIVTLFFVIVLGASLFLGTVMVIGTFHGVDPSNELTANGRAGRIARTLQDGTLCHYMIFDNKTAQTVEDRIGRCDENKPKPKQEKPATFTWGK |
Ga0207521_100251 | Ga0207521_1002511 | F083399 | MAEESAVSGDTRSWGFFATFVLGAIAFLAGQLAGMAALVGWYGFDLRNVPVLSQHGGAIILFIFVSAPVQVAILALAAGYKGNIADYLGYKLPRRGEVVLCLAILAAMIAVGDAMSWLAGRSVVDRFQTDIYHAANSVGQLPLLLAAVVVLIPI |
Ga0207521_100785 | Ga0207521_1007851 | F068796 | VRKTKLLSTVAMALLLGGVAASAQGMGKEPPERAPAAQQSAPAEKVAPSIKAGEQKSPQTTSQAAPDSKPTGKGHETTGQSPKSQATDKPGAMDKDK |
Ga0207521_101103 | Ga0207521_1011034 | F052029 | LALNFRSLLPANPLAQQEINNTGALSEQLVTIGAASL |
Ga0207521_101152 | Ga0207521_1011521 | F051273 | MRRFIPLLILLGLIFGASYASALINRVMGPWSSTAIHQDGSLSHMQFGVDLPRPEWVPVYPGAWVVGGSKITSVEHPAGFHGLDLGTRASLEGVVNTLKT |
Ga0207521_101325 | Ga0207521_1013251 | F010961 | NKLSFAVEAKMQRVTAMMAYVVSEGRLCALKPAEWSFLLVGVTLCGIATLLFLMLHA |
Ga0207521_101547 | Ga0207521_1015472 | F038480 | TAVHSMPLSLLNANVAQPVIAVSDQCGDRCGSSRSYVRDRRTVMAGYSGGYVLVRDPLIQRRPYCPFGSYVACVVSGTYCVDLCH |
Ga0207521_101720 | Ga0207521_1017201 | F062518 | NNVVNSPGASPTGAAVVDPTGPNLTPDTPWVQGGQLVVHTGSSSFTGISWFFLGSESGFQVTFHSPALADFTEGNQNNSAYSGSPPLTVGFLGSKLNVGSGPIQFSLTWATGSVDNSALQPNPGSGVPNLIFSYATFVEPDLLMLTKSVSDIIIFALNDGGADNDHDDFIGAAIVTERADCECALQQATTPIPGALPLFGSVLGGGLLLHRWRKRRSARASRSFTASYLRG |
Ga0207521_102055 | Ga0207521_1020551 | F095448 | GAACEPEANIERGFVMVASPHTRQSDPAIPNIVCPKCGLRMQVAAIEPAGTDDRTVTFGCDCGHRYDLSERAIVALARDSSDRW |
Ga0207521_102155 | Ga0207521_1021552 | F073765 | MRQLIGKIPIKGQRVVEEVYLSEQIRCRECQKTAPIGVEVVTVQKDGPSKKVLKRAFYCRSHAGDYE |
Ga0207521_102217 | Ga0207521_1022171 | F089374 | MTNILDSARASDEEPRLIVRKASHAPIWSVWAVLEGTPSEEIFEGSSE |
Ga0207521_102367 | Ga0207521_1023672 | F051439 | MPYHVRITTDTDGLINDAKPYPTLAAAMQVAGLSLKTGSATTAWIEDVHGNVCANTDDVKKHCGLT |
Ga0207521_102394 | Ga0207521_1023941 | F085659 | VTYDETWDDAPARRLPLAAIAVVVALGALLTSSYVLYRQSRVVDSERSARRAEIGRLEKQVTLLQSRGAALAGRVGSAEKTLKRRDSGIAPLASRVLKSVFTVETNTGLGSGFIAWRDADASYLLTANHVVEGHLIGDVTVSRKGGSWSGEV |
Ga0207521_102574 | Ga0207521_1025741 | F067965 | GSVKVKADRSLQGRKVYLQKFSRFHEWVKVRGVILGNGSAKRFRLGLPFGGHRYAVRIFMSLNQAGAGYLDGFSQTVVVRTPRR |
Ga0207521_102605 | Ga0207521_1026051 | F085779 | MKNSHCLLGTSGVAAAVLLYSLGAAVFAQDEPRRPLNPGLVPNTGQINTGAAPQPWSKSAEVDNIPAPAEAWAALVQPISRQPSAGGGATTTGTGGQQPPASGTINASSEPPPSGPIGSFGQTIPAKFSKRNDILDHLPTMAIPLPLTQEQRKQIYDAVMAEKSQPVVGADALELASELSPNQALNGMRPLPESV |
Ga0207521_103053 | Ga0207521_1030531 | F019338 | GSMPAMRRTEAVLTAGAVACLAFVSIVYPVFAQDVDPRCKDIFDKVACTCAVRNGGHVIPPPVGVKREGLKLRPKEEAGGTQTLDGGRVAFPKYYRREGLKFHRSRALEGYLACMRAAGR |
Ga0207521_103206 | Ga0207521_1032061 | F005305 | MEDLEKELGPLIENFQNLVKDAKSKKLESLREDEDFKKEFNQLSKYVIEPVMRKFESYLESKDVNSSVHIQSQIVAGKNPSIEFSLHLKLTHESRYPNIKFSSSGEKISVQEERLMTKDEVSQDMIPEYYDNELITEEFVKERLIRLIKSCFDKDWQSF |
Ga0207521_103551 | Ga0207521_1035511 | F071715 | MRKLAIGILAAAGVALSVPASAQGVWIGAGPVGVGVGVGPGYY |
Ga0207521_103648 | Ga0207521_1036481 | F038225 | MKRLSLALLGAVGAFFVLTPVQAADYRVIQYNDTKICQVVDMAGPFKPISSKYTVLTKKSLPSFADAMKARADVGAKAKCIL |
Ga0207521_103883 | Ga0207521_1038831 | F033920 | RSPIPYTLTGSEIAAVENGVRSVRRDLDNSAFRGFRATQHEDGQIDVCGWILPTGNLSEQPFIGTLFAGTFAPERIGGNEVDNAQIISDCQNRGARIA |
Ga0207521_104109 | Ga0207521_1041092 | F088579 | ADHARTDTPVVKPETELNGFKVAKQLDPTDHHVRATIVVLEGDKKVVGFIPHPEEVPGIVFA |
Ga0207521_104205 | Ga0207521_1042051 | F042319 | IGIADQSALLSRIAAWCTHFVEGVREGQEIAARYHALSRLSTPELARRGLNRHMIARVALTGY |
Ga0207521_104251 | Ga0207521_1042513 | F031610 | MLAIGRAAGDVAGYPGKITLITIGMGEAAIAANNAIAQIRGEKVQPKYSTD |
⦗Top⦘ |