| Basic Information | |
|---|---|
| IMG/M Taxon OID | 3300026774 Open in IMG/M |
| GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0095510 | Gp0091544 | Ga0207451 |
| Sample Name | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G01K1-12 (SPAdes) |
| Sequencing Status | Permanent Draft |
| Sequencing Center | DOE Joint Genome Institute (JGI) |
| Published? | N |
| Use Policy | Open |
| Dataset Contents | |
|---|---|
| Total Genome Size | 24456696 |
| Sequencing Scaffolds | 18 |
| Novel Protein Genes | 20 |
| Associated Families | 20 |
| Dataset Phylogeny | |
|---|---|
| Taxonomy Groups | Number of Scaffolds |
| All Organisms → cellular organisms → Bacteria → Proteobacteria | 1 |
| All Organisms → cellular organisms → Bacteria | 1 |
| Not Available | 10 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium | 1 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Sphingomonadales → Sphingomonadaceae → Sphingomonas → unclassified Sphingomonas → Sphingomonas sp. SE220 | 1 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium | 1 |
| All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Thermoleophilia → Solirubrobacterales → unclassified Solirubrobacterales → Solirubrobacterales bacterium | 1 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales | 1 |
| All Organisms → cellular organisms → Eukaryota → Viridiplantae → Streptophyta → Streptophytina → Embryophyta → Tracheophyta → Euphyllophyta → Spermatophyta → Magnoliopsida → Mesangiospermae → Liliopsida → Petrosaviidae → commelinids → Poales → Poaceae → PACMAD clade → Panicoideae → Andropogonodae → Andropogoneae → Tripsacinae → Zea → Zea mays | 1 |
| Ecosystem Assignment (GOLD) | |
|---|---|
| Name | Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts) |
| Type | Environmental |
| Taxonomy | Environmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil → Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts) |
| Alternative Ecosystem Assignments | |
|---|---|
| Environment Ontology (ENVO) | terrestrial biome → agricultural field → agricultural soil |
| Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Soil (non-saline) |
| Location Information | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location | Wisconsin, United States | |||||||
| Coordinates | Lat. (o) | 43.3 | Long. (o) | -89.38 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F000466 | Metagenome / Metatranscriptome | 1105 | Y |
| F000621 | Metagenome | 981 | Y |
| F002483 | Metagenome / Metatranscriptome | 555 | Y |
| F003059 | Metagenome / Metatranscriptome | 510 | Y |
| F017066 | Metagenome / Metatranscriptome | 243 | Y |
| F017236 | Metagenome / Metatranscriptome | 242 | Y |
| F019338 | Metagenome / Metatranscriptome | 230 | Y |
| F022446 | Metagenome / Metatranscriptome | 214 | Y |
| F025757 | Metagenome | 200 | N |
| F028554 | Metagenome / Metatranscriptome | 191 | N |
| F034564 | Metagenome / Metatranscriptome | 174 | Y |
| F037212 | Metagenome / Metatranscriptome | 168 | Y |
| F051771 | Metagenome / Metatranscriptome | 143 | Y |
| F056824 | Metagenome | 137 | N |
| F063479 | Metagenome / Metatranscriptome | 129 | Y |
| F077412 | Metagenome / Metatranscriptome | 117 | Y |
| F081299 | Metagenome / Metatranscriptome | 114 | Y |
| F085355 | Metagenome / Metatranscriptome | 111 | Y |
| F095123 | Metagenome / Metatranscriptome | 105 | N |
| F099776 | Metagenome | 103 | Y |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| Ga0207451_100037 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 1885 | Open in IMG/M |
| Ga0207451_100411 | All Organisms → cellular organisms → Bacteria | 1099 | Open in IMG/M |
| Ga0207451_101456 | Not Available | 791 | Open in IMG/M |
| Ga0207451_101999 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium | 721 | Open in IMG/M |
| Ga0207451_102291 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Sphingomonadales → Sphingomonadaceae → Sphingomonas → unclassified Sphingomonas → Sphingomonas sp. SE220 | 694 | Open in IMG/M |
| Ga0207451_102317 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium | 691 | Open in IMG/M |
| Ga0207451_102769 | Not Available | 651 | Open in IMG/M |
| Ga0207451_103481 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Thermoleophilia → Solirubrobacterales → unclassified Solirubrobacterales → Solirubrobacterales bacterium | 607 | Open in IMG/M |
| Ga0207451_103603 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales | 601 | Open in IMG/M |
| Ga0207451_103669 | Not Available | 597 | Open in IMG/M |
| Ga0207451_104110 | Not Available | 577 | Open in IMG/M |
| Ga0207451_104141 | Not Available | 575 | Open in IMG/M |
| Ga0207451_104446 | All Organisms → cellular organisms → Eukaryota → Viridiplantae → Streptophyta → Streptophytina → Embryophyta → Tracheophyta → Euphyllophyta → Spermatophyta → Magnoliopsida → Mesangiospermae → Liliopsida → Petrosaviidae → commelinids → Poales → Poaceae → PACMAD clade → Panicoideae → Andropogonodae → Andropogoneae → Tripsacinae → Zea → Zea mays | 563 | Open in IMG/M |
| Ga0207451_104577 | Not Available | 557 | Open in IMG/M |
| Ga0207451_104927 | Not Available | 543 | Open in IMG/M |
| Ga0207451_105229 | Not Available | 533 | Open in IMG/M |
| Ga0207451_105282 | Not Available | 532 | Open in IMG/M |
| Ga0207451_105953 | Not Available | 510 | Open in IMG/M |
| Scaffold ID | Protein ID | Family | Sequence |
|---|---|---|---|
| Ga0207451_100037 | Ga0207451_1000371 | F034564 | MNRSGRLSGYWYGAVCIAGLGFGLLGERRPFGQGILAHPFIVYAFVVAAGLLVIRVVRQQPVPELIPERALGLGCAAGVALFL |
| Ga0207451_100411 | Ga0207451_1004112 | F099776 | MSTLAERRRHVHSHVFKTAQIVVAERAPTIECRARDLSGYGARLCLSTTYGLPQQFDVIIDGKRRSVRPVWMTYTEMGVMFAEASQKSADLVECERDIASLIELLKMAEEKWPSSESYEISETEMLCRDQALLDMWPEACRRIGFSKREFPIDVIKLWQKQMGWPN |
| Ga0207451_101456 | Ga0207451_1014562 | F025757 | DTYRPSFCAQLIENHRLCGFSRPHRTEGHPPFKAIESTVRVGLALSATRAPMSYSASLSLFWLEMAVLIGCVALSIGMRSRPAMWVALGIVAHCAMWLAMHDEEILIRLVASTLVYLGLLKFSPNAARVWLCAGGALAGAFLLGTMALSLLMSFPGRWSVFGLSMGATLLFLTSGLLIGFWVVYRWTDPTQLGDTQPRQEA |
| Ga0207451_101999 | Ga0207451_1019991 | F003059 | MVRAGIAAGMLAGAAIVMAAMPAAAQVRDAVYRGTLICDKLPFSAGKGREAIEVTIAGGAVRYSHVVRLRDAAEPVSEQGKGSLNGQDIELQGSWKAGNRQYEAKYSGAFVRRHADLKGTQTWTDGGKSFTRACTGTIKRPFRVFLPGEKK |
| Ga0207451_102291 | Ga0207451_1022911 | F051771 | VGVSAAFAVDNVRDARRDEVRRQAVYRALDLELRQMAETHGPVFQREMTAELAQWDQAVAHGERPVPPAFRLLGAERPPTGVWDAAVATGSIELVDPELFYELARFYNRANSAGILYQRYSAGAEAHVWPYIDDGPQAFWDSSGKLRPEIKAHVQRLRDFHDRQGELGREARDLRMKIERAERG |
| Ga0207451_102317 | Ga0207451_1023171 | F000466 | KTAPSQGVVKRDWVPTGRIDFATRLNGTLEESDQPSEFKLLVEERRIVESITGNENLEIQWRLATLNEAKAVVAQYHKYLAENALIRSVSDETVSLPPPKKVQKIQETTAA |
| Ga0207451_102769 | Ga0207451_1027692 | F028554 | GQMRKAKQVRNRALSAGEGRRAIIVTAALTGLFLLAAFVTAGSFLSTDPQAMSSVAKVTPLPRTEGGEAASRVASIVMETDKKGRCEERQFDNRTGKMVSANYVNCDARLEPERDTTPSENINRERIRAILGAFKK |
| Ga0207451_103481 | Ga0207451_1034811 | F002483 | DAELERLRELPAFADGPLVERRPQLKVRRASRKPNRLGFAVPSEFRLSVTAYPGIRPGDVLETLLHELVHLHVGRAKEAHAWHGPTFKHVLKRAMREAYGIAIPTPRSTLHGPYAEAIEASIRRDRKNPP |
| Ga0207451_103481 | Ga0207451_1034812 | F017066 | MTLPLAHHSALVALPVFAPALVVILVLLVHRLREGRRWDEEEANGNN |
| Ga0207451_103603 | Ga0207451_1036031 | F037212 | MTNNEIGAADISEPADAIAWRHFWLANVVVESDGRPLVRATRPALQPRKDSLSRRLQRLR |
| Ga0207451_103669 | Ga0207451_1036691 | F017236 | MQHLPKQAVFLGRGLQARFDCCYQPLPDELNVLLWFLDGAERRGRIQATLNARQRVKLDLPADVHWMPADQVPEARADWRGHERHFESLRRAIDFVMQELTIADRANVTITTEDGNLTIEQIEKL |
| Ga0207451_104110 | Ga0207451_1041101 | F077412 | MPVAAITGLGKTVLLIVAITFIAWALITAIFVPKRSPGFPIRLDAYILVSVLLFIAQMSAVVWVTGTQETEEETHAAE |
| Ga0207451_104141 | Ga0207451_1041411 | F000621 | MPTTKHELLDWLMDVPEDAEIGTDGAGLALLAILGTNVHLLEIGRIPNADELYAEAINQAMIERLRRIHAAGGETETGVIIVTFQGFISGGPRLFSTDFNTAFIFKNTEQAESFIKEFADELHNPQILDCP |
| Ga0207451_104446 | Ga0207451_1044462 | F095123 | VVGGLMSYASLVTCEGAMNALSREGCRHFEAFDQADEDFDRDIYKVEDPVVKESAGALYDRMWGSYGREVVRARAEAARAQVTLSFTRCLLDAVCACVC |
| Ga0207451_104577 | Ga0207451_1045772 | F019338 | AMRRTEAVLTAGAVACLAFVSIVYPVFAQDVDPRCKDIFDKVACTCAVRNGGHVIPPPVGVKREGLKLRPKEEAGGTQTLDGGRVAFPKYYRREGLKFHRSRALEGYLACMRAAGRK |
| Ga0207451_104927 | Ga0207451_1049271 | F085355 | MDSLKAALTSVKDLLDWLPDLVVALLILAIAVLFALALHRWARKLFRRAIAGRYPFVFSVFT |
| Ga0207451_105229 | Ga0207451_1052292 | F022446 | MSAALPSPPAPSPRGRNPLPVEALQNHLGGLLREREELREAAFPLALERNRREIVRAQWELSHALIERYASG |
| Ga0207451_105282 | Ga0207451_1052821 | F056824 | MPRYQFLNIETFHLIIAEAKEAAWQLGLDEGPVVDATRDVNFGSPSTILVEDRIYE |
| Ga0207451_105498 | Ga0207451_1054981 | F063479 | MSARNWAGTHWIARSTRGGGRPKSGEVDLGPPVKSGRVRGLGKLHGLLAELAEALVGLEGGWSGLATAAVALAAMAGG |
| Ga0207451_105953 | Ga0207451_1059531 | F081299 | DRFLAGEPFGLDVALAGSVAAPSLHLEVRDASGLLVAEDLVETARLGWDPAGDGLGLRLDVGAPPLQFGRFEVTLALIGDDGRLLDRLARPIPLLVYPDDESRGLVRLEGTWRRGPNEAE |
| ⦗Top⦘ |