| Basic Information | |
|---|---|
| IMG/M Taxon OID | 3300026812 Open in IMG/M |
| GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0095510 | Gp0055861 | Ga0207518 |
| Sample Name | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G10A3-10 (SPAdes) |
| Sequencing Status | Permanent Draft |
| Sequencing Center | DOE Joint Genome Institute (JGI) |
| Published? | N |
| Use Policy | Open |
| Dataset Contents | |
|---|---|
| Total Genome Size | 29692333 |
| Sequencing Scaffolds | 22 |
| Novel Protein Genes | 25 |
| Associated Families | 25 |
| Dataset Phylogeny | |
|---|---|
| Taxonomy Groups | Number of Scaffolds |
| All Organisms → cellular organisms → Bacteria | 3 |
| Not Available | 8 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria | 2 |
| All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes | 1 |
| All Organisms → cellular organisms → Bacteria → Acidobacteria | 2 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria | 2 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → unclassified Bradyrhizobiaceae → Bradyrhizobiaceae bacterium | 1 |
| All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium | 3 |
| Ecosystem Assignment (GOLD) | |
|---|---|
| Name | Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts) |
| Type | Environmental |
| Taxonomy | Environmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil → Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts) |
| Alternative Ecosystem Assignments | |
|---|---|
| Environment Ontology (ENVO) | terrestrial biome → agricultural field → agricultural soil |
| Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Soil (non-saline) |
| Location Information | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location | Wisconsin, United States | |||||||
| Coordinates | Lat. (o) | 43.2958 | Long. (o) | -89.3799 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F001033 | Metagenome / Metatranscriptome | 799 | Y |
| F001436 | Metagenome / Metatranscriptome | 695 | Y |
| F003758 | Metagenome / Metatranscriptome | 470 | Y |
| F004886 | Metagenome / Metatranscriptome | 420 | Y |
| F014308 | Metagenome / Metatranscriptome | 264 | Y |
| F015418 | Metagenome / Metatranscriptome | 255 | Y |
| F015863 | Metagenome / Metatranscriptome | 251 | Y |
| F018801 | Metagenome / Metatranscriptome | 233 | Y |
| F020952 | Metagenome / Metatranscriptome | 221 | Y |
| F024822 | Metagenome / Metatranscriptome | 204 | N |
| F037759 | Metagenome / Metatranscriptome | 167 | N |
| F040358 | Metagenome / Metatranscriptome | 162 | N |
| F041232 | Metagenome | 160 | N |
| F050714 | Metagenome | 145 | N |
| F053375 | Metagenome | 141 | N |
| F053376 | Metagenome / Metatranscriptome | 141 | Y |
| F058266 | Metagenome | 135 | N |
| F060887 | Metagenome / Metatranscriptome | 132 | Y |
| F073679 | Metagenome / Metatranscriptome | 120 | Y |
| F082749 | Metagenome / Metatranscriptome | 113 | Y |
| F085279 | Metagenome / Metatranscriptome | 111 | Y |
| F087349 | Metagenome / Metatranscriptome | 110 | Y |
| F090709 | Metagenome | 108 | Y |
| F094081 | Metagenome / Metatranscriptome | 106 | Y |
| F103330 | Metagenome | 101 | N |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| Ga0207518_100041 | All Organisms → cellular organisms → Bacteria | 1729 | Open in IMG/M |
| Ga0207518_100246 | Not Available | 1184 | Open in IMG/M |
| Ga0207518_100724 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 886 | Open in IMG/M |
| Ga0207518_100819 | All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes | 855 | Open in IMG/M |
| Ga0207518_101156 | All Organisms → cellular organisms → Bacteria → Acidobacteria | 776 | Open in IMG/M |
| Ga0207518_101207 | Not Available | 767 | Open in IMG/M |
| Ga0207518_101308 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria | 748 | Open in IMG/M |
| Ga0207518_101402 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → unclassified Bradyrhizobiaceae → Bradyrhizobiaceae bacterium | 733 | Open in IMG/M |
| Ga0207518_101406 | All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium | 732 | Open in IMG/M |
| Ga0207518_101603 | Not Available | 708 | Open in IMG/M |
| Ga0207518_101822 | All Organisms → cellular organisms → Bacteria | 683 | Open in IMG/M |
| Ga0207518_101850 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 680 | Open in IMG/M |
| Ga0207518_102392 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria | 634 | Open in IMG/M |
| Ga0207518_102635 | All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium | 617 | Open in IMG/M |
| Ga0207518_103009 | Not Available | 597 | Open in IMG/M |
| Ga0207518_103288 | Not Available | 583 | Open in IMG/M |
| Ga0207518_103508 | All Organisms → cellular organisms → Bacteria | 574 | Open in IMG/M |
| Ga0207518_103951 | Not Available | 555 | Open in IMG/M |
| Ga0207518_104221 | Not Available | 545 | Open in IMG/M |
| Ga0207518_104469 | All Organisms → cellular organisms → Bacteria → Acidobacteria | 537 | Open in IMG/M |
| Ga0207518_104485 | All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium | 537 | Open in IMG/M |
| Ga0207518_104650 | Not Available | 532 | Open in IMG/M |
| Scaffold ID | Protein ID | Family | Sequence |
|---|---|---|---|
| Ga0207518_100041 | Ga0207518_1000411 | F018801 | MGLTILSIPFIALGVFLKPYALENSRCIGFAGLGPYCFEQASSMPEVIKYGSVAVGFALL |
| Ga0207518_100246 | Ga0207518_1002462 | F090709 | NRHAIDPSVTLPIDKLNWMQNELVKAGNLKAPFDLTTMTAPDIRAEAAKRATK |
| Ga0207518_100724 | Ga0207518_1007242 | F040358 | MRIAIFALAILAGSGAAEARQVEVVSTSPRHIEIAAWCTAGSNCQQEASDVAQGYCHDPYPRRALYVRSGFVERGFFGGRVIFVYRCNRRSINCEAGSCN |
| Ga0207518_100819 | Ga0207518_1008191 | F085279 | AYGLGVGQVLDFLEVPAASVVERRELPGVDPRLVKAVGLESNRHFILLDVDAVFAPIIGS |
| Ga0207518_101156 | Ga0207518_1011561 | F053375 | MHVTPLSAATILAFASPLHAQGIEVFGGYSVNADYVQNRPAILVADQKVSPFFSHGSGPTGFEASFKHDVRNGLGIKVDVS |
| Ga0207518_101207 | Ga0207518_1012072 | F053376 | MLTLRTLLTSGGEPYQYEVIGLPNGSGAMLARLSGVWKVLKIERGKGHAWNGSFASAEDALGSLDGLVKGALLVPVRS |
| Ga0207518_101308 | Ga0207518_1013082 | F024822 | VRQLFIRVAGVILCALALSGCVDSAGPLLSEAQPVLGERLRLQFYSLSKGTADEPEQATYKWDRGTYQRTGGGMTDIGSFSVHPLARDIFVVQSASAKRPGMFEYAIARRLVDGVYQVIAVDEADAGQVTRARFCKRASDSSCRIEKRNQL |
| Ga0207518_101402 | Ga0207518_1014022 | F014308 | LIFPPELEVAFMSGSTIGIQLIMPKTGTSPKATPEQQASEREAATQPVVKAPPPPGMGKIVDKVA |
| Ga0207518_101406 | Ga0207518_1014062 | F020952 | MPIIEKTIRIAAIALAFLFIGVSLLGIFGAWFVDGKATDVALKGFGLI |
| Ga0207518_101603 | Ga0207518_1016031 | F015418 | MSIESRRTFTKGLLASALVPGTSAFGQPNDPASIAIIDTPQNAAKVAAKLAAQNVKVVVRFFAR |
| Ga0207518_101822 | Ga0207518_1018221 | F050714 | MMRVLKPLQDKATTGAGKRLVLPEPRRVRFLIRGEGSI |
| Ga0207518_101850 | Ga0207518_1018501 | F037759 | LLSGAARAQSPELGAPSIGILPPSDILASVSYLGLDPSGEPVRRGAYYMLHAFDRAGIELLVVVDAQFGDVLFMAPALNTSLTPPYTRAARIIQVAPESGDQQKR |
| Ga0207518_102346 | Ga0207518_1023462 | F087349 | LEYGMSWSETMKGNATVASLEHVANDESVFRPTQATSWDPFEVWLTRIKQPRDRAAKSALADTSSGKGRV |
| Ga0207518_102392 | Ga0207518_1023921 | F073679 | MASEGREYFLARAAEERDAATSAPNPSLARVHDDLAAMYDRMAREAE |
| Ga0207518_102635 | Ga0207518_1026352 | F103330 | MESDEVFHLTAKISSNSNVADKHPILVNFKSRAALLAAGARIDRANIDLQTVEGGI |
| Ga0207518_103009 | Ga0207518_1030092 | F058266 | MNNDRLLVVFVGLYLIVIVALLFKDAPRPVAPEVKTPVAKVETAPIAKEDKPDAARPQADDGGKPGATPNCEKELRRTADLLRFFANRIQGGEDAQSVV |
| Ga0207518_103288 | Ga0207518_1032881 | F015863 | MRRFIPLLILLGLVFGASYASALINRVMGPWSSTAIHQDGSLSHMQFGVDLPRPEWVPVYPGAWVVGGSKITSVEHPAGFHGLDLGTRASLDEVKRFYTEQLTAAGFEVSDLGLMGLNPMTAAYLGVDGMLSAKRHATDDAIDVQ |
| Ga0207518_103508 | Ga0207518_1035082 | F001436 | MNHEQDVATLIELLKMAAERWPRSEADQASQSELFHEDHSLLEMWPEACRRTGVGSREFPPGVIKLWK |
| Ga0207518_103951 | Ga0207518_1039512 | F003758 | VLRAFENAEDEILMNTLKGAPRLDVPSLFRPQFMAEVQKEHPEYFADLPPLK |
| Ga0207518_104221 | Ga0207518_1042211 | F060887 | MNAAAWICLALPLGATVAIALAGTLISRRLAGYLATASVL |
| Ga0207518_104469 | Ga0207518_1044691 | F004886 | RLEVLLEIEKGPRSRSWFAGKAPQDLERPAAIKRLLDTGLIEPAPAPQHYRVTADGWEFLQALRARVGGSSGLDWTRVDEIDFSKL |
| Ga0207518_104485 | Ga0207518_1044851 | F001033 | EKRELLMCGNSLIARLTIAVFAFQMLGVTSVVHAESPDSTAGTSTAGTRKLFINPSSTSVALGKASLIVSPLTHRDGNYVGNYQLKVRPYFFKSEKGSLLLTASDDAVRKLQAGTAINFTGKAVTHKDGRTHIVLGRATPSSGNRGGVTFSIITDDAKIVFNTSYHFGTQPGT |
| Ga0207518_104650 | Ga0207518_1046502 | F041232 | RLLLVAASLLLTYGIALVTFMVRWIRFWMRGEYFLPYLFAFVTTLALLLFWWTSRRPHAPLPILSTIVYSVVAGYVAGLIAMVLYPFFQSDGLQHMIEALRFPTIEAAIAFFWFPIRLLTWLFGGITGVTMLVLSRRWRRMTC |
| Ga0207518_105025 | Ga0207518_1050251 | F094081 | MADDHRKTLKEEFTDRLEKAKGRLQQSFPEIQQSIKTSGAVEAARKIIDPAQSIFKQFADDIQLKDLFAKAEALVANLTVAKSVSRD |
| Ga0207518_105215 | Ga0207518_1052151 | F082749 | MDGESVDAAGKLGRKRLINHAMTLDAGLSLKGVRHDIDPVVSLPARPVPGMALMLVRFINHFEVLRRESLGQLFCDEIGGSHIARLGERSLPVNGY |
| ⦗Top⦘ |