| Basic Information | |
|---|---|
| IMG/M Taxon OID | 3300026731 Open in IMG/M |
| GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0095510 | Gp0072043 | Ga0207523 |
| Sample Name | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G05A4-10 (SPAdes) |
| Sequencing Status | Permanent Draft |
| Sequencing Center | DOE Joint Genome Institute (JGI) |
| Published? | N |
| Use Policy | Open |
| Dataset Contents | |
|---|---|
| Total Genome Size | 19672863 |
| Sequencing Scaffolds | 16 |
| Novel Protein Genes | 18 |
| Associated Families | 18 |
| Dataset Phylogeny | |
|---|---|
| Taxonomy Groups | Number of Scaffolds |
| Not Available | 9 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales | 1 |
| All Organisms → cellular organisms → Archaea | 2 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria | 1 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria | 1 |
| All Organisms → cellular organisms → Bacteria | 1 |
| All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium | 1 |
| Ecosystem Assignment (GOLD) | |
|---|---|
| Name | Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts) |
| Type | Environmental |
| Taxonomy | Environmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil → Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts) |
| Alternative Ecosystem Assignments | |
|---|---|
| Environment Ontology (ENVO) | terrestrial biome → agricultural field → agricultural soil |
| Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Soil (non-saline) |
| Location Information | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location | Wisconsin, United States | |||||||
| Coordinates | Lat. (o) | 43.3 | Long. (o) | -89.38 | Alt. (m) | N/A | Depth (m) | 0 | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F000268 | Metagenome / Metatranscriptome | 1411 | Y |
| F002896 | Metagenome / Metatranscriptome | 522 | N |
| F003755 | Metagenome / Metatranscriptome | 470 | Y |
| F014308 | Metagenome / Metatranscriptome | 264 | Y |
| F019867 | Metagenome | 227 | Y |
| F020379 | Metagenome / Metatranscriptome | 224 | Y |
| F022740 | Metagenome / Metatranscriptome | 213 | Y |
| F034564 | Metagenome / Metatranscriptome | 174 | Y |
| F040358 | Metagenome / Metatranscriptome | 162 | N |
| F045732 | Metagenome / Metatranscriptome | 152 | N |
| F061030 | Metagenome / Metatranscriptome | 132 | Y |
| F064778 | Metagenome / Metatranscriptome | 128 | Y |
| F072223 | Metagenome / Metatranscriptome | 121 | Y |
| F081936 | Metagenome / Metatranscriptome | 114 | N |
| F084328 | Metagenome | 112 | N |
| F086321 | Metagenome | 111 | N |
| F088977 | Metagenome / Metatranscriptome | 109 | N |
| F097305 | Metagenome / Metatranscriptome | 104 | N |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| Ga0207523_100020 | Not Available | 1598 | Open in IMG/M |
| Ga0207523_100559 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales | 815 | Open in IMG/M |
| Ga0207523_100762 | All Organisms → cellular organisms → Archaea | 753 | Open in IMG/M |
| Ga0207523_100954 | Not Available | 703 | Open in IMG/M |
| Ga0207523_101343 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 640 | Open in IMG/M |
| Ga0207523_101673 | Not Available | 601 | Open in IMG/M |
| Ga0207523_101678 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria | 601 | Open in IMG/M |
| Ga0207523_101853 | Not Available | 586 | Open in IMG/M |
| Ga0207523_102025 | Not Available | 574 | Open in IMG/M |
| Ga0207523_102059 | All Organisms → cellular organisms → Archaea | 571 | Open in IMG/M |
| Ga0207523_102509 | All Organisms → cellular organisms → Bacteria | 540 | Open in IMG/M |
| Ga0207523_102752 | Not Available | 526 | Open in IMG/M |
| Ga0207523_102872 | Not Available | 519 | Open in IMG/M |
| Ga0207523_102874 | Not Available | 519 | Open in IMG/M |
| Ga0207523_103105 | All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium | 508 | Open in IMG/M |
| Ga0207523_103208 | Not Available | 504 | Open in IMG/M |
| Scaffold ID | Protein ID | Family | Sequence |
|---|---|---|---|
| Ga0207523_100020 | Ga0207523_1000202 | F022740 | QAALNHAQAVDKAAGAGSRNVGFYHRRRAQILAARSNELGGLVSFISVGGQPVNTTRNGTIVAAFTFDDIVWTDIQQKTFAAATAQIRQIRPGSTPVLAATGTITPLADTEIKKLGWKIVQLKPER |
| Ga0207523_100559 | Ga0207523_1005591 | F064778 | SLNPSSKPMEKAAFSAACERGKPPGVPGAANGAGYSGDGRKGQEITNLGRQ |
| Ga0207523_100696 | Ga0207523_1006962 | F045732 | VRAAACGCVCAGHVVCNMRHRRRRVFIDTQKGRVSAGFTVAAEADAADVSMRLRERGWIAYRLRLEAEQYAWIATVIDWARRAA |
| Ga0207523_100762 | Ga0207523_1007621 | F081936 | PPNVEASVNPKVEQLAHVLEITLIVVPDSADLLNFNLRFLIIYRLKFTKTAIRTELIQVKINASIISLGT |
| Ga0207523_100954 | Ga0207523_1009541 | F088977 | MRTTSLAGVLSLSLSAPLFSQTFNQTATYIALMRSSVGGLPPVATSTLQGDLQDGVALAIRYGYVPSSSRMDLPSMNNFGLTAVLPTGTASTVSITGGLSSLSRGGSDAWIIGAGGDLRLTDWAFSQGRSAPHLRVAVNGQLDYSKPRESALIAGSVGLPLSIIRPNRPKQEMQVVPFVTPSFAFGN |
| Ga0207523_101343 | Ga0207523_1013433 | F003755 | SENSVMNATVIELPTVESLSDEIRGVVYERQTLRAVGAGREELERNRAELVRLQQALVDALIRRHLPANAA |
| Ga0207523_101561 | Ga0207523_1015611 | F072223 | RPNVKFATFFQTRFENYGTQCTGPSPSGLPVKCAGEDIAAVTSNQLKVLKPKKKRR |
| Ga0207523_101673 | Ga0207523_1016731 | F097305 | NEKSGKDKKSDSRSQRLAAELRENLRRRKAQVRGRKAPAAGEGTKRPGLPPRGG |
| Ga0207523_101678 | Ga0207523_1016782 | F014308 | MSGSTIGIQLIMPKTGTSPKATPEQQASEREAAQPVVKAPPPPGMGKIVDKVA |
| Ga0207523_101853 | Ga0207523_1018531 | F019867 | MHVTPLSAATILAFASPLHAQGIEVFGGYSVNADYVQNRPAILVADQKVSPFFSHGSGPTGFEASFKHDVRNGLGIKVDASGYSDTFPPGPAAYCQSDGSAAGIACGTGLTFHAIGRAFYVTAGPEWKIRRGKRFAPFAQTLVGIVSTRSTFMMNGSDVQYTNPFTGGVLLFTSAGFLPDRSIHYADAHADAGL |
| Ga0207523_102025 | Ga0207523_1020251 | F040358 | MRIAIFALAILAGSGAAEARQVEVVSTSPRHIEIAAWCTAGSNCQQEASDVAQGYCHGPDYPRRALYVRSGLVERGFFSERVIFVYKCNRRSI |
| Ga0207523_102059 | Ga0207523_1020591 | F002896 | LYVIKTIYGMDKMNEDTILHFYRILENSLLESDISKINEEDIDAWSQSFKKVVRESREKSGKGVFVPFLMWKLGEISPVEARKYLVNRKQDECRVSYDHNNVEYILRVMALMFMSWSVTNLKRKTQNGHCQNIDHPHGDKNPRLCKEGTIFHQELYNECVKTFKDLLIHSDAQHH |
| Ga0207523_102509 | Ga0207523_1025091 | F000268 | MLMRVVAVMLLLSAGIAAEAVSYSFVSKASGRLGGPIRFEFYRDSTTRPKTDIESFTVSMRTADDRWRAMWSILSGRGLTQPIEYGVTPPGFTTMIQPQKLIPGRVYAGFATDGHGGSSGVTFGFDKNGRMIFPDSFDSMSGE |
| Ga0207523_102752 | Ga0207523_1027521 | F084328 | MRPKNAAPSVSDACPDWEAPAFTKLPIASRTGSGANVANDAPRAIEPEAPGQPLSKLGFSFEMSFPM |
| Ga0207523_102872 | Ga0207523_1028721 | F061030 | DIFDKVACTCAVRNGGHVIPPPVGVKREGLKLRPKEEAGGTQTLDGGRVAFPKYYRREGLKFHRSRALEGYLACMRAAGRK |
| Ga0207523_102874 | Ga0207523_1028741 | F034564 | RSGRLSGYWYGAVCIAGLGFSLLGERRPFGQGILAHPFIVYAFVVAAGLLVIRVVRQQPVPELIPERALGLGCAAGVALFLAGNFIAAHLIGR |
| Ga0207523_103105 | Ga0207523_1031052 | F020379 | MLEVQRELAALKFIDGLSAHLKEVREPHKALRHALRDTREFFQATGGCIATLRAGRPQADLLFALPRRGAWDLGVLTRYIRHTHPPI |
| Ga0207523_103208 | Ga0207523_1032081 | F086321 | MNPSKKEIYKILERFTSQKGGILFILHKSFSSDSKPPQEQTSVVRDDERKSAIFEINIGTVAGLSMLCECEKVLADQVMSLDFLDSGIEGDVIWFGGILDKSGSEFIG |
| ⦗Top⦘ |