Basic Information | |
---|---|
IMG/M Taxon OID | 3300027472 Open in IMG/M |
GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0095510 | Gp0091531 | Ga0207449 |
Sample Name | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-HINK07-D (SPAdes) |
Sequencing Status | Permanent Draft |
Sequencing Center | DOE Joint Genome Institute (JGI) |
Published? | N |
Use Policy | Open |
Dataset Contents | |
---|---|
Total Genome Size | 18054749 |
Sequencing Scaffolds | 21 |
Novel Protein Genes | 21 |
Associated Families | 21 |
Dataset Phylogeny | |
---|---|
Taxonomy Groups | Number of Scaffolds |
All Organisms → cellular organisms → Archaea | 8 |
Not Available | 3 |
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium | 4 |
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria | 2 |
All Organisms → cellular organisms → Bacteria → Proteobacteria | 1 |
All Organisms → cellular organisms → Bacteria | 2 |
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → unclassified Hyphomicrobiales → Hyphomicrobiales bacterium | 1 |
Ecosystem Assignment (GOLD) | |
---|---|
Name | Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts) |
Type | Environmental |
Taxonomy | Environmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil → Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts) |
Alternative Ecosystem Assignments | |
---|---|
Environment Ontology (ENVO) | terrestrial biome → agricultural field → agricultural soil |
Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Soil (non-saline) |
Location Information | ||||||||
---|---|---|---|---|---|---|---|---|
Location | Wisconsin, United States | |||||||
Coordinates | Lat. (o) | 43.3 | Long. (o) | -89.38 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
Zoom: | Powered by OpenStreetMap © |
Family | Category | Number of Sequences | 3D Structure? |
---|---|---|---|
F001406 | Metagenome / Metatranscriptome | 703 | Y |
F001437 | Metagenome / Metatranscriptome | 695 | Y |
F001497 | Metagenome / Metatranscriptome | 683 | Y |
F002588 | Metagenome / Metatranscriptome | 546 | Y |
F005305 | Metagenome / Metatranscriptome | 405 | N |
F005950 | Metagenome / Metatranscriptome | 385 | Y |
F010126 | Metagenome / Metatranscriptome | 308 | Y |
F011016 | Metagenome / Metatranscriptome | 296 | Y |
F013020 | Metagenome / Metatranscriptome | 275 | N |
F016471 | Metagenome / Metatranscriptome | 247 | Y |
F021150 | Metagenome / Metatranscriptome | 220 | Y |
F029190 | Metagenome / Metatranscriptome | 189 | Y |
F033177 | Metagenome / Metatranscriptome | 178 | Y |
F033996 | Metagenome / Metatranscriptome | 176 | N |
F053116 | Metagenome | 141 | N |
F070478 | Metagenome / Metatranscriptome | 123 | N |
F080180 | Metagenome / Metatranscriptome | 115 | Y |
F087961 | Metagenome | 110 | Y |
F095285 | Metagenome / Metatranscriptome | 105 | Y |
F095448 | Metagenome / Metatranscriptome | 105 | N |
F099830 | Metagenome / Metatranscriptome | 103 | N |
Scaffold | Taxonomy | Length | IMG/M Link |
---|---|---|---|
Ga0207449_100022 | All Organisms → cellular organisms → Archaea | 1880 | Open in IMG/M |
Ga0207449_100070 | All Organisms → cellular organisms → Archaea | 1334 | Open in IMG/M |
Ga0207449_100276 | All Organisms → cellular organisms → Archaea | 968 | Open in IMG/M |
Ga0207449_100370 | All Organisms → cellular organisms → Archaea | 899 | Open in IMG/M |
Ga0207449_100541 | Not Available | 813 | Open in IMG/M |
Ga0207449_100978 | All Organisms → cellular organisms → Archaea | 695 | Open in IMG/M |
Ga0207449_101028 | All Organisms → cellular organisms → Archaea | 688 | Open in IMG/M |
Ga0207449_101083 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium | 682 | Open in IMG/M |
Ga0207449_101138 | Not Available | 674 | Open in IMG/M |
Ga0207449_101256 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria | 655 | Open in IMG/M |
Ga0207449_101265 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium | 655 | Open in IMG/M |
Ga0207449_101327 | Not Available | 646 | Open in IMG/M |
Ga0207449_101341 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 645 | Open in IMG/M |
Ga0207449_101437 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium | 634 | Open in IMG/M |
Ga0207449_101454 | All Organisms → cellular organisms → Bacteria | 632 | Open in IMG/M |
Ga0207449_101589 | All Organisms → cellular organisms → Archaea | 618 | Open in IMG/M |
Ga0207449_102242 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria | 565 | Open in IMG/M |
Ga0207449_102428 | All Organisms → cellular organisms → Bacteria | 554 | Open in IMG/M |
Ga0207449_102627 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → unclassified Hyphomicrobiales → Hyphomicrobiales bacterium | 543 | Open in IMG/M |
Ga0207449_102662 | All Organisms → cellular organisms → Archaea | 541 | Open in IMG/M |
Ga0207449_103017 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium | 523 | Open in IMG/M |
Scaffold ID | Protein ID | Family | Sequence |
---|---|---|---|
Ga0207449_100022 | Ga0207449_1000221 | F016471 | MIEADKFNAVKHATVAIGLANKDTKDVISSFGTGFFIGGEYIVSSAHIFSQCIKYNAQYKEKNKGTEGIYSAFNITTKGDQLELKTYHIIKAIRLPPVKEVKGFTGSVDLDIGIGKLDCHSDNFLHIKEPTQLKLYDEIVL |
Ga0207449_100070 | Ga0207449_1000702 | F087961 | LISNSHSLKLEETAKGVRISVHVYTNDKQTAINEAIATYLETKQICEKEKIQIAPMEKSK |
Ga0207449_100276 | Ga0207449_1002761 | F010126 | METIKGPASFSIPSMKSSIGSLMVCTCICTAALLLVIIESQTAIAIDTPLSNMSGTNSGSQLESNVTNRTSSDQIAYRNPQYGIFLLFPSNWTFSTSGLPEYTQIAGFYAPLQNLSDPIPARFTISVMSYQQNVSLKDFTNVTLSSLNQTNQIKILSSGPTTLAGRPGYQVVFSTLPNIGNPVSFEIMHSWTAVDNKIYVFQYSVESSKFDTYLPTVKQILDSLRINGKG |
Ga0207449_100370 | Ga0207449_1003701 | F013020 | MSAHNNNNDKQEQERLKLDVLNKIFGWIEDKETKAVMINKYYNNKEHRAALKAFLDDMVKALDESTAETNSKEEIKRQLSYIIREIDPLY |
Ga0207449_100541 | Ga0207449_1005411 | F053116 | VLGDIYIKMNILKRNRFDRRSDSTLIVLIAISLLYIMVTITALYPFDRLQIVAAATLQENNNNTKPVFLSTISTTIATGVGATGAILTVPGYLRARKQPKFLAAYLLKIHNKHDELCRYPKLSDKSKNEYRNFLDSLRCDIIYSLKNGDINENQYRLIEDRIAEYLNRLNYPR |
Ga0207449_100978 | Ga0207449_1009782 | F005950 | MTYAKQKVKAKIHRTSDYDDKHTGIRDFPDEKAMLHYGLSKVHKVIIKKYAKDDEFMAIARKTRGIKFDYDMELYDYIGTGVRESKR |
Ga0207449_101028 | Ga0207449_1010281 | F005305 | VEDLEKELGPIIENFQNLLKDAKAKKIDSLREDEDLKKEFNKLSKDVIEPVMKKFESYLKSKDVNSSVNVRSEIVSGKNPSIEFILHFKLTHESRYPNIKFSSFGEKISIQEDRLITKGEVRQDMMPEYYDKEQITVEFIKERLIRLIKSCFDKNWQSIYS |
Ga0207449_101083 | Ga0207449_1010831 | F001406 | MTGGEEAAPRRGRAPLRQRSRKRWAIAGGVAAGWLVLEVMTGSAISATLVLVAIAALAGASLAGLRALGITSDHPWIQRMASRPWRDGQDVLRIAMRHLPDVFVITPSGSLLAPAVVELQLNPADLDSLREQMDLDVINSSLTEVYEEQVVAYQARPAAPGSRKDSISRLRPRARSTRRARPARSSCSGPAMSG |
Ga0207449_101138 | Ga0207449_1011381 | F080180 | GWYWSTGHYGNYTVDMFVLTTSAAFNQQKTVDAYLAKGNGPSKVLVETMKGVTAHASGKNLTSPGGIHTYPEILTLQWKNGTNSATLTLTNPSIVASPPTIVNTNATIYGNPQYMRLQGTGTLNVQWGGNNETASAPAIWEVSYLH |
Ga0207449_101256 | Ga0207449_1012561 | F021150 | DVDPQDGDTSFVLTVKHSNRHGRAYKATGTATILVDAKTRVRRQGAKTLGALAPNDRVHVTAKACKADLKNGGTPDLTARKIGAHPVAAPTQPSS |
Ga0207449_101265 | Ga0207449_1012652 | F033177 | WMSGDDDLRKLLATLDPQARNDLRRVLIHDQADRDAIASQLLRYRDEHGDDWTDIIDMLTMHPEVRRLLVRVLGELEADHRTG |
Ga0207449_101327 | Ga0207449_1013271 | F001497 | YSDGALTARGYRLPDGTARAGSWVLVDLATGNVQGVVPPGEFSRRFRPVDLFADVPGLPTYLGTVTASDGTALVDLALDRNPFMLANGPRRVARPHALIVPRSHRDGWSSATAAELEACHTAMTLVAAWYRSMDGGHVVFCANDSAPNRDYLRDVEAADGILGDGGTDAAVTKNPRQEVQHAHLHAFYAERGGTENHESSALDGYPVIGAGYRA |
Ga0207449_101341 | Ga0207449_1013412 | F095448 | MVASPHTRQSDPAIPNIVCPKCGLRMQVAAIEPAGNDDRTVTFGCDCGHRYDLSERAIVALARDSSDRW |
Ga0207449_101437 | Ga0207449_1014371 | F001437 | MAGFLFRLETVDGAPAEPPTFATAVPNWSPGDEIPLGHRALRVIGKRDDDADQPPVLVVEDD |
Ga0207449_101454 | Ga0207449_1014542 | F070478 | MSTNARKKNENHVRIYDPDDLEWNAEHQVCPFKGTTLSAAQQVVVKLCKRIKQLGGELPMNWLDCELIYLERFAQKLQRVLDRKVLGF |
Ga0207449_101589 | Ga0207449_1015891 | F033996 | MQFNFTAKDKHALKSLSGDLRDMFSMKRVEDRLENPEYRKVFDASFFRTVEGNYVDEMLNWVEDFRKRLDNQESKESKEQLYSETKELVDAGWIQNPNG |
Ga0207449_102242 | Ga0207449_1022421 | F095285 | MSVKAGQAQHCRCRFRAGKPGEIAACGCGFWWRVSKRGRWYAISKRHAFRALTPGLMRELGYLEVTGR |
Ga0207449_102428 | Ga0207449_1024281 | F002588 | SLIVGATLAVLIWFFPRWFHDHISDEMNAFVLGLVPLLAGASVFLVRWFVSPYPIYMQVRRKVDSLTDTKKEERAKAVQACFERSAAILKQHHSLLLSFHALSRAEGHRLESNKEIADVCDLIQEAGYDHPFEGISPGYVPEKDWLPFLKYVKHAPNINPEEGKDYIDAANRWRDDHGYPLPPG |
Ga0207449_102627 | Ga0207449_1026272 | F011016 | RVLVVLDGDTIVVTMPGTSYSVTYRKLHDSPLLVASDMRDDPDSPINKFAFRARAWIAANDKARELGWIV |
Ga0207449_102662 | Ga0207449_1026621 | F099830 | NADSQNLENDVNKFLKGYENYITSVSLMTFIKDQLPTFLAVVTIQEKIPPVKVETPNTK |
Ga0207449_103017 | Ga0207449_1030172 | F029190 | VQLVSHLADADFRRYVTGTVDPETERHVRVCVCCALRLADAAMQAYWWERRGPLGRLVRLNNTQAVDELLTEIAREQRRDAA |
⦗Top⦘ |