| Basic Information | |
|---|---|
| IMG/M Taxon OID | 3300027407 Open in IMG/M |
| GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0095510 | Gp0055679 | Ga0207520 |
| Sample Name | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G01A5-12 (SPAdes) |
| Sequencing Status | Permanent Draft |
| Sequencing Center | DOE Joint Genome Institute (JGI) |
| Published? | N |
| Use Policy | Open |
| Dataset Contents | |
|---|---|
| Total Genome Size | 17522440 |
| Sequencing Scaffolds | 14 |
| Novel Protein Genes | 16 |
| Associated Families | 16 |
| Dataset Phylogeny | |
|---|---|
| Taxonomy Groups | Number of Scaffolds |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales | 1 |
| Not Available | 7 |
| All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria | 1 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Hyphomicrobiaceae → Rhodoplanes → unclassified Rhodoplanes → Rhodoplanes sp. Z2-YC6860 | 1 |
| All Organisms → cellular organisms → Bacteria | 1 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria | 1 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Rhizobiaceae → Rhizobium/Agrobacterium group → Agrobacterium | 1 |
| All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium | 1 |
| Ecosystem Assignment (GOLD) | |
|---|---|
| Name | Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts) |
| Type | Environmental |
| Taxonomy | Environmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil → Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts) |
| Alternative Ecosystem Assignments | |
|---|---|
| Environment Ontology (ENVO) | terrestrial biome → agricultural field → agricultural soil |
| Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Soil (non-saline) |
| Location Information | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location | Wisconsin, United States | |||||||
| Coordinates | Lat. (o) | 43.2958 | Long. (o) | -89.3799 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F001033 | Metagenome / Metatranscriptome | 799 | Y |
| F006812 | Metagenome / Metatranscriptome | 364 | Y |
| F009013 | Metagenome / Metatranscriptome | 324 | Y |
| F015418 | Metagenome / Metatranscriptome | 255 | Y |
| F022731 | Metagenome / Metatranscriptome | 213 | Y |
| F025530 | Metagenome | 201 | Y |
| F025757 | Metagenome | 200 | N |
| F038225 | Metagenome / Metatranscriptome | 166 | Y |
| F046248 | Metagenome | 151 | Y |
| F048297 | Metagenome / Metatranscriptome | 148 | Y |
| F068085 | Metagenome | 125 | Y |
| F070676 | Metagenome / Metatranscriptome | 123 | Y |
| F071480 | Metagenome / Metatranscriptome | 122 | Y |
| F075090 | Metagenome / Metatranscriptome | 119 | N |
| F099521 | Metagenome / Metatranscriptome | 103 | Y |
| F105676 | Metagenome / Metatranscriptome | 100 | Y |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| Ga0207520_100119 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales | 1144 | Open in IMG/M |
| Ga0207520_100123 | Not Available | 1137 | Open in IMG/M |
| Ga0207520_100172 | Not Available | 1079 | Open in IMG/M |
| Ga0207520_100616 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria | 824 | Open in IMG/M |
| Ga0207520_100994 | Not Available | 732 | Open in IMG/M |
| Ga0207520_101003 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Hyphomicrobiaceae → Rhodoplanes → unclassified Rhodoplanes → Rhodoplanes sp. Z2-YC6860 | 730 | Open in IMG/M |
| Ga0207520_101088 | Not Available | 715 | Open in IMG/M |
| Ga0207520_101134 | Not Available | 704 | Open in IMG/M |
| Ga0207520_101172 | All Organisms → cellular organisms → Bacteria | 698 | Open in IMG/M |
| Ga0207520_101905 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria | 609 | Open in IMG/M |
| Ga0207520_101974 | Not Available | 603 | Open in IMG/M |
| Ga0207520_102376 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Rhizobiaceae → Rhizobium/Agrobacterium group → Agrobacterium | 573 | Open in IMG/M |
| Ga0207520_102578 | All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium | 561 | Open in IMG/M |
| Ga0207520_102925 | Not Available | 540 | Open in IMG/M |
| Scaffold ID | Protein ID | Family | Sequence |
|---|---|---|---|
| Ga0207520_100119 | Ga0207520_1001193 | F025530 | MQIGEAARDNIAAGIWFAVLAGSLFLYAQSILMTTGLMLELTAAYSTFVLCGKSARSPFVHAIPYAFALAGAVFLCLAPDFHNAIEASLVFLGVTALMHGSVVYSALKNPRETEDPVYASAT |
| Ga0207520_100123 | Ga0207520_1001231 | F038225 | MKRFSLALLGTVGAFFILTPAQAADYRVVQYNDTKICQVVDMAGPFKPISSNYTVLTKKSIPTFDAAMKARAEVG |
| Ga0207520_100172 | Ga0207520_1001722 | F025757 | MSYSASLSLFWLEMAVLIGCVALSIGMRSRPAMWVALGIVAHCAMWLAMHDEEILIRLVASTLVYLGLLKFSPNAARVWLCAGGALAGAFLLGTMALSLLMSFPGRWSVFGLSMGATLLFLTSGLLIGFWVVHRWTEPAQRRESQPGQKA |
| Ga0207520_100616 | Ga0207520_1006162 | F048297 | MSHPSYEYHEHDPEPGSGLAGYAAIKYTAIVIIVLAILAFLAWYVVPAFD |
| Ga0207520_100994 | Ga0207520_1009943 | F006812 | MYVLLIILLVVLPFGTAIVCGALMQWAGNTISGWSSICGLALGIYFFHKCMEVLPAA |
| Ga0207520_101003 | Ga0207520_1010033 | F068085 | LFTCLGIGLALCVGAISILRSPPDNGPPAWFAAAFGAMFFWGAFALARWRLQ |
| Ga0207520_101088 | Ga0207520_1010881 | F075090 | AIVAEGEIAKAAALKPDEIPDGVSGDTVWKVADNLFTISVLPKDAVPAEIGDLNDLIVGGDAQKCRSGDFFAGTMLDVVESATIARAYTTCQTQQSATSTYYFAMPRKQGGGLYLTKIIATGVEVPPTIERAIKELDAKVRGVITAALARL |
| Ga0207520_101134 | Ga0207520_1011341 | F015418 | MSILSRRTFTKGLLASALVPGTSAFGQPNDPASIAIIDTPHNAAKAASKLAAQNIKVVVRFFARKPQPGLREKVMASDGNMIDGVRE |
| Ga0207520_101172 | Ga0207520_1011721 | F070676 | YGGDMASKTKESKKGTVVTALGESALTGLPGKVGGAVAKPVSQRTRWSEQQIRTAIGLALFAWALYRVLRPAVRAIRNR |
| Ga0207520_101499 | Ga0207520_1014992 | F071480 | VLVESADVRTLLLVRRRSVGWQTIRLAKVTAPRFRLGWPGLVLTRAGLPVVAYTRWNGVNLNSQLLVTRVDGKGRLTTRRITDEGFPKSLVPPPAAPVLFGNVVHVVESYGYRGVLGALEWFPRGKTWTGFGLDAGVGDFPVGPVLAGLSPSGVLHAAWTESLTSFDTAPVT |
| Ga0207520_101905 | Ga0207520_1019051 | F022731 | MGTDAIDRVKKELLRAFDNTRAELDRIELLAAGLAAFNTPIPSYEPTFRHLPQLNLTV |
| Ga0207520_101974 | Ga0207520_1019742 | F009013 | MLGEREAVVLEVKPWDVHGVGYVDVTVVYADRMLETARLGPESLPEDLEAGERVMVTKAVNMIVGIRRA |
| Ga0207520_102376 | Ga0207520_1023761 | F105676 | ARQSPWGAQRLYVDDAHWQETVTMLARDADRIVLCIDASDGVRWEIAHVLQSGHAGKTLFFLNPSTDVQTRKRQLQEDFGVSAADLASIDVDRVLALRTTSTDQLILMVCDKPERDAYLVAARLAFEGRAGNLTMSAKGR |
| Ga0207520_102578 | Ga0207520_1025781 | F001033 | SLIARLTIAVFVFQILGVTSVIHAERPDSTAGTSNAGTRKLFIDPSSTSVALRGKASLIVSPLTHRDGNYVGDYNLKVRPYFFKSEKGSLLLAASDDAVRKLQTGTAINFTGKAVTHKDGRTHIVLGRATPSSRDRGSVTFSIVTDDARIVFNTSYHFGT |
| Ga0207520_102816 | Ga0207520_1028162 | F099521 | LCLQAGPNASDCIYGAARDRGNNYAGGREATRLCTAAPARYRARCYEGIGTILGSLHRSGAERRAACDAVTPKAYRRDCYRGAAVS |
| Ga0207520_102925 | Ga0207520_1029251 | F046248 | ICAIDASHSRMKQKYALRRALPILVAAVLLTAIFVDVVAAGVKAVSRRSRASYPVPDTGITVSVPNAMKAFPAELLPQ |
| ⦗Top⦘ |