Basic Information | |
---|---|
IMG/M Taxon OID | 3300027430 Open in IMG/M |
GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0095510 | Gp0072028 | Ga0207561 |
Sample Name | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G05A1-10 (SPAdes) |
Sequencing Status | Permanent Draft |
Sequencing Center | DOE Joint Genome Institute (JGI) |
Published? | N |
Use Policy | Open |
Dataset Contents | |
---|---|
Total Genome Size | 27157210 |
Sequencing Scaffolds | 16 |
Novel Protein Genes | 20 |
Associated Families | 20 |
Dataset Phylogeny | |
---|---|
Taxonomy Groups | Number of Scaffolds |
Not Available | 9 |
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria | 1 |
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium | 1 |
All Organisms → cellular organisms → Bacteria | 3 |
All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Nitrososphaeria → Nitrososphaerales → Nitrososphaeraceae → Nitrososphaera → Candidatus Nitrososphaera evergladensis | 1 |
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae | 1 |
Ecosystem Assignment (GOLD) | |
---|---|
Name | Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts) |
Type | Environmental |
Taxonomy | Environmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil → Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts) |
Alternative Ecosystem Assignments | |
---|---|
Environment Ontology (ENVO) | terrestrial biome → agricultural field → agricultural soil |
Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Soil (non-saline) |
Location Information | ||||||||
---|---|---|---|---|---|---|---|---|
Location | Wisconsin, United States | |||||||
Coordinates | Lat. (o) | 43.3 | Long. (o) | -89.38 | Alt. (m) | N/A | Depth (m) | 0 | Location on Map |
Zoom: | Powered by OpenStreetMap © |
Family | Category | Number of Sequences | 3D Structure? |
---|---|---|---|
F000135 | Metagenome / Metatranscriptome | 1961 | Y |
F000569 | Metagenome / Metatranscriptome | 1018 | Y |
F002315 | Metagenome / Metatranscriptome | 572 | Y |
F015418 | Metagenome / Metatranscriptome | 255 | Y |
F017145 | Metagenome / Metatranscriptome | 242 | Y |
F017759 | Metagenome | 239 | N |
F018027 | Metagenome / Metatranscriptome | 237 | Y |
F019338 | Metagenome / Metatranscriptome | 230 | Y |
F021340 | Metagenome | 219 | Y |
F023931 | Metagenome | 208 | Y |
F037795 | Metagenome | 167 | Y |
F045732 | Metagenome / Metatranscriptome | 152 | N |
F050525 | Metagenome / Metatranscriptome | 145 | N |
F057709 | Metagenome | 136 | Y |
F058266 | Metagenome | 135 | N |
F059295 | Metagenome | 134 | Y |
F068281 | Metagenome | 125 | Y |
F080668 | Metagenome | 115 | Y |
F085664 | Metagenome / Metatranscriptome | 111 | Y |
F094081 | Metagenome / Metatranscriptome | 106 | Y |
Scaffold | Taxonomy | Length | IMG/M Link |
---|---|---|---|
Ga0207561_100206 | Not Available | 1398 | Open in IMG/M |
Ga0207561_100254 | Not Available | 1342 | Open in IMG/M |
Ga0207561_100556 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria | 1073 | Open in IMG/M |
Ga0207561_100874 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium | 923 | Open in IMG/M |
Ga0207561_100994 | Not Available | 883 | Open in IMG/M |
Ga0207561_101032 | Not Available | 871 | Open in IMG/M |
Ga0207561_101533 | Not Available | 754 | Open in IMG/M |
Ga0207561_101552 | Not Available | 750 | Open in IMG/M |
Ga0207561_101828 | All Organisms → cellular organisms → Bacteria | 711 | Open in IMG/M |
Ga0207561_102797 | All Organisms → cellular organisms → Bacteria | 609 | Open in IMG/M |
Ga0207561_103486 | Not Available | 566 | Open in IMG/M |
Ga0207561_103727 | All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Nitrososphaeria → Nitrososphaerales → Nitrososphaeraceae → Nitrososphaera → Candidatus Nitrososphaera evergladensis | 554 | Open in IMG/M |
Ga0207561_104111 | Not Available | 535 | Open in IMG/M |
Ga0207561_104156 | Not Available | 533 | Open in IMG/M |
Ga0207561_104415 | All Organisms → cellular organisms → Bacteria | 522 | Open in IMG/M |
Ga0207561_104554 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae | 518 | Open in IMG/M |
Scaffold ID | Protein ID | Family | Sequence |
---|---|---|---|
Ga0207561_100206 | Ga0207561_1002062 | F059295 | AAMGAVVYAAFFAALIAMPIYGGGAYDKNGYQPFNAPVPIFAKKWDANITAFSIQLLILVAGLLTVSGAFAG |
Ga0207561_100254 | Ga0207561_1002543 | F057709 | MSEFDLKVALIIFVTKFIDPFAAVPALVAGYFCRTWWQVVIAAAAVGIFVE |
Ga0207561_100556 | Ga0207561_1005562 | F019338 | ADGNRTGAFLIVGAIVCLSTAQFVFAQEVDPRCKDIYDKVACTCAVRNGGHVIPPPVGVKREGLKLRPKEAAGGTQTLDGGRVAFPKYYRREGLKFRRSDAIEGYLGCMRAAGRK |
Ga0207561_100874 | Ga0207561_1008742 | F017145 | WLAFGAALLIYSNDWHPSGWTALRKEATAPKPPVSVTEQYTGSIIIVPTRGEDCRQMMLDNRTGRMWDKGVVNCYEAVSRSEKEQRGGMSSLRINAIGKAFNRRDE |
Ga0207561_100994 | Ga0207561_1009943 | F045732 | IRAIISHVQKLRGPRAESLVSKMRHRRRKVFIDTHKGRVSAGFTVAAEADAADVSMRLRERGWIAYRLRLEAEQYAWVATVIDWTRRAA |
Ga0207561_101032 | Ga0207561_1010322 | F015418 | MSIVSRRTFTKGLLASALVPGTSAFGQPNDPASIAIIDTPQNAAKVAAKLAAQNVKVVVRFF |
Ga0207561_101173 | Ga0207561_1011731 | F000135 | PELAEDFERLMTDDRDVVRGSLDTVSDWRLTRPADVPGQSIGQADYVLIAEIVEVDRFEQQASEHVQRLQDDLAHLVSSRGMLIVRPVL |
Ga0207561_101533 | Ga0207561_1015332 | F058266 | MKNDTLLIVFVTLYLMVIAALLIKDAPRPVALEDKAPVAEVEKAPIAKEAASPQADDSSKPGSAPDCEKELRRTADLLRFFANRIHDGEDTQSIVADRRQQEKKISAVCEQ |
Ga0207561_101552 | Ga0207561_1015521 | F002315 | MKHVAVLSMAMLTVAFAADKKTYRYNCKGGAFTVTAAVEASGRWSKAEPVVLQIDSEPPQTLIADPDVPDADSFTNKDYEFYALKTFITLTRKSHGVVVKTY |
Ga0207561_101828 | Ga0207561_1018282 | F021340 | TIARSNEARSQFCHVAIVTLFFVIVLGASLFLGTVMVIGTFHGVDPSNELTANGRAGRIARTLQDGTLCHYMIFDNKTAQTVEDRIGRCDENKPKPKQEKPATFTWGK |
Ga0207561_101857 | Ga0207561_1018571 | F085664 | VHFNVAAFAMAQPNGNVGNFGNTPVGILRHPTWHEWDITLSRRFPVTFMGRKNSGVKLQFQTFNVFNEVQFTNMNASYTFTGANNSVNNSANTGKYTQSGDGLAAGTIAPRIMSLTLRFD |
Ga0207561_102797 | Ga0207561_1027971 | F080668 | MQAVGVDERVMTLIELFANPKLNAEITRQLAAGQRSVSISGEDAAKTGEF |
Ga0207561_103486 | Ga0207561_1034862 | F017759 | KYGADSQHIELQGKVHGSASLVDARRLARNQRRPKVPAIGPVLFAGEIAYRICAMDNRLIVRLKKSEEPKILTVQNSRGTEFFMANLRQIPSILRGNDSTGFFAKEI |
Ga0207561_103727 | Ga0207561_1037271 | F018027 | MPYRRIFVGTTILLLALVTGGAFAATGGLPLGPSSTMKARDFKGYYDGHKDTFLVTDVSNKAQARALHVNFSREIGKVKSAPAQYFVRGKAVRGQLSVFGSEPGEADYNPLWEEFYVSWNPGVTPVLLVKDDQITALAKSKKLTLKDARIVLNA |
Ga0207561_104111 | Ga0207561_1041112 | F068281 | SYAQKKVRFVAFLVGTSAAPSELIIMTYQSDARRDKRDDKFYISWIIRGGIVLVIVIAALAFTSTGNYPDLDVPQMTRTVPGPAS |
Ga0207561_104156 | Ga0207561_1041561 | F000569 | IKKVWLPGAASCLLFFGFQWVLIWLPFDKNRFQFLAIPYLVLPFVGALAAYGSRRMKGSVLERIVSALFPVFAFVALFAVRIVYGLFFEGEPYTLPHFRAGFSVTLVFIVVGGLLLVLGAWPFCRPHLREQLP |
Ga0207561_104211 | Ga0207561_1042112 | F094081 | MRDDQGKILKKEFVERFDKAKGRLQQSVPEIQHLLKTSNVVEAARKIIDPAQSIFKQFADDIQLKDLFAKAEALVANLTVAKSVSRDTPPTET |
Ga0207561_104415 | Ga0207561_1044151 | F037795 | VGALMVIGVATSVFQLYGVALVVGCMLMVMSLTDGPRGFKMLARDVKRHVVGR |
Ga0207561_104468 | Ga0207561_1044682 | F023931 | MNEAQTLSYTRAQTATRLRIYQVLFAISIIAGLLAGLWCIFDPVGFAQLVFQIDPYPQTWPRIWGATLFGLQLAYIPGVRNPSFYRWPNWASIAIKFLMTIIFLTAGSSFYLLAAWELVW |
Ga0207561_104554 | Ga0207561_1045541 | F050525 | AYLAGVSAIFGIGIVGLMALHSPTGRTPSSPPVAAAEPLAKPAKRPLDDKKTAHRNQTHKKVHVTRKQHEAPSIDAGRNAYGYAEELRRIDPNRFLFFGR |
⦗Top⦘ |