Basic Information | |
---|---|
IMG/M Taxon OID | 3300026740 Open in IMG/M |
GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0095510 | Gp0072080 | Ga0207439 |
Sample Name | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G05A1w-12 (SPAdes) |
Sequencing Status | Permanent Draft |
Sequencing Center | DOE Joint Genome Institute (JGI) |
Published? | N |
Use Policy | Open |
Dataset Contents | |
---|---|
Total Genome Size | 20743468 |
Sequencing Scaffolds | 26 |
Novel Protein Genes | 28 |
Associated Families | 27 |
Dataset Phylogeny | |
---|---|
Taxonomy Groups | Number of Scaffolds |
All Organisms → cellular organisms → Archaea | 6 |
Not Available | 12 |
All Organisms → cellular organisms → Bacteria | 2 |
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Sphingomonadales → Sphingomonadaceae → Sphingomonas | 1 |
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria | 1 |
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Hyphomicrobiaceae → Rhodoplanes | 1 |
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium | 2 |
All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia | 1 |
Ecosystem Assignment (GOLD) | |
---|---|
Name | Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts) |
Type | Environmental |
Taxonomy | Environmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil → Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts) |
Alternative Ecosystem Assignments | |
---|---|
Environment Ontology (ENVO) | terrestrial biome → agricultural field → agricultural soil |
Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Soil (non-saline) |
Location Information | ||||||||
---|---|---|---|---|---|---|---|---|
Location | Wisconsin, United States | |||||||
Coordinates | Lat. (o) | 43.3 | Long. (o) | -89.38 | Alt. (m) | N/A | Depth (m) | 0 | Location on Map |
Zoom: | Powered by OpenStreetMap © |
Family | Category | Number of Sequences | 3D Structure? |
---|---|---|---|
F000569 | Metagenome / Metatranscriptome | 1018 | Y |
F001213 | Metagenome / Metatranscriptome | 746 | Y |
F002616 | Metagenome / Metatranscriptome | 543 | Y |
F004317 | Metagenome / Metatranscriptome | 443 | Y |
F016622 | Metagenome / Metatranscriptome | 246 | Y |
F017892 | Metagenome / Metatranscriptome | 238 | Y |
F019548 | Metagenome / Metatranscriptome | 229 | N |
F022731 | Metagenome / Metatranscriptome | 213 | Y |
F023901 | Metagenome / Metatranscriptome | 208 | N |
F026602 | Metagenome / Metatranscriptome | 197 | N |
F026949 | Metagenome | 196 | Y |
F038480 | Metagenome | 166 | Y |
F040716 | Metagenome | 161 | Y |
F043582 | Metagenome / Metatranscriptome | 156 | Y |
F047460 | Metagenome / Metatranscriptome | 149 | Y |
F057496 | Metagenome / Metatranscriptome | 136 | N |
F057773 | Metagenome / Metatranscriptome | 136 | Y |
F058528 | Metagenome / Metatranscriptome | 135 | N |
F068880 | Metagenome / Metatranscriptome | 124 | Y |
F070698 | Metagenome / Metatranscriptome | 123 | N |
F071393 | Metagenome | 122 | N |
F078970 | Metagenome / Metatranscriptome | 116 | Y |
F083156 | Metagenome | 113 | Y |
F088429 | Metagenome | 109 | N |
F089166 | Metagenome / Metatranscriptome | 109 | Y |
F089980 | Metagenome | 108 | N |
F094081 | Metagenome / Metatranscriptome | 106 | Y |
Scaffold | Taxonomy | Length | IMG/M Link |
---|---|---|---|
Ga0207439_100092 | All Organisms → cellular organisms → Archaea | 1653 | Open in IMG/M |
Ga0207439_100213 | Not Available | 1353 | Open in IMG/M |
Ga0207439_100333 | All Organisms → cellular organisms → Archaea | 1199 | Open in IMG/M |
Ga0207439_100542 | All Organisms → cellular organisms → Bacteria | 1038 | Open in IMG/M |
Ga0207439_101042 | Not Available | 838 | Open in IMG/M |
Ga0207439_101341 | All Organisms → cellular organisms → Bacteria | 775 | Open in IMG/M |
Ga0207439_101903 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Sphingomonadales → Sphingomonadaceae → Sphingomonas | 686 | Open in IMG/M |
Ga0207439_101944 | Not Available | 681 | Open in IMG/M |
Ga0207439_102038 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria | 667 | Open in IMG/M |
Ga0207439_102099 | All Organisms → cellular organisms → Archaea | 659 | Open in IMG/M |
Ga0207439_102254 | Not Available | 641 | Open in IMG/M |
Ga0207439_102262 | Not Available | 640 | Open in IMG/M |
Ga0207439_102576 | Not Available | 612 | Open in IMG/M |
Ga0207439_102765 | All Organisms → cellular organisms → Archaea | 594 | Open in IMG/M |
Ga0207439_102832 | Not Available | 589 | Open in IMG/M |
Ga0207439_103188 | All Organisms → cellular organisms → Archaea | 564 | Open in IMG/M |
Ga0207439_103296 | Not Available | 558 | Open in IMG/M |
Ga0207439_103344 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Hyphomicrobiaceae → Rhodoplanes | 555 | Open in IMG/M |
Ga0207439_103418 | Not Available | 551 | Open in IMG/M |
Ga0207439_103590 | Not Available | 542 | Open in IMG/M |
Ga0207439_103934 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium | 525 | Open in IMG/M |
Ga0207439_104158 | All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia | 514 | Open in IMG/M |
Ga0207439_104254 | Not Available | 510 | Open in IMG/M |
Ga0207439_104316 | All Organisms → cellular organisms → Archaea | 508 | Open in IMG/M |
Ga0207439_104362 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium | 506 | Open in IMG/M |
Ga0207439_104445 | Not Available | 502 | Open in IMG/M |
Scaffold ID | Protein ID | Family | Sequence |
---|---|---|---|
Ga0207439_100092 | Ga0207439_1000922 | F023901 | MKDNCSQNEIKTSEQSQSCEGGVPLDPKKFGGYEFAINTKGWDAKREPTHNCHDQHKSGLIPTNLENDKAVRLRQTVKDESGKVHQIGEIDYMDGNGFHKVMDIFDSSPNPWMVDRNLYETKSYFWIRNNGSGYITVRDVSLEILS |
Ga0207439_100213 | Ga0207439_1002132 | F019548 | MNPSKKEIYKILERFTSQKGGILIILHNSFSSDSKPPQEQTSVVRDDERKSAIFEINFGTVAGLSMLCECEKALADQVMSLDFLDSGIEGDVIWFGGMLDKSGSEFIGSTYDDGLKSAPVEQSELVHRVNQAIDKCLEYMLNSVELDKKVYVDASDRMSGYVKLTRIGEHIREKHPDLFRDNTKSE |
Ga0207439_100333 | Ga0207439_1003331 | F026602 | MISLTLASMHVFAGRIYAIEQPSKMEGINEEECNKIFKCKIISEDVLKYPDIVNPFAKNEEIAKTLNANDAQIMTEHTCQKLMDVDIVKKKDQKIGEQTPKYLVCLP |
Ga0207439_100542 | Ga0207439_1005421 | F083156 | MSDGGRERASLGVEVWNSSQKWSVQRSAVRSIAWLDVL |
Ga0207439_101042 | Ga0207439_1010421 | F001213 | QQEEVNVIVAGSGVYRIEGEDIPVSVGSFLRFDPGTTRQPIAGPEGMTMIGVGARRGSYEPRGPF |
Ga0207439_101341 | Ga0207439_1013411 | F070698 | QTNIFDRRHHTLMVILNISEIWNNGHNTTFRDSHLEDDNSEILSNGNAPYVIKGNGDCMIIVHADALGASKNFGDGH |
Ga0207439_101511 | Ga0207439_1015112 | F040716 | MTHVRIAANAILVFSFMAVTGFPALAAPAKCNAELRKCNSHCNLVYESGRANRTCRNRCKDNLYVCKARPS |
Ga0207439_101903 | Ga0207439_1019032 | F043582 | PQREKKMPLLFYFPLIIWMGVLEAMQDEMRVAATAKARR |
Ga0207439_101944 | Ga0207439_1019442 | F047460 | MFLLHFVMFGCIYDHFVTALNSAQMGQSGAINAKVRATKSRL |
Ga0207439_102038 | Ga0207439_1020381 | F078970 | ASRVWIDRVQSEISLWSDLATKLSSTKSVPEALDAYTKCVSQRMQMAADDGRRLAEEAQQLTQKFAQSLGNGRPGMTT |
Ga0207439_102099 | Ga0207439_1020991 | F068880 | KYKGVLESADRLTFGHSNYGYRYFLNLSSPSKLLDSSSGLIPKNEFLSKIPEGYDVPFKGMLILTQKHNELYAIIFLNPKEKFDSMLNQIQPTLDSIQLSG |
Ga0207439_102254 | Ga0207439_1022541 | F038480 | NMEGACRVVADRGSSQMKLGRKSGKIVRRDTMKAILVSIVLYFAATAAHSMPISVLNANGLSATIPISDQCGDRCGSSRSYVKDRRSGVGGYSGGYVLVRDPLIQRRPFCPFGSYVACVVSGTYCVDLCH |
Ga0207439_102262 | Ga0207439_1022621 | F057773 | LILEPAVASHETDDGLGKVDIEIVADDIPSDVGGGAAQQAAEKARKILFGPRIADHAFDLAGGNIEGRNQGLSAVAAVLELTPLDLARRHRQSRRDALERLDAGHLVDRDGAMGVIGGGRGFVDRADVCALGIEGGIGLRGQPVADAMRLEVSLFFKKRPTERCEMFGMSPRRMASSAISRWLQWLMGRSLSDGFSHVIATRAQICSGVNVA |
Ga0207439_102576 | Ga0207439_1025762 | F058528 | NDIPDGLSVWPQDDVARAICNALMMATGALVTIREDQHLKDQGTYAIGIGHKLN |
Ga0207439_102765 | Ga0207439_1027652 | F088429 | TPEPQPETNASLNTPEPQPETNASLNTPEPQPETNASEIVTPRSIDLNITVGKDPIARSENQMVTVVALDPTTGKVLDRVFIKLEIKDPVGILVKNYTGTVGNLTRTFKIGENAIGTFIISATASQAGVQSTKSLPFQVQ |
Ga0207439_102832 | Ga0207439_1028321 | F002616 | MQDTPMPNSFPNWTSRIVIAGRIAEYRRPESRHESTMPGDYHLPFGRVEAQAKAARWLWQSYII |
Ga0207439_103188 | Ga0207439_1031881 | F089980 | MDLESATELIVMNWLGIFSLAATIFIAIVTINYRNKQHQIKGLLDAFKILNTREHRTSRRKVYELYIEYEKNKDVGIFDNVPEVVDVRADFDVIGTLVKSRNIDEKLFLIEYGPLAYRCWKYLKNHIEAERKKRNFDPFMMNFESLAGKADNFWLKRGYDLSKTLLYQPEQ |
Ga0207439_103296 | Ga0207439_1032961 | F089166 | ACELGYWAMVDALEAERNAAKKNNAGDHPAFSCEPEAEPSPAPRVQRTADEPAP |
Ga0207439_103344 | Ga0207439_1033441 | F038480 | VLVSIGLCLAATAVHSMPLSLLNANVAQPVIAVSDQCGDRCGSSRSYVRDRRTVMAGYSGGYVLVRDPLIQRRPYCPFGSYVACIMSGTYCIDLCH |
Ga0207439_103418 | Ga0207439_1034181 | F004317 | MGKVFRALRPCLSPYRDERQMNLSYMVLLDMWNMFEQKFLSFIGG |
Ga0207439_103590 | Ga0207439_1035901 | F000569 | MDLHIKKVWLPGAASCLLFFGFYWVLIWLPFDKNRFQFLAIPYLVLPFAGALAAYSSRRMKGSVLERILSALFPAFAFVVLFAVRIVYGLFFEGQPYTLPHFLAGFSVTLVFIVVGGLLLVLGAWPFCRPHLREQ |
Ga0207439_103909 | Ga0207439_1039091 | F094081 | MADDHRKTLKEEFTDRLEKAKGRLQQSFPEIQQSIKTSGAVEAARKIIDPAQSIFKQFADDIQLKDLIAKAEALVANANLTLTKAASKDAAPTEARPIGAPSNEPSAKKAGVKKTPSR |
Ga0207439_103934 | Ga0207439_1039341 | F016622 | VLREVAEYPNCFGPLGPGAERIETDRYTLCLGPGSTWNTVQRQRFALEELDEVLEEVRAHLRDRNRTQTQWEVGSSAPAGLVDALLERGIGFDKDPYAVALVLTSEPPPPREGLVARQIETYAEYLDANAVQWEAFGTPPEQI |
Ga0207439_104158 | Ga0207439_1041581 | F026949 | MKKLFALILSVAALAPFTAQSQRPPGSLAGFLSGQGLVGVKLERRYGNHLFVL |
Ga0207439_104254 | Ga0207439_1042542 | F022731 | MGTEATIDRVKKELLRAFDNTRAELDRIEILAAGLAAFNAPIPGYEPMFRHLPQLNRNAHELAADEPRA |
Ga0207439_104316 | Ga0207439_1043162 | F057496 | MAGVYRNHTKVAEDHSSAYCKDEYVFGGKGEPLRLSHLIAFNVIEDAEHISGILTVEFDYYSDDDSIKYRDMHYSNPRLIKHIEGNPTVMKNID |
Ga0207439_104362 | Ga0207439_1043622 | F017892 | YTKQSTQGPTTVVEGFFEGSLEDAYDEYKKELEAAGFKILFDEIEEHDSEVSWEGEGRSGQVALREECGSDDKIYVHITNRPASE |
Ga0207439_104445 | Ga0207439_1044451 | F071393 | QANELQDAHDEIDRLSQTVSALQEAVTQYQAGAAAAEDEIVLLESEKAALQAQLDGAFEESKTLADRVLAAEAAAKRREENIASSLKQIDFLNAELMAASSERFKVVAAMQGEQRRQRSAFNQQKSMLEIRLQEKEALAATQAATIKQLEGVRDELDKRFRVIEAL |
⦗Top⦘ |