| Basic Information | |
|---|---|
| IMG/M Taxon OID | 3300026779 Open in IMG/M |
| GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0095510 | Gp0072056 | Ga0207471 |
| Sample Name | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G10A2w-11 (SPAdes) |
| Sequencing Status | Permanent Draft |
| Sequencing Center | DOE Joint Genome Institute (JGI) |
| Published? | N |
| Use Policy | Open |
| Dataset Contents | |
|---|---|
| Total Genome Size | 19861161 |
| Sequencing Scaffolds | 25 |
| Novel Protein Genes | 25 |
| Associated Families | 25 |
| Dataset Phylogeny | |
|---|---|
| Taxonomy Groups | Number of Scaffolds |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales | 3 |
| All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium | 3 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria | 1 |
| All Organisms → cellular organisms → Bacteria | 2 |
| Not Available | 11 |
| All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota | 1 |
| All Organisms → cellular organisms → Archaea | 2 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae | 1 |
| All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Methylacidiphilae → Methylacidiphilales → Methylacidiphilaceae → Candidatus Methylacidithermus → Candidatus Methylacidithermus pantelleriae | 1 |
| Ecosystem Assignment (GOLD) | |
|---|---|
| Name | Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts) |
| Type | Environmental |
| Taxonomy | Environmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil → Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts) |
| Alternative Ecosystem Assignments | |
|---|---|
| Environment Ontology (ENVO) | terrestrial biome → agricultural field → agricultural soil |
| Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Soil (non-saline) |
| Location Information | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location | Wisconsin, United States | |||||||
| Coordinates | Lat. (o) | 43.3 | Long. (o) | -89.38 | Alt. (m) | N/A | Depth (m) | 0 | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F000959 | Metagenome / Metatranscriptome | 821 | Y |
| F001033 | Metagenome / Metatranscriptome | 799 | Y |
| F002508 | Metagenome / Metatranscriptome | 553 | Y |
| F003996 | Metagenome / Metatranscriptome | 458 | Y |
| F016471 | Metagenome / Metatranscriptome | 247 | Y |
| F017145 | Metagenome / Metatranscriptome | 242 | Y |
| F027009 | Metagenome / Metatranscriptome | 196 | Y |
| F028554 | Metagenome / Metatranscriptome | 191 | N |
| F033457 | Metagenome / Metatranscriptome | 177 | Y |
| F034995 | Metagenome | 173 | N |
| F037759 | Metagenome / Metatranscriptome | 167 | N |
| F040398 | Metagenome / Metatranscriptome | 162 | Y |
| F049535 | Metagenome / Metatranscriptome | 146 | N |
| F054841 | Metagenome / Metatranscriptome | 139 | Y |
| F062531 | Metagenome | 130 | N |
| F064703 | Metagenome / Metatranscriptome | 128 | Y |
| F068085 | Metagenome | 125 | Y |
| F080668 | Metagenome | 115 | Y |
| F081321 | Metagenome / Metatranscriptome | 114 | N |
| F087926 | Metagenome / Metatranscriptome | 110 | N |
| F090709 | Metagenome | 108 | Y |
| F092011 | Metagenome / Metatranscriptome | 107 | Y |
| F092487 | Metagenome / Metatranscriptome | 107 | Y |
| F094085 | Metagenome / Metatranscriptome | 106 | Y |
| F094116 | Metagenome | 106 | Y |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| Ga0207471_100027 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales | 2408 | Open in IMG/M |
| Ga0207471_100102 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales | 1774 | Open in IMG/M |
| Ga0207471_100487 | All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium | 1197 | Open in IMG/M |
| Ga0207471_100625 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 1107 | Open in IMG/M |
| Ga0207471_100837 | All Organisms → cellular organisms → Bacteria | 1008 | Open in IMG/M |
| Ga0207471_100839 | Not Available | 1007 | Open in IMG/M |
| Ga0207471_100867 | All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota | 994 | Open in IMG/M |
| Ga0207471_101014 | Not Available | 938 | Open in IMG/M |
| Ga0207471_101449 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales | 818 | Open in IMG/M |
| Ga0207471_101496 | All Organisms → cellular organisms → Bacteria | 809 | Open in IMG/M |
| Ga0207471_101511 | All Organisms → cellular organisms → Archaea | 806 | Open in IMG/M |
| Ga0207471_101910 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae | 736 | Open in IMG/M |
| Ga0207471_102062 | Not Available | 711 | Open in IMG/M |
| Ga0207471_102067 | Not Available | 710 | Open in IMG/M |
| Ga0207471_102080 | All Organisms → cellular organisms → Archaea | 708 | Open in IMG/M |
| Ga0207471_102226 | Not Available | 688 | Open in IMG/M |
| Ga0207471_102233 | Not Available | 688 | Open in IMG/M |
| Ga0207471_103097 | All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium | 602 | Open in IMG/M |
| Ga0207471_103249 | All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Methylacidiphilae → Methylacidiphilales → Methylacidiphilaceae → Candidatus Methylacidithermus → Candidatus Methylacidithermus pantelleriae | 591 | Open in IMG/M |
| Ga0207471_103366 | Not Available | 583 | Open in IMG/M |
| Ga0207471_103828 | All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium | 551 | Open in IMG/M |
| Ga0207471_104235 | Not Available | 529 | Open in IMG/M |
| Ga0207471_104252 | Not Available | 528 | Open in IMG/M |
| Ga0207471_104619 | Not Available | 510 | Open in IMG/M |
| Ga0207471_104625 | Not Available | 510 | Open in IMG/M |
| Scaffold ID | Protein ID | Family | Sequence |
|---|---|---|---|
| Ga0207471_100027 | Ga0207471_1000275 | F087926 | VNYAQDHKFALYWKGTHAFAKRQRELGDRFVARPVFTRKGSTYVGLVPLDRQKKKA |
| Ga0207471_100102 | Ga0207471_1001021 | F054841 | IHRRSNGTVDTDVYRRKAFLLRRETTNQILRRLGRSVLPQLIGAIAILVSYVLFLPRNPLPQGAGSTFAPAKVSVLPSSLRTPKSNHRSQ |
| Ga0207471_100487 | Ga0207471_1004872 | F001033 | MCGHSLIARLTIAVFVFQMLGVTSVVHAERPDSTAGTSSAGTRKLFINPSSTSVALGKASLIVSPLTHRDGNYVGDYQLTVRPYFFKSEEGSLLLAASDDAVRKLQAGTAFNFTGQAVTHKDGRTHIMLGRATPSSRDRGSVTFSIVTDDARIVFNTSYHFPAPRP |
| Ga0207471_100625 | Ga0207471_1006251 | F037759 | NACPHPYYRAILLRPWAPRTPGVPGRIRVNHLVSAQGQVDNEQGFAMKGTGLLHPGKLAAVIVAGGLLSGAARAQSPELGAPSIGILPPSDILASVSYLGLDPSGEPVRRGAYYVLHAFDRAGIELWVVVDAQFGDVLFMAPALNTSLTPPYVRAARIIQVEPPESGGQQKK |
| Ga0207471_100837 | Ga0207471_1008372 | F080668 | FHALMQAVGVDERVMTLIELFANPKLNAEITRQLAAGQRSVSISGEDAAKTGEF |
| Ga0207471_100839 | Ga0207471_1008391 | F094085 | MRSIPAALFALVFGAAATVPAFGIASVNVFGLKAPEEITGFTLNDSTNFEKIRPGD |
| Ga0207471_100867 | Ga0207471_1008673 | F027009 | YFKKSKQNLQSEHDSMIQDVKQDISSYKKKALSNA |
| Ga0207471_101014 | Ga0207471_1010142 | F081321 | MRAVMCTVLRAAALISLALVLVSPTHAACRGNCEPNVEVARAAMQQIFKQTFLSPYTLVSFERLDGRSGERYGGAFYEMRIRAVLHYDGVKLRCRRPACPELHHYLLEDDARSKKATVAGWLFLQQDGEGDWQPVPLTPGPQ |
| Ga0207471_101449 | Ga0207471_1014491 | F033457 | RNFMSYVLASAFVVLLLAVVTPPGFGVAARPSIEGQRLAPQIVDRTRKSDQLPVPKATGRRLTPPAAPVLVGCDPVFSALSKDKQANYPGRCLA |
| Ga0207471_101496 | Ga0207471_1014962 | F094116 | AFDVAATQIRVADLEERTGLDFGDIKKFDHFAAGGASGTLELPSIEGIVQRAKIVRNGNDIVV |
| Ga0207471_101511 | Ga0207471_1015112 | F016471 | MTEADKFGAVKDATVAIGLANKDTKDVISSFGTGFIIGGEYIVSSAHIFSQCIKYNAQNKDKNKGMEGIYSAFNITTNGDQLELNTYHIIKAIRLPPVKEVTGFTGSVDLDIGIGKLDRHSDNFLHIKEPTQLKLYDEIVICGYPSGR |
| Ga0207471_101910 | Ga0207471_1019102 | F092011 | VPVHYRFGGMDTPNVLFHTAVLVLGVLGILVSLAGIAFWWVSRKMNRLT |
| Ga0207471_102062 | Ga0207471_1020622 | F090709 | IDPSVTLPIDKLNWMQNELVKAGNLKAPFDLTTMTAPDIRAEAAKRATK |
| Ga0207471_102067 | Ga0207471_1020672 | F017145 | HAMTAHKSSHRHWMLRDIPRTYVLLTWLAFGAALLIYSNDWHPSGWTALRKEATAPKPPVSVTEQYTGSIIIVPTRGEDCRQMMLDNRTGRMWDKGIVNCYEAVSRPEKGQRGGMSSLRMNAIGKAFNRRDE |
| Ga0207471_102080 | Ga0207471_1020801 | F062531 | TMIWFQNSTKSIFAITKAPDDLVFPLFIAGPFMTGYLKYKGVLESADRLTFGHSNYGYRYFLNLSSPSKLLDSSSGLIPKNEFLSKIPKGYDVPFQGKLILTQKHKELYAIIFLNPKEKFDSMLNQIQPTLDSIQLSG |
| Ga0207471_102226 | Ga0207471_1022261 | F092487 | MSKSVTVGTTAFFACILCATPVSLGVSPQGNVSLTHNSASAEVG |
| Ga0207471_102233 | Ga0207471_1022331 | F064703 | KSLTVLIVALLGVFAVKSIIGPGKMNLGESVKYPVPTYDLHVAQPVDMKNFPSDVIPLP |
| Ga0207471_103097 | Ga0207471_1030971 | F000959 | MKPILQINHKQRGPRRDNRARRGHQSFPLTDYNYRPTAEAEASSLAGRPATKAPAFHKLSSKFFGAETSRDYVVELLFFILITGIAAWPVVSMLIAVIRLIRNY |
| Ga0207471_103249 | Ga0207471_1032491 | F049535 | MRRHIADSRASTKSIHESAWSLHRFASLLSTFAVVAGVPRQTSRSIKPRERDLRCLRLFFGPVFNEANEPSSTCRLRFQR |
| Ga0207471_103366 | Ga0207471_1033661 | F002508 | NTQHYARWRNSEVSRNFHRLGLLVAAIILTAGLLLMAKDALGVRFWDLIPADIPILAGGIAIGLTGIGLVSLAAYGIVRAVGWAIDKSV |
| Ga0207471_103828 | Ga0207471_1038281 | F003996 | GKLRVPTHPRRTFRDQFMRMVTGSGDFATRLTATSLNGRVEVAKSARMEEPAEPVVFRGSMGIPQFSPDGKRLLILSGGLWNVYDTIRLIDVSPFYRRQESAAENFQPKSAPTWLAEIASAVSAFDLGGDGSLLTLETVRKRHPESKAGDPYEAVWKRFFPDDRSTQQR |
| Ga0207471_104235 | Ga0207471_1042351 | F040398 | VPNFLRLCVATSSKVLSERRDDLVKFVAAEMDAYRFALANRAETIKVSHEMTHAKPDDKRAEFITDEAIKNKQIDPALSIPLDRLDWMQNLFVKAGVIKQTVPIESIVDKSVNADAAKIAGK |
| Ga0207471_104252 | Ga0207471_1042522 | F068085 | MIALKYILLFTCLGIGLALCVGVISILRSPPDNGPPAWFAAAFGAMFFWGAFALARWRL |
| Ga0207471_104619 | Ga0207471_1046191 | F028554 | LKGQMRKAKQVRNRALSAGEGRRAIIVTAALTGLFLLAAFVTAGSFLSTDPQAMSSVAKVTPLPRTEGGEAASRVASIVMETDKKGRCEERQLDNRTGKMVSANYVNCDARLEPERDSTPSENINRERIRAILGAFKK |
| Ga0207471_104625 | Ga0207471_1046251 | F034995 | MTKILALIAGIFLIGFPAWMLKEMSVPLPEYRKGGYADYVLLAYELLAVYGSVLLGIAVHVGRAKWSGQSLFPFGLFKAGMWGVLVAASAYFVAGGVYGLISYGAFDKVIGGTLWGLLAIFWGVFGALVSAGLALLFYAWRGSAGYRPPGSAG |
| ⦗Top⦘ |