| Basic Information | |
|---|---|
| IMG/M Taxon OID | 3300002656 Open in IMG/M |
| GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0085736 | Gp0056710 | Ga0005494 |
| Sample Name | Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF143 (Metagenome Metatranscriptome, Counting Only) |
| Sequencing Status | Permanent Draft |
| Sequencing Center | DOE Joint Genome Institute (JGI) |
| Published? | N |
| Use Policy | Open |
| Dataset Contents | |
|---|---|
| Total Genome Size | 6021844 |
| Sequencing Scaffolds | 31 |
| Novel Protein Genes | 36 |
| Associated Families | 32 |
| Dataset Phylogeny | |
|---|---|
| Taxonomy Groups | Number of Scaffolds |
| Not Available | 25 |
| All Organisms → cellular organisms → Eukaryota → Opisthokonta → Fungi → Fungi incertae sedis → Blastocladiomycota → Blastocladiomycota incertae sedis → Blastocladiomycetes → Blastocladiales → Blastocladiaceae → Allomyces → Allomyces macrogynus | 1 |
| All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Cytophagia → Cytophagales → Hymenobacteraceae → Hymenobacter → Hymenobacter aerophilus | 1 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Desulfovibrionales → Desulfovibrionaceae → Desulfovibrio | 1 |
| All Organisms → cellular organisms → Eukaryota → Discoba → Heterolobosea → Tetramitia → Eutetramitia → Vahlkampfiidae → Naegleria → Naegleria gruberi | 1 |
| All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Opitutae → Opitutales → Opitutaceae → Lacunisphaera → Lacunisphaera limnophila | 1 |
| All Organisms → cellular organisms → Bacteria → Terrabacteria group → Cyanobacteria/Melainabacteria group → Cyanobacteria → Nostocales → Nostocaceae → Nostoc → Nostoc punctiforme | 1 |
| Ecosystem Assignment (GOLD) | |
|---|---|
| Name | Forest Soil Microbial Communities From Harvard Forest Long Term Ecological Research (Lter) Site In Petersham, Ma, For Long-Term Soil Warming Studies |
| Type | Environmental |
| Taxonomy | Environmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil → Forest Soil Microbial Communities From Harvard Forest Long Term Ecological Research (Lter) Site In Petersham, Ma, For Long-Term Soil Warming Studies |
| Alternative Ecosystem Assignments | |
|---|---|
| Environment Ontology (ENVO) | forest biome → land → forest soil |
| Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Soil (non-saline) |
| Location Information | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location | Harvard Forest LTER, Petersham, MA, USA | |||||||
| Coordinates | Lat. (o) | 42.53312 | Long. (o) | -72.189707 | Alt. (m) | N/A | Depth (m) | 0 to .1 | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F000203 | Metagenome / Metatranscriptome | 1619 | Y |
| F001273 | Metagenome / Metatranscriptome | 733 | Y |
| F001357 | Metagenome / Metatranscriptome | 716 | Y |
| F003383 | Metagenome / Metatranscriptome | 490 | Y |
| F007311 | Metagenome / Metatranscriptome | 353 | Y |
| F008027 | Metagenome / Metatranscriptome | 340 | Y |
| F015068 | Metagenome / Metatranscriptome | 257 | Y |
| F015698 | Metagenome / Metatranscriptome | 252 | Y |
| F020134 | Metagenome / Metatranscriptome | 225 | Y |
| F021676 | Metagenome / Metatranscriptome | 218 | Y |
| F021721 | Metagenome / Metatranscriptome | 217 | Y |
| F021919 | Metagenome / Metatranscriptome | 216 | Y |
| F023505 | Metagenome / Metatranscriptome | 209 | Y |
| F024050 | Metagenome / Metatranscriptome | 207 | Y |
| F026580 | Metagenome / Metatranscriptome | 197 | Y |
| F029356 | Metagenome / Metatranscriptome | 188 | Y |
| F031389 | Metagenome / Metatranscriptome | 182 | Y |
| F032287 | Metagenome / Metatranscriptome | 180 | Y |
| F035144 | Metagenome / Metatranscriptome | 172 | Y |
| F039025 | Metagenome / Metatranscriptome | 164 | Y |
| F048323 | Metagenome / Metatranscriptome | 148 | Y |
| F057922 | Metagenome / Metatranscriptome | 135 | N |
| F062515 | Metagenome / Metatranscriptome | 130 | Y |
| F063428 | Metagenome / Metatranscriptome | 129 | Y |
| F069734 | Metagenome / Metatranscriptome | 123 | N |
| F071421 | Metagenome / Metatranscriptome | 122 | N |
| F078329 | Metagenome / Metatranscriptome | 116 | Y |
| F088037 | Metagenome / Metatranscriptome | 109 | N |
| F091420 | Metagenome / Metatranscriptome | 107 | N |
| F093040 | Metagenome / Metatranscriptome | 106 | N |
| F094972 | Metagenome / Metatranscriptome | 105 | Y |
| F098814 | Metagenome / Metatranscriptome | 103 | N |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| Ga0005494J37277_100004 | Not Available | 522 | Open in IMG/M |
| Ga0005494J37277_100013 | Not Available | 882 | Open in IMG/M |
| Ga0005494J37277_100046 | All Organisms → cellular organisms → Eukaryota → Opisthokonta → Fungi → Fungi incertae sedis → Blastocladiomycota → Blastocladiomycota incertae sedis → Blastocladiomycetes → Blastocladiales → Blastocladiaceae → Allomyces → Allomyces macrogynus | 824 | Open in IMG/M |
| Ga0005494J37277_100056 | Not Available | 834 | Open in IMG/M |
| Ga0005494J37277_100132 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Cytophagia → Cytophagales → Hymenobacteraceae → Hymenobacter → Hymenobacter aerophilus | 621 | Open in IMG/M |
| Ga0005494J37277_100195 | Not Available | 762 | Open in IMG/M |
| Ga0005494J37277_100241 | Not Available | 748 | Open in IMG/M |
| Ga0005494J37277_100286 | Not Available | 963 | Open in IMG/M |
| Ga0005494J37277_100338 | Not Available | 560 | Open in IMG/M |
| Ga0005494J37277_100367 | Not Available | 565 | Open in IMG/M |
| Ga0005494J37277_100419 | All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Desulfovibrionales → Desulfovibrionaceae → Desulfovibrio | 591 | Open in IMG/M |
| Ga0005494J37277_100509 | Not Available | 774 | Open in IMG/M |
| Ga0005494J37277_100517 | Not Available | 508 | Open in IMG/M |
| Ga0005494J37277_100550 | Not Available | 597 | Open in IMG/M |
| Ga0005494J37277_100739 | Not Available | 788 | Open in IMG/M |
| Ga0005494J37277_100850 | Not Available | 763 | Open in IMG/M |
| Ga0005494J37277_100981 | Not Available | 511 | Open in IMG/M |
| Ga0005494J37277_101051 | Not Available | 763 | Open in IMG/M |
| Ga0005494J37277_101585 | Not Available | 838 | Open in IMG/M |
| Ga0005494J37277_101668 | All Organisms → cellular organisms → Eukaryota → Discoba → Heterolobosea → Tetramitia → Eutetramitia → Vahlkampfiidae → Naegleria → Naegleria gruberi | 523 | Open in IMG/M |
| Ga0005494J37277_102107 | All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Opitutae → Opitutales → Opitutaceae → Lacunisphaera → Lacunisphaera limnophila | 579 | Open in IMG/M |
| Ga0005494J37277_102473 | Not Available | 580 | Open in IMG/M |
| Ga0005494J37277_102756 | Not Available | 589 | Open in IMG/M |
| Ga0005494J37277_102766 | Not Available | 587 | Open in IMG/M |
| Ga0005494J37277_103485 | Not Available | 588 | Open in IMG/M |
| Ga0005494J37277_104527 | Not Available | 501 | Open in IMG/M |
| Ga0005494J37277_105335 | Not Available | 570 | Open in IMG/M |
| Ga0005494J37277_105525 | Not Available | 780 | Open in IMG/M |
| Ga0005494J37277_105634 | Not Available | 582 | Open in IMG/M |
| Ga0005494J37277_105944 | Not Available | 566 | Open in IMG/M |
| Ga0005494J37277_111372 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Cyanobacteria/Melainabacteria group → Cyanobacteria → Nostocales → Nostocaceae → Nostoc → Nostoc punctiforme | 511 | Open in IMG/M |
| Scaffold ID | Protein ID | Family | Sequence |
|---|---|---|---|
| Ga0005494J37277_100004 | Ga0005494J37277_1000041 | F057922 | VPSDPANSPFSDWHAQSEHSTQGSGDVVLLLPVAAFIRPRISAPEPICLFYLLEARVSKQTFARPHRLLPFESHRSEVNAPALSLRRNSELFFQSVRP*TPILACALTPSGYVRCPKPVTVFQAQSSQTSIQLPLPFRTFILPDRSAQSVARSEKLAFVSGPFSLRSPQASINF |
| Ga0005494J37277_100013 | Ga0005494J37277_1000131 | F069734 | MSPWLNRTLHPRLTPRMNLRVHSGHGKLGMASGALLVPIRRFTIGKPTTNSPTETLTYAFYHA |
| Ga0005494J37277_100013 | Ga0005494J37277_1000132 | F003383 | MRSRVAPLPRSSRSARVASPGCPFPAPFLLSRRPNPRVAPRFRAFGCAGDSSFESPRTRMPLALLVSARLRVAPVALAFSCPACDVGLGSPLVLHLRLYRRWIIESPRCSHHSAVPTYQSSSCPKSQPFGIADDSLFELPRTLNPPVPIDGYPSYLGSRTIRFALVRSPSCPGHLPSATAIDQFPGCPKSWVSHRSPIPLASSFPESWILG* |
| Ga0005494J37277_100046 | Ga0005494J37277_1000461 | F007311 | CWPPHIAVAPVMDYSMLDALRSPLRPDCSVLRFLRPVPASDLTRRPTLSAPRLLRIAPHSPPCARAVHGSPRATDCSVVQHLELLPRLRIAPPAMPAFRAVLGLLLAHRLLPHGVPGLLRVRRLARHAGLRIAPYSDALRPWPAHGFLRARS* |
| Ga0005494J37277_100056 | Ga0005494J37277_1000561 | F063428 | GTKRTVAVERLRLFVSKIIPGDRGKVGLGWLARLLLNWPKGRTAEAKVTGSSGAFARAKCE* |
| Ga0005494J37277_100132 | Ga0005494J37277_1001321 | F021676 | MPDGPATRPKTPLAVENGVGKPAAPEKGAPNAGSGKREWRTPIP |
| Ga0005494J37277_100184 | Ga0005494J37277_1001841 | F039025 | PLAAPAFNLRLASAANFPARP*ANPPARIGVVSPGSVGGKYPAFALYYALPIDWLLTFQLALVSGLQLGLRLLPTHTWRRPSARLVWQLPILIGCCCNSQLAPSAAATPSSRWRLPPVATPVANCRLASAVLLRLCRFRLARLAPCLLTSGWAFDAPLVSTEPCIAG*AVDEYSVSTGSCTLRICQ |
| Ga0005494J37277_100195 | Ga0005494J37277_1001952 | F003383 | ARVASPGCPFPAPFLLSRRPNPQVAPRFRTFGSAGDGPFELPRTSHAFSVAGFGKVPSCPGRYAFSCSACDEGRGSPLVLHLRLDRRWVIELPRCSHHSAVPTYRFSGCPKFRPFGIADDSLSGLPRTLNPPAPIDGYPSYPGSRIPRFASVESPGCPGHFPFGYGN*PISRLP* |
| Ga0005494J37277_100241 | Ga0005494J37277_1002412 | F032287 | VVPQGAAWDLRRNPQDTERPAERQAARFRQEKIWRGASMSKGCDETAGDTSGANPDPESPSKKRGQAARKGSRERGSEHEETRTPTRRDTEDG* |
| Ga0005494J37277_100286 | Ga0005494J37277_1002861 | F024050 | MDVRVAPNFAFLRRCRFQPGSESPRCLLPSSSAPDVGLGFPLVLHLRLYRRWIIESPRFSHLSAVPSCQSSSCPEPQPFGIADDSPPRLPQTSNPPAPIDGYPSYLGSRTIRFALVEAPSYPGSSPSATAIDQFPGCPKSWVSHRSPIHHASSCPESWFLG* |
| Ga0005494J37277_100338 | Ga0005494J37277_1003381 | F031389 | LTRPRRRIERSAVAAGEQKAASRSELLPEWEKADAHQSFDPWLNKIGCGEQRAYVTPFPDAKFRP |
| Ga0005494J37277_100367 | Ga0005494J37277_1003671 | F015068 | DVADLER*SAAETLEIETGGGEREEIGRSRKAGRKTANPAERFDPEGGKSPKGEWRLEERSSSKP*KLQRARSNEAAETGANRTG* |
| Ga0005494J37277_100419 | Ga0005494J37277_1004191 | F021919 | FQVTRSRSQTPRRHATAQSFASRIAGRHPFGSPPRSFKRNGSIPRGSPASFPSWNRHLEAAFHSPETTARFQATISRSKLPTYSFDTLPCIRPARSDPDSPTRPGSPRRAQDHYRNPVARLLPGTSNPSSDLHSPSGPFGPLRIKAFNPISGRKAHLLSAPDCPSLPNIESILLVRCQITAPGSLSVQWLAVPQTS |
| Ga0005494J37277_100509 | Ga0005494J37277_1005091 | F029356 | MTTLILNPRGKDGVRPPARVAGRGEATTRRLDRLPGESPGTDVRKTNAEAEATRLPERSLGT |
| Ga0005494J37277_100517 | Ga0005494J37277_1005172 | F062515 | MLKTTPLDCETKGSLREKRRDPWLWANASLEAMADPELVVKTRERE |
| Ga0005494J37277_100522 | Ga0005494J37277_1005222 | F000203 | GMGVRHALFPDAGIWRTLAASFPTPFSTASGVTGLMTGPSSALRSLNFE* |
| Ga0005494J37277_100550 | Ga0005494J37277_1005501 | F026580 | NRSELDEVTQAGFEPARIAVTQRVNGCYCASARSEGSGGFRKEVRQKKKDAERRDKWSVRTGRDTRPRQGP* |
| Ga0005494J37277_100636 | Ga0005494J37277_1006361 | F098814 | SK*PDFQRCSPPACTASNMHSESSSVLPASSPTAGFPGCRIKTAKRAAY*F*TGLSARNGLSLARDDLRSRGSHHEVKVPGLLLRFRISRLFRPVRLLLRYQFRLAPVSAASTLQTRCGLTARFCRLRFQLPLPFGNIASLGITASTGLAASQPAFRNCPISVRSPPPYSIARSGCGSSFPARYCFVGLLFLKP |
| Ga0005494J37277_100739 | Ga0005494J37277_1007392 | F024050 | GPSAVPAMVGSSYPELRMPSAVLVLARFRVSPVAPASSCSARDEGLELPLVPHLRLHRQWIVESPRLSHLSAVPTGQSSSRPESRPFGIADDSSPRLPQTPNPPVPADRYPSYLGSRTIRFALVESPGCPGHSPLATAIDQFPGCPKSRVFRRSPILLNSSRPEPWFLG* |
| Ga0005494J37277_100850 | Ga0005494J37277_1008502 | F029356 | MTTLILNPRGKDGGRPPARVAGRGEAKTRKLGRLPGESPGTEVRKISAEAEATRLPER |
| Ga0005494J37277_100981 | Ga0005494J37277_1009811 | F021721 | DGSDNRLPLAIATFQPATGQSNKRALNELTILPEPESRYGLSLAHNDAYATIARSMFLACTFVSLSKIFANPFDTRLSRSVRFRGRSGAMSTPDTRFPRRSPTLPIFPRSPLPFGSSYENPLDRSVQPVPDTGSSPCLTPDRPLLPAASSFDSAADQCSKLAVPCSAIVP |
| Ga0005494J37277_101051 | Ga0005494J37277_1010511 | F071421 | LPPLSFLRDGPSQFHSGLTATMICDMRLSGGGNLSPVTPKSGYRFRPTSRNLAKVVAIGQPPTDPIAACWENQQDFS* |
| Ga0005494J37277_101585 | Ga0005494J37277_1015851 | F093040 | MRLAVAEGVKHALGVHHPKTGTDRKGPGLPTNPDLSPVTRDYPRRSRVDGQSPREASATLNLR |
| Ga0005494J37277_101668 | Ga0005494J37277_1016681 | F008027 | PVLQQHPKMSIVYYSVERPIQILVANYGGRDVTNIARDQYHKHGVFTATNDALGGDPLPNVPKFLHVIYSFSGVLGSVSIPEHQQFSPATPPTTVLGALYGSADVTEKVRGLGHEFQVENGHFGDPNPGIFKNFSVVYSKNGQIKSISAKEHDRVHLH* |
| Ga0005494J37277_102107 | Ga0005494J37277_1021071 | F020134 | MTPSRGAGDLAYPMVAHRKGRAAPAATFLSPVARDYPSRALGGVFSPP |
| Ga0005494J37277_102473 | Ga0005494J37277_1024731 | F015698 | SK*PDSQLRSPPACTALSFVFRSAVRFCLSAPLVAFSSPPPSSTLPGAPQSLSATGPVARDGLSLPSNGLRLR*VRSRINVPGLPLRSPHGHSQARSAFRSTARCGSPRYRQLPSFSPLPARFRARSASPPTAFPIRIFTSLWINVLPDSLPIGPPSDYARSPLAPRSRLSITSRWKRINA*GPLRFRRLAV |
| Ga0005494J37277_102756 | Ga0005494J37277_1027561 | F071421 | AQQAIQLPPLSFPRDGPPQFHSGLTATMVCGMRLSGGGNLSPITLESGYRFGPTPRNLARVVAIGQPPTDPIAARWENQRDFS* |
| Ga0005494J37277_102766 | Ga0005494J37277_1027661 | F094972 | MNPLPDSVPAFLADTEPPLPFRDFYIPLQIAAFDSAFRSKAHLYELPDSPSLPVSFMLLTISLRIIVPGPLRLTKFDCSVN |
| Ga0005494J37277_103485 | Ga0005494J37277_1034851 | F088037 | VLQSSTCIAVLPGADISKGLSLASRDFLFPGCLEKVNAPGCFLQRPAEISLKPVPPTASPLNPVCPGLGGILAMSPLPDFVSALLAATVSPLPSRDFYIPLRIAAFDAACHLKAYLLELPDFPSLPAGLPLLTSGRRIIVPGPLLLTRFVCSV |
| Ga0005494J37277_104527 | Ga0005494J37277_1045271 | F048323 | FDSLTGITKSESGESSEELKITGKPELHKNESSKEMKKRVN* |
| Ga0005494J37277_105335 | Ga0005494J37277_1053351 | F001273 | MRIDLYTKTILTIIALLLAIIVMKPLFQPQPALAEGKYAGVQFSYSGGNHAFFDARTGDVWEYGEDGHFRQHHKVHEFGKDHDH* |
| Ga0005494J37277_105525 | Ga0005494J37277_1055251 | F035144 | MFTLAAAPCTPGTLASYIALGSTGCTVGNDTFFNFQLINDNASGGATMVTAADINVQGMGPAGTMGASSQNSFLPQDIGVDFDTALWAVTAGQSQDDDISFDVSVGTGAVDITDAGVDQISNTVPNGTASVTEKGCSGLVFPCASTWGVDTNDSTFVSDTIFSATGTLSVEKDIALVGGTGSAGLSNVADVFSTSEVPEPRALSFLLGLGLVAGFVFRKKFQGENA* |
| Ga0005494J37277_105634 | Ga0005494J37277_1056341 | F091420 | EQSSRVERSETGTMIRRLITKSEWQAGSEGKSGGSHGELFSGTLN* |
| Ga0005494J37277_105634 | Ga0005494J37277_1056342 | F078329 | MPAPNRTEQQASLKALKRFSRKQKGGELRGRRLAVLIAEAPTGHAGGGRGCEADHLE |
| Ga0005494J37277_105944 | Ga0005494J37277_1059441 | F023505 | LDVGGGRKKAASFPEIIPGDWAKVGSGWLAQPLEDRFARSGNRRHNSPVPLSAFARRQ* |
| Ga0005494J37277_111372 | Ga0005494J37277_1113721 | F001357 | MKKILLLAVLALALPMAVFAGTSVDYTNSGGTLSGSSAGLSLSGSVLIAANGLNGGGLITGSNLGSVSFTTGALLSGNLQMGGTFGAGGSFTVTGNGTDGIPNGVLFTGTFSSPVSWTLVTLANGTHNYTLIGTLTGTTGGSSVVTQGVT |
| ⦗Top⦘ |