| Basic Information | |
|---|---|
| IMG/M Taxon OID | 3300002651 Open in IMG/M |
| GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0085736 | Gp0056637 | Ga0005458 |
| Sample Name | Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF107 (Metagenome Metatranscriptome, Counting Only) |
| Sequencing Status | Permanent Draft |
| Sequencing Center | DOE Joint Genome Institute (JGI) |
| Published? | N |
| Use Policy | Open |
| Dataset Contents | |
|---|---|
| Total Genome Size | 5501876 |
| Sequencing Scaffolds | 20 |
| Novel Protein Genes | 21 |
| Associated Families | 20 |
| Dataset Phylogeny | |
|---|---|
| Taxonomy Groups | Number of Scaffolds |
| Not Available | 18 |
| All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Hexapoda → Insecta → Dicondylia → Pterygota → Neoptera → Endopterygota → Diptera → Brachycera → Muscomorpha → Eremoneura → Cyclorrhapha → Schizophora → Acalyptratae → Ephydroidea → Drosophilidae → Drosophilinae → Drosophilini → Drosophila → Sophophora → melanogaster group → melanogaster subgroup → Drosophila melanogaster | 1 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Burkholderiaceae → Burkholderia → pseudomallei group → Burkholderia mallei | 1 |
| Ecosystem Assignment (GOLD) | |
|---|---|
| Name | Forest Soil Microbial Communities From Harvard Forest Long Term Ecological Research (Lter) Site In Petersham, Ma, For Long-Term Soil Warming Studies |
| Type | Environmental |
| Taxonomy | Environmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil → Forest Soil Microbial Communities From Harvard Forest Long Term Ecological Research (Lter) Site In Petersham, Ma, For Long-Term Soil Warming Studies |
| Alternative Ecosystem Assignments | |
|---|---|
| Environment Ontology (ENVO) | forest biome → land → forest soil |
| Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Soil (non-saline) |
| Location Information | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location | Harvard Forest LTER, Petersham, MA, USA | |||||||
| Coordinates | Lat. (o) | 42.532967 | Long. (o) | -72.180244 | Alt. (m) | N/A | Depth (m) | 0 to .1 | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F004183 | Metagenome / Metatranscriptome | 449 | Y |
| F007311 | Metagenome / Metatranscriptome | 353 | Y |
| F016972 | Metagenome / Metatranscriptome | 243 | Y |
| F018180 | Metagenome / Metatranscriptome | 236 | Y |
| F018933 | Metagenome / Metatranscriptome | 232 | N |
| F021676 | Metagenome / Metatranscriptome | 218 | Y |
| F021919 | Metagenome / Metatranscriptome | 216 | Y |
| F022827 | Metagenome / Metatranscriptome | 212 | Y |
| F030100 | Metagenome / Metatranscriptome | 186 | Y |
| F030305 | Metagenome / Metatranscriptome | 185 | N |
| F035644 | Metagenome / Metatranscriptome | 171 | Y |
| F038498 | Metagenome / Metatranscriptome | 165 | Y |
| F041210 | Metagenome / Metatranscriptome | 160 | Y |
| F048323 | Metagenome / Metatranscriptome | 148 | Y |
| F058583 | Metagenome / Metatranscriptome | 134 | N |
| F059015 | Metagenome / Metatranscriptome | 134 | Y |
| F076700 | Metagenome / Metatranscriptome | 117 | N |
| F080934 | Metagenome / Metatranscriptome | 114 | Y |
| F089618 | Metagenome / Metatranscriptome | 108 | N |
| F093895 | Metagenome / Metatranscriptome | 106 | N |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| Ga0005458J37252_100036 | Not Available | 812 | Open in IMG/M |
| Ga0005458J37252_100060 | Not Available | 585 | Open in IMG/M |
| Ga0005458J37252_100208 | Not Available | 598 | Open in IMG/M |
| Ga0005458J37252_100605 | Not Available | 668 | Open in IMG/M |
| Ga0005458J37252_100746 | Not Available | 907 | Open in IMG/M |
| Ga0005458J37252_100774 | Not Available | 810 | Open in IMG/M |
| Ga0005458J37252_101011 | Not Available | 579 | Open in IMG/M |
| Ga0005458J37252_101991 | Not Available | 555 | Open in IMG/M |
| Ga0005458J37252_102241 | Not Available | 512 | Open in IMG/M |
| Ga0005458J37252_102306 | Not Available | 825 | Open in IMG/M |
| Ga0005458J37252_102759 | All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Hexapoda → Insecta → Dicondylia → Pterygota → Neoptera → Endopterygota → Diptera → Brachycera → Muscomorpha → Eremoneura → Cyclorrhapha → Schizophora → Acalyptratae → Ephydroidea → Drosophilidae → Drosophilinae → Drosophilini → Drosophila → Sophophora → melanogaster group → melanogaster subgroup → Drosophila melanogaster | 566 | Open in IMG/M |
| Ga0005458J37252_102923 | Not Available | 562 | Open in IMG/M |
| Ga0005458J37252_102989 | Not Available | 569 | Open in IMG/M |
| Ga0005458J37252_104278 | Not Available | 1369 | Open in IMG/M |
| Ga0005458J37252_104290 | Not Available | 806 | Open in IMG/M |
| Ga0005458J37252_104559 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Burkholderiaceae → Burkholderia → pseudomallei group → Burkholderia mallei | 805 | Open in IMG/M |
| Ga0005458J37252_105085 | Not Available | 731 | Open in IMG/M |
| Ga0005458J37252_105615 | Not Available | 664 | Open in IMG/M |
| Ga0005458J37252_105847 | Not Available | 651 | Open in IMG/M |
| Ga0005458J37252_106141 | Not Available | 1353 | Open in IMG/M |
| Scaffold ID | Protein ID | Family | Sequence |
|---|---|---|---|
| Ga0005458J37252_100036 | Ga0005458J37252_1000361 | F076700 | AGVSILITRSTAGVASSSYPSGRSPAFAGAGSSGCAGTVALTCVNALHSGSTGGQPPGSDRFRVLRLDRLQTSDSHRLFRASDRPVVTRSTCVEPSFLRLGRQPTSDSHRCRPLARLAASSGLRLMLPLPLEWLRFQFAPAASPFAFTGREPLGLRLVAPSPAEPLMHSLFPPNLASPAKPSMSIPLPLALASSGIFQLNNFRLASAFALLVRPAIPLRLAPQVSPSVRAGGHPPVLTGCPLRQPCWRSTSGLRRLSLLPAFRQPTPMPF |
| Ga0005458J37252_100060 | Ga0005458J37252_1000601 | F059015 | SRVHGTIQRPVCMRTFALRPDVRDKTAHRSLPPLFSMRQVIALSRLHHTSLSD*HLAGAGSSIRPFARSQRRFRHHCEVNVPGLHLRFHIGNLRESVRLQALSLRSVSRPNRGAVNAQNPLSAPTSNAPDFSPISTPLQVLLRKPSGSKRSTGSISGSPSYQTFDCLLLPATSSFDSATDQRLKLASFGLTYRS |
| Ga0005458J37252_100208 | Ga0005458J37252_1002081 | F004183 | MERLRGANEDRANSEGGPNGCEAYQRVDPKGAKTPEGERRQAAQPAEQAGAECNGLEAWMQPDAGANQQLAAESKSRTSRKTGRQVSEVAGQEL* |
| Ga0005458J37252_100605 | Ga0005458J37252_1006052 | F021676 | MPDGPATRPETPLAVENGVGKPAAPEKGAPNAGSGKRDWRTPIP |
| Ga0005458J37252_100746 | Ga0005458J37252_1007461 | F030305 | GVFRMPLQFIGSCIVPFGFLVPVPSFLFPALSDLARRCSSGRPVPRVSDRTGDEAPSCPGSSVFSAVPADGSSSRPDSRILQLGSPQIARLPRLSTSCLAVDERPGCPVRSIIWLYRRRRFRVAPNLTSFGGTVSNSPSRPGSSLLQPLPLMVLRVAPGAPSSGFAGGDSSGCPEALIPRLCRLVAFRVSPDLPPSDICRFRFSGLPQIGFLGGSMMNPRFARTLHPRSIQLTSLQVAPKFPLPAAPRMNLQTHSGLAFLPTLRCSLNLYPLFARRRTGLLRTTINQFRIACRAGLQSVSLQ |
| Ga0005458J37252_100774 | Ga0005458J37252_1007742 | F089618 | GLVVQPVLRPSATPAANLQLSLPLPPLAAPVSNIRLASAALPPARPRANPPARIGVFSPGSTGGKHPACTVCYALPIDWLLTFQLALASGLQLGLRLLPTHIWCRPSARQVLRLPVLTGFRRNLRLAPAAAAAPTRAGGCPLLPHRPRTSDSHRLLFCRLCRSRLTQLALRALTSGWAFDAPLASTEPCIAG* |
| Ga0005458J37252_101011 | Ga0005458J37252_1010111 | F058583 | PRVHGTIERTTGMLGVRSAANLSLNLRIAARRRCFSTCAGSMPRSIVKLKSLRRSWKLETAFHSLATTFAHHCEVNVPDLLLRFRTENPAEPVRSRTPPLRSVFEAEPGRNRRPLPVALRRSPALLRFHSISTPLEAHLQSPPDQSVRPSSSRGSLPDETLACPLLPSARSFRILPRITAFDALCSARLIVP |
| Ga0005458J37252_101991 | Ga0005458J37252_1019911 | F038498 | *ANHRHARRSIRGESVFEFPDHRSPSPFFNMRRVNAATCEKLKSLRRNRRLETAFHSLTTTFSRHCEVKVPDLLLRLRAENLTEPVRSRTPSLHSVFETELGRILRPLPVVLRRSPALLRFHSISTPLEAYLQSPPDQSVQPSSSRGSLPDRTSAYPSLPLALSFRIWPRITVPDALCPARLIVP |
| Ga0005458J37252_102241 | Ga0005458J37252_1022411 | F018180 | SGLLSLRLHFAHMPAVFFSTSPSFWKVNEACLLSDPSQIGRPFVTPFSAFSVRLVPVRNQPLSSAPCWRTVATIYPLGNCDSLKPETLLSYLTRLGPVSHRASSLSPFFAFTDSARTLPEELVNSASAVLPFGFPLEESCQPAWPDFSPVTWDYPGDLRRLSFVRLHLSR |
| Ga0005458J37252_102306 | Ga0005458J37252_1023061 | F007311 | VPSCWPPHFAVAPVMDCSMLDAWHSLLHADCSALHFVCPVQTTDLTRCSTLSALRLLRIAPRSTPCARAGHGSLRATDCSVVQHLELCPACGLLRHRDLAFHAALGLLLVRRFPLSVAHGLLRARRLTLHAGLRIAPYADILRPCAVYGLLHFRH* |
| Ga0005458J37252_102759 | Ga0005458J37252_1027591 | F021919 | SRSQTPRRHATAQSFASRIAGRHPFGSPPRFFKRNGSIPRGSPASFPSWNRHLEAAFHSPETTARFQATISRSKLPTYSFDTLPCIRPARSDPDSPTRPGSPRRAQDHYRNPVA*LLPGTSNPSSDLHSPSGPFGPLRIKAFNPISGRKVHLPSAPDCPSLPNIESILLVRCQITAPGPLSVQWLAVP |
| Ga0005458J37252_102923 | Ga0005458J37252_1029231 | F041210 | MLSGTSQCGFPMEPVARDGLSLARNGCHLSAASIPGSTVLACHFAASQLASSPGTPLTPSPPLVCPSVGGFFAFRPDASPPAGTFGCFGCLHSPPGLLHPSGSKRSTVSAASRLAFRTRPISSRSPLTVLLLDFDRGSTFQVRYVSGGLLF |
| Ga0005458J37252_102989 | Ga0005458J37252_1029891 | F080934 | YPTTQLPPGMHGRSCAICVVVRFCFLCSPHRDFARFGSALGSAPVGLTANRNLHLGTAFRSPNKTACFQAPLPRSMLLTYPFGSPLSLLRTRSIHPLVHAVRLAPDGANSTRQTRCPVPSERPRPFFRSPLPFGAFGTPPDQSADLNTSREVHQIVTPDFLRSPLPAVLKEPATDQRSRLATSRLAYCL |
| Ga0005458J37252_103089 | Ga0005458J37252_1030891 | F048323 | YFDSLTGITESESGKSSEALKITGRPELHEKMKIRKNKKKEC* |
| Ga0005458J37252_104278 | Ga0005458J37252_1042781 | F035644 | METPASPKEQALSEYVHSDYLTDSERDAEADIPVPDRLLPKREYRRLVEALRLSQKIGLLVWLNRQGLLSLGGRERLLYLQSKCSFEALEAGLRFARRLTEEAKLQSDFQHQMRELNRRPQSKTFRQSESRRIGVGYRDKGMLPEQSLRARRMAWEESFLPTESIPEVLLSVLQKYLPACLTEDGEWVDLSVFPGTFGSEDNPELKTLLHPL* |
| Ga0005458J37252_104290 | Ga0005458J37252_1042901 | F093895 | PELLVPTFSTSHRSGITPCSTLASCAIHGVLRGRHRMPVLAADCSAINPLALRPAHELLRARHLAFRFVPGLLLARRFPPCVELGFLRVQPSRSVSSSDFSALVPSRFATLTDCSVHVAWLLVSRPERSVLSTSTRSYPPITRCATLCASHRSRITSCPTLDLPSPAQIAPRL* |
| Ga0005458J37252_104559 | Ga0005458J37252_1045591 | F018933 | RVAGLSLRCSRRSRIAPRSTLDSHCLSQIAPRLALIDRCCSPIARRSTLDAPFRSQIAPRSTLRARGGLGSLLVRHLEPLPRLRIAPYSTLSFPLRSQIAPRSILPAPCRSRIAPRSAIDAQCRSTDISVRLHSAPCAVHGSLRAQHFMLRTGH*LLCVRRLHFARFADCSALLTLCLVLHTDCSALNT* |
| Ga0005458J37252_105085 | Ga0005458J37252_1050851 | F022827 | TDHSALPSPDSLQVGDQVSSSSFPSPGHPCGRPAPVFSTGIQDQGCHPRYLQRFRRSLSTSPFPARSSPARTSQPAFRPAARCPEGCSLVPRRALRPHPGVEVSLAFSLDPAPHGFRFGIRAIPAVPIRVVSFKLWVLPQQLESACASYPACSPCVSIRLARRHFLLDLPRTRRSFSTLAGFQLRSYGPTVSPPSSLPAHFYMCRSSAGSSRFSFRMVALRLPPGCSEPIAFAESFFLLSGSL |
| Ga0005458J37252_105615 | Ga0005458J37252_1056151 | F016972 | VRLNPLTDAIPVVNLYLTIYEFDKISFGAKTSRYGPYGLGYRRATITLTK* |
| Ga0005458J37252_105847 | Ga0005458J37252_1058471 | F030100 | PKKADARKWTEASGIIPGDWGKVRPGWLAKPLPKQTARLRVGGRIHQFLWRRSRAVSKREKGTERRGMIRSYSEPGQVAQAGFELE* |
| Ga0005458J37252_106141 | Ga0005458J37252_1061411 | F035644 | MGSPESPEDWALSEYVHSDYFSESERDAEADIPVPDRLLPKRDYRRLVGALRLSQKIGLLIWLNRMDLLSLGGKERLLYLQSKASYEALEAGLRFARRLSEEKKLQSDFMHSMRELNRRPQSKHFRQTSERRRIGVGYRDKGMLPEQSSRARKAAWEESFIPTESIPKGLLEILKRYLPSCLTEDEEWVDLSVFPGTFGSEGDSEMTKLLHPL* |
| ⦗Top⦘ |