Basic Information | |
---|---|
IMG/M Taxon OID | 3300002651 Open in IMG/M |
GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0085736 | Gp0056637 | Ga0005458 |
Sample Name | Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF107 (Metagenome Metatranscriptome, Counting Only) |
Sequencing Status | Permanent Draft |
Sequencing Center | DOE Joint Genome Institute (JGI) |
Published? | N |
Use Policy | Open |
Dataset Contents | |
---|---|
Total Genome Size | 5501876 |
Sequencing Scaffolds | 20 |
Novel Protein Genes | 21 |
Associated Families | 20 |
Dataset Phylogeny | |
---|---|
Taxonomy Groups | Number of Scaffolds |
Not Available | 18 |
All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Hexapoda → Insecta → Dicondylia → Pterygota → Neoptera → Endopterygota → Diptera → Brachycera → Muscomorpha → Eremoneura → Cyclorrhapha → Schizophora → Acalyptratae → Ephydroidea → Drosophilidae → Drosophilinae → Drosophilini → Drosophila → Sophophora → melanogaster group → melanogaster subgroup → Drosophila melanogaster | 1 |
All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Burkholderiaceae → Burkholderia → pseudomallei group → Burkholderia mallei | 1 |
Ecosystem Assignment (GOLD) | |
---|---|
Name | Forest Soil Microbial Communities From Harvard Forest Long Term Ecological Research (Lter) Site In Petersham, Ma, For Long-Term Soil Warming Studies |
Type | Environmental |
Taxonomy | Environmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil → Forest Soil Microbial Communities From Harvard Forest Long Term Ecological Research (Lter) Site In Petersham, Ma, For Long-Term Soil Warming Studies |
Alternative Ecosystem Assignments | |
---|---|
Environment Ontology (ENVO) | forest biome → land → forest soil |
Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Soil (non-saline) |
Location Information | ||||||||
---|---|---|---|---|---|---|---|---|
Location | Harvard Forest LTER, Petersham, MA, USA | |||||||
Coordinates | Lat. (o) | 42.532967 | Long. (o) | -72.180244 | Alt. (m) | N/A | Depth (m) | 0 to .1 | Location on Map |
Zoom: | Powered by OpenStreetMap © |
Family | Category | Number of Sequences | 3D Structure? |
---|---|---|---|
F004183 | Metagenome / Metatranscriptome | 449 | Y |
F007311 | Metagenome / Metatranscriptome | 353 | Y |
F016972 | Metagenome / Metatranscriptome | 243 | Y |
F018180 | Metagenome / Metatranscriptome | 236 | Y |
F018933 | Metagenome / Metatranscriptome | 232 | N |
F021676 | Metagenome / Metatranscriptome | 218 | Y |
F021919 | Metagenome / Metatranscriptome | 216 | Y |
F022827 | Metagenome / Metatranscriptome | 212 | Y |
F030100 | Metagenome / Metatranscriptome | 186 | Y |
F030305 | Metagenome / Metatranscriptome | 185 | N |
F035644 | Metagenome / Metatranscriptome | 171 | Y |
F038498 | Metagenome / Metatranscriptome | 165 | Y |
F041210 | Metagenome / Metatranscriptome | 160 | Y |
F048323 | Metagenome / Metatranscriptome | 148 | Y |
F058583 | Metagenome / Metatranscriptome | 134 | N |
F059015 | Metagenome / Metatranscriptome | 134 | Y |
F076700 | Metagenome / Metatranscriptome | 117 | N |
F080934 | Metagenome / Metatranscriptome | 114 | Y |
F089618 | Metagenome / Metatranscriptome | 108 | N |
F093895 | Metagenome / Metatranscriptome | 106 | N |
Scaffold | Taxonomy | Length | IMG/M Link |
---|---|---|---|
Ga0005458J37252_100036 | Not Available | 812 | Open in IMG/M |
Ga0005458J37252_100060 | Not Available | 585 | Open in IMG/M |
Ga0005458J37252_100208 | Not Available | 598 | Open in IMG/M |
Ga0005458J37252_100605 | Not Available | 668 | Open in IMG/M |
Ga0005458J37252_100746 | Not Available | 907 | Open in IMG/M |
Ga0005458J37252_100774 | Not Available | 810 | Open in IMG/M |
Ga0005458J37252_101011 | Not Available | 579 | Open in IMG/M |
Ga0005458J37252_101991 | Not Available | 555 | Open in IMG/M |
Ga0005458J37252_102241 | Not Available | 512 | Open in IMG/M |
Ga0005458J37252_102306 | Not Available | 825 | Open in IMG/M |
Ga0005458J37252_102759 | All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Hexapoda → Insecta → Dicondylia → Pterygota → Neoptera → Endopterygota → Diptera → Brachycera → Muscomorpha → Eremoneura → Cyclorrhapha → Schizophora → Acalyptratae → Ephydroidea → Drosophilidae → Drosophilinae → Drosophilini → Drosophila → Sophophora → melanogaster group → melanogaster subgroup → Drosophila melanogaster | 566 | Open in IMG/M |
Ga0005458J37252_102923 | Not Available | 562 | Open in IMG/M |
Ga0005458J37252_102989 | Not Available | 569 | Open in IMG/M |
Ga0005458J37252_104278 | Not Available | 1369 | Open in IMG/M |
Ga0005458J37252_104290 | Not Available | 806 | Open in IMG/M |
Ga0005458J37252_104559 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Burkholderiaceae → Burkholderia → pseudomallei group → Burkholderia mallei | 805 | Open in IMG/M |
Ga0005458J37252_105085 | Not Available | 731 | Open in IMG/M |
Ga0005458J37252_105615 | Not Available | 664 | Open in IMG/M |
Ga0005458J37252_105847 | Not Available | 651 | Open in IMG/M |
Ga0005458J37252_106141 | Not Available | 1353 | Open in IMG/M |
Scaffold ID | Protein ID | Family | Sequence |
---|---|---|---|
Ga0005458J37252_100036 | Ga0005458J37252_1000361 | F076700 | AGVSILITRSTAGVASSSYPSGRSPAFAGAGSSGCAGTVALTCVNALHSGSTGGQPPGSDRFRVLRLDRLQTSDSHRLFRASDRPVVTRSTCVEPSFLRLGRQPTSDSHRCRPLARLAASSGLRLMLPLPLEWLRFQFAPAASPFAFTGREPLGLRLVAPSPAEPLMHSLFPPNLASPAKPSMSIPLPLALASSGIFQLNNFRLASAFALLVRPAIPLRLAPQVSPSVRAGGHPPVLTGCPLRQPCWRSTSGLRRLSLLPAFRQPTPMPF |
Ga0005458J37252_100060 | Ga0005458J37252_1000601 | F059015 | SRVHGTIQRPVCMRTFALRPDVRDKTAHRSLPPLFSMRQVIALSRLHHTSLSD*HLAGAGSSIRPFARSQRRFRHHCEVNVPGLHLRFHIGNLRESVRLQALSLRSVSRPNRGAVNAQNPLSAPTSNAPDFSPISTPLQVLLRKPSGSKRSTGSISGSPSYQTFDCLLLPATSSFDSATDQRLKLASFGLTYRS |
Ga0005458J37252_100208 | Ga0005458J37252_1002081 | F004183 | MERLRGANEDRANSEGGPNGCEAYQRVDPKGAKTPEGERRQAAQPAEQAGAECNGLEAWMQPDAGANQQLAAESKSRTSRKTGRQVSEVAGQEL* |
Ga0005458J37252_100605 | Ga0005458J37252_1006052 | F021676 | MPDGPATRPETPLAVENGVGKPAAPEKGAPNAGSGKRDWRTPIP |
Ga0005458J37252_100746 | Ga0005458J37252_1007461 | F030305 | GVFRMPLQFIGSCIVPFGFLVPVPSFLFPALSDLARRCSSGRPVPRVSDRTGDEAPSCPGSSVFSAVPADGSSSRPDSRILQLGSPQIARLPRLSTSCLAVDERPGCPVRSIIWLYRRRRFRVAPNLTSFGGTVSNSPSRPGSSLLQPLPLMVLRVAPGAPSSGFAGGDSSGCPEALIPRLCRLVAFRVSPDLPPSDICRFRFSGLPQIGFLGGSMMNPRFARTLHPRSIQLTSLQVAPKFPLPAAPRMNLQTHSGLAFLPTLRCSLNLYPLFARRRTGLLRTTINQFRIACRAGLQSVSLQ |
Ga0005458J37252_100774 | Ga0005458J37252_1007742 | F089618 | GLVVQPVLRPSATPAANLQLSLPLPPLAAPVSNIRLASAALPPARPRANPPARIGVFSPGSTGGKHPACTVCYALPIDWLLTFQLALASGLQLGLRLLPTHIWCRPSARQVLRLPVLTGFRRNLRLAPAAAAAPTRAGGCPLLPHRPRTSDSHRLLFCRLCRSRLTQLALRALTSGWAFDAPLASTEPCIAG* |
Ga0005458J37252_101011 | Ga0005458J37252_1010111 | F058583 | PRVHGTIERTTGMLGVRSAANLSLNLRIAARRRCFSTCAGSMPRSIVKLKSLRRSWKLETAFHSLATTFAHHCEVNVPDLLLRFRTENPAEPVRSRTPPLRSVFEAEPGRNRRPLPVALRRSPALLRFHSISTPLEAHLQSPPDQSVRPSSSRGSLPDETLACPLLPSARSFRILPRITAFDALCSARLIVP |
Ga0005458J37252_101991 | Ga0005458J37252_1019911 | F038498 | *ANHRHARRSIRGESVFEFPDHRSPSPFFNMRRVNAATCEKLKSLRRNRRLETAFHSLTTTFSRHCEVKVPDLLLRLRAENLTEPVRSRTPSLHSVFETELGRILRPLPVVLRRSPALLRFHSISTPLEAYLQSPPDQSVQPSSSRGSLPDRTSAYPSLPLALSFRIWPRITVPDALCPARLIVP |
Ga0005458J37252_102241 | Ga0005458J37252_1022411 | F018180 | SGLLSLRLHFAHMPAVFFSTSPSFWKVNEACLLSDPSQIGRPFVTPFSAFSVRLVPVRNQPLSSAPCWRTVATIYPLGNCDSLKPETLLSYLTRLGPVSHRASSLSPFFAFTDSARTLPEELVNSASAVLPFGFPLEESCQPAWPDFSPVTWDYPGDLRRLSFVRLHLSR |
Ga0005458J37252_102306 | Ga0005458J37252_1023061 | F007311 | VPSCWPPHFAVAPVMDCSMLDAWHSLLHADCSALHFVCPVQTTDLTRCSTLSALRLLRIAPRSTPCARAGHGSLRATDCSVVQHLELCPACGLLRHRDLAFHAALGLLLVRRFPLSVAHGLLRARRLTLHAGLRIAPYADILRPCAVYGLLHFRH* |
Ga0005458J37252_102759 | Ga0005458J37252_1027591 | F021919 | SRSQTPRRHATAQSFASRIAGRHPFGSPPRFFKRNGSIPRGSPASFPSWNRHLEAAFHSPETTARFQATISRSKLPTYSFDTLPCIRPARSDPDSPTRPGSPRRAQDHYRNPVA*LLPGTSNPSSDLHSPSGPFGPLRIKAFNPISGRKVHLPSAPDCPSLPNIESILLVRCQITAPGPLSVQWLAVP |
Ga0005458J37252_102923 | Ga0005458J37252_1029231 | F041210 | MLSGTSQCGFPMEPVARDGLSLARNGCHLSAASIPGSTVLACHFAASQLASSPGTPLTPSPPLVCPSVGGFFAFRPDASPPAGTFGCFGCLHSPPGLLHPSGSKRSTVSAASRLAFRTRPISSRSPLTVLLLDFDRGSTFQVRYVSGGLLF |
Ga0005458J37252_102989 | Ga0005458J37252_1029891 | F080934 | YPTTQLPPGMHGRSCAICVVVRFCFLCSPHRDFARFGSALGSAPVGLTANRNLHLGTAFRSPNKTACFQAPLPRSMLLTYPFGSPLSLLRTRSIHPLVHAVRLAPDGANSTRQTRCPVPSERPRPFFRSPLPFGAFGTPPDQSADLNTSREVHQIVTPDFLRSPLPAVLKEPATDQRSRLATSRLAYCL |
Ga0005458J37252_103089 | Ga0005458J37252_1030891 | F048323 | YFDSLTGITESESGKSSEALKITGRPELHEKMKIRKNKKKEC* |
Ga0005458J37252_104278 | Ga0005458J37252_1042781 | F035644 | METPASPKEQALSEYVHSDYLTDSERDAEADIPVPDRLLPKREYRRLVEALRLSQKIGLLVWLNRQGLLSLGGRERLLYLQSKCSFEALEAGLRFARRLTEEAKLQSDFQHQMRELNRRPQSKTFRQSESRRIGVGYRDKGMLPEQSLRARRMAWEESFLPTESIPEVLLSVLQKYLPACLTEDGEWVDLSVFPGTFGSEDNPELKTLLHPL* |
Ga0005458J37252_104290 | Ga0005458J37252_1042901 | F093895 | PELLVPTFSTSHRSGITPCSTLASCAIHGVLRGRHRMPVLAADCSAINPLALRPAHELLRARHLAFRFVPGLLLARRFPPCVELGFLRVQPSRSVSSSDFSALVPSRFATLTDCSVHVAWLLVSRPERSVLSTSTRSYPPITRCATLCASHRSRITSCPTLDLPSPAQIAPRL* |
Ga0005458J37252_104559 | Ga0005458J37252_1045591 | F018933 | RVAGLSLRCSRRSRIAPRSTLDSHCLSQIAPRLALIDRCCSPIARRSTLDAPFRSQIAPRSTLRARGGLGSLLVRHLEPLPRLRIAPYSTLSFPLRSQIAPRSILPAPCRSRIAPRSAIDAQCRSTDISVRLHSAPCAVHGSLRAQHFMLRTGH*LLCVRRLHFARFADCSALLTLCLVLHTDCSALNT* |
Ga0005458J37252_105085 | Ga0005458J37252_1050851 | F022827 | TDHSALPSPDSLQVGDQVSSSSFPSPGHPCGRPAPVFSTGIQDQGCHPRYLQRFRRSLSTSPFPARSSPARTSQPAFRPAARCPEGCSLVPRRALRPHPGVEVSLAFSLDPAPHGFRFGIRAIPAVPIRVVSFKLWVLPQQLESACASYPACSPCVSIRLARRHFLLDLPRTRRSFSTLAGFQLRSYGPTVSPPSSLPAHFYMCRSSAGSSRFSFRMVALRLPPGCSEPIAFAESFFLLSGSL |
Ga0005458J37252_105615 | Ga0005458J37252_1056151 | F016972 | VRLNPLTDAIPVVNLYLTIYEFDKISFGAKTSRYGPYGLGYRRATITLTK* |
Ga0005458J37252_105847 | Ga0005458J37252_1058471 | F030100 | PKKADARKWTEASGIIPGDWGKVRPGWLAKPLPKQTARLRVGGRIHQFLWRRSRAVSKREKGTERRGMIRSYSEPGQVAQAGFELE* |
Ga0005458J37252_106141 | Ga0005458J37252_1061411 | F035644 | MGSPESPEDWALSEYVHSDYFSESERDAEADIPVPDRLLPKRDYRRLVGALRLSQKIGLLIWLNRMDLLSLGGKERLLYLQSKASYEALEAGLRFARRLSEEKKLQSDFMHSMRELNRRPQSKHFRQTSERRRIGVGYRDKGMLPEQSSRARKAAWEESFIPTESIPKGLLEILKRYLPSCLTEDEEWVDLSVFPGTFGSEGDSEMTKLLHPL* |
⦗Top⦘ |