| Basic Information | |
|---|---|
| IMG/M Taxon OID | 3300026804 Open in IMG/M |
| GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0075432 | Gp0054328 | Ga0207737 |
| Sample Name | Tropical forest soil microbial communities from Luquillo Experimental Forest, Puerto Rico - Sample 2 (SPAdes) |
| Sequencing Status | Permanent Draft |
| Sequencing Center | DOE Joint Genome Institute (JGI) |
| Published? | N |
| Use Policy | Open |
| Dataset Contents | |
|---|---|
| Total Genome Size | 26809480 |
| Sequencing Scaffolds | 33 |
| Novel Protein Genes | 35 |
| Associated Families | 32 |
| Dataset Phylogeny | |
|---|---|
| Taxonomy Groups | Number of Scaffolds |
| All Organisms → cellular organisms → Bacteria | 6 |
| Not Available | 15 |
| All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Bryobacterales → Solibacteraceae → Candidatus Solibacter → Candidatus Solibacter usitatus | 1 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria | 1 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales | 2 |
| All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → unclassified Spartobacteria → Spartobacteria bacterium | 1 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Xanthobacteraceae → Pseudolabrys → Pseudolabrys taiwanensis | 1 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium | 2 |
| All Organisms → cellular organisms → Bacteria → Acidobacteria | 1 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria | 1 |
| All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Sulfotelmatomonas → Candidatus Sulfotelmatomonas gaucii | 1 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Desulfobacterales → Desulfobacteraceae → Desulfosarcina → Desulfosarcina cetonica | 1 |
| Ecosystem Assignment (GOLD) | |
|---|---|
| Name | Tropical Forest Soil Microbial Communities From Luquillo Experimental Forest, Puerto Rico |
| Type | Environmental |
| Taxonomy | Environmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil → Tropical Forest Soil Microbial Communities From Luquillo Experimental Forest, Puerto Rico |
| Alternative Ecosystem Assignments | |
|---|---|
| Environment Ontology (ENVO) | forest biome → tropical forest → forest soil |
| Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Soil (non-saline) |
| Location Information | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location | Luquillo Experimental Forest Soil, Puerto Rico | |||||||
| Coordinates | Lat. (o) | 18.0 | Long. (o) | -65.0 | Alt. (m) | N/A | Depth (m) | .1 | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F001022 | Metagenome / Metatranscriptome | 804 | Y |
| F001962 | Metagenome / Metatranscriptome | 611 | Y |
| F003090 | Metagenome / Metatranscriptome | 508 | Y |
| F005929 | Metagenome / Metatranscriptome | 386 | Y |
| F006756 | Metagenome / Metatranscriptome | 365 | Y |
| F008136 | Metagenome | 338 | Y |
| F008160 | Metagenome / Metatranscriptome | 338 | Y |
| F011271 | Metagenome | 292 | Y |
| F011941 | Metagenome / Metatranscriptome | 285 | Y |
| F012657 | Metagenome | 278 | Y |
| F013380 | Metagenome | 272 | Y |
| F014508 | Metagenome | 262 | Y |
| F019273 | Metagenome | 230 | Y |
| F024236 | Metagenome | 206 | Y |
| F025163 | Metagenome / Metatranscriptome | 203 | Y |
| F029216 | Metagenome / Metatranscriptome | 189 | Y |
| F031453 | Metagenome / Metatranscriptome | 182 | Y |
| F034030 | Metagenome / Metatranscriptome | 175 | Y |
| F044096 | Metagenome / Metatranscriptome | 155 | Y |
| F057789 | Metagenome | 135 | N |
| F065953 | Metagenome | 127 | N |
| F067850 | Metagenome / Metatranscriptome | 125 | Y |
| F069634 | Metagenome | 123 | Y |
| F074315 | Metagenome | 119 | N |
| F075781 | Metagenome / Metatranscriptome | 118 | Y |
| F078037 | Metagenome | 116 | N |
| F083637 | Metagenome / Metatranscriptome | 112 | N |
| F091102 | Metagenome | 107 | N |
| F091204 | Metagenome | 107 | N |
| F096399 | Metagenome / Metatranscriptome | 104 | N |
| F097844 | Metagenome / Metatranscriptome | 104 | N |
| F104209 | Metagenome | 100 | N |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| Ga0207737_100274 | All Organisms → cellular organisms → Bacteria | 2744 | Open in IMG/M |
| Ga0207737_100459 | Not Available | 2366 | Open in IMG/M |
| Ga0207737_100489 | All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Bryobacterales → Solibacteraceae → Candidatus Solibacter → Candidatus Solibacter usitatus | 2328 | Open in IMG/M |
| Ga0207737_100742 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria | 2025 | Open in IMG/M |
| Ga0207737_101238 | Not Available | 1691 | Open in IMG/M |
| Ga0207737_101687 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales | 1501 | Open in IMG/M |
| Ga0207737_102516 | All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → unclassified Spartobacteria → Spartobacteria bacterium | 1286 | Open in IMG/M |
| Ga0207737_102657 | All Organisms → cellular organisms → Bacteria | 1253 | Open in IMG/M |
| Ga0207737_102970 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Xanthobacteraceae → Pseudolabrys → Pseudolabrys taiwanensis | 1189 | Open in IMG/M |
| Ga0207737_103496 | All Organisms → cellular organisms → Bacteria | 1102 | Open in IMG/M |
| Ga0207737_103530 | Not Available | 1099 | Open in IMG/M |
| Ga0207737_103677 | All Organisms → cellular organisms → Bacteria | 1074 | Open in IMG/M |
| Ga0207737_104203 | Not Available | 1015 | Open in IMG/M |
| Ga0207737_104223 | Not Available | 1013 | Open in IMG/M |
| Ga0207737_104846 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium | 950 | Open in IMG/M |
| Ga0207737_105860 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales | 865 | Open in IMG/M |
| Ga0207737_105979 | Not Available | 856 | Open in IMG/M |
| Ga0207737_106246 | Not Available | 836 | Open in IMG/M |
| Ga0207737_106630 | Not Available | 810 | Open in IMG/M |
| Ga0207737_110054 | All Organisms → cellular organisms → Bacteria → Acidobacteria | 651 | Open in IMG/M |
| Ga0207737_110214 | Not Available | 645 | Open in IMG/M |
| Ga0207737_110735 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 629 | Open in IMG/M |
| Ga0207737_111092 | Not Available | 618 | Open in IMG/M |
| Ga0207737_111588 | Not Available | 603 | Open in IMG/M |
| Ga0207737_111955 | Not Available | 594 | Open in IMG/M |
| Ga0207737_112536 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium | 578 | Open in IMG/M |
| Ga0207737_113515 | All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Sulfotelmatomonas → Candidatus Sulfotelmatomonas gaucii | 555 | Open in IMG/M |
| Ga0207737_114422 | Not Available | 535 | Open in IMG/M |
| Ga0207737_114988 | Not Available | 524 | Open in IMG/M |
| Ga0207737_115313 | All Organisms → cellular organisms → Bacteria | 517 | Open in IMG/M |
| Ga0207737_115316 | All Organisms → cellular organisms → Bacteria | 517 | Open in IMG/M |
| Ga0207737_115510 | Not Available | 514 | Open in IMG/M |
| Ga0207737_115924 | All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Desulfobacterales → Desulfobacteraceae → Desulfosarcina → Desulfosarcina cetonica | 507 | Open in IMG/M |
| Scaffold ID | Protein ID | Family | Sequence |
|---|---|---|---|
| Ga0207737_100274 | Ga0207737_1002747 | F057789 | MSETALQFTGTDLRGLLDVLLEALKVIRRTGRRPTITVNGKLYASHDLRKVAALLPDEELQPHHQALVNVMFAKKQRGYRKTAYNLADRVNSWRTEEQKLAARQAAKTSALAVAEPAPFRNRGFSERTIKALLDCSIDQPERLLFMQSADLKKIPGVGKASFDEIMRYRAKFIRSSVMKKNP |
| Ga0207737_100459 | Ga0207737_1004593 | F029216 | MYALIIVIGMLSTGASGSAVIPVGVTSQIVGKFKNLDECKAAASQPHAGGPISDFSLSTSWGANWYCTYTGAN |
| Ga0207737_100489 | Ga0207737_1004893 | F008136 | MREAVDTSGMARRRSTEEIEQELEQYRTSGLTQIEYCRQTGMVLSTLGRYLRRSGVEPERLIRVNLESAVAESAACFALVLGNGRRIESGWRFGEAELASLIRVVEGA |
| Ga0207737_100742 | Ga0207737_1007422 | F001022 | MKILEIVPRDRTRLYGALVAKEAAIRRSGRGTYSRVGRRSLGSARWKHKMYKGSVQLAHDPSEIVTAKVRAATPEDERKLLSSFLGFVDRHCGDYVDTITIQYRQIR |
| Ga0207737_101238 | Ga0207737_1012382 | F083637 | VDLLFGRGVLMKRTLVVICAVMLALGLAGCGWAGKTPIIGKGKAPAPVVTKG |
| Ga0207737_101687 | Ga0207737_1016872 | F012657 | RLLSASPVMIAASLHQMAEKDDPAVAPETLRLPEAVHAQFREKLFLYREANVLLALVDRVNPSSDDRDPLFEPVFWEYERIVFWELADPVIRATRRQSVIAALRDLNLRTDLGNGHDFALSWSRNWFAGIGHNEMYPARLERLSRFWSHEYSAVQKVLKAAVRTSRI |
| Ga0207737_102516 | Ga0207737_1025162 | F001962 | MILKSETYHFHRLDLTRQAGFIVTIYDEDGLRLASTPPMPTPTQAFEEARKVVDNKVEAPIKRGRPPPDL |
| Ga0207737_102516 | Ga0207737_1025165 | F019273 | MIHKVKTNKKYPFKQMQPGERFKLKDDDIRSAQKMAWYYRTRCKRPINVVIAKGDDGYHCQRID |
| Ga0207737_102657 | Ga0207737_1026571 | F104209 | KGHRLARHNGHADERTHAHSTLSAETTHVQVFYRFHPLYSSTLQILRRPKRGDGAVCVSDPMGRRLKIPMWMLLPNSAEMKIAEQAYLSKEALLSLVLLVSTPREIENRVHANLLQAVVDTCKGGQRATTTTPGAGDRKSGGHGADRRRDTNRTDRSHGPHSGGGLSNGRRKSR |
| Ga0207737_102970 | Ga0207737_1029702 | F078037 | SAALAQGPSTEKLKADAQEVVKIISSDSAKAQAYCETIKLGDQMDQAEQNNDSDKAEGLSKKMDELNQKLGPEYLKLGQDLQDIDPNSPDGLELGRTLAALDKLCK |
| Ga0207737_103496 | Ga0207737_1034964 | F074315 | METTDFTEQWSNMQKMFLASSEMSASFRENARQFWENQGKVLDNM |
| Ga0207737_103530 | Ga0207737_1035301 | F091204 | MSATVIEFPGARESGSKPAHDVQSAQVSNEVVAPLTDFEIAAIENLGTIFQAGGTEAFNFAAKRQLFILASLIKRILGEDGLNELLTAAEWV |
| Ga0207737_103677 | Ga0207737_1036771 | F083637 | VDSLFKQGHLMKKMLVVICSLVLALGLAGCGWAGKAPIIGKGKAPA |
| Ga0207737_104203 | Ga0207737_1042032 | F008160 | MYALVTVIAILSPATGSVTPVGVTSQTVGSFRTLDQCKAAAMQPHGEGAISDLSLTRGVYRYCAFAGETLRNNSRR |
| Ga0207737_104223 | Ga0207737_1042232 | F011271 | MGKNGGRRGDRRARIDFQLEPKERQALKLTEIREALVAAGYDTTAKQAAVLGVCRSTAWVLLNRDKRAGPSAKVIKRILSSPQVPERARRKVEQYVEQKVRGLYGHCESATRSFGNQFQH |
| Ga0207737_104846 | Ga0207737_1048461 | F044096 | VRSAILIFLAILVSATTPARAQGTWLETRMTRAICSSEATPVANTDRLARRL |
| Ga0207737_105860 | Ga0207737_1058602 | F097844 | MINPLDLLNKFISAIIAYGPKNKTKKARMARKAARRKRFAAMKRH |
| Ga0207737_105979 | Ga0207737_1059792 | F034030 | MRRREFILLGAYVIAGCVAAVTSADLALAQAKNSTSMEDRLSAKIRCQDFQKNSDGKWTSSSKAKIGKIDFSNHTFGVDEVDIGGADLATFLNRKCAAH |
| Ga0207737_106246 | Ga0207737_1062462 | F014508 | MFTWALVIFIGLWLILADIGPVRRAKLMGNPMLIHIIVIGSGLWIHGGSAEGAMAAVGSGVCSAIFVRYQRRMYGYIRRGQWYPGIFRHDDPRQGLKT |
| Ga0207737_106630 | Ga0207737_1066301 | F096399 | RSPEWSPAFPDMNTLNVDDEYWTIREVCELVKDDDGVLPDEIVDELKNAMHGVRRLLELLGRSRTYATGSQCLLELLDNRIKTFRTKR |
| Ga0207737_106630 | Ga0207737_1066302 | F024236 | MHLVLVFTVAVGLYVLGMIVLALAAILGSVRGNRELKKLEPEPRPGDY |
| Ga0207737_110054 | Ga0207737_1100542 | F065953 | MRITKIVPALVVFVYASPALAQQKVFEWQRGTEESVRLDPAN |
| Ga0207737_110214 | Ga0207737_1102141 | F011271 | GTLSTGPERRNLSTCEVGHTTFAPTAVTFLRSHLAAATKNMRSNRTRSVGRRSRIDFQLEPKERQALKLAEIREALVAAGYNTTAKQAAVLGIGRSTAWWLLNHNKRAGPSAKVIKRILLSPQIPKRVRRKVEQYVEEKVRGVYGHSEQRTQWFSEQFQRSPNS |
| Ga0207737_110735 | Ga0207737_1107351 | F097844 | LDLLNKFISAIIAYGPKNKTKKARMARKAARRKKLAAIKRH |
| Ga0207737_111092 | Ga0207737_1110921 | F069634 | VKVGIHSPNKPPVVVSLLLAILALIGYSVDSSFAFFIAMFAYIVGALGVLVEI |
| Ga0207737_111588 | Ga0207737_1115881 | F005929 | MRENMSLLSFWGLVFSLIAVTLFVVIYSYQKSKYANRRIVEDLRQRSVDVYLAAKTPEAESDSKRFREASNEIERLRAETRVLFWLIIAMNTAVIIIFILAYQYF |
| Ga0207737_111955 | Ga0207737_1119552 | F003090 | ARNAMEKKTVTDYKGYRIEVCPVGKGWRASIFSPGSIRPWPNSPANLEKSSAEELVAEAKRLIDARLGPQRL |
| Ga0207737_112536 | Ga0207737_1125361 | F091102 | DLDTVQIIQPGRFTVVSTEIDNPEVMQFRLNVLEHLRTHCAHAEGKYPAPPELFTLGRPDIAVGDIEVAHVSGSKFVRWFYPYRLLSPTLENYEILFCDGESNYFEMRTLIANGSRHKDVYDCRRGLYGMMHDENDPTSALLIVVPEGSYLFDYYVAVCRALTHEEPYLPQKH |
| Ga0207737_113515 | Ga0207737_1135152 | F031453 | IERAEWRPTMAGCKVIIHQHLDQTLTLMIAGHRVGHYSAQGKLLTPLTKKQVKAVEKTLRGKVQKQTFPLNLQIPHTTRDSHFPTASTTADF |
| Ga0207737_114422 | Ga0207737_1144221 | F075781 | KPVGQTRPTEERFLLRVDGQTKRSFSSKEAAATAGAAVKKAYPIVMVTIVDTEDGTSEIIKP |
| Ga0207737_114988 | Ga0207737_1149881 | F067850 | VVQLQLLDLVISRQCGAVPMDAATRAALIDLMARVLVVVFHE |
| Ga0207737_115313 | Ga0207737_1153132 | F011941 | MAEPRQNRGPFQIQVSYLFDRLLEPKLAQAYELLVPCREHPVGVKEFDDEDGGNLRKSV |
| Ga0207737_115316 | Ga0207737_1153162 | F025163 | MQLTLAFLEPSPSARPSPSQKLDAETCAEALNILGRIIAQACETTQHTEATDE |
| Ga0207737_115510 | Ga0207737_1155101 | F013380 | MSEFAIGQKIVCVCDDWKNAFFGSVRETGERYPVKDGVYTVIGHDWLLLADRPGVMIAEVSNDCIWAEQNFRPIEPRKTDISVFQKFLVNPKEKIDA |
| Ga0207737_115924 | Ga0207737_1159241 | F006756 | AERFEAELAEFRPAARRSRAQDRLFAFLDGLCSKAALAAYLRDIADSDRSLSRQITELLELIRQYGPDAVAGAIEKAATARAFGADYVANILRQQQCPRREQPPLRLRDPRLNELVTDPLSLLAYDAFILQPEKESDDTPGTETPRSESDGHEPPSGDDPL |
| ⦗Top⦘ |