Basic Information | |
---|---|
IMG/M Taxon OID | 3300022547 Open in IMG/M |
GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0111485 | Gp0147102 | Ga0212126 |
Sample Name | Wilbur_combined assembly |
Sequencing Status | Permanent Draft |
Sequencing Center | DOE Joint Genome Institute (JGI) |
Published? | N |
Use Policy | Open |
Dataset Contents | |
---|---|
Total Genome Size | 456199054 |
Sequencing Scaffolds | 26 |
Novel Protein Genes | 26 |
Associated Families | 21 |
Dataset Phylogeny | |
---|---|
Taxonomy Groups | Number of Scaffolds |
All Organisms → cellular organisms → Archaea | 10 |
Not Available | 7 |
All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria | 1 |
All Organisms → cellular organisms → Bacteria | 2 |
All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon | 3 |
All Organisms → cellular organisms → Archaea → Candidatus Thermoplasmatota → Thermoplasmata → unclassified Thermoplasmata → Thermoplasmata archaeon | 1 |
All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium | 1 |
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium | 1 |
Ecosystem Assignment (GOLD) | |
---|---|
Name | Bacterial And Archaeal Communities From Various Locations To Study Microbial Dark Matter (Phase Ii) |
Type | Environmental |
Taxonomy | Environmental → Aquatic → Thermal Springs → Hot (42-90C) → Sediment → Hot Spring Sediment → Bacterial And Archaeal Communities From Various Locations To Study Microbial Dark Matter (Phase Ii) |
Alternative Ecosystem Assignments | |
---|---|
Environment Ontology (ENVO) | Unclassified |
Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Water (non-saline) |
Location Information | ||||||||
---|---|---|---|---|---|---|---|---|
Location | USA: California | |||||||
Coordinates | Lat. (o) | 39.0314 | Long. (o) | -122.4323 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
Zoom: | Powered by OpenStreetMap © |
Family | Category | Number of Sequences | 3D Structure? |
---|---|---|---|
F011196 | Metagenome / Metatranscriptome | 294 | Y |
F014681 | Metagenome / Metatranscriptome | 261 | Y |
F021710 | Metagenome | 217 | Y |
F024682 | Metagenome / Metatranscriptome | 205 | Y |
F025653 | Metagenome / Metatranscriptome | 200 | Y |
F028081 | Metagenome / Metatranscriptome | 192 | Y |
F030484 | Metagenome / Metatranscriptome | 185 | Y |
F035541 | Metagenome / Metatranscriptome | 172 | Y |
F036103 | Metagenome / Metatranscriptome | 170 | Y |
F037495 | Metagenome | 168 | Y |
F038920 | Metagenome / Metatranscriptome | 165 | Y |
F055311 | Metagenome / Metatranscriptome | 139 | Y |
F058721 | Metagenome / Metatranscriptome | 134 | Y |
F060598 | Metagenome / Metatranscriptome | 132 | Y |
F073072 | Metagenome | 120 | Y |
F073077 | Metagenome | 120 | Y |
F091298 | Metagenome / Metatranscriptome | 107 | Y |
F096272 | Metagenome | 105 | Y |
F098703 | Metagenome | 103 | Y |
F100933 | Metagenome / Metatranscriptome | 102 | Y |
F101433 | Metagenome / Metatranscriptome | 102 | Y |
Scaffold | Taxonomy | Length | IMG/M Link |
---|---|---|---|
Ga0212126_1000844 | All Organisms → cellular organisms → Archaea | 26196 | Open in IMG/M |
Ga0212126_1001087 | Not Available | 22254 | Open in IMG/M |
Ga0212126_1002468 | All Organisms → cellular organisms → Archaea | 12724 | Open in IMG/M |
Ga0212126_1003131 | All Organisms → cellular organisms → Archaea | 10769 | Open in IMG/M |
Ga0212126_1003244 | All Organisms → cellular organisms → Archaea | 10506 | Open in IMG/M |
Ga0212126_1003752 | All Organisms → cellular organisms → Archaea | 9367 | Open in IMG/M |
Ga0212126_1003887 | All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria | 9102 | Open in IMG/M |
Ga0212126_1005115 | All Organisms → cellular organisms → Archaea | 7483 | Open in IMG/M |
Ga0212126_1005947 | All Organisms → cellular organisms → Archaea | 6647 | Open in IMG/M |
Ga0212126_1007780 | All Organisms → cellular organisms → Bacteria | 5423 | Open in IMG/M |
Ga0212126_1012260 | All Organisms → cellular organisms → Archaea | 3845 | Open in IMG/M |
Ga0212126_1018498 | All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon | 2835 | Open in IMG/M |
Ga0212126_1022460 | All Organisms → cellular organisms → Archaea → Candidatus Thermoplasmatota → Thermoplasmata → unclassified Thermoplasmata → Thermoplasmata archaeon | 2456 | Open in IMG/M |
Ga0212126_1031973 | All Organisms → cellular organisms → Archaea | 1900 | Open in IMG/M |
Ga0212126_1035440 | All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium | 1761 | Open in IMG/M |
Ga0212126_1036597 | All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon | 1721 | Open in IMG/M |
Ga0212126_1036746 | Not Available | 1717 | Open in IMG/M |
Ga0212126_1063230 | All Organisms → cellular organisms → Archaea | 1154 | Open in IMG/M |
Ga0212126_1064312 | Not Available | 1140 | Open in IMG/M |
Ga0212126_1068451 | All Organisms → cellular organisms → Bacteria | 1089 | Open in IMG/M |
Ga0212126_1072091 | Not Available | 1049 | Open in IMG/M |
Ga0212126_1079004 | All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon | 981 | Open in IMG/M |
Ga0212126_1112192 | Not Available | 763 | Open in IMG/M |
Ga0212126_1129830 | Not Available | 687 | Open in IMG/M |
Ga0212126_1139463 | Not Available | 654 | Open in IMG/M |
Ga0212126_1185957 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium | 536 | Open in IMG/M |
Scaffold ID | Protein ID | Family | Sequence |
---|---|---|---|
Ga0212126_1000844 | Ga0212126_100084422 | F024682 | LSAEDELIAKLKQEIGKTVPSMFAAMAENMLESNRDVIINWLKENKDLVKQVIES |
Ga0212126_1001087 | Ga0212126_10010876 | F025653 | MSDPIKYFETKLKAMNLAELQAYKKRLDERIQKMIMDTAPNEQIAPLILYRGILEHEIETRTKQR |
Ga0212126_1002468 | Ga0212126_10024688 | F098703 | MVKKVHIIINLLSEASKKSDSQIEEKIRNEAKIPLCSNIEDVSVEDTEESYNNLKKHGISSNVARNLLDLYTE |
Ga0212126_1003131 | Ga0212126_10031318 | F036103 | LKNWNNIIRILGIGTCSVGAFLMAFGEPIIGADHAGIATVADITGIGLIGTGNTSSFAGKKKEEQ |
Ga0212126_1003244 | Ga0212126_10032448 | F058721 | MLRATLPCADPQCKDEMRLVFQNERFLGYRCLLKPNSHNFRYDIEHKRWEKIIIKTKPIIGYKESPYDIALDEEVAAETI |
Ga0212126_1003752 | Ga0212126_10037521 | F028081 | MDQEEALQRLQKTRIENEQAYLKAKAFLDGFRARGQLSQNDSEFLFLLEFVIKGFKNHGNDIIKAFENQVRFNEAFNNMQAKVNDLEEEIKQLRITLDKMYHDR |
Ga0212126_1003887 | Ga0212126_100388710 | F055311 | MTEDILDYWIRLIKTIFPENAWITSRFFKNDCLIDIDWKLEDDPENPNKRSKKIEIIIKAATLENYLDKNKKERELSDMMLKEFISEQYNRFSSDYEISTSQYVPKEKWLITSDVLNCRPSVDTPPNL |
Ga0212126_1005115 | Ga0212126_10051152 | F038920 | LLTEIVTDEQLIDLYTTAGYLVAVDYPKEEVKLHTVDCMLADPISSVGVKPSKARANKTGEFWFSESREEANSKAEEIAKKRGYTYTVCPICNR |
Ga0212126_1005947 | Ga0212126_100594713 | F096272 | MGEYFLSWMEDLKKLHCKPEVSIAEINDFILQHPKWAASIIRNAIGFEMYREFCLCCKNFDRCCEKLGAIKRETRLGCICNEFLNQEYSEANRPRLRAYFEEVAVLLNI |
Ga0212126_1007780 | Ga0212126_10077803 | F101433 | MKRRVIFAAVVLVGLAAVAVGIARGDFETIHRFAAQI |
Ga0212126_1012260 | Ga0212126_10122604 | F073077 | QWSLVEESGFVNRTLIIGSMPQVNADCVRWMQPFPDMEQYDLLIIDLASFPKDYPPTLFTNIGLLKRTARLFIRDNKEIFCIMEKPLKILFKQIPLNYSWIPFPQKLTVNPMILGRTVVVTDERFSEYMKNVDKWENELFWTDTTNCSFAPIAVNKSENAIAATITINDRGKIHFLPRTTRISRAKSIKLLMNLSNREEGQNEPSWLGSVEIPDSKRPQDQWNSSVPAEEYRKLFSVNHKNVIKAVQIMLEDMGIQTLQNAEFGLLGLKENIVVKVASAKGKIEAQNPKVNQLARFIEHQRRNEKIVFIANTYSGLPPNERANREHLDLSVKLFFETSNVVFLTTLSLYNLWKKVITFQISVKEASFLLHNEKGEI |
Ga0212126_1018498 | Ga0212126_10184984 | F021710 | MRRDKPHSTLTEFRLCKEFGWTPKALARQPAKTVEAFVVIMNEMDRQTEEEMRKAKREVKQGVR |
Ga0212126_1022460 | Ga0212126_10224606 | F014681 | MRSLKECIHGAVDLGVVLMVIVAFAGLMVIAYIIWTVRGQLTGPSAAANETLDAITGGFDDAVGLILVAITIFVLAIAISALLMLRGRS |
Ga0212126_1031973 | Ga0212126_10319734 | F021710 | MKRGKPHPALMEFRLCKEFGWTPMSLARQPAKTVESFVVIMNEMDRQTEEEMRKAKQEARHVR |
Ga0212126_1035440 | Ga0212126_10354403 | F011196 | MAADGLDPKREEFLRLARAAFERMFGSDGQNGLVTFTEREERACEITDELARWLMAEHLAQDSAGEAGVQRDCPLCGGPVQYASAEQAEQEVRELMTRRGKIEYRRAAVRCPRCRKIFFPAG |
Ga0212126_1036597 | Ga0212126_10365973 | F073072 | MKKKAIITIILIEDPEAYKSKTNSDIEKEILEEIGPIPYAASVEKVTVIDFQKETKTRPT |
Ga0212126_1036746 | Ga0212126_10367461 | F035541 | VKRLALIITLVAGLFLVGATAVFAATTDLNRIPITDTLEVGTMEWDLTWRYSDDFERGRKLSSRLFAALFDNFEFGMSWGISRRVHELGYNPYVRGAGPVEFSMKYKILDEYDGGFPVSLAVGAEGITGNYQRTGMDPTYYGVIGFHDVHIGGWWDWYVGIAHNPTGFDDDDNSIFGGFKYWINEDWQFNADYWGRNDNSDYVISGGVNYDWCNHLGFQGWVERDSITEDNVFVLQMIARADMRDLTAQVSDPE |
Ga0212126_1063230 | Ga0212126_10632301 | F098703 | MVKKVHIIIDLLPEASKTSNSQIEEKIRNDAKIPWCSNIEQVSVEDTEASYMKLKKHGISSKVARNLVDLYTE |
Ga0212126_1064312 | Ga0212126_10643123 | F030484 | MVDKKEEAIDEAISKMSQIKKAAGDFRENVAGLVKDVNIESTDWRFNVESHNEGVTIDIAIKLLITKKEEDPETSN |
Ga0212126_1068451 | Ga0212126_10684511 | F060598 | VKGKIGNTGRILKAEELVLAYDLDARLWLAVRVYQGTKKLSKGLVEIVRELLHHRGQLKGLLRLFFDKGGYCGQIFRTLVDCPDVRFYTPAVRYSTNVKQWEQLKEADFDSEPFVFDKHADLPAKERPVYRLADTEMTINVREGRKVVGTVTLRAVVLHDPQGEKLAERWPVVMLTDDRQINARTLLNEYGDHWGQEFGHRIGRHDLYLDILPPGYVLKTWRDDQGELHREVEYDQTAFFLSAWLRCLVFNLMTRFAQAMGGEYTKMWAGTLLRKFIRRPATLYLVGKELHIVFDPFPGQEELQPLLDQLNAKRTALPWLNNLVVQFSIAQDEPLYPLTEPEKRNRLFGDG |
Ga0212126_1072091 | Ga0212126_10720912 | F100933 | MEIKLEGNKLIIEAIVSSGVPSKSEKTLVVASTNGFVEVPGTNLKVSLNVVKPRR |
Ga0212126_1079004 | Ga0212126_10790042 | F021710 | MRRGKPHPALMEFRLCKEFGWTPTSLARQPAKTVESFVVIMNEMDRQTEEEMRKTKRETRYRVH |
Ga0212126_1112192 | Ga0212126_11121921 | F037495 | RTMNNTDDSASHEEQLATRPMDYSDLMAMTRPLVMTEDVLRSLRGDDVFRVGFGTSAEPADRRAEFALRIGEILCPGGECDVARESISASGKTGSNLKLKIHLGMS |
Ga0212126_1129830 | Ga0212126_11298302 | F038920 | LLTEILTNEQLINLYTTPGYLVAVDYPKKEVTLHTVDCMLADPISSVGVKPSKARENKTGEFWFSESREEANSKAEEIAKKRGYTYTICPICNR |
Ga0212126_1139463 | Ga0212126_11394631 | F025653 | MSDPIKYFETKMKAMNLAELQAYKKRLDESITQKIAATAPNEQIAPLILYRGILEHEIETRTKPR |
Ga0212126_1185957 | Ga0212126_11859571 | F091298 | LWVASPGQALEHLSVGEPIQNMQITDLLDLDPLVVSRWLCGEDMRGIYQPIVVRHSALDGLDLEGRTFYEIVELVDCRIATAHFKYAYFYSSLLVEDCVFGGDFEGRGLQGDGRMVFHNTIFAGWADFSGISVRGRADLVDVSFPGGTNLLRILSNGSRALLGHEINFSRCRFRPVDI |
⦗Top⦘ |