Basic Information | |
---|---|
IMG/M Taxon OID | 3300025833 Open in IMG/M |
GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0111384 | Gp0110937 | Ga0210009 |
Sample Name | Groundwater microbial communities from aquifer - Crystal Geyser CG12_big_fil_rev_8/21/14_0.65 (SPAdes) |
Sequencing Status | Permanent Draft |
Sequencing Center | DOE Joint Genome Institute (JGI) |
Published? | Y |
Use Policy | Open |
Dataset Contents | |
---|---|
Total Genome Size | 550407140 |
Sequencing Scaffolds | 22 |
Novel Protein Genes | 24 |
Associated Families | 21 |
Dataset Phylogeny | |
---|---|
Taxonomy Groups | Number of Scaffolds |
All Organisms → cellular organisms → Bacteria → Nitrospinae/Tectomicrobia group → Nitrospinae → Nitrospinia → Nitrospinales → Nitrospinaceae → Nitrospina | 1 |
Not Available | 11 |
All Organisms → cellular organisms → Bacteria | 1 |
All Organisms → Viruses → Predicted Viral | 1 |
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Ignavibacteriae → Ignavibacteria → Ignavibacteriales → unclassified Ignavibacteriales → Ignavibacteriales bacterium CG18_big_fil_WC_8_21_14_2_50_31_20 | 1 |
All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Nitrosomonadales → unclassified Nitrosomonadales → Gallionellales bacterium RBG_16_57_15 | 1 |
All Organisms → cellular organisms → Archaea → unclassified Archaea → archaeon | 1 |
All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon | 2 |
All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Huberarchaea → Candidatus Huberarchaeum → Candidatus Huberarchaeum crystalense | 3 |
Ecosystem Assignment (GOLD) | |
---|---|
Name | Development Of A Pipeline For High-Throughput Recovery Of Near-Complete And Complete Microbial Genomes From Complex Metagenomic Datasets |
Type | Environmental |
Taxonomy | Environmental → Aquatic → Freshwater → Groundwater → Unclassified → Groundwater → Development Of A Pipeline For High-Throughput Recovery Of Near-Complete And Complete Microbial Genomes From Complex Metagenomic Datasets |
Alternative Ecosystem Assignments | |
---|---|
Environment Ontology (ENVO) | freshwater biome → aquifer → groundwater |
Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Subsurface (non-saline) |
Location Information | ||||||||
---|---|---|---|---|---|---|---|---|
Location | USA: Utah: Grand County | |||||||
Coordinates | Lat. (o) | 38.9383 | Long. (o) | -110.1342 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
Zoom: | Powered by OpenStreetMap © |
Family | Category | Number of Sequences | 3D Structure? |
---|---|---|---|
F001003 | Metagenome / Metatranscriptome | 808 | Y |
F014357 | Metagenome / Metatranscriptome | 263 | Y |
F021923 | Metagenome / Metatranscriptome | 216 | N |
F037060 | Metagenome / Metatranscriptome | 168 | Y |
F037574 | Metagenome / Metatranscriptome | 167 | N |
F042607 | Metagenome | 158 | Y |
F044301 | Metagenome | 154 | N |
F047487 | Metagenome / Metatranscriptome | 149 | N |
F054576 | Metagenome / Metatranscriptome | 139 | N |
F057887 | Metagenome / Metatranscriptome | 135 | N |
F057888 | Metagenome / Metatranscriptome | 135 | N |
F059646 | Metagenome | 133 | Y |
F060595 | Metagenome / Metatranscriptome | 132 | N |
F071917 | Metagenome | 121 | Y |
F071960 | Metagenome | 121 | N |
F075685 | Metagenome | 118 | Y |
F083231 | Metagenome / Metatranscriptome | 113 | Y |
F091328 | Metagenome | 107 | N |
F096615 | Metagenome / Metatranscriptome | 104 | N |
F096627 | Metagenome | 104 | N |
F102531 | Metagenome | 101 | N |
Scaffold | Taxonomy | Length | IMG/M Link |
---|---|---|---|
Ga0210009_1000049 | All Organisms → cellular organisms → Bacteria → Nitrospinae/Tectomicrobia group → Nitrospinae → Nitrospinia → Nitrospinales → Nitrospinaceae → Nitrospina | 142528 | Open in IMG/M |
Ga0210009_1033753 | Not Available | 2129 | Open in IMG/M |
Ga0210009_1044701 | All Organisms → cellular organisms → Bacteria | 1787 | Open in IMG/M |
Ga0210009_1077263 | All Organisms → Viruses → Predicted Viral | 1267 | Open in IMG/M |
Ga0210009_1091107 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Ignavibacteriae → Ignavibacteria → Ignavibacteriales → unclassified Ignavibacteriales → Ignavibacteriales bacterium CG18_big_fil_WC_8_21_14_2_50_31_20 | 1139 | Open in IMG/M |
Ga0210009_1109829 | Not Available | 1010 | Open in IMG/M |
Ga0210009_1145907 | Not Available | 842 | Open in IMG/M |
Ga0210009_1158008 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Nitrosomonadales → unclassified Nitrosomonadales → Gallionellales bacterium RBG_16_57_15 | 800 | Open in IMG/M |
Ga0210009_1181115 | All Organisms → cellular organisms → Archaea → unclassified Archaea → archaeon | 732 | Open in IMG/M |
Ga0210009_1191267 | All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon | 706 | Open in IMG/M |
Ga0210009_1200403 | All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Huberarchaea → Candidatus Huberarchaeum → Candidatus Huberarchaeum crystalense | 685 | Open in IMG/M |
Ga0210009_1203699 | Not Available | 678 | Open in IMG/M |
Ga0210009_1206313 | All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon | 673 | Open in IMG/M |
Ga0210009_1224537 | All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Huberarchaea → Candidatus Huberarchaeum → Candidatus Huberarchaeum crystalense | 637 | Open in IMG/M |
Ga0210009_1246046 | All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Huberarchaea → Candidatus Huberarchaeum → Candidatus Huberarchaeum crystalense | 600 | Open in IMG/M |
Ga0210009_1258172 | Not Available | 582 | Open in IMG/M |
Ga0210009_1259470 | Not Available | 580 | Open in IMG/M |
Ga0210009_1279996 | Not Available | 552 | Open in IMG/M |
Ga0210009_1290374 | Not Available | 539 | Open in IMG/M |
Ga0210009_1297629 | Not Available | 530 | Open in IMG/M |
Ga0210009_1316912 | Not Available | 509 | Open in IMG/M |
Ga0210009_1320494 | Not Available | 505 | Open in IMG/M |
Scaffold ID | Protein ID | Family | Sequence |
---|---|---|---|
Ga0210009_1000049 | Ga0210009_1000049140 | F083231 | MASKDGKIFDTIVYIGLGVNGLTAVYLLLMYFEVI |
Ga0210009_1033753 | Ga0210009_10337532 | F060595 | KQIKKYYTAFLGFISQKKFNQQFFFNTKTKKALLHEISFLEWTH |
Ga0210009_1044701 | Ga0210009_10447014 | F091328 | LRLVVILFIFLYDEGVGRKSVTILVLACLLVGGVAGGVFLYLKHQRTVLSNVAVEKSGDLPMELSVDFDPTVNSPKFMIESVDRKVKTFDLKSVFPPTFEGKRLTSRITCQEIKIVGPGDSVGEEVVYDVLMERMEGVSKEMMIFSGLCSDNTCAEIHQSCRLYLAKVAP |
Ga0210009_1077263 | Ga0210009_10772631 | F096615 | RERELNNYLDSADRLDNNDERIKELARDKFNALPNFYSPKGEKYRYTFMDDAMGSLKQEDFIAIARLLRDGEHLQAGKLLEASLMRVLVAEAEEEIEDEEND |
Ga0210009_1091107 | Ga0210009_10911071 | F060595 | KQIKKYYTAFLGFISQKKFNQQFFFNTKTKKALLHEISFLEWTHL |
Ga0210009_1109829 | Ga0210009_11098291 | F075685 | MSSTTTRTLYDPNTHPVAAAYVDANAAERHAARETWLASDEERGYRFAVYRTVGLRHCDSVPRRRESPHLCRPSELRHRVDLWARAIASALESQTATYWLICHAYPVHPRTGRSDHGLLAEYILAVHPDVPSCIGYLAHDWK |
Ga0210009_1145907 | Ga0210009_11459072 | F054576 | MEFQLGLPEIIAFVLANIGGGWALLRLSFAQFELRLDDRFELLDNAMADVKRIELEIVRADTRNAQTYVTQTSHDKVLERIFNVLSSMEQKLDGKANAADCEEKILRHMERK |
Ga0210009_1158008 | Ga0210009_11580082 | F054576 | EFQLGLLELIAFAVANIGGGWALLRISFTQFELRIKDQFELLDKAVNDVKRIELEIVRADTRNAQTYVTQANHDKVLERIFNVLSSMEQKLDGKANAADCEAKLLRHMERK |
Ga0210009_1181115 | Ga0210009_11811151 | F021923 | MNLTDFFNNYEFIKKLQNHLTVDKENFLNDFPNNNIEKSLNQLLLSPSNAYLSKKIIPQTFADMTWKFKLLEGQQTLTSPSSREFTLTYESQTSRSIFYYETPISGSCYVECDFVDVEPGTTNCIFLGNANNITSKTYLAIQTFPLSSFQGNKIGDGWVAIMEFDNSNIHNPNVLTWSYDIEPTGVFRLSKNTSTLKLHHNRTYSLGAKTTFIGEDLHAGF |
Ga0210009_1191267 | Ga0210009_11912672 | F059646 | VESITVAILAICIFLSFYFTFLSFQTIDDALKKQLVTLAASSLITGVIMLACITISLGIKKAFPRVDSKLRSRKEPSEHEG |
Ga0210009_1200403 | Ga0210009_12004032 | F037574 | MTGIFADEANEYTTITISKDTAKKLKEFFCGKETYNYVITFLLDYHDEKTYRF |
Ga0210009_1203699 | Ga0210009_12036992 | F054576 | MEFQLGLTELIAFVLANIGGGWALLRLSFAQFELRIKDQFKLLDKAVNDVKRIELEIVRADTRNAQTYVTQANHDKVLERIFNVLSSMEQKLDGKANAADCEAKHLRHMER |
Ga0210009_1206313 | Ga0210009_12063131 | F071917 | NTRESDIINYGVKKFDYSSEEMKKVIKRMVIKGKIHYIVHSKLEPPEVYISLKELLPPEIVKTLIEAFIQMKAGEEDVQKILDEAASIAEQIKQKHSRK |
Ga0210009_1212124 | Ga0210009_12121241 | F096627 | FGWQVERLPAALLQWISSAPSLLRLERIPKDGAAQRIHLFTAGDQGLSLEMDDDTARFVIFQTRRLLQESAVRWLALPAGAKKSTTAHDLPQPITFLPTTWKDSQLASRILKEQGMNTKTAKSTLAWAVSLEWVTALSKVKLEGQRNVVANQFVLCGNAKSIWGGRDEQTKVSFVSMAEKTINATIGEML |
Ga0210009_1224537 | Ga0210009_12245371 | F001003 | MEKFKKFKQENKDFLLPCNLSTKEKDIFMYLFRFLRKKIDQGIYPEIYNDEIIYTKPDVKSLCLKEIVLCKKYKKGWVITVNPHNITKNAECGFCGAKFNE |
Ga0210009_1246046 | Ga0210009_12460462 | F057888 | QLGTADVIETGIIQNFLGVPFIVSTNCPQYNSGNPDINETGVNWGTAGNTGILISKRKSGQKCSGGIYWKQKGKIDYIQNVEEMLHKFTLIMAFKCTHLQPTSICLIKTSKV |
Ga0210009_1246105 | Ga0210009_12461051 | F102531 | KLEAIRQVGSVSVPDYAATPEQIAQLQLADKARTADEILQCPYTWCRHNSTVSDAILESRGYQPYRAAMMMQTTEGLSAGGHQVSAIIVNGEPVFIDLTNNLIIPGQQALEQILLNSGKQLTALEMIRLTTNNVWDVINLIPK |
Ga0210009_1258172 | Ga0210009_12581722 | F071960 | EKGLAHHLVEGIHEACQNQGEAEFPEGFQERTGKTVLELLAMPLGEAIPLLKQGDGSPLSRLLPHRAFEGMLLMPRGADTICATCHEHGACRDWVIGAAEDVCLELDLVEPGEILSYIAAKVIEGDYDHNTSITQIAQEFRADVLITTEVRK |
Ga0210009_1259470 | Ga0210009_12594701 | F057887 | NEYVFDDLGNAYLVSRPWNPTEPIIFLSIDTEKEYEYNTNTYSFVDKPIVYTAGLGVFKLFSKILKKLKNITSTTRVASVLARIKNIRTGKKILYFNRKRGIWTAARDVSFMSLLYFGLSQGGKVATQTVNIAVSLIGLKMFMNFICEEAIQGAGMGLFIASASDMEAETLSIAIKNYKKVYAIAEDVIQFAD |
Ga0210009_1279996 | Ga0210009_12799962 | F042607 | IQLYNKIVASYTNLTNKGEDLFIDLFETDDNGTIIFLKPPSKRQTSFEVFLFLMAVMQQQHIRLMYKQVEDLCAQVNEKLKDK |
Ga0210009_1290374 | Ga0210009_12903741 | F047487 | ARPMTTLYGYIAAIVLIVFLGAGWAHEHDKRIVFEAQVEQAGKDAAKHTAETDAKHREEMQNAEQNTIIATNSIADWYRAHPAVRVRYANTDCSAVPSTDNNPSVPDDSTASGYVSPYSPESTEQVASRLDQLQKLLRADGVRVE |
Ga0210009_1297629 | Ga0210009_12976291 | F044301 | TFSKGVSAVTKSPAQQAVAQKDAFAKNTIAGKEALIKNLSKVSVEEWKAKTIAGFDKLQAKVVRAVETGKWNAAKTLTAGKNAHAAVANMKKGTLNDSYERYLAAQKAVVAVYA |
Ga0210009_1316912 | Ga0210009_13169121 | F014357 | SGSKFTVKNTAFAGMCIVYDIYDAGDINNTMENGKDSKELTFLMYGQPTADVNTATDTQITKKNNPSEFCDFPFRGGVAAKRKIDLYGVVYSSRGADDDIAPSHYIHTNYLKLMKGRNVLFDSMRKGLPAQGKDVPTGSTFSAENGYDVGGEYSDLYQKDALIFDQPIT |
Ga0210009_1320494 | Ga0210009_13204941 | F037060 | KFSQMVENDYTLTTQPGNSLIIGIVQNTLELHTTVPDQTGTVANKIELTRMDQVIADKEIKVNNEDWYGAEIAVTFQDAVNARVDKMKLAKYFLQEQLTDIPDKMLAKALQDTSVTQRVYGGDATSVATLQTGDILTPKLFANAMLKIEESGYVPYCFLCSPSQANAI |
⦗Top⦘ |