Basic Information | |
---|---|
Taxon OID | 3300005531 Open in IMG/M |
Scaffold ID | Ga0070738_10000006 Open in IMG/M |
Source Dataset Name | Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen12_06102014_R2 |
Source Dataset Category | Metagenome |
Source Dataset Use Policy | Open |
Sequencing Center | DOE Joint Genome Institute (JGI) |
Sequencing Status | Permanent Draft |
Scaffold Components | |
---|---|
Scaffold Length (bps) | 813319 |
Total Scaffold Genes | 711 (view) |
Total Scaffold Genes with Ribosome Binding Sites (RBS) | 568 (79.89%) |
Novel Protein Genes | 8 (view) |
Novel Protein Genes with Ribosome Binding Sites (RBS) | 7 (87.50%) |
Associated Families | 8 |
Taxonomy | |
---|---|
All Organisms → cellular organisms → Bacteria | (Source: IMG/M) |
Source Dataset Ecosystem |
---|
Environmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil → Surface Soil Microbial Communities From Centralia Pennsylvania, Which Are Recovering From An Underground Coalmine Fire. |
Source Dataset Sampling Location | ||||||||
---|---|---|---|---|---|---|---|---|
Location Name | USA: Pennsylvania, Centralia | |||||||
Coordinates | Lat. (o) | 40.7999 | Long. (o) | -76.3402 | Alt. (m) | Depth (m) | Location on Map | |
Zoom: | Powered by OpenStreetMap © |
Family | Category | Number of Sequences | 3D Structure? |
---|---|---|---|
F000043 | Metagenome / Metatranscriptome | 3655 | Y |
F003145 | Metagenome / Metatranscriptome | 505 | Y |
F006635 | Metagenome / Metatranscriptome | 368 | Y |
F017542 | Metagenome / Metatranscriptome | 240 | Y |
F018220 | Metagenome / Metatranscriptome | 236 | Y |
F028956 | Metagenome / Metatranscriptome | 190 | Y |
F030503 | Metagenome / Metatranscriptome | 185 | Y |
F031275 | Metagenome / Metatranscriptome | 183 | Y |
Protein ID | Family | RBS | Sequence |
---|---|---|---|
Ga0070738_10000006188 | F006635 | GAGG | MNRLLLPTGLFLLAILAAVAASSPPSTASWHLTSESETLVPDATLRAQIFRALDAAEHAGTDPSISHFKVRAATVVTVDGEDHVVLGGNTEYEVPEAIHGESSLLNHVTSLYGPDATRHAVRFVAFFSEHCGAGASCGDCRDYQHAVTDYQRLLVVCGQSSDHTVRISRFADQLVEEEKFPMASPESVPITAAELERLTRAASEARLGGVTLFSTTRHTGAAGLSYTGKIYRAAGGDDAAFHYRYPIGGLLQQADTERDYYLRAIVVVGEKGQWPVVSYRDRQYGFESSSFNHEAGKLPIALILADGLGHYRATTFEAALPHAFSTAEFMPQAVKDFLASHTPTAK* |
Ga0070738_10000006292 | F031275 | GGCGG | MMAAFILAFSLLTLLMFFVSYCRSLTAASMHESLTEEVLDVTGIREKAYGRDYARVVQLLRLCPDRNDDRSGLQVVGAYYSFLNFLDQTMSKLSPAVHTWAEEERASCAHFAAVALGRRIALNREMMAQQAEL* |
Ga0070738_10000006456 | F003145 | GAGG | MQAREAYLDRVTTEVEEVASRVAQLKGRFAKQKVSVKLDHYWELDHVRSRFAEFKRRVQELEDADDLELATLQEEVDLAWKDLMQAVDALMAALH* |
Ga0070738_10000006481 | F000043 | GGA | LPGLRGLSVSEFRATPTAAPDYEKGVTVSFSTDGDRDAFLRRLEEYFAPRRFTNAADAFDTVKAYVLESSAKEN* |
Ga0070738_10000006547 | F018220 | AGGAGG | VQKIFRCKECQRGMRVVGSTGLGKEIERTVLCPYCKKKNKVKWPRGDTFNVQRVGTR* |
Ga0070738_10000006564 | F028956 | GAGG | MFAVLAFGWLNLAFRAWTRDVAHWLILVSVAVAAVFVFFSVSSFRRARQL* |
Ga0070738_1000000670 | F030503 | GGAGG | MQGLRAPLATLLFVGLSASGTLAGGKPSASLGTILTAENARVGDSKAEVGTTVYSGDHLFTDRQGSVQIRAGAARLLLQSASAAVVNDDGGAPSAKLFLGTATFSTGNAKAFTLYASRAAIRAQSDAPTIGQVTYISEKELLVVSKRGTLTITVDGDTQVIEEGAAYRVVLEPPAMAQGPEGAGSGKHDKGMGGPPLKAGRSYFLVVAVGITALITAFAVSEALESPSRP* |
Ga0070738_1000000672 | F017542 | N/A | MRKTLLLLSLFLLLPSGCKNQPEIDYSSLDQAGMFSTTLNEIKKIKVSNAEIQQLAALKRAGAGDDFCLALLKAARSHNHDFTSGDSAVALSRAGYSDAQILEMAQADKVDELSDDAVMLKLMGLSNSTVQTVMQRREQGLPTLTSEQIGRLKNTGMSEQKILEFINKGLSGPQAESEISKLEAERDHSHTSFVRVHGRRR* |
⦗Top⦘ |