| Basic Information | |
|---|---|
| Taxon OID | 3300021178 Open in IMG/M |
| Scaffold ID | Ga0210408_10000004 Open in IMG/M |
| Source Dataset Name | Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-M |
| Source Dataset Category | Metagenome |
| Source Dataset Use Policy | Open |
| Sequencing Center | DOE Joint Genome Institute (JGI) |
| Sequencing Status | Permanent Draft |
| Scaffold Components | |
|---|---|
| Scaffold Length (bps) | 411559 |
| Total Scaffold Genes | 393 (view) |
| Total Scaffold Genes with Ribosome Binding Sites (RBS) | 310 (78.88%) |
| Novel Protein Genes | 10 (view) |
| Novel Protein Genes with Ribosome Binding Sites (RBS) | 9 (90.00%) |
| Associated Families | 10 |
| Taxonomy | |
|---|---|
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales | (Source: IMG/M) |
| Source Dataset Ecosystem |
|---|
| Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil → Forest Soil Microbial Communities From Barre Woods Harvard Forest Lter Site, Petersham, Massachusetts, United States |
| Source Dataset Sampling Location | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location Name | USA: Massachusetts | |||||||
| Coordinates | Lat. (o) | 42.481016 | Long. (o) | -72.178343 | Alt. (m) | Depth (m) | Location on Map | |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F001574 | Metagenome / Metatranscriptome | 669 | Y |
| F015364 | Metagenome / Metatranscriptome | 255 | Y |
| F018971 | Metagenome / Metatranscriptome | 232 | Y |
| F020232 | Metagenome / Metatranscriptome | 225 | Y |
| F022039 | Metagenome / Metatranscriptome | 216 | Y |
| F027575 | Metagenome / Metatranscriptome | 194 | Y |
| F045388 | Metagenome / Metatranscriptome | 153 | N |
| F077529 | Metagenome / Metatranscriptome | 117 | Y |
| F077545 | Metagenome / Metatranscriptome | 117 | Y |
| F105573 | Metagenome / Metatranscriptome | 100 | Y |
| Protein ID | Family | RBS | Sequence |
|---|---|---|---|
| Ga0210408_10000004124 | F077545 | GAGG | MTSLHTKLVLSALGIALLSGPAVARQLPDQSNVVVNGQIVGADPDAQIRTQLLREWDFRYGE |
| Ga0210408_10000004129 | F015364 | AGGAG | MPSRKKLPDQPPYLKRRTFEQGRAEVDAAHVATCRLYCDALDLWRRCAEPLCRRHRHCLGEPTGCFVRGLHRVSARRRLEAQKVVIAGGPRRIAPATHIEWTVRRSALKQIVSWGLG |
| Ga0210408_10000004131 | F001574 | GAGG | MATLKLGTLAFADWRAGGLFRPAAEKFAGAWLACLLVMARGNVFAAFSPEHLYLATVCGTVGAIVTVALLLQMDRTTNSVGRQATISAVVTLIGDVFAHPSHFPPQWAEPLVTAAVSAGIAVALWYGKRWAGFAY |
| Ga0210408_10000004138 | F027575 | N/A | MTVGRPAKPDRLAALIAWMLVVRVDNLVKDGAKRRRLRRSTILDKIINANRVLSKRSRPLSPPSDIFLIPHKIA |
| Ga0210408_10000004216 | F018971 | AGGA | MSDSGRKFAGAEHVAPPLAPIGINDILRDADVLLALMGQEPLGVVAELANTLTVSAHEAGARDIEEAASNVRRLASGRGPVALTGAMRALTDAIARTERAFAA |
| Ga0210408_10000004265 | F077529 | GGAGG | MRNRWRAGCLAAGMVAVGVAPTVAQAPVQAPRTYECTANAHCSVSCQVDGEKQMQTGAPKTITVTPIAPNNYVVELVEQNGHVQTLYLAGTKVACNLDGLTRKAE |
| Ga0210408_1000000446 | F045388 | GAGG | VRSSRLTDDDLWRVIADNTDAMSVLIEKQFELDADAGTPDPDTRQKQMLFNVQAIDNYNRQYRDCIAEIRRRHPSI |
| Ga0210408_1000000447 | F020232 | GGAGG | MAEPAAEPIPPDFAEAFAYAVLVYEIWTPEEPGRLIQIGSRSYSIIEVCRFVDQFADRLPERVYLKLRSYLRDDPDGKLKADLAADPSYATAVRCLRGMMQRRTEVHKQRGGTEG |
| Ga0210408_1000000448 | F022039 | GGAGG | MKNDPADFDKIDAALMSYELSDEALEAAARVDRGTAITVGYCATASNAWYCMPL |
| Ga0210408_1000000450 | F105573 | GGAG | MKTRHDQQKPAQGRELLESIGREIRTNIDLAEPIPDRLNELIEQLVLRIDEREKEREEV |
| ⦗Top⦘ |