NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Scaffold Ga0209166_10053073

Scaffold Ga0209166_10053073


Overview

Basic Information
Taxon OID3300027857 Open in IMG/M
Scaffold IDGa0209166_10053073 Open in IMG/M
Source Dataset NameSurface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1 (SPAdes)
Source Dataset CategoryMetagenome
Source Dataset Use PolicyOpen
Sequencing CenterDOE Joint Genome Institute (JGI)
Sequencing StatusPermanent Draft

Scaffold Components
Scaffold Length (bps)2357
Total Scaffold Genes8 (view)
Total Scaffold Genes with Ribosome Binding Sites (RBS)7 (87.50%)
Novel Protein Genes6 (view)
Novel Protein Genes with Ribosome Binding Sites (RBS)5 (83.33%)
Associated Families6

Taxonomy
All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Candidatus Angelobacter → unclassified Candidatus Angelobacter → Candidatus Angelobacter sp. Gp1-AA117(Source: UniRef50)

Ecosystem & Geography

Source Dataset Ecosystem
Environmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil → Surface Soil Microbial Communities From Centralia Pennsylvania, Which Are Recovering From An Underground Coalmine Fire.

Source Dataset Sampling Location
Location NameUSA: Pennsylvania, Centralia
CoordinatesLat. (o)40.7999Long. (o)-76.3402Alt. (m)Depth (m)
Location on Map
Zoom:    Powered by OpenStreetMap ©

Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F011259Metagenome / Metatranscriptome293Y
F011803Metagenome / Metatranscriptome287Y
F023966Metagenome / Metatranscriptome208Y
F044745Metagenome / Metatranscriptome154Y
F070583Metagenome / Metatranscriptome123Y
F081702Metagenome / Metatranscriptome114Y

Sequences

Protein IDFamilyRBSSequence
Ga0209166_100530731F011259N/AMSRQDNRVLVRTGARELTIEEVDQVSAAMAHTNVCTAIMATSTVTGPGDGDGCSDTDRDANFI
Ga0209166_100530732F070583AGGAGMNDESRVLIRRGARELDREETKKVSGGLNTQTACTWDPDLGGRDGDGPPEC
Ga0209166_100530733F081702AGGAGMNEQDRVLIRRGARELNSEEVERVGGGYNTLVITFGPNGRDGDGLLGDS
Ga0209166_100530734F011803AGGAGMNDQNRVLIRRGARELTPTESALVNGGFITFSVCTNSPSPDGDQHVGEVGC
Ga0209166_100530736F044745AGGAGMNNRNRVLIRQGARDLNEHEVAVVSGGLTTLTACTLAAAGTLDGDTFHSECGSPDQ
Ga0209166_100530737F023966AGGAGMDHDNRVLTRRGARELTQDEVESVQGALKVAHTLTPCFIDKKQQLLNGDQAIGECGP

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.