NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Scaffold Ga0070730_10000002

Scaffold Ga0070730_10000002


Overview

Basic Information
Taxon OID3300005537 Open in IMG/M
Scaffold IDGa0070730_10000002 Open in IMG/M
Source Dataset NameSurface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1
Source Dataset CategoryMetagenome
Source Dataset Use PolicyOpen
Sequencing CenterDOE Joint Genome Institute (JGI)
Sequencing StatusPermanent Draft

Scaffold Components
Scaffold Length (bps)606246
Total Scaffold Genes576 (view)
Total Scaffold Genes with Ribosome Binding Sites (RBS)453 (78.65%)
Novel Protein Genes9 (view)
Novel Protein Genes with Ribosome Binding Sites (RBS)6 (66.67%)
Associated Families9

Taxonomy
All Organisms → cellular organisms → Bacteria → Proteobacteria(Source: IMG/M)

Ecosystem & Geography

Source Dataset Ecosystem
Environmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil → Surface Soil Microbial Communities From Centralia Pennsylvania, Which Are Recovering From An Underground Coalmine Fire.

Source Dataset Sampling Location
Location NameUSA: Pennsylvania, Centralia
CoordinatesLat. (o)40.7999Long. (o)-76.3402Alt. (m)Depth (m)
Location on Map
Zoom:    Powered by OpenStreetMap ©

Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F000579Metagenome / Metatranscriptome1011Y
F007480Metagenome / Metatranscriptome350Y
F008454Metagenome / Metatranscriptome333Y
F010670Metagenome / Metatranscriptome300Y
F014995Metagenome / Metatranscriptome258Y
F022700Metagenome / Metatranscriptome213Y
F030498Metagenome / Metatranscriptome185Y
F062774Metagenome / Metatranscriptome130Y
F075734Metagenome / Metatranscriptome118Y

Sequences

Protein IDFamilyRBSSequence
Ga0070730_10000002100F000579N/AMVLQWHALGGTWTACDMPPALVHGIALIRAAGPNICIFGQGGRLRLQVGPHQYALSENSPRISCTRGIASFGFRRRFTVKSSSGDVLFSHSYWTHQGRDFYRWLAEKASDPDWRISCARQWSDGVASGAMRPH*
Ga0070730_10000002124F062774N/AMKAGEKYFALTPKGVEELRGRAAKLDANTRNILSLIEQGFTSADALLQRSKSTRDEMIDMLRLLLGNGFVSTAVSDGTVKAPTPEPTPSVADSISERLRLKQGISPSQARFALSNFCLDQFGTAGKDLADVVDLCEDVAGLQMALDSIRSEVKRVCPDQRPALVACVREINETDYDG*
Ga0070730_10000002128F022700N/AMGHWYYGRHFTLLAAGAVILFFVAQWNLLRDSLIGTFALNGALHALALVSTLRAPEVLSRKAAFIAIAIVLSVMSLYVGIIGLTLFAVLPGSERLYVVLGVCALSGAITYGSLVRLFWLRRLSSRLILSMAASCVLATLLAFLARTHAVWLGSWWLAAVWWFAFSGSLYFFDTHPDVLQRSKYNAANKGAPTWRDA*
Ga0070730_10000002250F075734GAGGMKKYRLGLAALALMVTAAHADDYLSPTEERVRLSLGVVRYSNRTDLQINSSADVPGTPLNAEDEFGLDKVDYEAKVQALVRVGERNRLRFDYFSLDRSGQNTLTQPIVFRDVVLQPGDPLKSDLSIRTFGITYGYSFLHSDRYEVAATIGINDTDISARARVQTQTRHIDQTEDQAGPFPTVGLDATYVLSKRFYFDGRAQYFKVHIDDIDGSLGIYELDALYRLRPNISFALGYTSLRAHLASTQIKQSGLFNFNSSGPEIFLRVAF*
Ga0070730_10000002276F030498AGGAGMATMFGKCRSGSDFWSMRREASLNWLALGLLLAAWNASDNDAAANDAAADAHRRPAALRGYTHVSVRAANRVAADL*
Ga0070730_10000002294F008454GGAGGMSKTYVAGLMSGFLGGMMGAFVLGHLGVPVISPASAAPVQEMISAGRIRLVDATGRTRAEFAMSPDGGPGLFFYDSKGRNRLVLGLYSPAESEYPFVVLNDTHNEAAGIFRLFGGQETPVVVLKNKGADRSILGLNPSSTEPFLVNYSSDRKKTAIFGSF*
Ga0070730_10000002361F014995GGAGGVHHRRPLLYPWLSSTALTALFMAGTSMSWGAPRVDDLVPQAPAAFLPGGMLGIQLGGSWEASKQNPSLHRLTCQSVPDARDFDEVCFFRASADSRVGGAAIHDGFIVRKDDHVVLVGTGIAIKNADDPLAESVVQSFQSQIHSAFQHTGDNVLFVKLPARRLTDDEMAGYSQKAPVLLVQLEPKNNELAILYGYLGPVNVFGSLTSD*
Ga0070730_10000002423F010670GGAGGMNKFVLIVLATFTLMASGLSVAGDKTTDAPAKSSSFVPHPHTSRHVYGTPIQPAVVSHARTSPHKQTSKKRSSKTASRDKR*
Ga0070730_10000002471F007480GGAGGMDNPNSPANPSPRTTLKLKAGVKRALEEPKAKPEPQPQSKGNQKPGAHWSDEYKRRMQADMDALTSR*

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.