NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Scaffold Ga0070731_10000008

Scaffold Ga0070731_10000008


Overview

Basic Information
Taxon OID3300005538 Open in IMG/M
Scaffold IDGa0070731_10000008 Open in IMG/M
Source Dataset NameSurface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen03_05102014_R1
Source Dataset CategoryMetagenome
Source Dataset Use PolicyOpen
Sequencing CenterDOE Joint Genome Institute (JGI)
Sequencing StatusPermanent Draft

Scaffold Components
Scaffold Length (bps)523476
Total Scaffold Genes438 (view)
Total Scaffold Genes with Ribosome Binding Sites (RBS)332 (75.80%)
Novel Protein Genes10 (view)
Novel Protein Genes with Ribosome Binding Sites (RBS)6 (60.00%)
Associated Families10

Taxonomy
All Organisms → cellular organisms → Bacteria(Source: IMG/M)

Ecosystem & Geography

Source Dataset Ecosystem
Environmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil → Surface Soil Microbial Communities From Centralia Pennsylvania, Which Are Recovering From An Underground Coalmine Fire.

Source Dataset Sampling Location
Location NameUSA: Pennsylvania, Centralia
CoordinatesLat. (o)40.7999Long. (o)-76.3402Alt. (m)Depth (m)
Location on Map
Zoom:    Powered by OpenStreetMap ©

Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F000036Metagenome / Metatranscriptome4352Y
F002492Metagenome / Metatranscriptome554Y
F002979Metagenome / Metatranscriptome516Y
F003336Metagenome / Metatranscriptome493Y
F011872Metagenome / Metatranscriptome286Y
F012141Metagenome / Metatranscriptome283Y
F025789Metagenome / Metatranscriptome200Y
F047844Metagenome / Metatranscriptome149Y
F087514Metagenome / Metatranscriptome110Y
F087522Metagenome110Y

Sequences

Protein IDFamilyRBSSequence
Ga0070731_1000000818F000036AGGMAYKETFWMACDSTEQLRAEYGPFHTRAEAEAEAKKLGFGYLLRYEHILGENEEIEEVRCIFVELPGATPVGVEPAPVTLHTRCATCGEASAHEKGWQAEVWADIHEFEHSRHRVRLFEHARGKGLKEIGDWRG*
Ga0070731_10000008253F003336GGAGMGGTAAYGMYPRSIALPEIVCALNRAGFGNEDICMVLSPAHPVAEAVRDAKVTDIEREDGALGARMIGWISKFGAVVIPTVGFFIRSQAFFRALVTEQNFPALSRGSGTLLDLGFSECDAKRLGHQLSDVGALIYVSCPESSKADRALELLRRMGAREAASLAAIKVAGAAA*
Ga0070731_10000008256F002492AGGAMSNKATSTSSASPAGWHVLYQAALFETDPNKMPLRIAEAETAILVRMKELFTITSDHIEEDLILDDALYGLRALRNCVVQEASAA*
Ga0070731_10000008355F087522N/AMRRSLNILGSAGLAAMLSMAPVVGLHAQSHDSGEHKILLQADLKAGQVLRYELEAAGSFLPIADASGAILTPPRGPCDYALAAIVTLRPQAPDKDGNTPVEARYSETRVTSVGCALFSAVDFQRRLAALQASPVMFRVGPHAETAMERTRDGYFKYWDGGELLRKTTQDLLQTEFSQHPVAAGDSWKPRGQFAYARDHALKDLELSGADLRFRSMVQVDGKRCAWVTSQYVFSPIDLPAAGTASGGVIVPGAGNNAVAAVLHISLLLDPTSHHVAWLHRSQTIDNKLTLASPYETSDPRDDPAEPDDPQDSENPMPDLSSMRSDHPGRYPFMTFHFQEEARARLLPDQRSVEWLAALKRFEETPGPVSGAVPVAATKTLLPNAMIQAAKPLAVNRTTRVVVDSDSLMATPAGFTRYEKGLCRDVWFCATVSVALPGDVQVSEDTPLRTVYLARQEGLLLSVAVGPALDRRSVGLTEDEELRKEANYYLANYVWLAAKPGIGTNFSSATLDGYPGLITDFSATQRDLADIRGVLGLMLTPWGKVVPVSCSSDHASSADLQAICEKVVTSVSLQR*
Ga0070731_10000008356F047844GGAMKLSFSVRVGLAFVVWIGCLGTLAWAQDSLGDPSANPGDSLGDLARQTRAQHTPSGEGTSTKAQELVNEMQQEQEAADNAPVGFKNYDAGDYRLYVPFPFSLEGRENDGAVLLGSRLGITNTEVMAGTPIPIPPNSSDNDLTNVARQLAGLHGASAYCSAIKEGTHKAFRCSWQTSPRLLGHEVWGSMEFVVSSDRLIPVMCVSPDDVHQTCVVYDTYGHNTCSDREYQLYGWDHRKAQAAADASNRDERTTVQMCEQVIYPSIQLKEDIVVHPATISANKATKVVAVAAVRDQTEGDTGSQGPSLAELARRTRQATHAKAETSLNNSEGSSAPAGFQSFVLQYCQNPQQCSEATVIIPDKTEVVSRTNGQHIFKTALDGEPVFLYAGPADVNAPYRSLTDPDYIRMRDLANANGWSREKADGVSTQELTIEDRYALMTRFRYQRDQKRWWIGERVLVQNRSGEFLVGCTAPQEHFADAEVLCTTLVNSLRLP*
Ga0070731_10000008357F012141GGGGGMRDRYWFWLGARLLAGVALIGWALSYLGTGSGEKEFQKTLEAMKQVHTFRVASTTNQVGVQHNEMLWEVDCPRDIVHHQMHMVMVSSTNSGEMKQDQMMVGNRTYERGSDGSWVPNRSGYQGASAKWYCRNLAEGTDSNLMPQIATMIKRGILQKGDKKTVNGVRCREWLVTMKGGFSGLEHDTVCLGLDDHLPYEMTVDWAHSRTSFSDYNVPIQFDAPEAILQPASTSTGVN*
Ga0070731_10000008372F011872N/AMQTGTMSGDSTTRKDSLKVEKVSLLILTLCVLVLVSLLVFKPF*
Ga0070731_10000008374F025789GAGGMHLWRIRLVLLFGTVLALPPAFAAPFVLFPKVTELASPDGRYIVRNADREGASTDFVGTFHSLWLTESATGHSRKLCDYLGVAAVAWSRNDFLIVTQYSRRTSQALVFSATGSPDPVMLDKPTLTYLVPIELRPALRGNDHVFVEASRVEDDTLYLTVWGYGQHDANGFRWHCQYALREGAISCTEERATH*
Ga0070731_1000000841F002979N/AVKPRPKSQKRQPGKKAEAASLRMFLVPCSCGRTFAVAEKYDQQGTAWSRYIVCPGCGKRHDPKNRLLQMGFHSEGYWKVDEC*
Ga0070731_10000008424F087514N/AMAEATEGQDLRQDRPDGHSARDWQVLWFFIHLAAVYAIAEFVTPWLAGWTHGTLLPHLQHPTSSGRFEFFFSHIFAFSFIPAFLSGLVNARFKHKAAQFVWLVPAAILVFKFATFPAPSVFQSQFSAAFHQYFGSGFVIPESRDFQDLFSTASNPDMWRGMAQHQFTAPFYAGVAYSAAAWIALKIELSRKVAEGVKKWEQSRVEHQL*

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.