NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Scaffold Ga0209300_1000214

Scaffold Ga0209300_1000214


Overview

Basic Information
Taxon OID3300027365 Open in IMG/M
Scaffold IDGa0209300_1000214 Open in IMG/M
Source Dataset NameSubsurface microbial communities from deep shales in Ohio, USA - Utica-3 well 1 S input2 RT (SPAdes)
Source Dataset CategoryMetagenome
Source Dataset Use PolicyOpen
Sequencing CenterDOE Joint Genome Institute (JGI)
Sequencing StatusPermanent Draft

Scaffold Components
Scaffold Length (bps)35171
Total Scaffold Genes45 (view)
Total Scaffold Genes with Ribosome Binding Sites (RBS)37 (82.22%)
Novel Protein Genes10 (view)
Novel Protein Genes with Ribosome Binding Sites (RBS)9 (90.00%)
Associated Families10

Taxonomy
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage(Source: UniRef50)

Ecosystem & Geography

Source Dataset Ecosystem
Environmental → Terrestrial → Deep Subsurface → Unclassified → Unclassified → Deep Subsurface → Subsurface Microbial Communities From Deep Shales In Ohio And West Virginia, Usa

Source Dataset Sampling Location
Location NameOhio, USA
CoordinatesLat. (o)39.849Long. (o)-81.036Alt. (m)Depth (m)2500
Location on Map
Zoom:    Powered by OpenStreetMap ©

Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F001733Metagenome / Metatranscriptome644Y
F031872Metagenome / Metatranscriptome181N
F033435Metagenome / Metatranscriptome177N
F039641Metagenome / Metatranscriptome163N
F042901Metagenome / Metatranscriptome157Y
F049628Metagenome / Metatranscriptome146N
F050378Metagenome / Metatranscriptome145N
F059968Metagenome / Metatranscriptome133N
F065785Metagenome / Metatranscriptome127Y
F066786Metagenome / Metatranscriptome126N

Sequences

Protein IDFamilyRBSSequence
Ga0209300_100021410F066786AGGGGGMTQERVDLTWRCGHTAHIMVGYSQSDLKYKMAMMTSTLQICAACENKLAIERAWSLTQRLLEPNPIVMTGSEKQIEWARSIRTTKYEVLAHVLDCLHEAYKTRQDEWPAIARAISPVVNDVSIWRSYTQSGAIIDRRNINWTGAFTNALSRAGLYIGGLK
Ga0209300_100021413F065785AGGAMDIKLSCIECNRPNVVPYGRGHRICGICSQRQLKRERRKQTQRRIQTLGGFVLVVMCVWTACAMASDWNTPNSPDHRAHQAMQARD
Ga0209300_100021420F059968GGAMSKATATEEKAELLVRIKELRAAGNSISRTAQIMCMTRGTVQRWIMEERPEREVKKMDPYVSIDEKTAIVVKWAELIASGESRSSAAASVGFPTMMLNRWLMSEPSLRIEFQESIGKKQNNFGGRKSFEQILADVRAGRPVWRDGGRFKIQLVEAALMRYELDGANVWRCKGFATLSGNDVLARDWTVVS
Ga0209300_100021421F049628GGAMKFSEVIEPLMHGKPVTRASWEHDVYVRYSDLYEAFVMHAIPEPKIIQGITIYPEWMLADDWMWGEFHPVKDEIKWTQTTS
Ga0209300_100021431F039641N/AMIPAAYSNALKNAIQAYSYADRVAIWRTVNQSDGIGGISQHWVQVAEIRATISNTGDTEGIVGGMIEQSGTWTLTCSPDIEVRADDRIYVSGNPQALSPYYECIGSDYGHTNAVSQTIALRARTNG
Ga0209300_100021432F050378GAGGMSPEMWVQIGIQAFITIMSIGAAWVALQVRLTRLETQVAHIISTLDGQQQEVRRIEQRLGKLENKVSALEAIIQR
Ga0209300_100021433F001733GGAGGMNSISIKRLVVVVIVAFVAAFTSVFGDGIRTSEAHDISELGAVLALYGSKAVAAGVTAAMSSVLAFLTMPFKGVEANSLKVGK
Ga0209300_100021439F031872GGAMVESLVVDEWIYDTLTADATLQGLLAVDNRAPSYQQGIYLYYAPEKDPISLRQPQVPYIVVRHLDAGQTDTTSVCGGRIVTTSSHQVWCWDTQSGAVSMARIKGIVDRIDTLLNKQSVDTTTPVFFLNRASVSSSIDVSQDGRVDNGISQVYVATITP
Ga0209300_100021441F033435AGGALSSIFDNIPKLEGRPNHTVDIERFIGAPGSFTFREPKASDLFPRPEVEKMLKIAFPEFPAQMLQILMIMARCYVTQPGDGEINPARRFAQLARDRSDIYLFVVGQFAAAFPINIEEAVDEVPND
Ga0209300_10002145F042901GGAMARKQSDNKDITAGPVRVAEKPEGLLWLLKASEHEILERLHQDGALIYIHPALDGVVSYRIEENPAHDQKVVHVWR

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.