NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Scaffold Ga0214921_10000039

Scaffold Ga0214921_10000039


Overview

Basic Information
Taxon OID3300023174 Open in IMG/M
Scaffold IDGa0214921_10000039 Open in IMG/M
Source Dataset NameFreshwater microbial communities from Lake Lanier, Atlanta, Georgia, United States - LL-1505
Source Dataset CategoryMetagenome
Source Dataset Use PolicyOpen
Sequencing CenterDOE Joint Genome Institute (JGI)
Sequencing StatusPermanent Draft

Scaffold Components
Scaffold Length (bps)262308
Total Scaffold Genes374 (view)
Total Scaffold Genes with Ribosome Binding Sites (RBS)236 (63.10%)
Novel Protein Genes13 (view)
Novel Protein Genes with Ribosome Binding Sites (RBS)12 (92.31%)
Associated Families13

Taxonomy
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales → Myoviridae(Source: IMG-VR)

Ecosystem & Geography

Source Dataset Ecosystem
Environmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater → Freshwater Microbial Communities From Lake Lanier, Atlanta, Georgia, United States

Source Dataset Sampling Location
Location NameUSA: Georgia
CoordinatesLat. (o)34.2611Long. (o)-83.95Alt. (m)Depth (m)2
Location on Map
Zoom:    Powered by OpenStreetMap ©

Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F000354Metagenome / Metatranscriptome1244Y
F001483Metagenome / Metatranscriptome686Y
F005563Metagenome / Metatranscriptome396Y
F008243Metagenome / Metatranscriptome336Y
F011670Metagenome / Metatranscriptome288Y
F012023Metagenome / Metatranscriptome284Y
F013393Metagenome / Metatranscriptome271Y
F015470Metagenome254N
F035734Metagenome171Y
F054849Metagenome139N
F058128Metagenome135Y
F061653Metagenome / Metatranscriptome131Y
F103252Metagenome / Metatranscriptome101Y

Sequences

Protein IDFamilyRBSSequence
Ga0214921_10000039104F012023AGGMKQIDYKKLNERIVEQAYETALGEMSKEQLQKLIQDKFERARKHHEALTKLSTFSVN
Ga0214921_10000039111F015470AGGAGMVTKKAPVRKAPVKKVVAKKTPVKKAPVKKTTTAPIVIDVSSTTKSASSMLDKAIDLIKWVDTPFKLFEVILLASVFFFGYFAWDSRTVILNAITQSSHITNLKEVNHLIPVAVGLQKDLDAVSVVVHKASLVTNTRTTMLAFGPKGRDNALDGLVSSLFNKDPIRNAAIIGMLNGEVVCDKLEVTGKTSEWEVKQGATFICRGGIPPEVGDFDGYVSVGFKTEPTELTVVKTRINLATSEMSQ
Ga0214921_10000039112F054849GAGMKWLIVFLLLLISCIPSSSQPCDDCIGGVVATDTNTCLVSNFYKIAYSIHDPGLRHQQLSSWLTTNGNKCDSKQLVIIWNRLAEWGGAADSAELRGKLLYYFSRAEERERKEKS
Ga0214921_10000039113F058128AGGAMDAIRWFPIVQPSYANVDGILADVHARKNEERWAEYNKELVEKIHFRDYLYDLYLKRAHENHIRMEIFNVSTIDYYV
Ga0214921_10000039197F008243AGGAMHTLEQEIQAIVEQCQMGNISEDEHNYLLTEIRDIRAAQECAGNEELFRYVVQACNIAMKLV
Ga0214921_10000039256F013393AGGAMRHVSGFSNSTRIRFIIDGFGMYATINDVFTKTATVSHGAALRYAVEYLAEIRRRSPVLGEGRPVGTVITHEGHQVQINLMAN
Ga0214921_10000039264F061653GGAGMRYGNMPYKYRVVTKVSSKEDAVIDVFGRQILLTTNNYQTARKECIEWAAYDDILVHVINQFGSSKFSCDGASEAYDRFPKTAEAV
Ga0214921_10000039289F035734GAGMTAKPFRQWLNDLWRDNCDEHDGWGQPRMTMKEYFHKYKYWLKREYRHQIKG
Ga0214921_10000039291F011670AGGAMNISRAEQSVVKHNLEQYRIDQNRLEKQRTEDYAKKIEERRVEQIIAERVSRNLRLDLDKGRNIDIEC
Ga0214921_10000039309F001483GAGGVDKLKELLLELLKFIGESPFRLFTVVFLCLFTFLGWVAYTEKDNFMASYRAQQALPKMNGKYEQAVNFILKNTDAELVAVFQVNTLLNTRKLVYLTTRGSGHDTVHDGTNVGLLTKNHSNNEDVIGLMSGKIPCGPYLTPQSYIGFTYKNYGVTYMCRISVPADPGLFIGQISVGWKEQPTEVELAQTVLIVASSLLFDKK
Ga0214921_10000039331F103252N/AMYIEFDILNAMDNFDDLEDAVDLWAKKHNIPYTTKVAKGLKYRLGLNQPEHFTLFFITWADCEYKVRNIK
Ga0214921_1000003964F000354GGAGMTTVNYDNFASFDINECCDHFDSEKQSNWKKINKFIVADGQEFAHIMETEFDFDETGANEYEAFQAGVKYALTKMNIAFEAAAVDLQVCEVDLVESMGFVLVRADDEPEDFVKRVLKKPVLMVDSWV
Ga0214921_1000003971F005563GGAGMIDYYEALKEMHQGNVVKYVGTVNGNVMSENGASFCMCRGCIFLFDDGVIKWNKLGYMVYDPDFRYELTGETVDPRAWKPEKNRDRKEIKSKLGYSRVGLGNI

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.