NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300007994

3300007994: Human tongue dorsum microbial communities from NIH, USA - visit 2, subject 160158126 reassembly



Overview

Basic Information
IMG/M Taxon OID3300007994 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0063646 | Gp0053048 | Ga0113878
Sample NameHuman tongue dorsum microbial communities from NIH, USA - visit 2, subject 160158126 reassembly
Sequencing StatusPermanent Draft
Sequencing CenterBaylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size118347307
Sequencing Scaffolds15
Novel Protein Genes18
Associated Families18

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales1
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA4161
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → Candidatus Saccharimonas aalborgensis1
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria3
All Organisms → cellular organisms → Bacteria → Proteobacteria1
Not Available2
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → unclassified Porphyromonas → Porphyromonas sp.1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae → Chryseobacterium group → Chryseobacterium1
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → unclassified Candidatus Saccharimonas → Candidatus Saccharimonas sp.1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas1
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Siphoviridae sp. ct89S111
All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Neisseriales → Neisseriaceae → Neisseria1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Oral Cavity → Tongue Dorsum → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal surface

Location Information
LocationUSA: Maryland: Natonal Institute of Health
CoordinatesLat. (o)39.0042816Long. (o)-77.1012173Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F033081Metagenome178Y
F043991Metagenome155N
F046433Metagenome151N
F076191Metagenome118N
F077405Metagenome117N
F080166Metagenome115N
F081454Metagenome114N
F081455Metagenome114N
F084362Metagenome112N
F089055Metagenome109Y
F089057Metagenome109N
F090517Metagenome108N
F092229Metagenome107N
F095633Metagenome105N
F098763Metagenome103N
F099453Metagenome103N
F103431Metagenome101N
F103433Metagenome101N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0113878_100024All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales102261Open in IMG/M
Ga0113878_100029All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA41699159Open in IMG/M
Ga0113878_100068All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → Candidatus Saccharimonas aalborgensis59576Open in IMG/M
Ga0113878_100966All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria12472Open in IMG/M
Ga0113878_101777All Organisms → cellular organisms → Bacteria → Proteobacteria8305Open in IMG/M
Ga0113878_101916Not Available7852Open in IMG/M
Ga0113878_103015All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → unclassified Porphyromonas → Porphyromonas sp.5656Open in IMG/M
Ga0113878_106847All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae → Chryseobacterium group → Chryseobacterium2944Open in IMG/M
Ga0113878_115642All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria1450Open in IMG/M
Ga0113878_117281All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria1325Open in IMG/M
Ga0113878_118813All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → unclassified Candidatus Saccharimonas → Candidatus Saccharimonas sp.1224Open in IMG/M
Ga0113878_133194All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas719Open in IMG/M
Ga0113878_136998All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Siphoviridae sp. ct89S11644Open in IMG/M
Ga0113878_139840Not Available598Open in IMG/M
Ga0113878_140275All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Neisseriales → Neisseriaceae → Neisseria591Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0113878_100024Ga0113878_10002450F092229MSENYRFDHMPEVVLRNVKFIRENNIDIGTGDDVLDCMMEINPVLRQRIYDDYDLAKDVAERRFRSTIEELDLATVLQKCTTRPYIAILNNIFFRYFNSKLIDDMFKLGESTKVLDLAIEYECEYYTINSAKTNIRRYMQQVYFDKYAADSNIISSHRVLNDPQVNAVKSAEFTYDLFTAARSEKFNPEMVRDIFLKYGLKTNSSRNLYTRMNNNLSLYYYMEDYLTEYMLKGSFTYGSQVYSTIKEFKCLPLMNVLTQLTRPNPSGYVLDSNLELVKG*
Ga0113878_100024Ga0113878_10002470F099453MLRRKDMNRFDIIELAQQTITFVHSAFNGKVNALDPYTRLNFVAGYLDKKTNIARTTPYGCIYVSLEAFADTVEAYKFIDTDQIRNLALEIIIHELTHIDQLIDYRYIKFNNGYREEIERQCVKQSCQWILDNIQFIRSLGLVVIPEVYEERLVGLSNVTYSFKNPAVIAMSKLEHMIGKKFKEFNSTDIEIHYVDRLKNYYKIPVCANRMYQNSQNLNDLGERLLNDKQYTIEYMEYGNSKLVIKITQGA*
Ga0113878_100024Ga0113878_10002473F080166MEVTFNGILKRLGNDVKENFHTEYIVNSGSLTTNVKYRYRMKLSPKGEQTCVYIDWDNYDDLFNVLEESIKICDPENPRTPFKRTYSDKGDLLDIRCNSLQVKYQHLNDRFGNTIDLIPFVLVDEQSGLLTEAIRFRFNNELVYDVPISRLKGFRRFLMTYNPLLHAGAMARYMAMTPLLGSNRQNMMRS*
Ga0113878_100024Ga0113878_10002474F081455MDLLKHLNETKLNLKFQQEREKRNAIEDSKALIRHLDPDYAEYELESLNPVEEIDAQNVEIIDAVDAIQQPLENKDAGIAVNFSQMINQPAIQEEVAKVVTSVPEQGEPKIKVVFPQNEHILGSYVDYDSFNKIKESNADTIIRSVRVLNFKMSDPNAVTAFNNFIMKFNPECDPNKRLKYELIRHQGREKDIVVRLSTVVNNVKYYADIYADLNKIDLDHHLISSAKKR*
Ga0113878_100029Ga0113878_10002923F089057MTNLIPIIAKKYNRKGDTSGSLKSLVSDLNCIDNVDDSLLFLSSIPRETKYTLDEVFDLITSNDIYIKIFGNVLTFINMDLDYHRLLLNAIKSESYKIISIINESIPTPDLFLAKNNYECLSVALDKPFVIFDKILGMVVSQLLHTASSKEERIFGIFMTICIINREINKLASLCTGYLAITRDEVLVKDLMNESAMVAFQYMSTEDINNVVSDINSRTVLSRYLSNM*
Ga0113878_100068Ga0113878_10006848F089055LKVEKMNSTPECVTKTPEIEAREKLAAIFSDAERCDNSKVNPELGKTAIDIENTSRMNSADDGAVYLCNQALGSYGKSLDYINNSPLEAVQAIGNSLQRLREYKTERSCR*
Ga0113878_100966Ga0113878_10096611F076191MKIIAENPAEEALLWRIKALSDELVNQDNRSTSMPVWTILDNNKAGKDYGAVMYFTGKAAEQHIEKNNHHYNNPTTCVRSAHDNRELKDVIHLLILAGGNEIPSNHYGFLRNA*
Ga0113878_101777Ga0113878_1017773F103433MKEWSKNKPGVVFFFVVWFILSISFIGNFFGTGLWNGWFDGFQKDSSAIVEKTAYCKNKYDYKGPLIAADSKDYNKIMMSQDCNPSQVKPYVSQYGLQARVIAGLSPNDASKIPAYIKRVSIFLAVLAAFLLALVVQKIRALFGGITASVFVIMLAFSPWIAGYARNIYWIEPMLIAPFVISFVGYQYFKKSKKLWLFYIIESVAMFLKLLNGYEYVSTIAISVLVPIIFFELVHKNVKIINLWKQAVPVFAATVVAFFGAYWVNFVSLTDYYGSSDKAASAINARASDRGISGIRSMRAYAVGNFKILRPETYNFINQIVNLDNMANNSGKTYKYIIVNVVNYLLLPAITLPVHINGMFGEFVQSILFWTILGYLIILSSRKIIGKKYNRPFLWSMNFSVIGAFCWLALMPGHALPHAHINGIIFYIPLLLFVYVLIGLWADYVVKRTVKYE*
Ga0113878_101916Ga0113878_1019166F084362MNCTFTVRWSDEKNKPRSKTYETEVSAKKAKKWLLEHGGRDVDVAVKINNKPAGSLQDSEKQPEAVAEQKGFWWEK*
Ga0113878_103015Ga0113878_1030152F090517LGLLFLASSCKNKKATPRLEFSSVELRQTVWNGTLEYKNPKRDSYSVYLNFLSDSEVEVSAYDSKDPTASYIVQADCFYTITDRIFTLKTQGDRELHPSMDQNAWYLIRKEPSLLVFQANAGNPDREATLTLRKKL*
Ga0113878_106847Ga0113878_1068471F081454MKVFKLVLLLFITSASLVFGQEKRYFFKHEFQPNSKYLIKYKTDMDGGYKFVGSKEVIDKIGMDGVKMTINSDIESAISTQKKQGNNVPFILEYTKYFYKAEINGETVNRKIPLQGVKLIGDIVNGKKMEVKNVEGNIDENTKKILIESIKQFSAIDTDFPKEGLKIGDSFDVVVPYKQSTQMGDIEMIMNIKYKLLKVEKEEAYFDMLIDFVMGDKNVKNMDLSASGDGKGFLLFDMKNNYFTNQN
Ga0113878_115642Ga0113878_1156422F033081MAWLFRRAMPQDTRPTFVWSRLVAEIENAGYFSRWKFSILAVGLIIMTIATIKMLLFVPGLNQSAVSLLTRGLETFLPTRWATATAWTVGMAGVFLMGDLTNYTPSQKILHKIKATRYEVYNTILFLALLEEQAFRSGSERWNWRERVRASVCFGLLHITNIWYSFAAGIALSVTGFGFLLVYLWYYRKYRIQIIATAAAATVHALYNAIALSLIAVVLAIDIAKLL*
Ga0113878_117281Ga0113878_1172811F046433MIELPTSPDALSELSPVAPPKLLSQAQDASRDNLMVYVKADNYLGTETSDPSFMESRCETTEYEAINDFVQFIETTKHYLPDYMEDCAKELIDELAFLGMPELNFAANALAKRLRHHLEVDNKPVYIDVGNSLSQCRAKNKMKSSQYILSLVLSKFPDDEFEEYEGRLKVYGGRGEIDKSSKILFLDDWIVSGDQVKERIAGFEVDNDPESHEASVLVMAASGDYLDNGISAYSQYGGTIYPVEACYVLKNSPDAGGMSRVTGIHSSTDNTFGYEVDGIAYCAIERGILKGEKIDELSLPALANIVRPYRNGEDFDGLSRFRQLLEKE*
Ga0113878_118813Ga0113878_1188131F095633MGNYENFTEVGRREGLTEGELRTMGMLAMEATEELKKTTIRKETVLLGSVPFGSWDEFAKAVQEMAAHIYEPIPVKINTKRLIAAAFLDDGGEMSVEEHSVPEEVFIDLPRTRCVVGADRSHKSYEFTCPVLKKFPDGELYPIREAYVISAIDVNGSQEVDFKII*
Ga0113878_133194Ga0113878_1331942F098763MDRRTSLRVLSTALYLTTKLFKAVVRTTISYDSSDEVKGKGRMNAVPTAVDE*
Ga0113878_136998Ga0113878_1369981F043991MSKKNPSVVDYFDLNGDLNEEAYEFEDVKLEDYIDKRSNIKPSWIGKYSQQMHFDLVDGTEVSFYKGRNLIYSDILFSGGVRTILFKCRQKKNLTRFISRVLELSQGKPENIHPDFRA*
Ga0113878_139840Ga0113878_1398402F077405LLVAKQRGVFYGFIVLCQIKFVKKFFDWLKIIQK*
Ga0113878_140275Ga0113878_1402751F103431MIDFDALVVGMLFFIQLFLQGIAWRVAIAHFLHAERGNAAAAAFDGAFGENIADCHAEDDNDKDAESQKKGFHVCIPEG*

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.