NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300008522

3300008522: Human tongue dorsum microbial communities from NIH, USA - visit 1, subject 765135172 reassembly



Overview

Basic Information
IMG/M Taxon OID3300008522 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0063646 | Gp0052931 | Ga0111045
Sample NameHuman tongue dorsum microbial communities from NIH, USA - visit 1, subject 765135172 reassembly
Sequencing StatusPermanent Draft
Sequencing CenterBaylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size124642862
Sequencing Scaffolds19
Novel Protein Genes22
Associated Families20

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA4165
All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae1
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes2
Not Available3
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria1
All Organisms → Viruses → Predicted Viral2
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Lactobacillales → Streptococcaceae → Streptococcus1
All Organisms → cellular organisms → Bacteria2
All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae → Haemophilus → Haemophilus parainfluenzae1
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ct6uZ81

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Oral Cavity → Tongue Dorsum → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal surface

Location Information
LocationUSA: Maryland: Natonal Institute of Health
CoordinatesLat. (o)39.0042816Long. (o)-77.1012173Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F032313Metagenome180N
F043991Metagenome155N
F046432Metagenome151Y
F054110Metagenome140N
F067846Metagenome125Y
F068942Metagenome124N
F074985Metagenome119N
F077405Metagenome117N
F081455Metagenome114N
F084362Metagenome112N
F085820Metagenome111N
F089057Metagenome109N
F092229Metagenome107N
F092232Metagenome107N
F095629Metagenome105N
F095631Metagenome105N
F099452Metagenome103N
F099453Metagenome103N
F103436Metagenome101Y
F105379Metagenome100N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0111045_100094All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA41650311Open in IMG/M
Ga0111045_100106All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA41648215Open in IMG/M
Ga0111045_100554All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae20037Open in IMG/M
Ga0111045_101385All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes11167Open in IMG/M
Ga0111045_102201All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA4168025Open in IMG/M
Ga0111045_102988Not Available6439Open in IMG/M
Ga0111045_103128All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA4166244Open in IMG/M
Ga0111045_103925All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria5261Open in IMG/M
Ga0111045_104147All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA4165040Open in IMG/M
Ga0111045_104321All Organisms → Viruses → Predicted Viral4879Open in IMG/M
Ga0111045_104751All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Lactobacillales → Streptococcaceae → Streptococcus4515Open in IMG/M
Ga0111045_106655Not Available3448Open in IMG/M
Ga0111045_107337All Organisms → cellular organisms → Bacteria3178Open in IMG/M
Ga0111045_110609All Organisms → cellular organisms → Bacteria2300Open in IMG/M
Ga0111045_112913All Organisms → Viruses → Predicted Viral1910Open in IMG/M
Ga0111045_115509All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae → Haemophilus → Haemophilus parainfluenzae1591Open in IMG/M
Ga0111045_129028All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes839Open in IMG/M
Ga0111045_133661All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ct6uZ8715Open in IMG/M
Ga0111045_140199Not Available584Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0111045_100094Ga0111045_10009416F089057MINVIPLIAKKYNRKGDTSGSLKSLISDLNCVTDNDDVLLFLSSIPRETKYSLDDAFDIIVSNDEYSNIFRATLVFLNIDLDYHRLLLNAIKSESYTIICMINKAIPTPDLFLAKNNYECLTIALDKSYAIFDKVLGMVVGQIKHTASSKEGRALGIFMTLCILNKDIDKLASLCTGYLATCRSEYMVKDLMNKSAIDAFQYMSEEDIHIVVDDINSRTVLSRYLNKI*
Ga0111045_100106Ga0111045_10010616F081455MENVECISNLISKCSDYINRKEKNTNTEEKDIPPVEEIGAETVDIIDAAEEAIQQPLQNKDASIAVNFSQMVNKPKEEVKTEVNSVPPEGETKVNVLFPKTEHILGNYVDYDSFIKIKESNTDKVVRAVRLLNYKMSDQNAAAAFAQFVSTFNPEGDLNKRLRYELIRHQGREKDLVIRLSTVLNGTTKYYADIYPDLNKIDLDHHLISSAKK*
Ga0111045_100106Ga0111045_10010620F099453MNRFDVIELAQQTLTFVYDTFNGKVNTLDPYTRLNFVSGYLDTKTNIARTTPYGCIYVSLEAFADTVERQGFIDTDQIRNLALEIIIHELTHVDQLIDYKYIKFNNGYRDEVELKCVKQSCQWILDNMQYIRSLGLVVIPEVYQARLANLGNIIYTPKYPIAIAMAKLEYMLGRKFREFSNNNIEIQYIDRLKTHYSFMVCENRSYINSRNLNDLGERLLNDKQYTVEYLEYGNSKLVIKITQGV*
Ga0111045_100106Ga0111045_10010639F092229MNKEYRFNHIPEVVLRNIRFIRDNNIDIGTGDDVLECMMDINPVVRTKIYDDYEFAKDVAERRFGSTIEGLDLRTVLQKCINRPYNSILNNIFFRYFNSELIDDLFKLGQSPKVLDLAIEYECEYYTVNAAKTNIRRYNTDAYYNKFAADSNIISSHRSLHDPQVNAVKSAEFTYDLLMASRAEEFNPEIVREIFVKYGLKPNSSRNLYNRINDNLNLFYYIEDYLEEYKEEGRFIYGTKEYKILKELRSLPLMVVLTQLTRKNDSGYVLNSNLELVKG*
Ga0111045_100554Ga0111045_1005542F067846MESLQAQWERKTFNDYDRRCCAEDAYNEAIEREIECIEDDISNGDSDAICAFSEKMFDDDEFLKAVALGTDYEEMRIKILTAMAEDRLEQIEEDYRKGYILND*
Ga0111045_101385Ga0111045_10138513F095629MTFKERMMRELIICVCLLGCFGVANANNVEQPKEVKIVHNDDSVILHKKIYQLEKRIERLEELLKKEDK*
Ga0111045_102201Ga0111045_1022015F095631MASDAVSVNKTLRSYQETVLNTVQNDTLLNANIYDIHQYIENIKKRYVDEDEITLSMGIFGYMGDVNSNALQNAVTMAAEYSNEAIPIKAKFEKNVISHALMLGINKIFAEPATMQAMFVFYEDELILNTVSDTFKFDRNIKIMVGDYEFHLPYDLIIKRIELPTGEYIYTGMYDTTQSNPIITRNSNDVDPYLKPTVRSKIDGRNVVMLLVDLRQYEYTTYHKTIITTNPLESKMLQFEFDNQLAGFDVDVKEYDQPTRKLKPVYNGLNTDGINNFCNYTYIDSSTIRVMFDNSSYLPTANTEVTVNLYTCQGSNGNISYKDSIYFRVKSEKINYDRLNLLVIPTSDAQYGIDKRSIADLKKLIPKEALSRGSVTNSTDINNYFNTIDDDDNKLFFFKKMDNPLARLYYAFVLMDSPTNIIPTNTIPIEAIRRDFDNISDSNYILTAGNIIKYDGTTNASVAYQSSEEELNNARKNQFLYMNPFMCIVNKKPLYVSYYMNIMDVNKLLEFTYVNQDSKVQFVANKMNWYRHYLSERDTYVGDISIMQNIQSDIGLVHRDDPYDPEKITGVDVKVLAVFYTDEKYQVPYRWAEAEFVNYDQNTFIMDYKFKLNTDNKIDKNIKLKINNVYEVGNATRLSPGYMTNNMNMKIFVFAKDVFGYNAGLHKADQIFTADFLEGYSLTNEYTVKYGIDFLYNYSDLIESHIKIRKQDNGQISYIIDRVPVISYDYVNTEERIQDFINNLEKKRIHILECLDVLEDSFGIDIKFFNTYGPSKLFYVNDGVPLNRVNLSMTFKVKFLTTTDKYLTEYIKNDIRKYIEDKSRISDIHIPNIITFITQKYAENVTYFEFLDFNGYGPGYQHIYRKDESIVGRIPEFLNINTIGTENNALDINIIIA*
Ga0111045_102988Ga0111045_10298812F084362MNCTFTVRWSDEKNKPHAKTYATEDDAKRAKKWLLEHGVRSVDIAVKINNKPAGSLKDDNPSESAAEQKGFWWQE*
Ga0111045_103128Ga0111045_1031285F092232MNTQAKFIAEYNDKNRPKFNDKFFNKSDDDIIEDLKDVILSCERNKFYTIKVLNFEVIDDYNEVQKLLIGDETPSISIKDSDLKILKVTYHVACTKDEDTFDVLIAIPRVIDGAYIHLNGNDYFPLFQLVDGSTYNNTTASSAKTQSITLKTNSNAVKMLRNFIDLNTTNEETVRAAMFSVYLFDHKVTLFEYYLARFGWYETLDKFNFEDVIKISDHDLNDPEYYTFAIANAHMKTPFYISAVKSFMDNDRILQSFVASFARAISLYATKKTTLDQIYTTEFWICKLGYNFVSSETSVFTKGNAIIESLENSYDIPTKKRLRLPDHIKEDIYSVLKWMACEFSSIRLKNNLDASSKRIRWSEYIAAMYIMLINVKLRRLPEKHDPNMEAYRIKQQLNTQPMALIAELQKSNLKGFRNMVNDRDSFLQLKYTIKGPSGPGESNSKNVARNVRAIDPSHLGIIDLNTSSASDPGVGGMLCPLNYGVYEWNSFTNEEEPNVWDENFSKMLNIYREEKGYTSAIMLADDAGLELTDAR
Ga0111045_103925Ga0111045_1039255F046432MWEMTENELNEIISKYQMPEVRYLVEEEGSFGESEFFWVIQNQSTNQKYLLVNTYSHHGVEAEVEYYREEGFDNLEAIPRRIETLENASDADDEISKYLFGMYSIFEIKS*
Ga0111045_104147Ga0111045_1041477F099452MDKTYAELLQETLSKIYELKDLNNRDRGKALTIFIGERLNRELLLSSMNIFNLYKEIVNLDDVSLLNDLRKTSWYKDWFISDKRNSDLIDLSKFNFRSLERFEKESYLKDVEHYDFKKVIEVDSYSLYDTLAEENGVDLFKLAAENILINHGFFNNTDYNLYDIPDKYMEDIEVSLYMCLLNSGNMDFMDKKTFKSTELFYIVKNDICGTIFFTLFDRMNEDTRTRVR*
Ga0111045_104321Ga0111045_1043215F054110VNYQPTIKKLLKALQMNGRRYVVDVRQSWSKYDKPCKIYIVSRMYTEEEYKLTFPEKYKKGKTFKQGQLYKKESEYSSTKQHEVLLFLVRTYKGGD*
Ga0111045_104751Ga0111045_1047513F103436MITTSKGGWRYKSDFEIFDSLRDWVMKCDVKYVKRDALDKIDYARSLWCRAEYVAAVHLLDENEVFLKKSDWPYYALGIQILRARKHEFFNE*
Ga0111045_106655Ga0111045_1066553F032313MYRFLILIFALTLMACDNNTPQEKPREQEKHEVPVSKPKPQFDEVGERIWYGQTPAMRLDSTNYGAGLTSVFGMRTSSISKQRFDSLFKQTVWEIKDIRVVETDLSLAKKNPGIMGWVTTTEFTCRNGVIVLHRQGINVNHVDTVNYVYDEVGNEIVLDTGIRWSVLRLNKNAVEFLQRGRTMWGPFDWYYGRNSGRSEVTLEAK*
Ga0111045_106655Ga0111045_1066554F032313MYRFLILIFALMLMACDNDTPQEKPREQEKHEVPVPKPKPQFDEVGERIWYGRTPAIRLDSTDYGAGLTWVLEMRTSSIPKQRFDSLFKQTVWEIKDICAVETDLSLAKKIPRFVGGSIAKEFTCRNGVILRHMQGIDINCVDTVNYVYNEDLNEIVLEGTGIRWYVLRLNKYAVEFLQQGHNIWGPFDWYYGRNSGRSEVTLEAK*
Ga0111045_107337Ga0111045_1073371F046432MWEMTESKLSNIISKYQLPMDNYLVEIDGAFGRGEFFWVIKNQSTNKKYLLVNTYSHHGVESELECYREGGFDNLEAIPRKIETLENASDADDEISKYLFGMYSIFEIKP*
Ga0111045_110609Ga0111045_1106092F085820MRSTFYLFAMLFVATTFFSCETGEPAPRATWGEIVNPIEAFMYPRDLKVFAGDNDGRRWLILVIPDSTKSSFAPTSKSTPAEVARYKELSQLVGNPTEPVVNECHFHRTWLTQGVKGIRVLRTRADGRDEDVTAQCGNLYFYTDKQIFDCQFKCGNRSIFAKPLGETVEADYLWLPGRDVFGLVAPLNPDGLKQRIVLRLADGTEIEKELSEKGKK*
Ga0111045_112913Ga0111045_1129133F074985MMELTDGGWYKTPRIIKGSDFLAHIHDTYASGNAMYVEFKASEGEVRILEYKRLYDVDTESAVLFTINTYPQESILLKNIEEYEFIQYRPQQAWKAIHMGSTKRFNLEQFDQLWLDQTFQKLHPVIVNHDGKFWYVMGLKLDVDADGSFWGLYLKRQDSDFMKEIRMPLTQKFIYNPISGSWFLDDPTQEIKDLEEIKQTLRADAILDVTVSGVPMKLIRVQEIAKGVLFFVFQDEEKNKRYYYNRPAIKLRIVTDSKTGEQKYLLDHIKAMHID*
Ga0111045_115509Ga0111045_1155091F077405CRALPTELFPRLLVVKQRGVFYGFIVLCQIKFVKNFFDWLKIVQK*
Ga0111045_129028Ga0111045_1290282F043991DLNGDLNEEAYEFEDVKLDEYIDKRSNVKPSWVGKYSHQMHFDLPDDTEVSFYKGLNIVCADINFAGGIRTILFKCRQKKNLTRFISRVLEIAQGDPSNVHPDFRA*
Ga0111045_133661Ga0111045_1336612F105379IIGVGPEKSYFQTTSFMVDLSPHINNLLVNISDLKNLDKITQLEPSKENPEIAIHKPVVSVFNWDAEYVKACMNSLREYQIDDNIIARTDEFHNTDCYNELMAGSASTGAFRINIGGYMIDIPKSAIPTLKSDHVVATVYNAPNKDFNVLRFKITKRNGIIVNQSMLFLPY*
Ga0111045_140199Ga0111045_1401991F068942RKILSLPTLALCFTLCTALFAGCGENYDGSVTEVHWSNVKNPEYGNAINITLKAEGETFTTVGDHSWISFSNDASTLDTFTRHRFPEMDKDTAYYKDIVIYLTRNESKGTATLKLVAPPNRTQQPKQFKFSVSVTPPGLYIFKVRQPALPAKAQ*

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.