NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300007976

3300007976: Human tongue dorsum microbial communities from NIH, USA - visit 1, subject 764447348 reassembly



Overview

Basic Information
IMG/M Taxon OID3300007976 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0063646 | Gp0053172 | Ga0114368
Sample NameHuman tongue dorsum microbial communities from NIH, USA - visit 1, subject 764447348 reassembly
Sequencing StatusPermanent Draft
Sequencing CenterBaylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size144596637
Sequencing Scaffolds19
Novel Protein Genes22
Associated Families20

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae6
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella → Alloprevotella sp. oral taxon 4733
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales1
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae1
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Siphoviridae sp. ctHip21
All Organisms → cellular organisms → Bacteria2
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → Candidatus Saccharimonas aalborgensis2
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae → Chryseobacterium group → Chryseobacterium1
Not Available1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Oral Cavity → Tongue Dorsum → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal surface

Location Information
LocationUSA: Maryland: Natonal Institute of Health
CoordinatesLat. (o)39.0042816Long. (o)-77.1012173Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F032313Metagenome180N
F042387Metagenome158N
F043990Metagenome155N
F046433Metagenome151N
F051212Metagenome144N
F061925Metagenome131N
F061926Metagenome131N
F071328Metagenome122N
F077404Metagenome117N
F078842Metagenome116N
F080164Metagenome115N
F080165Metagenome115N
F081454Metagenome114N
F081510Metagenome114N
F085820Metagenome111N
F090516Metagenome108N
F092230Metagenome107N
F097490Metagenome104N
F103432Metagenome101N
F105380Metagenome100N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0114368_100024All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae177304Open in IMG/M
Ga0114368_100048All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella → Alloprevotella sp. oral taxon 473123169Open in IMG/M
Ga0114368_100149All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales64753Open in IMG/M
Ga0114368_100189All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes55555Open in IMG/M
Ga0114368_100230All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae50516Open in IMG/M
Ga0114368_100334All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae39635Open in IMG/M
Ga0114368_100342All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae39187Open in IMG/M
Ga0114368_100559All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae30082Open in IMG/M
Ga0114368_100599All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella → Alloprevotella sp. oral taxon 47328844Open in IMG/M
Ga0114368_100913All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Siphoviridae sp. ctHip222101Open in IMG/M
Ga0114368_100932All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella → Alloprevotella sp. oral taxon 47321773Open in IMG/M
Ga0114368_101220All Organisms → cellular organisms → Bacteria18310Open in IMG/M
Ga0114368_101502All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → Candidatus Saccharimonas aalborgensis15657Open in IMG/M
Ga0114368_101668All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae14390Open in IMG/M
Ga0114368_101853All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → Candidatus Saccharimonas aalborgensis13291Open in IMG/M
Ga0114368_102043All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae12346Open in IMG/M
Ga0114368_103233All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae → Chryseobacterium group → Chryseobacterium8350Open in IMG/M
Ga0114368_111232All Organisms → cellular organisms → Bacteria2069Open in IMG/M
Ga0114368_116420Not Available1296Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0114368_100024Ga0114368_100024163F043990MIKKLGIIFTFGVIILGIVVYANHKIESSAIEREFGVNMSNMNIDEKYRKEEWAPNGDGEKTIILTYDQLDSSFTKLNKLPIKEDLPPNGIPKQFLNITNGYYKYIGDENDDRDFGILIVDTTRKEIYIYNQIL*
Ga0114368_100048Ga0114368_10004856F032313MYRFLILIFAITLMACDNNTPQGKPHEQEKHEVPVPKPKPQFDEVGERIWYGRTPAMRLDSTDYGAGLTSVFGMRTSSIPKQRFDSLFKQTVWEIKDIRVVETDLSLAKKNPGIMGWITTTEFTCRNGVIVLHRQGINVNHVDTVNYVYDEVGNEIVLEGTGIRWFVLRLNKNAVEFLQRGRTMWGPFDWYYGRNSGRSEVTLEAK*
Ga0114368_100048Ga0114368_10004857F032313MYRFLILLFALTLMACDNNTPQEKPHEQEKHEVPVPKPKPQFDEVGERIWYGRTPAMRLDSTDYGAGLIWVLEMRTSSIPKQRFDSLFKQTVWEIKDICAVETDLSLAKKIPRFVGGSIAKEFTCRNGVILRHMQGIDINCVDTVNYVYNEDLNEIVLEGTGIRWYVLRLNKNAVEFLQQGHNIWGPFDWYYGRNSGRSEVTLEEK*
Ga0114368_100149Ga0114368_1001496F090516MKSNLSLKLFLAFERYFIENDEVISLDKSSEFTDVIVGIGFLPQDMSENTDFKKKTLEKYGFSSSTALADDFRKRVLNIDEPIPENFEKDGIGYVYTVISGYDTFYNRMYMFGIHCFNGDFNVTYYDLENDAETGDYYEEHELYSQAKGYRWLDPESDYYEDVLAWEALNKLATDIYFHLEDKLDVKIDIKPIPEEEKVVPTQEHLAKFLAFCGVEQDVIDENKERLLKALEEYTPDEYEGISEAMAEMMEYSHKIQRAEPVIEIIREYGVCRFSDWKFYAEELEEYILDLADFSDWKWEYPADTYSADLFPYMRKQLSLYHLWLCHLDEGADAYLFLLFSEKDMPEIMKLARILDLPLKAYFK*
Ga0114368_100189Ga0114368_10018934F081510MKFNLNAVKATAKTTWVTTKILGKKYAPVILLGAGLVGYGYSVYEGIKSGKKLEATKAKYEEMEAAGEEFSRVDVVKDIAKDVAIPVAVATASTASIILGFAIQTNRLKAVSAALAMVTEEHARYRLRAKTVLDEETFKKIDAPLETKTVNVDGEDIEVESIVPNEGDFYGMWFKKSHKYASDSPEYNEGVIKEADNVLTEKMMRKGMLTFAEVLDILGFEVPKAALPFGWTDTDGFYIEWDAHEVWNDDKQEYEVQFYVRWKTPRNLYATTNFHDFVPKKTRKELN*
Ga0114368_100230Ga0114368_10023027F103432MKLIHSLFSLSLLLALGGLFCTTACQDDVEPTQRAGLISTDSLIHAAEVYDGKAFEHVVSTTATGLRVSEPRRVVPMLPRQLHVEMEGKTLFRRHNLPSVSAYSLQVVAVGDTIYRQKESDAQFNADLDALFRQSIGIAPRLFGVRELSVVGIDSRGKVRDLGNYSCPLLQGKRRNVNYRTREGIFHEHYEAASVDTFSVKSNWLLKTKAEPSLYAPSFRLLVWQQPAAGCTKLRFTLTLVDGRSLVAEVPLR*
Ga0114368_100230Ga0114368_10023028F080164MFCALFQIISNSFSIFASKFNYPLSRNSRFVSFMRPTSFVLFILLWMVGLTLSAAPQVTLRERANAFPLITEKDPSEIYAPYAWRLPVVPLRLDNREIRNFAKYPALPSLSGGKLTVRVLVVGDTVAVHQDLMDDFAKRCRTTLGLGVRTAPKLFGIKGMHVYGVQKDGSRQAVDEQVTLHLPGFEKAEKPLLYKGQEGRLVLCEYYESHRGDLFLNAANAHPEIFGELCPVVDFHFPVELRRAYAWLLLEIELEDGTKLSTSLQHYDEQTSILDHPARS*
Ga0114368_100334Ga0114368_1003346F051212MKKFFFIFVLYWLHSCNGTEKAMPTSPDTQKTSISEKQNAEKIERIIYSQIGGDTGGKNVHLVITKDSIIYHLTEGVTDEKTIANLSLNNNNKDWEAFIDKIDLEDFEKGKPSKELIMDLPTTKIIIKTNKKEYSKTNIQNNKTWDYITKQIIDIKYSQLNNHLNLEK*
Ga0114368_100342Ga0114368_10034235F097490MNIFCKIILPLLCIISCSERKEIEVYNMKIDENKKEVLVEIRNNTENNYYLLSPIVSIMTKHLQYIDGEMIEGQIHHKKLDSIVCSVCIWDDICKEEYYAMRDIVLLPKKSVKKIKYKYDNEEYIEIEIVHIGFPYNGYYNEIGKKMQFMLKKKLDSSNIIKGYEFYNKDIETMTIKM*
Ga0114368_100342Ga0114368_10034238F042387MKRIILFFMAGLFLVSCSRENDKMTDETLANSAKMQLPTKVTIAENKKVISKRFEYQNDNELKEIIDEGSGERIVFVYEKDFITSKIRYSQAGEELGKTNYQYNNGKLSSVIDEVVISDSGIQYKRVVTREYHYNGSEVSVNENIKYHSESYAYNLRDENFTHTYVLNGENITKIHHEISKNVPGGHFYLNPNNMVVVDEEVTYDAKNSPYKNIKGFSVLAVEFCGLDEDENTIADYLNFRWVSHNPTLIKKSINLYGSGVDSSEYKFQYEYKNNFPIKTKLNIDNQIVTTMVYEYNK*
Ga0114368_100559Ga0114368_10055925F081454MKTFKLVLLLFITSASLVFGQEKRYFFKYEFQPNSKYLIKYKTDMDGGYKFVGSKEVIDKIGMDGVKMTINSDIESAISTQKKQGNNVPFILEYTKYFYKAEINGETVNRKIPLQGVKLIGDIVNGKKMEVKNVEGNIDENTKKILIESIKQFSAIDTDFPKEGLKIGDSFDMVIPYKQSIPNAGDIEMKMNVKYKLLKVEKEEAYFDMLIDFVMGDKNVKNMDLSASGDGKGFLLFDMKNNYFTSQNIDMTINLKLKTELLTLENTSKAKSVITQQKIK*
Ga0114368_100599Ga0114368_10059911F077404MMVANSYAQKTDSINTEAGKSVLHRNAIYIPPALEQYADTALLHQRFNVENKGNYLYTPFTEDNEPTIPFNYGFLHPLGERFYNCFMGKVDRILRPKADKGFIILTSYLVVLGDSYAFDTSNKDTSKLADLKYLDFRHIKRDFSYGHPYQGFTHNDRIELSNFVQSYGRQAALETANAWVMASYPFSLQSTKFENRYTRGRKLILTDGHSTLYLYFLMIDSVAPNFDTEVLPYIKGVFRFNRFR*
Ga0114368_100913Ga0114368_1009132F105380MSILELDATQYVKQGRIFKKFDSNLLDSYMDGRQNNYNINLAELDDQISDGIVYADRNGKMIYKFGAKKIIQTAITNGLEISGLSKDLEMKHYSFWVPDLYFVSFYSFNPNEDLYIAYRSKDEEFICLTNIWPDGSSREESYFPNGDRLTLKRLCKGSMMSDVSSSDYDAWRNNAVTRASQFVNKFVSARGNSDLDFVGSALRSKVPSHNIKKCAVFLGTITKEQENVNTYEEFMEWTKNTKWFK*
Ga0114368_100932Ga0114368_1009324F085820MRSTFYLFAMLFLATTFFSCETVEPSPRPTWGEIVNPIEAFMYPRDLKVVAAREEGRRWLILVVPDSTKSSFAPTSKSTPAEVARYKELSQLVGNPTEPVVNECHFHRTWLTQGVKAIRVVRTQADGRDEDVTAQCGNLYFYTDKQIFDCQFKCGNRSIFAKPLGETVEADYLWLPGRDVFGLIAPPNPDHLKQRIVLRLADGTEIEKELSEKGKK*
Ga0114368_101220Ga0114368_10122011F092230MRQAHKRTVDKLKAYLLKVFFPLFIVCIILVAFFRQIGCGSEGEYAFQISEWGAKLKNIYGTDFINKEIIVRDNAVRVDGIRCLYAVNQNEDGLSIYLLLPGGDYLTHNYVGSSFVRFSNSSEYINMAYGEGSVEVSDSTSTGEAQNTEEKEARDKVDEAINSMRHLFASAIMVNLRVVELYKILTVCMILIVIAMTIGYYSYLKPETVYEFYCKLRRKEKYPSDVNLVKRIGFLVIILPPCMLFLLIV*
Ga0114368_101502Ga0114368_1015021F046433MRRETTEYKALNEFLYFMKMVEPYVPDHIKDDARELRNGLVFLGMSELHFAATALAKRLRHHLEVDNKPVYVDVGNLLSQCRVKNEMKSSQYILSLVLSKFPDDEFEEYEGRLKVYGGRGEIDKSSKILFLDDWIVSGDQIKERIAGFEVDNNPGAHKVSVLVMAASSKYIDNGIGADPLWGKATYPVEAYYRLKNDHDDRGMSRVTGIHSSTDRTFGCEVDDIAYLAIDGGILKGERIDRLTLPALVNIVRPYRNGEDFDGLSRFRQLLGRE*
Ga0114368_101668Ga0114368_10166817F061925MSWEYSINLDSEEAVSSVVTDLKICELFSSSTTDYIDWKNPKSIDSAPYDARFYTDKKKTIYISINSFSKNIFSALKSILAKQNYHLTDCETDKEVTLEHIFRSVI*
Ga0114368_101853Ga0114368_10185319F078842MIISSIYKIADNDGLIAHIYEHLLAQYVLKRLQDNEFFVLSDIILSAKTYGDTCFMDAELYSPEAKKTYDEALPEFDKLVIPEDDILRAASECGIEMNRNIVEVDRSELSKKLREVQISPWRKQIDMAYRKAHDESSVNTLFRTSYIKYSKESDDLFRECVLEYSIDESHIQTPVDQALAAIVMQIVALNFLTVVREKYTVYDRGDQWSEASISVGYRMFLGLLKKDDKIINQLGRDFLEYVKSLSSYVFCDNLQKALVRCSDNHKQVILNRSMLNAILGGCVIGGKGWLEMADSARIRQMINSIELDIYEVNS*
Ga0114368_102043Ga0114368_1020437F061926MIKTAKHIKTFLASVLLLIFVMNVSGLFVRLHHQETHQKTHQKTEKIAECSDKVCYHKAHLQTKSDCDCGFLCTLNYFYILPEKPQTEIHVNEYFSYFSSYKIFVSERIILLWQSRAPPVLS*
Ga0114368_103233Ga0114368_1032336F080165MKSIKILLLLLSLTACNNKTKTISTLDLEKTIIDYKDLPSKVKERVFYGEAMKLGEEDEERFQSLQETNNPKKYEYYTKQDPQLAWVHDPYIRNKKTKQEYSIDKDGPMGSRYIIYGDSLYIPNHYNIYEKDSLKYTFTRYILR*
Ga0114368_111232Ga0114368_1112323F071328MSQKHWTFTNIVRYIEEYERNPLLIERMKWKFIPEGECIVEFVELCKHLVLERTINPKTHQTTAIYLRYSQQLLLKKKRAIRRLGIGKKNVSATLRLCGTHYTEYGDDEHRVFFLDTDVNIYFCKHYQLPIYILQRIEFSNKEYRSFILKVLPVKKGEW*
Ga0114368_116420Ga0114368_1164201F080164MVGLTLCAAPQVTLRERASAFPLITEKDESEIIAPYAWRLPVVPLSLDNREIRNFAKYPALPSLSGGKLTVRVLVVGDTVAVHQDLMDDFAKRCRTTLGFGVRTAPKLFGIKGMHVYGVQKDGSRQAVDKQVTLHLPGFEKAEKLLHYKGQAGQLVLCEYYESHRGDLFLDVANAHPEIFGELRPVVDFHFPVELRRAYAWLLLEIELEDGTKLSTSLQHYDEQTSILDHPARS*

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.