NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300007966

3300007966: Human tongue dorsum microbial communities from NIH, USA - visit 2, subject 763536994 reassembly



Overview

Basic Information
IMG/M Taxon OID3300007966 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0063646 | Gp0052890 | Ga0105959
Sample NameHuman tongue dorsum microbial communities from NIH, USA - visit 2, subject 763536994 reassembly
Sequencing StatusPermanent Draft
Sequencing CenterBaylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size143607971
Sequencing Scaffolds17
Novel Protein Genes27
Associated Families22

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Negativicutes → Veillonellales → Veillonellaceae → Veillonella → Veillonella tobetsuensis1
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → Candidatus Saccharimonas aalborgensis4
All Organisms → cellular organisms → Bacteria4
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella → Alloprevotella sp. oral taxon 4733
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria2
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella → Alloprevotella sp. oral taxon 473 → Alloprevotella sp. oral taxon 473 str. F00401

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Oral Cavity → Tongue Dorsum → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal surface

Location Information
LocationUSA: Maryland: Natonal Institute of Health
CoordinatesLat. (o)39.0042816Long. (o)-77.1012173Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F032313Metagenome180N
F033081Metagenome178Y
F046432Metagenome151Y
F054110Metagenome140N
F066860Metagenome126N
F072446Metagenome121N
F078842Metagenome116N
F080164Metagenome115N
F080166Metagenome115N
F081455Metagenome114N
F085820Metagenome111N
F089055Metagenome109Y
F089057Metagenome109N
F092229Metagenome107N
F092232Metagenome107N
F094007Metagenome106N
F095629Metagenome105N
F095631Metagenome105N
F095633Metagenome105N
F099452Metagenome103N
F099453Metagenome103N
F105379Metagenome100N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0105959_100004All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales235411Open in IMG/M
Ga0105959_100285All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Negativicutes → Veillonellales → Veillonellaceae → Veillonella → Veillonella tobetsuensis42936Open in IMG/M
Ga0105959_100306All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → Candidatus Saccharimonas aalborgensis41427Open in IMG/M
Ga0105959_100421All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → Candidatus Saccharimonas aalborgensis35116Open in IMG/M
Ga0105959_100706All Organisms → cellular organisms → Bacteria25826Open in IMG/M
Ga0105959_100910All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella → Alloprevotella sp. oral taxon 47321736Open in IMG/M
Ga0105959_101254All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella → Alloprevotella sp. oral taxon 47317217Open in IMG/M
Ga0105959_101260All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → Candidatus Saccharimonas aalborgensis17158Open in IMG/M
Ga0105959_101629All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella → Alloprevotella sp. oral taxon 47314374Open in IMG/M
Ga0105959_101997All Organisms → cellular organisms → Bacteria12324Open in IMG/M
Ga0105959_102044All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria12087Open in IMG/M
Ga0105959_105196All Organisms → cellular organisms → Bacteria5167Open in IMG/M
Ga0105959_105480All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → Candidatus Saccharimonas aalborgensis4869Open in IMG/M
Ga0105959_105585All Organisms → cellular organisms → Bacteria4780Open in IMG/M
Ga0105959_106311All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli4201Open in IMG/M
Ga0105959_107679All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella → Alloprevotella sp. oral taxon 473 → Alloprevotella sp. oral taxon 473 str. F00403390Open in IMG/M
Ga0105959_108811All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria2913Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0105959_100004Ga0105959_100004103F092232MNTQAKFIAEYNDKNRPKFNDKFFNKSDDDIIEDLKDVILSCERNKFYTIKVLNFEVIDDYNEVQKLLIGDETPSISIKDSDLKILKVTYHVACTKDEDTFDVLIAIPRVIDGAYIHLNGNDYFPLFQLVDGSTYNNTTAASAKTQSITLKTNSNAVKMLRNFIDLNTTNEETVRAAMFSVYLFDHKVTLFEYYLARFGWYETLSKFNFEDVIKISDHDLNDPEYYTFAIANAHMKTPFYISAVKSFMDNDRILQSFVASFARAISLYATKKTTLDQIYTTEFWICKLGYNFVSSETSVFTKGNAIIESLENSYDIPTKKRLRLPDHIKEDIYSVLKWMACEFSSIRLKNNLDASSKRIRWSEYIAAMYIMLINVKLRRLPEKHDPNMEAYRIKQQLNTQPMALIAELQKSNLKGFRNMVNDRDSFLQLKYTIKGPSGPGESNSKNVARNVRAIDPSHLGIIDLNTSSASDPGVGGMLCPLNYGVYEWNSFTNEEEPNVWDENFSKMLNIYREEKGYTSAIMLADDAGLELTDARDPAAVAFDADLLGQTIAKVARTRAFEKQLRPALINMEDSCSIYFEEV*
Ga0105959_100004Ga0105959_100004152F099452MDKTYTELLQETLSKIYELKDLNNRDRGKALTIFIGERLNRELLLSSMNIFNLYKEIVNLDDVSLLNDLRKTPWYKDWFIDDKRNSDLIDLSKFNFRSLERFEKESYLKDVEHYDFEGVIEVDSYSLYDTLAEDNSVELFKLAAENILINHGFFNNTDYNLYDIPDKYMEDIEVSLYMCLLNSGNMDFMDKKTFKSTELFYIVKNNICGTIFFTLFDRMNEDTRTRAR*
Ga0105959_100004Ga0105959_100004193F089057MINVIPLIAKKYNRKGDTSGSLKSLISDLNCVTDNDDVLLFLSSIPRETKYSLDDAFDIIVSDDTYSNIFRATLVFLNIDLDYHRLLLNAIKSESYTIICMINKAIPTPDLFLAKNNYECLTIALDKSYAVFDKVLGMVIGQIKHTASSKEGRVLGIFMTLCILNKDIDKLASLCTGYLATCRSEYMVKDLMNKSAMDAFQYMSEEDIHTVVDDINSRTVLSRYLNKM*
Ga0105959_100004Ga0105959_1000043F095631MASDAVSVNKTLRSYQETVLNTVQNDTLLNANIYDIHQYIENIKKRYVDEDEITLSMGIFGYMGDVNSNALQNAVTMAAEYSNEAIPIKAKFEKNVISHALMLGINKIFAEPATMQAMFVFYEDELILNTVSDTFKFDRNIKIMVGDYEFHLPYDLIIKRIELPTGEYIYTGMYDTTQSNPIITRNSNDVDPYLKPTVRSKIDGRNVVMLLVDLRQYEYTTYHKTIITTNPLESKMLQFEFDNQLAGFDVDVKEYDQPTRKLKPVYNGLNTDGINNFCNYTYIDSSTIRVMFDNSSYLPTANTEVTVNLYTCQGANGNISYKDSIYFRVKSEKINYDRLNLLVIPTSDAQYGIDKRSIADLKKLIPKEALSRGSVTNSTDINNYFNTIDDDDNKLFFFKKMDNPLARLYYAFVLMDSPTNIIPTNTIPIEAIRRDFDNISDSNYILTAGNIIKYDGTTNASVAYQSSEEELNNARRNQFLYMNPFMCIVNKKPLYVSYYMNIMDVNKLLEFTYVNQDSKVQFVANKMNWYRHYLSERDTYVGDISIMQNIQSDIGLVHRDDPYDPEKITGVDVKVLAVFYTDEKYQVPYRWAEAEFVNYDQNTFIMDYKFKLNTDNKIDKNIKLKINNVYEVGNATRLSPGYMANNMNMKIFVFAKDVFGYNAGLHKTDQIFTADFLEGYSLTNEYTVKYGIDFLYNYSDLIESHIKIRKQDNGQISYIIDRVPVISYDYVNTEERIQDFINNLEKKRIHILECLDVLEDSFGIDIKFFNTYGPSKLFYVNDGVPLNRVNLSMTFKVKFLTTTDKYLTEYIKNDIRKYIEDKSRISDIHIPNIITYITQKYAENVTYFEFLDFNGYGPGYQHIYRKDESIVGRIPEFLNINTIGTENNALDINIIIA*
Ga0105959_100004Ga0105959_10000432F081455METTNIQEINMKAAEKLGELFDYVFCGKKPNTEEKDIPPVEEIGAETVDIIDAAEEAIQQPLQNKDTSIAVNFSQMVNKPKEEVKTEVNSVPPEGETKVNVLFPKTEHILGNYVDYDSFIKIKESNTDKVVRAVRLLNYKMSDQNAAAAFAQFVSEFNPECDPNKRLRYELIRHQGREKYLVIRLSTVVNGTTKYYADIYPDLNKIDLDHHLISSAKK*
Ga0105959_100004Ga0105959_10000433F080166VNILANFENYNKVVEQIFELNYYLTFKLEVTFNNIIKRINTEIKENFHSEYVVGANKLTTNLRYKYQMRLSPRGEKIGIVIDWDNYDDLCTIIEESINICDPENKMSPFKRLYSTTGDLLDIKCDSLKVRYLHLEDRWNNKVDLIPFVLVDDNRGTLTEAMRFRFNNDLTFDVPVSRLKGFRRFLMTYNPVLHAGAMARYMAMTPLLGSNRQNMLR*
Ga0105959_100004Ga0105959_10000436F099453MLRRKDMNRFDIIELAQETLIFVYNTFNGKVNTLDPYTRLNFVSGYLDTKTNIARTTPYGCIYVSLEAFADTVERQGFIDTDQIRNLALEIIIHELTHVDQLIDYKYIKFNNGYREEVELKCVKQSCQWILDNIQYIRSLGLVVIPEVYQARLANLTNVIYTPKYPIAIVMAKLEYMLGKKFREFSNNNIEIQYIDRLKTHYSFMVCENRSYINSRNLNDLGERLLNDKQYTVEYLEYGNSKLVIKITQGA*
Ga0105959_100004Ga0105959_10000455F092229MNKEYRFNHIPEVVLRNIRFIRDNNIDIGTGDDVLECMMDINPVVRTKIYDDYEFAKDVAERRFGSTIEKLDLRTVLQKCITRPYNSILNNIYFRYFNSELIDDLFKLGQSSKVLDLAIKYECEYYTVNAAKTNIRRYNTDAYYNKFAADSNIISSHRSLHDPQVNAVKSAEFTYDLLMASRAEEFNPEIVREIFIKYGLKPNSSRNLYNRINDNLNLFYYIEDYLDEYREEGRFIYGTKEYKILKELRGLPLMVVLTQLTRKNDSGYILNSNLELVKG*
Ga0105959_100004Ga0105959_10000479F105379MVIHFPLSQSDIESLLSISKLLKCDKILYDRNYVNPIIGVGPEKSYFQTTSFMVDLSPHINNLLVNISDLKNLGKITQLEPSKENPEIAIHKPVVSVFNWDAEYVKACMNSLREYQIDDNIIARTDEFHNTDCYNELMAGSASTGAFRINIGGYMIDIPKSAIPTLKSDHVVATVYNAPNKNFNILRFKITKRNGIIVNQSMLFLPY*
Ga0105959_100004Ga0105959_10000484F095629MMRELIICVCLLGCFGIANANNVEQPKEVKIVHNDNSVALHKKIYQLEKRIERLEELLKKEGK*
Ga0105959_100285Ga0105959_10028515F054110VNYQPTIKKLLKALQMNGRRYVVDVRQSWSKYDKPCKVYIVNRMYTEEEYKLTFPHKYKKGKTFKPKQLYKKESEYSSTKQHEVLLFLVKAYKGGD*
Ga0105959_100306Ga0105959_1003067F066860MTTKKQKLQKQQAIDTWIVVALWVSAIWFSLARGFITGIGGWVLALLGPWALIVSCICLAIISRQMKKRHASKDHLTTIVRVSFIVMSISLFICGLAMPDFSDMETFSTLSVYTNNAISFKTSKTIAIISGFVVVLSLFVAVTFGIAEDKE*
Ga0105959_100421Ga0105959_10042136F095633MGNYENSTEAWRREGLTEGELRIMGALAVEAIEKLRKTTVREETVLLGSVPFGSWDEFAKAVQEMAAHSYEPIPVEINTKRLIATAFLDDEGEMSVEERFVPEEVFIDLSRTRCDAEEDRNHKSYEFTCPVLMEHPDGELCPTRKAYVISAIDVNGSQEVDFNIIYGGLN*
Ga0105959_100706Ga0105959_10070611F046432MWEMTENELSEVISKYQMPEGRYLVEQEGSFGESEFFCVIKNQLTNQKYLLMNTYSHHGVEDEVEYYREEGFDNLEAIPRKIETLENASDADDEISKYLFGMYSIFEIKS*
Ga0105959_100910Ga0105959_10091014F085820MKLSSLYLFAMLFLATTFFSCETDEPAPRPTWGEIVNPIEAFMYPRDLKVYADDNDGRRWLILVIPDSTKSSFAPTSKSTPGEVARYKELSQLVGNPTEPVVNECHFHRTWLTQGVKAIRVVRTQADGRDEEVTAQCGNLYFYTDKQIFDCQFKCGGRSIFAKPLGETVEADYLWLPGRDVFGLIAPPNPDHLKQRIVLRLADGTEIEKELSKKGKK*
Ga0105959_101254Ga0105959_10125418F080164MVGLALCAAPQVTLRERASAFPLITEKDASEIDAPYAWRLPVVPLSLDNREIRNFAKFPLLPSLSGGILTVRVLIVGDTVAVHRDLMDDFAKRCRTTLGLGVRTAPKLFGIKGMHVYGVQKDGSRQAVDEQVTLHLPGFEKAEKPLHYKEQTGQLVLCEYYGSHRGDLLLDAANARPEIFGELCPVVDFHFPVELRRAYAWLLLEIELEDGTKLSTSLQHYDEQTSILDHPARS*
Ga0105959_101260Ga0105959_10126010F089055LKVEKMNSTPECVTKTPEIEAREKLAAIFSDAEQRNNSKVNPELGKTAIDIENTSRINSTDDGAVDLCNQALGSYGKSLDYINKSPLETVQAIGNSLQRLREYKTKEICE*
Ga0105959_101629Ga0105959_1016297F032313MYRLLILLFAITLMACDNDTPQEKPREQEKHEVPVPKPKPQFDEVGERIWYGQTPAMRLDSTDYGAGLTWVLEMRTSSIPKQRFDSLFKQTVWEIKDICAVETDLSLAKKIPKFVGGSITKEFTCRNGVILRHMQGIDINLVDTVNYVYNEDLNEIVLEGTGIRWYVLRLNKYAVEFLQQGHNIWGPFDWYYGRNSGRSEVTLEAK*
Ga0105959_101629Ga0105959_1016298F032313MYRFLILIFALTLMACDNNTPQEKPHEQEKHEVPVPVSKPQFDEVGERIWYGRTPAMRLDSTDYGAGLTSVFGMRTSSIPKQRFDSLFKQTVWEIKDIRVVETDLSLAKKNPGIMGWVTTTEFTCRNGVIVLHRQGIDVNHVDTVNYVYDEVGNEIVLEGTGIRWSVLRLNKNAVEFLQRGRTMWGPFDWYYGRNSGRSEVTLEEK*
Ga0105959_101997Ga0105959_1019978F046432MTESKLSNIISKYQLPMDDYSVEVDGAFGRGEFFWVIKNQSTNKKYLLVNTYSHHGVESELECYREGGFDNLEAIPRRIETLELASDAEDEISKYLFGMYSIFEIKS*
Ga0105959_102044Ga0105959_10204414F094007MKLETVEVLELARPNRADVIDVVDSDGNVVSLDYLGEDFVPDANSYSDKDFTKRNRIIVEMCDLFGGTRRRAGFAEYYRGRGDYDRARRIERNRGSDISEVGRLAINACEACPLKLDCELYGKLGGAVLNKVIDYKRIRTATSLTKAGKKRPGWNKGCIDNDA*
Ga0105959_105196Ga0105959_1051964F033081MHTDITVVYRPKKGVMAWLFRRAMPQDTRPTFVWSRLVTEIENAGYFSRWKFSILAVGLIIMTIATIKMLLFVPGLNQSAVSLLTRGLETFLPTRWATATAWTVGMAGVFLMGDLTNYTPSQTILHKIKATRYEVYNIILFLALLEEQAFRSGSERWNWRERVRASVCFGLLHIMNIWYNFATGIALSVTGFGFLLVYLWYYRKYRIQIIATAAAATVHALYNVIALSLIAVVLAIDIAKLLCSLLRTSVKVSACHVRSIDVYLC*
Ga0105959_105480Ga0105959_1054803F089055LKVEKMNSTPECVTKTPEIEAREKLAAIFSDAKRYDGNSGVKPELGKAAIDGKDIKNIIKVNSADNEAVDLCNQALGSYGKSLGRINNSPLETVREIGDLLQSFREDKTKESCR*
Ga0105959_105585Ga0105959_1055851F046432MWEMTENELSEIISKYQMPEGRYSLVEEGSFGESEFFWVIKNQSTNQKYLLMNTYSHHGVEAEAEYYREEGFDNLEAIPRKIETLENASDAEDAIFKYLFGLYSIFEIKS*
Ga0105959_106311Ga0105959_1063116F078842MIISSIYKTADNDGLIAHIYEHLLAQYVLKCLQDNEFFVLSDIILSAKTYGDTCFMDAELYSPEAKKTYDEVLREFDKLVIPEDDILRAAGECGIEMNRNIVEVDRSELSKKLREVQISPWRKQIDMAYRKAHDESSVNTLFRTSYIKYSKESDDLFRECVLEYSIDESHIQTPVDQALAAIVMQIVALNFLTVVREKYTVYDRGDQWSEASISVGYRMFLGLLKKDDKIINQLSCDFLEYIKILSSPVFCDNLRKALVRCSDNHKQVILNRSMLNAILGGCIIGGKGWLEMADSARIRQMINSIELDIYEVNS*
Ga0105959_107679Ga0105959_1076792F072446MRKLLFLLSGLCLYCLAACDNDHEPTKPVRPFHGDTLAQIAWNFRFIVEHHYHSIPGIVPERTTYRVPVIPRSVEDKTKKEYNDMELGKEAHLVFRATVHGDTINRQKRELEALSLQLNRLTLTSIGTSPVLCGVKSIEAVGIAENGNTYDLSAEMKLRIRDYSYRLKYPSGIITLDCENTESLTAKYVVPLGRIREYELAEHIQPELKFYLPVKRCMDFSSIRFAITLFNGEVLSFQHKLPSKSVLQELPNKSVLENYQQYGFIPESTYFTTLWPLPDYKYNEREL*
Ga0105959_108811Ga0105959_1088112F046432MWGMTESELSEIISKYQLPMDDYLVEVGGAFGRGEFFWIIKNQSTNKKYLLVNTYSHHGVESELECYREGGFDNLEAIPRRIETLELASDAEDEISKYLFGMYSIFEIKS*

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.