NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300008333

3300008333: Human tongue dorsum microbial communities from NIH, USA - visit 1, subject 508703490 reassembly



Overview

Basic Information
IMG/M Taxon OID3300008333 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0063646 | Gp0053264 | Ga0115302
Sample NameHuman tongue dorsum microbial communities from NIH, USA - visit 1, subject 508703490 reassembly
Sequencing StatusPermanent Draft
Sequencing CenterBaylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size123636095
Sequencing Scaffolds14
Novel Protein Genes25
Associated Families23

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → Candidatus Saccharimonas aalborgensis2
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella → Alloprevotella sp. oral taxon 4732
All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae → Haemophilus1
All Organisms → cellular organisms → Bacteria2
All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Negativicutes → Veillonellales → Veillonellaceae → Veillonella → Veillonella tobetsuensis1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae → Haemophilus → Haemophilus parainfluenzae1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Lachnospiraceae1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Lactobacillales → Streptococcaceae → Streptococcus1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Oral Cavity → Tongue Dorsum → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal surface

Location Information
LocationNational Institutes of Health, USA
CoordinatesLat. (o)N/ALong. (o)N/AAlt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F032313Metagenome180N
F033081Metagenome178Y
F046433Metagenome151N
F054110Metagenome140N
F067846Metagenome125Y
F073671Metagenome120N
F077405Metagenome117N
F080166Metagenome115N
F081455Metagenome114N
F085820Metagenome111N
F089055Metagenome109Y
F089057Metagenome109N
F092229Metagenome107N
F092230Metagenome107N
F094007Metagenome106N
F095629Metagenome105N
F095631Metagenome105N
F097527Metagenome104N
F099452Metagenome103N
F099453Metagenome103N
F103432Metagenome101N
F103433Metagenome101N
F103435Metagenome101N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0115302_100002All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → Candidatus Saccharimonas aalborgensis329620Open in IMG/M
Ga0115302_100003All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales232039Open in IMG/M
Ga0115302_100067All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella → Alloprevotella sp. oral taxon 47383519Open in IMG/M
Ga0115302_100087All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae → Haemophilus69251Open in IMG/M
Ga0115302_100139All Organisms → cellular organisms → Bacteria52455Open in IMG/M
Ga0115302_100214All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → Candidatus Saccharimonas aalborgensis42209Open in IMG/M
Ga0115302_100344All Organisms → cellular organisms → Bacteria32153Open in IMG/M
Ga0115302_100602All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella → Alloprevotella sp. oral taxon 47321575Open in IMG/M
Ga0115302_101121All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae13847Open in IMG/M
Ga0115302_101905All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae9603Open in IMG/M
Ga0115302_104615All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Negativicutes → Veillonellales → Veillonellaceae → Veillonella → Veillonella tobetsuensis4775Open in IMG/M
Ga0115302_105299All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae → Haemophilus → Haemophilus parainfluenzae4240Open in IMG/M
Ga0115302_106777All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Lachnospiraceae3399Open in IMG/M
Ga0115302_113815All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Lactobacillales → Streptococcaceae → Streptococcus1685Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0115302_100002Ga0115302_100002193F046433MIELPTNPDALSELSPVVPPKLLSQAQDASRDNLMVYVKADNYLGTETSDPSFMESRCETTEYEAINDFVQFIEMTKHYLPDYMENCAKELIDELAFLGMPELNFAANALAKRLRHHLEIDNKPVYIDVGNSLSQYRAKNEMKSSQYILSLVLSKFPDDEFEEYEGRLKVYGGRGEIDKSSKILFLDDWIIGGDQVRERISGFEVDNDPESHEASVLVMAASGDYLDNGISAYSRYGGTIYPVEACYVLKNSSDAGGVSRVTGIHSSTDNTFGYEVDGIAYCAIERGILKGERIDDLSLPALANIVRPYRNGKNFDGLSRFRQLLEKE*
Ga0115302_100002Ga0115302_100002272F033081MHTDITVVYRPKKGVMAWLFRRAMPQDTRPTFVWSRLVTEIENAGYFSRRKFSILAVGLIIMTIAMIKMLLFVPGLNQSVVSLLTRGLETFLPTGWATATAWTVGMAGVFLMGNFTNYTPSQKFLHKIKATRYEVYNILLLLALLEEQAFRSGSERWNWRERVRASVCFGLLHITNIWYSFAAGIALSVTGFGFLLVYLWYYRKYRIQIIATAAAATVHALYNAIALSLIAIVLAIDITKLL*
Ga0115302_100003Ga0115302_100003105F095631MASDAVSVNKTLRSYQETVLNTVQNDTLLNANIYDIHQYIENIKKRYVDEDEITLSMGIFGYMGDVNSNALQNAVTMAAEYSNEAIPIKAKFEKNVISHALMLGINKIFAEPATMQAMFVFYEDELILNTVSDTFKFDRNIKIMVGDYEFHLPYDLIIKRIELPTGEYIYTGMYDTTQSNPIITRNSNDVDPYLKPTVRSKIDGRNVVMLLVDLRQYEYTTYHKTIITTNPLESKMLQFEFDNQLAGFDVDVKEYDQPTRKLKPVYNGLNTDGISNFCNYTYIDSSTIRVMFDNSSYLPTANTEVTVNLYTCQGANGNISYKDSIYFRVKSEKINYDRLNLLVIPTSDAQYGIDKRSIADLKKLIPKEALSRGSVTNSTDINNYFNTIDDDDNKLFFFKKMDNPLARLYYAFVLMDSPTNIIPTNTIPIEAIRRDFDNISDSNYILTAGNIIKYDGTTNASVAYQSSEEELNNARRNQFLYMNPFMCIVNKKPLYVSYYMNIMDVNKLLEFTYVNQDSKVQFVANKMNWYRHYLSERDTYVGDISIMQNIQSDIGLVHRDDPYNPEKITGVDVKVLAVFYTDEKYQVPYRWAEAEFVNYDQNTFIMDYKFKLNTDNKIDKNIKLKINNVYEVGNATRLSPGYMANNMNMKIFVFAKDVFGYNAGLHKADQIFTADFLEGYSLTNEYTVKYGIDFLYNYSDLIESHIKIRKQDNGQISYIIDRVPVISYDYVNTEERIQDFINNLEKKRIHILECLDVLEDSFGIDIKFFNTYGPSKLFYVNDGVPLNRVNLSMTFKVKFLTTTDKYLTEYIKNDIRKYIEDKSRISDIHIPNIITFITQKYAENVTYFEFLDFNGYGPGYQHIYRKDESIVGRIPEFLNINTIGTENNALDINIIIA*
Ga0115302_100003Ga0115302_100003133F103435MKFVFTTEPIYQYYRAYLYPSDKDKLDKDLMVEYGDYKDYWDLKNQQDALPENIFVAELTSRDYPRNPWNYVSQLISKLTYSYLIDNPEFENIFSEILLNQSEEEFYEFYKAIDRFYNGSEIFIIVSNDEYSDMVTQMMCNVIRRMYGIHPQVIYDIDDVLNIRDDIDFSPQGAQLAYLQRSAYYKLEAKRSLEPLQIWYPFDMNTYTNALE*
Ga0115302_100003Ga0115302_100003141F089057MINVIPLIAKKYNRKGDTSGSLKSLISDLNCVTDNDDVLLFLSSIPRETKYSLDDAFDIIVSDDTYSNIFRATLVFLNIDLDYHRLLLNAIKSESYTIICMINKAIPTPDLFLAKNNYECLTIALDKSYAVFDKVLGMVVGQIKHTASSKEGRALGIFMTLCILNKDIDKLASLCTGYLATCRSEYMVKDLMNKSAMDAFQYMSEEDIHTVVDDINSRSVLSRYLNKM*
Ga0115302_100003Ga0115302_100003182F099452MDKTYTELLQETLSKIYELKDLNNRDRGKALTIFIGERLNRELLLSSMNIFNLYKEIVNLDDVSLLTDLRKTPWYKKWFTYDQENSRLIDLSKFNFRSLERFEKVEYLKDAEHYDFEGVIEVDSYSLYDTLAEENGLNLFCFAAENILLNHGFFNNTDYQLYDVPEEYIDDQEVCMYMCLLNKDNIDFIDKNTYEDSVLYDIVRDRIFNAIYWSIRDSIEEDTRTRAR*
Ga0115302_100003Ga0115302_10000325F095629MRELIICVCLLGCFGVVNANNVEQPKEVKIVHNDDSIILHKKIYQLEKRIERLEELLKKEGK*
Ga0115302_100003Ga0115302_10000354F092229MNKEYRFNHIPEVVLRNIRFIRDNNIDIGTGDDVLECMMDINPVVRTKIYDDYEFAKDVAERRFGSTIEKLDLRTVLQKCITRPYNSILNNIYFRYFNSELIDDLFKLGQSSKVLDLAIKYECEYYTVNAAKTNIRRYNTDAYYNKFAADSNIISSHRSLHDPQVNAVKSAEFTYDLLMASRAEEFNPEIVREIFVKYGLKPNSSRNLYNRINDNLNLFYYIEDYLEEYREEGRFIYGTKEYKILKELRSLPLMVVLTQLTRKNDSGYILNSNLELVKG*
Ga0115302_100003Ga0115302_10000373F099453MLRRKDMNRFDVIELAQQTLTFVYNTFNGKVNTLDPYTRLNFVSGYLDTKTNIARTTPYGCIYVSLEAFADTVERQGFIDTDQIRNLALEIIIHELTHVDQLIDYKYIKFNNGYREEVELKCVKQSCQWILDNMQYIRSLGLVVIPEVYQARLANLTNVIYTPKYPIAIAMAKLEYMLGKKFREFSNNNIEIQYIDRLKTHYSFMVCENRSYINSRNLNDLGERLLNDKQYTVEYLEYGNSKLVIKITQGA*
Ga0115302_100003Ga0115302_10000376F080166VNILANFENYNKVVEQIFELNYYLTFKLEVTFNTIHKKINTEIKENFHSEYVVGANKLTTNLRYKYQMRLSPRGEKIGIVIDWDNYDDLCTVIEEAINICDPENKMSPFRRLYSTTGDLLDIKCDSLKVRYLHLEDRWNNKVDLIPFVLVDDNRGTLTEAMRFRFNNDLTFDVPVSRLKGFRRFLMTYNPVLHAGAMARYMAMTPLLGSNRQNMLR*
Ga0115302_100003Ga0115302_10000377F081455MEPLGKNSIKLMEKVLDNIILKSKKDIPPAEEIGAETVDIIDSAEEAIQQPLENKDSSIAVNFSQMVNKPKEEVKTEVNSVPPEGETKVNVLFPKTEHILGNYVDYDSFIKIKESNTDKVVRAVRLLNYKMSDQNAAAAFAQFVSEFNPECDPNKRLRYELIRHQGREKDLVIRLSTVINGKTKYYADIYPDLNKIDLDHHLISSAKK*
Ga0115302_100067Ga0115302_10006716F103432MKLIHSLFSLPLLFVLGGLFCTTACQDDVEPTQRTGLISTDSLIHAAEVYDGKAFEHVVSTTATGLRVSEPRRVVPMLPRQLHVTMDGKTIFRRHTLPSVSAYSLQVVAVGDTIYRQKESDAQFNADLDALFRQSIGIAPRLFGVRELSVAGIDRKGKPRDLGNYSCPLLQGKRRNVNFRTKEGVFHEYFEAASVDTFSVKSNWLLKTKAEPSLYAPSFRLLVWEQPAEGCTKLRFTLTLVDGRSLVAEVPLR*
Ga0115302_100087Ga0115302_10008727F073671MNKQQAEHELAELHEKERSLEKALELVREKIRELVNYTDKNKV*
Ga0115302_100087Ga0115302_10008732F067846MESLQAQWERKTFDDYDRRCCAEDAYNEAVEREIECIEDDISNGDSDAICAFSEKMFDDDEFLKAVALGADYEEMRIKILTAMAEDRLEQLEEDYRKGFILND*
Ga0115302_100139Ga0115302_10013914F094007MKSKTVEVLALARPNRAGVIDVVDSDGNVVPLDYLGEDFVPDVNSYSDEDFTKRNRIIVEMCDLFGRIRRRAGFAEYHRGRGNYDRARRIERNRGSDISEVGRLAINACEACPLKLDCELYGKLGGAVLSDVLDYKKVRTATSLTKAGKKRSGWNKGCIDNNA*
Ga0115302_100214Ga0115302_10021424F089055LKVEKMNSTPECVTKTPEIEAREKLAAIFSDAEQRGDNSKVSPELGKAAIDIENTSKMDSADNGAVDFCNQALGGYGKSLDYINNSPLETVQAIGNSLQLFREDKTKESCK*
Ga0115302_100344Ga0115302_1003443F103433MKEWSKNKPGVVFFFVVWFILSISFIGNFFGTGLWNGWFDGFQKDSSAIVEKTAYCKNKYDYKGPLIAADSKDYNKIMMSQDCNPSQVKPYVSQYGLQARVIAGLSPNDASKIPAYIKRVSIFLAVLTAFLLALVVQKIRALFGRITASVFVVMLAFSPWIAGYARNIYWIEPLLIAPFVISFVGYQYFKKLKKLWLFYIIESVAMFLKLLNGYEYVSTIAISVLVPIIFFELVHKNVKIINLWKQAVPVFAATVVAFSGAYWVNFISLTDYYGSSDKAASAINARASDRGISGIRSMRAYAVGNFKILRPETYNFINQLVNLDNMANNSGKTYKYIIVNVVNYLLLPAITLPVHINGMFGEFIQSILFWTILGYLIILSSRKIIGKKYNRPFLWSMNFSVIGAFCWLALMPGHALPHAHINGIIFYIPLLLFVYVLIGLWADYVVKRTVKYE*
Ga0115302_100602Ga0115302_1006023F085820MRSTFYLFAMLFLATTFFSCETVEPSPRPTWGEIVNPIEAFMYPRDLKVVAAREEGRRWLILVVPDSTKSSFAPTSKSTPAEVARYKELSKLVGNPTEPVVNECHFHRTWLTQGVKAIRVVRTQADGRDEEVTAQCGNLYFYTDKQIFDCQFKCGDRSIFAKPLGETVEADYLWLPGRDVFGLIAPPNPDHLKQRIVLRLADGTEIEKELSEKRKK*
Ga0115302_101121Ga0115302_10112112F067846MESLQAQWERKTFNDYDRRCCAEDAYNEAVEREIECIEEDIANNDSDALCAFSEKMFDDDEFLKAVALGTDYEEMRIKILTAMAEDRLEQLEKDYKNGYILND*
Ga0115302_101121Ga0115302_10112118F073671MNKQQAEHELAELHEKERSLEKALELVREKIRELINYTDKNKGNNNGAKF*
Ga0115302_101905Ga0115302_1019053F032313MNFMSNCNWRAQSVSIESFKIVLMYRFLILIFALTLMACDNNTPQEKPHEQEKHEVPVPVSKPQFDEVGERIWYGRTPAMRLDSTDYGAGLTSVFGMRTSSIPKQRFDSLFKQTVWEIKDIRVVETDLSLAKKNPGIMGWITTTEFTCRNGVIVLHRQGIDVNHVDTVNYVYDEVGNEIVLEGTGIRWFVLRLNKNAVEFLQRGRTMWGPYDWYYGRNSGRSEVTLEAK*
Ga0115302_104615Ga0115302_10461510F054110MNGRRYVVDVRQSWSKYDKPCKVYIVNRMYTEEEYKLTFPHKYKKGKTFKQGQLYKKESEYSSTKQHEVLLFLVKTYKGGD*
Ga0115302_105299Ga0115302_1052995F077405GRALPTELFPRLLVAKQRGVFYGFISWCQIKFVKKFFD*
Ga0115302_106777Ga0115302_1067772F092230MRQAHKRTVDKLKAYLLKVFFPLFIVCIILVAFFRQIGCGSEGEYAFQISEWGAKLKNIYGTDFINKEIIVRDNAVRVDGIRCLYAVNQNEDGLSIYLLLPEGTYLTHNYVGSSFVRFSNSSEYINMAYGEGSVEVSDSTSTGEVQNTEEKEARDKVDEAINSMRQLFASAIMVNLRVVELYKILTVCMILIAIAMTIGYFSYLKPETVYGFYCKLRRKEKYPSDVNLVKRIGFLVIILPPCMLFLLIV*
Ga0115302_113815Ga0115302_1138151F097527MIYFKMEKIGNSTYNKEKKTRSENLVFNTIPAAGVEPARPCGHWILSSITPLF

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.