NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300008660

3300008660: Human tongue dorsum microbial communities from NIH, USA - visit 2, subject 604812005 reassembly



Overview

Basic Information
IMG/M Taxon OID3300008660 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0063646 | Gp0053012 | Ga0111492
Sample NameHuman tongue dorsum microbial communities from NIH, USA - visit 2, subject 604812005 reassembly
Sequencing StatusPermanent Draft
Sequencing CenterBaylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size65121422
Sequencing Scaffolds18
Novel Protein Genes26
Associated Families25

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales1
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Siphoviridae sp. ctHip21
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → unclassified Porphyromonas → Porphyromonas sp. oral taxon 2794
All Organisms → cellular organisms → Bacteria1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella → Alloprevotella sp. oral taxon 4731
All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae1
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes2
All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae → Haemophilus1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae1
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria4
All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Neisseriales → Neisseriaceae → Neisseria1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Oral Cavity → Tongue Dorsum → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal surface

Location Information
LocationUSA: Maryland: Natonal Institute of Health
CoordinatesLat. (o)39.0042816Long. (o)-77.1012173Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F033081Metagenome178Y
F040149Metagenome162N
F043235Metagenome156N
F047508Metagenome149N
F066860Metagenome126N
F067846Metagenome125Y
F073671Metagenome120N
F080166Metagenome115N
F081455Metagenome114N
F081510Metagenome114N
F089057Metagenome109N
F092229Metagenome107N
F092232Metagenome107N
F094006Metagenome106Y
F094007Metagenome106N
F095633Metagenome105N
F098763Metagenome103N
F099452Metagenome103N
F099453Metagenome103N
F103431Metagenome101N
F103432Metagenome101N
F103433Metagenome101N
F103435Metagenome101N
F105379Metagenome100N
F105380Metagenome100N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0111492_100001All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales239650Open in IMG/M
Ga0111492_100002All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Siphoviridae sp. ctHip2239454Open in IMG/M
Ga0111492_100114All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → unclassified Porphyromonas → Porphyromonas sp. oral taxon 27936154Open in IMG/M
Ga0111492_100322All Organisms → cellular organisms → Bacteria21409Open in IMG/M
Ga0111492_100374All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella → Alloprevotella sp. oral taxon 47319723Open in IMG/M
Ga0111492_101008All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae10669Open in IMG/M
Ga0111492_101478All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → unclassified Porphyromonas → Porphyromonas sp. oral taxon 2797825Open in IMG/M
Ga0111492_101505All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes7695Open in IMG/M
Ga0111492_101592All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes7284Open in IMG/M
Ga0111492_101944All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → unclassified Porphyromonas → Porphyromonas sp. oral taxon 2795948Open in IMG/M
Ga0111492_101958All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → unclassified Porphyromonas → Porphyromonas sp. oral taxon 2795920Open in IMG/M
Ga0111492_102844All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae → Haemophilus4027Open in IMG/M
Ga0111492_104945All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae2146Open in IMG/M
Ga0111492_105120All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria2074Open in IMG/M
Ga0111492_106247All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria1682Open in IMG/M
Ga0111492_107302All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Neisseriales → Neisseriaceae → Neisseria1421Open in IMG/M
Ga0111492_107714All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria1345Open in IMG/M
Ga0111492_108650All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria1195Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0111492_100001Ga0111492_100001115F105379MVIHFPLNQSDIESLLSISKLLKCDKILYDRNYINPIIGVGPEKSYFQTTSYMVDLSPHINNLLVNISDLKNLSKITQLEPSGDNPEIAIHKPVVSVFNWDTEYVKACMNSLREYQIDDNIITRTDEFHNTDCYNELMAGSASTGAFRINVGGYMIDIPKSAIPTLKSDHVVATVYNAPNKDFNVLRFKITKRNGIVVNQSMLFLPY*
Ga0111492_100001Ga0111492_100001137F092232MNSQSKFIAEYNDRNRPKFNDRFFCKSDDDIIEDLKDVILSCERNKFYTIKVLGFEVIDDYAEVQKLLIGEETPSISIKDSDLKLLKVTYYVGCTKDEETFDVLIAIPRVIDGAYIHLNGNDYFPLFQLVDGSTYNNTTATSAKTQSITLKTNSNAVKMLRNFVDLNTTKEESIRLAMFSVYLFDHKVTLFEYYLARFGWYDTLEKFNFQDIIRITDYDIDDPEYYTFAIANSHMKSPFYISAVKSFVDNDRILQSFIASFQRAIMLFATKKTTLDQIYTTQFWIQKLGFNFVSSETSTFTKGNAIIESLENSYDIPTKKRLRLPDEIKADIYSVLKWMACEFSSIRLKNNLDASTKRIRWSEYIAAMYIMLINIKLRRLPEKHDPNMEAYRIKQQLNTPPMALIAELQKSNLKGFRNMVNDRDSFLQLKYTIKGPSGPGESNSKNVARNVRAIDPSHLGIIDLNTSSASDPGVGGMLCPLNHGVYEWNSFTNEEEPNVWDENFSKMLNVYREEKGYTSAIMLAEDAGLELTDTRDPDAVAFDAQLLGQTIAMVARTREFETQLRPALINMEDSCSIYFEEA*
Ga0111492_100001Ga0111492_100001188F099452MGKTYEELLQETLSKIYELKDLDNRDRGKALTIFIGERLNRELLLSSRHIFTLYKDIINLDDVSLLTDLRKTDWYEKWFTDDSNNANLIDLSRFNFKTLARFEKEEYLRNVEGYDFEAVTQVDGYSLFDTLIEDKDVELFKLAAENILISHGFFDNTDYNFYDIPDEYMGDKEVCAYMCLLNIENMDFVDKKTLDTTVLYNIVKDRICGSIYFTLFDSLNKDTRTNAR*
Ga0111492_100001Ga0111492_100001227F089057MTNIIPIIAKKYNRKGDTSGSLKSLISDLNCIDNVDDSLLFLSSIPRETKYTLDEVFDLITSDDIYIKIFGNVLTFLNMDLDYHRLLLNAIKSESYKIISIINESIPTPDLFLAKNNYECLSVALDKPFVIFDKILGMVVSQLLHTASSKEERIFGIFMTICIINREINKLASLCTGYLAITRDEVLVKDLMNESAMVAFQYMSTEDINNVVSDINSRTVLSRYLSNM*
Ga0111492_100001Ga0111492_10000164F081455MDLLKHLNETKLNLKFQQEREKRNAIEDSKALIRHLDPDYAEYELESLNPVEEIDAQNVEIIDAVDAIQQPLENKDAGIAVNFSQMINQPAIQEEVAKVVTSVPEQGEPKIKVVFPQNEWILGNYVDYDSFNKIKESNADIIIRSVRVLNFKMSDPNAVAAFNNFIMKFNPECDPNKRLRYELIRHQGREKDIVVRLSTVVNNVKYYADIYADLNKIDLDHHLISSAKKK*
Ga0111492_100001Ga0111492_10000165F080166MEVTFNSILKRLGNDVKENFHTEYIVNSGSLTTNVKYRYRMKLSPKGEQTCVYIDWDNYDDLFNVLEESIKICDPENPRTPFKRTYSDKGDLLDIRCNSLQVKYQHLNDRFGNTIDLIPFVLVDEQSGLLTEAIRFRFNNELVYDVPISRLKGFRRFLMTYNPLLHAGAMARYMAMTPLLGSNRQNMMRS*
Ga0111492_100001Ga0111492_10000168F099453MNRFDIIELAQQTITFVHSAFNGKVNALDPYTRLNFVSGYLDKKTNIARTTPYGCIYVSLEAFADTVEAYRFIDTDQIRNLALEIIIHELTHVDQLIDYRYIKFNNGYREEIERQCVKQSCQWILDNIQFIRSLGLVVIPEVYEERLVGLSDVTYAFKNPAVIAMSKLEHMIGKKFKEFNSNDIEIHYVDRLKNYYKIPVCVNRMYQNSQNLNDLGERLLNDKQYMIEYMEYGNSKLVIKITQGA*
Ga0111492_100001Ga0111492_1000018F103435MKFVFCTEPIYQYYRACVYDADKDKLDKQLLVEYGDYKDIWDLKQQQDALPENIFVAELTSRDYPRNPWNYVSQLINKLTYQYLIDSPDFENIFSEILFNQSEREFYEFYKAIDRFYNGSEIFITVGNDDYSDMVTQMVCSVLRRTYGIRPQIIYDIDDVHSIRDDIDFSPEGAQIAYLQRHTYLALEGKSTVEPLRVWYPFDMNSYTNALE*
Ga0111492_100001Ga0111492_10000190F092229MNKEYRFDHIPEVVLRNVKFIRENNIDIGTGDDVLDCMMEINPVLRQRIYDDYDLAKDVAERRFRSTIEELDLATVLQKVTTRPYIAILNNIFFRYFNSKLIDDMFKLGESTKVLDLAIEYECEYYTINSAKTNIRRYMQQAYFDKYAADSNIISSHRVLNDPQVNAVKSAEFTYDLFTAARSEKFNPEMVRDIFLKYGLKTNSSRNLYTRMNNNLSLYYYMEDYLTEYMLKGSFTYGSQVYSTIKEFKCLPLMNVLTQLTRHNPSGYVLDSNLELVKG*
Ga0111492_100002Ga0111492_100002125F105380MPILELDASQYVKQGRIFKKFDSNLLDSYMDGRQNNYNINLAELDDQISDGIVYADRNGKMIYKFGAKKIIQTAITNGLEISGLSKDLEMKHYSFWVPDLYFVSFYSFNPNEDLYIAYRSKDEEFICLTNIWPDGSSREESYFPNGKRLKLKRLCKGSMMSDVSSSDYDAWRNNAVTRASQFVNKFVSARGNSDLDFVGSALRNKVPSHNIKKCAVFLGTITKEQENVNTYEEFMEWTKNTKWFK*
Ga0111492_100114Ga0111492_10011426F047508MLGHRLVEGRVKYPYLRNLGEYLRHGFDTEDVGWVVKRSELCALMEHIYYLWGDTYALSKALCTVYEAVTDGVDLIEGLYEVLFFENVEDNLYATCVVRNVKVALNLLSFGVTEGDEGVVDPYALFVPRGQDLVVGELDEGELQGGAATVEDQDFHKVLYYMVRCELIPSSP*
Ga0111492_100322Ga0111492_1003225F103433MKEWSKNKPGVVFFFVVWFILSISFIGNFFGTGLWNGWFDGFQKDSSAIVEKTAYCKNKYDYKGPLIAADSKDYNKIMMGQDCNPSQVKPYVSQYGLQARVIAGLSPNDTSKIPAYIKRVSIFLAVLTAFLLALVVQKIRALFGGITASVFAVMLAFSPWIAGYARNIYWIEPLLIAPFVISFVGYQYFKKSKKLWLFYIIESVAMFLKLLNGYEYVSTIAISVLVPIIFFELVHKNVKIINLWKQAVPVFAATVVAFFGAYWVNFVSLTDYYGSSDKAASAINARASDRGISGIRSMRAYAVGNFKILRPETYNFINQIVNLDNMANNSGKTYKYIIVNVVNYLLLPAITLPVHINGMFGEFVQSILFWTILGYLIILSSRKIIGKKYNRPFLWSMNFSVIGAFCWLALMPGHALPHAHINGIIFYIPLLLFVYVLIGLWADYVVKRTVKYE*
Ga0111492_100374Ga0111492_10037410F103432MKLIHSLFSLSLLLALSGLFCTTACQDDAEPTQRAGLISTDSLIHAAKVYDGKAFEHVVSTTATGLRVSEPRRIVPMLPRQLHVEMEGKTIFRRHNLPSVSAYSFQVLAVGDTIYRQKESDAQFNADLDALFHQSIGIAPRLFGVKELSVLGIDSRGKTRDLGNYSYPLLRGVRIYMAFRSREGVFHEHYEAASVDTFSVKDDWLLNTKAEPSLYVPSFRLLVWDQPAEGCTKLRFTLTLVDGRSLVTEVPLY*
Ga0111492_101008Ga0111492_10100811F067846MESLQAQWERKTFNDWDKQCSKEDDYNRAIEMEIESIKEDIANNDSDAICAFSEKMFDDDEFLKAVALGTDYEEMRIKILKSLAEERIEQRRKDYEKGFILND*
Ga0111492_101478Ga0111492_1014786F098763MDRRTSLRVLSPALYLATKLFEAVVWTTIADDSSDEVEGKGGMNAVPTAVDE*
Ga0111492_101505Ga0111492_10150514F081510MKLPKINTKAIKAGAKTTYNTAKILGKKYAPIALVTTGLVGYGIAVYQGIKSGKKLEATKAKYEAKDAAGEEYTRMDVVKDVTKDVAVPVAIAVASTAAIGLGFAIQTNRLKAVSAALTAVTEEHARYRLQCKEVLDEETFKKVDTPMNQVTVEEDGKEAQSFVPKEGLMYGNWFKYSANYASDSPEYNEQWIRESIRVLEEKIARKGLLNFSDMLDQLGFDVPKAALPFGWTDTDGFYIEYDIMEVWNAEEQMHEPQIYVRWKCPRNLYATTNFRDLIPGRKELA*
Ga0111492_101592Ga0111492_10159213F081510MKLPKINTKAIKAGAKTTYNTAKILGKKYAPIALVTTGLVGYGIAVYQGIKSGKKLEATKAKYEAKDAAGEEYTRMDVVKDVTKDVAVPVAIAVASTAAIGLGFAIQTNRLKAVSAALTAVTEEHARYRLQCKEVLDEETFKKVDTPMDQVTVEEDGKEAQSFVPKEGLMYGNWFKYSANYASDSPEYNEQWIRESIRVLEEKIARKGLLNFSDMLDQLGFDVPKAALPFGWTDTDGFYIEYDIMEVWNAEEQIHEPQIYVRWKCPRNLYATTNFRDLIPGRKELA*
Ga0111492_101944Ga0111492_1019443F043235MDEVVPSDEGHLLIDLCDDDPRSLCSGLGIVTRYPEGAIALFIGLAHRDQCDIDRIDTIPKEVWEFMEVTREIVDSLIQVSGAAILVKEVKDGMYMPHHLWAEVPRLGKVQHVEGFHVREALAIIVEGFGEAAGGCHSMAKDQEVPALYCRSHGFEGGRSMASNVLLPGSAH*
Ga0111492_101958Ga0111492_1019583F040149VADEGAKELRWEVLIEEQGIPVLFVEVEAWYDGRVSSSEILRSVGIALEREPRLTSVWSHDSEDAIDYFIYDVLVPEGHALTAVRERETVVAQLLNIHRYVYYP*
Ga0111492_102844Ga0111492_10284413F073671MNKEQAEHELAELHEKERSLEKALELVREKIRELVNYMDKNKGQK*
Ga0111492_104945Ga0111492_1049452F094006MRGALEPADPTGALYESVVSVEAHIGELQAGGIACRGLFAFGGSNLLAVAELGAEAPHAIDMYDIAVCEPLSELFAEELEYAFNFSA*
Ga0111492_105120Ga0111492_1051202F033081MAWLFRRAMPQDTRPTFVWSRLVAEIENAGYFSRWKFSILAVGLIIMTIATIKMLLFVPGLNQSAVSLLTRGLETFLPTRWATATAWTVGMAGVFLMGDLTNYTPSQKILHKIKATRYEVYNTILFLALLEEQAFRSGSERWNWRERVRASVCFGLLHIMNIWYNFATGIALSVTGFGFLLVYLWYYRKYRIQIIATAAAATVHALYNAIALSLIAIVLAIDITKLL*
Ga0111492_106247Ga0111492_1062472F066860MTTKKQKLQKQQAIDTWIVIALWVSAIWFSLARGFITGIGGWVLALLGPWALIVSCICLAIISRQMKKRHASKDHLTTIVRVSFIVMSISLFICGLAMPDFSDMETFSTLSVYTNNAISFETSKTIAIISGFVVVLSLFVAVTFGIAEDGE*
Ga0111492_107302Ga0111492_1073024F103431MIDFDTLVVGMLFFIQLFLQGIAWRVAIAHFLHAERGNAATAAFDGAFGENIADCHAEDDNDKNAESKEEGFHVCIPEG*
Ga0111492_107714Ga0111492_1077142F094007ARPNRAGVIDVVDSDGNVVPLDYLGEDFVPDANSYSDEDFTKRNRIIVEMCDLFGGIRRRAGFAERHRGRGDYDRARRIERNRGSDISEVGRLAISACEACPLKLDCELYGKLGGAVLSDVLNYKKVRTAISLTKAGKKRSGWNKGCIDNNA*
Ga0111492_108650Ga0111492_1086502F095633MGNYENSTEVGRGEGLTEGELRTMGTLAMEATEELKKTTISKEAVLLGSVPFGSWDEFAKAVQEMAAHKMTAYSYEPIPVKINTKRLIAIAFLDDRGEMSVEEHSVPEEVFIDLSRTRCVVDADRSHKSYKFTCPVLKRYPDGELYPIREAYVISAIDVNGSQEVDFKII*

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.