NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 7000000386

7000000386: Human tongue dorsum microbial communities from NIH, USA - visit 1, subject 765135172



Overview

Basic Information
IMG/M Taxon OID7000000386 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0063646 | Gp0052931 | Ga0031278
Sample NameHuman tongue dorsum microbial communities from NIH, USA - visit 1, subject 765135172
Sequencing StatusPermanent Draft
Sequencing CenterBaylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size89753140
Sequencing Scaffolds19
Novel Protein Genes21
Associated Families20

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
Not Available5
All Organisms → Viruses → Predicted Viral4
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella1
All Organisms → cellular organisms → Bacteria2
All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Lactobacillales → Streptococcaceae → Streptococcus1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Neisseriales → Neisseriaceae → Neisseria1
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA4163

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Oral Cavity → Tongue Dorsum → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal surface

Location Information
LocationUSA: Maryland: Natonal Institute of Health
CoordinatesLat. (o)39.0042816Long. (o)-77.1012173Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F032313Metagenome180N
F033081Metagenome178Y
F036281Metagenome170N
F046432Metagenome151Y
F054110Metagenome140N
F067846Metagenome125Y
F072446Metagenome121N
F076191Metagenome118N
F078842Metagenome116N
F081455Metagenome114N
F084362Metagenome112N
F085820Metagenome111N
F089057Metagenome109N
F092232Metagenome107N
F095629Metagenome105N
F095631Metagenome105N
F099453Metagenome103N
F103431Metagenome101N
F103435Metagenome101N
F103436Metagenome101Y

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
C2245678Not Available592Open in IMG/M
C2284115Not Available990Open in IMG/M
C2299793All Organisms → Viruses → Predicted Viral1469Open in IMG/M
C2300863All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria1521Open in IMG/M
C2301687Not Available1574Open in IMG/M
C2306861Not Available1989Open in IMG/M
C2309189All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella2301Open in IMG/M
C2309245All Organisms → cellular organisms → Bacteria2309Open in IMG/M
C2313345All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae3368Open in IMG/M
C2315115All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Lactobacillales → Streptococcaceae → Streptococcus4424Open in IMG/M
SRS018791_WUGC_scaffold_24999All Organisms → Viruses → Predicted Viral4114Open in IMG/M
SRS018791_WUGC_scaffold_35548All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Neisseriales → Neisseriaceae → Neisseria1273Open in IMG/M
SRS018791_WUGC_scaffold_39542All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA4166389Open in IMG/M
SRS018791_WUGC_scaffold_44403All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA4165518Open in IMG/M
SRS018791_WUGC_scaffold_5064All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA41619400Open in IMG/M
SRS018791_WUGC_scaffold_50995All Organisms → Viruses → Predicted Viral4132Open in IMG/M
SRS018791_WUGC_scaffold_52867Not Available717Open in IMG/M
SRS018791_WUGC_scaffold_53743All Organisms → cellular organisms → Bacteria2101Open in IMG/M
SRS018791_WUGC_scaffold_7994All Organisms → Viruses → Predicted Viral1518Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
C2245678C2245678__gene_118870F032313FLILIFALTLMACDNNTPQEKPREQEKHEVPVSKPKPQFDEVGERIWYGQTPAMRLDSTNYGAGLTSVFGMRTSSISKQRFDSLFKQTVWEIKDIRVVETDLSLAKKNPGIMGWVTTTEFTCRNGVIVLHRQGINVNHVDTVNYVYDEVGNEIVLDTGIRWSVLRLNKNAVEFLQRGRTMWGPFDWYYGRNSGRSEV
C2284115C2284115__gene_132794F084362MNCTFTVRWSDEKNKPHAKTYATEDDAKRAKKWLLEHGVRSVDIAVKINNKPAGSLKDDNPSESAAEQKGFWWQE
C2299793C2299793__gene_139700F036281MITLIKVDEGPVDIYELRMQYLAKLKETDGVMLPTFIYRNKDLFVTEFKPTCDDQWIMYMTNAEGLITKMRIKNGDLMSNGSVLFLAEERKTYNAKEYYDYWAAREGKPAPFFYESRQYHVKSFMRVPGSTDLWITAERETGHWYTFRMSDAQKSKFTRHTMTNEKGHQSYDWVLENVEWAADTIRYF
C2300863C2300863__gene_140192F033081MPRDTRPIFVWPKLVVAIEGYFHRRWLVAIAVLLIAVTIVIAKALLLVPGLDNSVVGLLTSIFETFLPARWATGAAWVAGMTGVFLIGDFTDYTPSQKSLHSLRATKWGVYNALLLFALWEEQAFRSGSEKWSWRERVRASVCFGAIHVVNIWYSFAAGIALSVTGFGFLLVYLWYYRKYRNQIVATGAAATVHTLYNVIVLSLTVAAAAVYLAIDIA
C2301687C2301687__gene_140565F085820MRSTFYLFAMLFVATTFFSCETGEPAPRATWGEIVNPIEAFMYPRDLKVFAGDNDGRRWLILVIPDSTKSSFAPTSKSTPAEVARYKELSQLVGNPTEPVVNECHFHRTWLTQGVKGIRVLRTRADGRDEDVTAQCGNLYFYTDKQIFDCQFKCGNRSIFAKPLGETVEADYLWLPGRDVFGLVAPLNPDGLKQRIVLRLADGTEIEKELSEKGKK
C2306861C2306861__gene_143171F076191MKIIAENPAEEALLWRIKALSDELVNQDNRSTSMPVWTILDNNKAGKDYGAVMYFTGKAAEQHINENDHHYKKPMICVRSAHDNRELKDVIHLLILAGGNEIPNNHYGFLRDAGY
C2309189C2309189__gene_144420F072446MKKLLFLLSGLCLYCLAACDNDHEPTKPVRPFHGDTLAQIAWNFRFIVEHHYHSIPGIVPERTTYRVPVIPRSVEDKTKKEYNDMELGKEAHLVFRATVHGDTINRQKRELEAISRQLNALTLTSIGTSPVLCGVKSIEAVGVAERGNTYDLSREMKLRIRDYFGRVKYRSSGIVTLNCENTESMTAKYVVPLARIREDELAEHIQPELKFYLPVKRCMDFSSIRFAITLFNGKVLSFQHKLPSKSVLQELPNKSVLENYQQYGFEREATYFTTLWPLPDYKYNEREW
C2309245C2309245__gene_144456F046432MWEMTESKLSNIISKYQLPMDNYLVEIDGAFGRGEFFWVIKNQSTNKKYLLVNTYSHHGVESELECYREGGFDNLEAIPRKIETLENASDADDEISKYLFGMYSIFEIKP
C2313345C2313345__gene_146959F067846MESLQAQWERKTFNDYDRRCCAEDAYNEAIEREIECIEDDISNGDSDAICAFSEKMFDDDEFLKAVALGTDYEEMRIKILTAMAEDRLEQIEEDYRKGYILND
C2315115C2315115__gene_148331F103436MITTSKGGWRYKSDFEIFDSLRDWVMKCDVKYVKRDALDKIDYARSLWCRAEYVAAVHLLDENEVFLKKSDWPYYALGIQILRARKHEFFNE
SRS018791_WUGC_scaffold_24999SRS018791_WUGC_scaffold_24999__gene_32284F089057MINVIPLIAKKYNRKGDTSGSLKSLISDLNCVTDNDDVLLFLSSIPRETKYSLDDAFDIIVSNDEYSNIFRATLVFLNIDLDYHRLLLNAIKSESYTIICMINKAIPTPDLFLAKNNYECLTIALDKSYAIFDKVLGMVVGQIKHTASSKEGRALGIFMTLCILNKDIDKLASLCTGYLATCRSEYMVKDLMNKSAIDAFQYMSEEDIHIVVDDINSRTVLSRYLNKI
SRS018791_WUGC_scaffold_35548SRS018791_WUGC_scaffold_35548__gene_46633F103431MIDLDALVVGMLFFIQLFLQGIAWRVAITHFLHAERGNAAAAAFNGAFGEDIADCHAEDDDDKDAESQKEGFHV
SRS018791_WUGC_scaffold_37921SRS018791_WUGC_scaffold_37921__gene_49977F103435MKFVFTTEPIYQYYRAYLYPSDKDKLDKDLMVEYGDYKDYWDLKNQQDALPENIFVAELTSRDYPRNPWNYVSQLISKLTYSYLIDNPEFENIFSEILFNQSEEEFYEFYKAIDRFYNGSEIFIIVSNDEYSDMVTQMMCNVIRRMYGIHPQVIYDIDDILNIRDDIDFSPQGAQLAYLQRSAYYKLEAKRSLEPLQIWYPFDMNTYTNALE
SRS018791_WUGC_scaffold_39542SRS018791_WUGC_scaffold_39542__gene_52275F092232MNTQAKFIAEYNDKNRPKFNDKFFNKSDDDIIEDLKDVILSCERNKFYTIKVLNFEVIDDYNEVQKLLIGDETPSISIKDSDLKILKVTYHVACTKDEDTFDVLIAIPRVIDGAYIHLNGNDYFPLFQLVDGSTYNNTTASSAKTQSITLKTNSNAVKMLRNFIDLNTTNEETVRAAMFSVYLFDHKVTLFEYYLARFGWYETLDKFNFEDVIKISDHDLNDPEYYTFAIANAHMKTPFYISAVKSFMDNDRILQSFVASFARAISLYATKKTTLDQIYTTEFWICKLGYNFVSSETSVFTKGNAIIESLENSYDIPTKKRLRLPDHIKEDIYSVLKWMACEFSSIRLKNNLDASSKRIRWSEYIAAMYIMLINVKLRRLPEKHDPNMEAYRIKQQLNTQPMALIAELQKSNLKGFRNMVNDRDSFLQLKYTIKGPSGPGESNSKNVARNVRAIDPSHLGIIDLNTSSASDPGVGGMLCPLNYGVYEWNSFTNEEEPNVWDENFSKMLNIYREEKGYTSAIMLADDAGLELTDARDPVAVAFDADLLGQTIAKVARTRAFEKQLRPALINMEDSCSIYFEEV
SRS018791_WUGC_scaffold_44403SRS018791_WUGC_scaffold_44403__gene_58972F095631MASDAVSVNKTLRSYQETVLNTVQNDTLLNANIYDIHQYIENIKKRYVDEDEITLSMGIFGYMGDVNSNALQNAVTMAAEYSNEAIPIKAKFEKNVISHALMLGINKIFAEPATMQAMFVFYEDELILNTVSDTFKFDRNIKIMVGDYEFHLPYDLIIKRIELPTGEYIYTGMYDTTQSNPIITRNSNDVDPYLKPTVRSKIDGRNVVMLLVDLRQYEYTTYHKTIITTNPLESKMLQFEFDNQLAGFDVDVKEYDQPTRKLKPVYNGLNTDGINNFCNYTYIDSSTIRVMFDNSSYLPTANTEVTVNLYTCQGSNGNISYKDSIYFRVKSEKINYDRLNLLVIPTSDAQYGIDKRSIADLKKLIPKEALSRGSVTNSTDINNYFNTIDDDDNKLFFFKKMDNPLARLYYAFVLMDSPTNIIPTNTIPIEAIRRDFDNISDSNYILTAGNIIKYDGTTNASVAYQSSEEELNNARKNQFLYMNPFMCIVNKKPLYVSYYMNIMDVNKLLEFTYVNQDSKVQFVANKMNWYRHYLSERDTYVGDISIMQNIQSDIGLVHRDDPYDPEKITGVDVKVLAVFYTDEKYQVPYRWAEAEFVNYDQNTFIMDYKFKLNTDNKIDKNIKLKINNVYEVGNATRLSPGYMTNNMNMKIFVFAKDVFGYNAGLHKADQIFTADFLEGYSLTNEYTVKYGIDFLYNYSDLIESHIKIRKQDNGQISYIIDRVPVISYDYVNTEERIQDFINNLEKKRIHILECLDVLEDSFGIDIKFFNTYGPSKLFYVNDGVPLNRVNLSMTFKVKFLTTTDKYLTEYIKNDIRKYIEDKSRISDIHIPNIITFITQKYAENVTYFEFLDFNGYGPGYQHIYRKDESIVGRIPEFLNINTIGTENNALDINIIIA
SRS018791_WUGC_scaffold_5064SRS018791_WUGC_scaffold_5064__gene_6338F099453MLRRKDMNRFDVIELAQQTLTFVYDTFNGKVNTLDPYTRLNFVSGYLDTKTNIARTTPYGCIYVSLEAFADTVERQGFIDTDQIRNLALEIIIHELTHVDQLIDYKYIKFNNGYRDEVELKCVKQSCQWILDNMQYIRSLGLVVIPEVYQARLANLGNIIYTPKYPIAIAMAKLEYMLGRKFREFSNNNIEIQYIDRLKTHYSFMVCENRSYINSRNLNDLGERLLNDKQYTVEYLEYGNSKLVIKITQGV
SRS018791_WUGC_scaffold_5064SRS018791_WUGC_scaffold_5064__gene_6342F081455METMKNLLEMENVECISNLISKCSDYINRKEKNTNTEEKDIPPVEEIGAETVDIIDAAEEAIQQPLQNKDASIAVNFSQMVNKPKEEVKTEVNSVPPEGETKVNVLFPKTEHILGNYVDYDSFIKIKESNTDKVVRAVRLLNYKMSDQNAAAAFAQFVSTFNPEGDLNKRLRYELIRHQGREKDLVIRLSTVLNGTTKYYADIYPDLNKIDLDHHLISSAKK
SRS018791_WUGC_scaffold_50995SRS018791_WUGC_scaffold_50995__gene_70401F095629MTFKERMMRELIICVCLLGCFGVANANNVEQPKEVKIVHNDDSVILHKKIYQLEKRIERLEELLKKEDK
SRS018791_WUGC_scaffold_52867SRS018791_WUGC_scaffold_52867__gene_74800F032313FLILIFALMLMACDNDTPQEKPREQEKHEVPVPKPKPQFDEVGERIWYGRTPAIRLDSTDYGAGLTWVLEMRTSSIPKQRFDSLFKQTVWEIKDICAVETDLSLAKKIPRFVGGSIAKEFTCRNGVILRHMQGIDINCVDTVNYVYNEDLNEIVLEGTGIRWYVLRLNKYAVEFLQQGHNIWGPFDWYYGRNSGRSEVTLEAK
SRS018791_WUGC_scaffold_53743SRS018791_WUGC_scaffold_53743__gene_77511F078842MIISSIYKTADNDGLIAHVYEHLLAQYVLKRLQDNEFFVLSDIILSAKTYGDTCFMDAELYSSEAKTTYDKVLREFDKLVIPEDDILRAASECGIEMNRNIAEVDRSELSKKLHEVQISPWRKQIDMVYRKAHDESSVNTLFRASYIKYSKESDDLFRECVLEYSIDESHIQTPIDQALAAIVMQIVALNFLTVVREKYTVYDRGDQWSEASISVGYRMFLGSLKKDDKIINQLSYDFLEYIKILSSSVFCDNLQKALVRCSDNHKQVILNRSMLNAILGGCVIGGKGWLEMADSARIRQMINSIELDIYEVNS
SRS018791_WUGC_scaffold_7994SRS018791_WUGC_scaffold_7994__gene_10223F054110VNYQPTIKKLLKALQMNGRRYVVDVRQSWSKYDKPCKIYIVSRMYTEEEYKLTFPEKYKKGKTFKQGQLYKKESEYSSTKQHEVLLFLVRTYKGGD

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.