NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 7000000550

7000000550: Human tongue dorsum microbial communities from NIH, USA - visit number 3 of subject 158883629



Overview

Basic Information
IMG/M Taxon OID7000000550 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0063646 | Gp0053108 | Ga0031350
Sample NameHuman tongue dorsum microbial communities from NIH, USA - visit number 3 of subject 158883629
Sequencing StatusPermanent Draft
Sequencing CenterBaylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size129314517
Sequencing Scaffolds10
Novel Protein Genes11
Associated Families11

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA4164
Not Available1
All Organisms → Viruses1
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria2
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes1
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Minimicrobia → Candidatus Minimicrobia vallesae1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Oral Cavity → Tongue Dorsum → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal surface

Location Information
LocationUSA: Maryland: Natonal Institute of Health
CoordinatesLat. (o)39.0042816Long. (o)-77.1012173Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F033081Metagenome178Y
F046433Metagenome151N
F054110Metagenome140N
F080166Metagenome115N
F085820Metagenome111N
F092229Metagenome107N
F092232Metagenome107N
F095631Metagenome105N
F095633Metagenome105N
F099453Metagenome103N
F105379Metagenome100N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
C3691504All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA4169451Open in IMG/M
C3692076All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA41619299Open in IMG/M
SRS075404_LANL_scaffold_25661Not Available837Open in IMG/M
SRS075404_LANL_scaffold_50247All Organisms → Viruses6330Open in IMG/M
SRS075404_LANL_scaffold_66156All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria1124Open in IMG/M
SRS075404_LANL_scaffold_69137All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA4165484Open in IMG/M
SRS075404_LANL_scaffold_69749All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes8246Open in IMG/M
SRS075404_LANL_scaffold_70262All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA41611493Open in IMG/M
SRS075404_LANL_scaffold_7521All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria1147Open in IMG/M
SRS075404_LANL_scaffold_9994All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Minimicrobia → Candidatus Minimicrobia vallesae917Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
C3691504C3691504__gene_221119F095631MASDAVSVNKTLRSYQETVLNTVQNDTLLNANIYDIHQYIENIKKRYVDEDEITLSMGIFGYMGDVNSNALQNAVTMAAEYSNEAIPIKAKFEKNVISHALMLGINKIFAEPATMQAMFVFYEDELILNTVSDTFKFDRNIKIMVGDYEFHLPYDLIIKRIELPTGEYIYTGMYDTTQSNPIITRNSNDVDPYLKPTVRSKIDGRNVVMLLVDLRQYEYTTYHKTIITTNPLESKMLQFEFDNQLAGFDVDVKEYDQPTRKLKPVYNGLNTDGINNFCNYTYIDSSTIRVMFDNSSYLPTANTEVTVNLYTCQGSNGNISYKDSIYFRIKSEKINYDRLNLLVIPTSDAQYGIDKRSIADLKKLIPKEALSRGSVTNSTDINNYFNTIDDDDNKLFFFKKMDNPLARLYYAFVLMDSPTNIIPTNTIPIEAIRRDFDNISDSNYILTAGNIIKYDGTTNASVAYQSSEEELNNARKNQFLYMNPFMCIVNKKPLYVSYYMNIMDVNKLLEFTYVNQDSKVQFVANKMNWYRHYLSERDTYVGDISIMQNIQSDIDLVHKDDPYDPEKITGVDVKVLAVFYTDEKYQVPYRWAEAEFVNYDQNTFIMDYKFKLNTDNKIDKNIKLKINNVYEVGNATRLSPGYMANNMNMKIFVFAKDVFGYNAGLHKADQIFTADFLEGYSLTNEYTVKYGIDFLYNYSDLIESHIKIRKQDNGQISYIIDRVPVISYDYVNTEERIQDFINNLEKKRIHILECLDVLEDSFGIDIKFFNTYGPSKLFYVNDGVPLNRVNLSMTFKVKFLTTTDKYLTEYIKNDIRKYIEDKSRISDIHIPNIITFITQKYAENVTYFEFLDFNGYGPGYQHIYRKDESIVGRIPEFLNINTIGTENNALDINIIIA
C3692076C3692076__gene_222886F092232MNTQAKFIAEYNDKNRPKFNDKFFNKSDDNIIEDLKDVILSCERNKFYTIKVLNFEVIDDYNEVQKLLIGDETPSISIKDSDLKILKVTYHVACTKDEDTFDVLIAIPRVIDGAYIHLNGNDYFPLFQLVDGSTYNNTTASSAKTQSITLKTNSNAVKMLRNFIDLNTTNEETVRAAMFSVYLFDHKVTLFEYYLARFGWYETLDKFNFEDVIKISDHDLNDPEYYTFAIANAHMKMPFYISAVKSFMDNDRILQSFVASFARAISLYATKKTTLDQIYTTEFWICKLGYNFVSSETSVFTKGNAIIESLENSYDIPTKKRLRLPDHVKEDIYSVLKWMACEFSSIRLKNNLDASSKRIRWSEYIAAMYIMLINVKLRRLPEKHDPNMEAYRIKQQLNTQPMALIAELQKSNLKGFRNMVNDRDSFLQLKYTIKGPSGPGESNSKNVARNVRAIDPSHLGIIDLNTSSASDPGVGGMLCPLNYGVYEWNSFTNEEEPNVWDENFSKMLNIYREEKGYTSAIMLADDAGLELTDTRDPAAVAFDADLLGQTIAKVARTRAFEKQLRPALINMEDSCSIYFEEV
SRS075404_LANL_scaffold_25661SRS075404_LANL_scaffold_25661__gene_29303F085820MRSTFYLFAMLFLATTFFSCETVEPSPRPTWGEIVNPIEAFMYPRDLKVFADGEEGRRWLILVVPDSTKSSFAPTSKSTPAEVARYKELSKLVGNPTEPVVNECHFHRTWLTQGVKAIRVVRTQADGRDEEVTAQCGNLYFYTDKQIFDCQFKCGNRSIFAKP
SRS075404_LANL_scaffold_50247SRS075404_LANL_scaffold_50247__gene_58771F054110VNYQPTIKKLLKALQMNGRRYVVDVRQSWSKYDKPCKVYIVNRMYTEEEYKLTFPHKYKKGKTFKPKQLYKKESEYSSTKQHEVLLFLVKAYKGGD
SRS075404_LANL_scaffold_66156SRS075404_LANL_scaffold_66156__gene_81764F033081MAWLFRRAMPQDTRPTFVWSRLVAEIENAGYFSRWKFSILAVGLIIMTIATIKMLLFVPGLNQSVVSLLTRGLETFLPTRWATTTAWTVGVAGVFLMGNFTNYTPSQKFLHKIKATRYEVYNTILLLALLEEQAFRSGSEKWNWRERVRASVCFGLLHIMNIWYNFATGIALSVTGFGFLLVYLWYYRKYRIQIIATAAAATVHALYNAIALSLIAVVLAIDIAKLL
SRS075404_LANL_scaffold_69137SRS075404_LANL_scaffold_69137__gene_88443F092229MSENYRFEHIPEVVLRNIRFIRDNNIDIGTGDDVLECMMDINPVVRTKIYDDYEFAKDVAERRFGSTIEKLDLRTVLQKCITRPYNSILNNIYFRYFNSELIDDLFKLGQSSKVLDLAIKYECEYYTVNAAKTNIRRYNTDAYYNKFAADSNIISSHRSLHDPQVNAVKSAEFTYDLLMASRAEEFNPEIVREIFVKYGLKPNSSRNLYNRINDNLNLFYYIEDYLEEYREEGKFIYGTKEYKILKELRSLPLMVVLTQLTRKNDSGYILNSNLELVKG
SRS075404_LANL_scaffold_69749SRS075404_LANL_scaffold_69749__gene_90699F105379MVIHFPLSQSDIESLLSISKLLKCDKILYDRNYINPIIGVGPEKSYFQTISYMVDLSPHINNLLVNISDLKNLGKITQLEPSKENPEIAIHKPVVSVFNWDAEYVKACMNSLREYQIDDNIIARTDEFHNTDCYNELMAGSASTGAFRINVGGYMIDIPKSAIPTLKSDHVVATVYNAPNKNFNILRFKITKRNGIIVNQSMLFLPY
SRS075404_LANL_scaffold_70262SRS075404_LANL_scaffold_70262__gene_93358F099453MLRRKDMNRFDVIELAQQTLTFVYDTFNGKVNTLDPYTRLNFVSGYLDTKTNIARTTPYGCIYVSLEAFADTVERQGFIDTDQIRNLALEIIIHELTHVDQLIDYKYIKFNNGYREEVELKCVKQSCQWILDNIQYIRSLGLVVIPEVYQARLTNLTNVIYTHKYPIAIAMAKLEYMLGRKFREFSNNNIEIQYIDRLKTHYSFMVCENRSYINSRNLNDLGERLLNDKQYTVEYLEYGNSKLVIKITQGA
SRS075404_LANL_scaffold_70262SRS075404_LANL_scaffold_70262__gene_93360F080166VNILANFENYNKVVEQIFELNYYITFKLEVTFNTIHKKINTEIKENFHSEYVVGANKLTTNLRYKYQMRLSPRGEKIGIVIDWDNYDDLCTIIEESINICDPENKMSPFKRLYSTTGDLLDIKCDSLKVRYLHLEDRWNNKVDLIPFVLVDDNRGTLTEAMRFRFNNDLTFDVPVSRLKGFRRFLMTYNPVLHAGAMARYMAITPLLGSNRQNMLK
SRS075404_LANL_scaffold_7521SRS075404_LANL_scaffold_7521__gene_8515F046433MIELSTSPDALSELSPVAPPKLLSQAQDASRGNLMVYVKADNYLGTETSDPSFMKSRYKTTEYEAINDFVQFIEMIKHYLPDYMENCAKELIDELAFLGMPELNFAANALAKRLRRHLEVNNKPVYIDVGNSLSQYRAKNEMKSSQYILSLVLSKFPDDEFEEYEGRLKVYGGRGEIGKSSKILFLDDWIVSGDQVKERIAGFEVDNDPESHEASVLVMAASRDYLDNGISAYSQYGGTIYPVEACYVLKNSPDAGGMSRVTGIHSSTDNTFGYEVDGIAYCAIERGILKGEGIDELSLPALANIVRPYRNGKNFDGLSRFRQLLEKE
SRS075404_LANL_scaffold_9994SRS075404_LANL_scaffold_9994__gene_11381F095633MRNYENFTKVGRGEGLTEGELRTMGALAVEATEELKKTTIRKEAVLLGSVPFGSWDEFAKAVQEMAAHSYEPIPVEINTKRLIATAFLDDEGEMSVEERFVPEEVFIDLSRTRCDAEEDRNHKSYEFTCPALMEHPDGKLYLTRKAYVISVIDVNGSQEVDFNIIYGGLN

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.