NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 7000000351

7000000351: Human tongue dorsum microbial communities from NIH, USA - visit 2, subject 763536994



Overview

Basic Information
IMG/M Taxon OID7000000351 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0063646 | Gp0052890 | Ga0031290
Sample NameHuman tongue dorsum microbial communities from NIH, USA - visit 2, subject 763536994
Sequencing StatusPermanent Draft
Sequencing CenterBaylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size113056216
Sequencing Scaffolds18
Novel Protein Genes23
Associated Families20

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Siphoviridae sp. ctHip21
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → unclassified Saccharibacteria → Candidatus Saccharibacteria bacterium1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales2
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria5
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA4165
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae1
Not Available1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Prevotella1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Oral Cavity → Tongue Dorsum → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal surface

Location Information
LocationUSA: Maryland: Natonal Institute of Health
CoordinatesLat. (o)39.0042816Long. (o)-77.1012173Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F032313Metagenome180N
F033081Metagenome178Y
F046432Metagenome151Y
F066860Metagenome126N
F068942Metagenome124N
F072446Metagenome121N
F080166Metagenome115N
F081455Metagenome114N
F085820Metagenome111N
F089057Metagenome109N
F092229Metagenome107N
F092232Metagenome107N
F095629Metagenome105N
F095631Metagenome105N
F095633Metagenome105N
F099452Metagenome103N
F099453Metagenome103N
F103432Metagenome101N
F105379Metagenome100N
F105380Metagenome100N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
C2212453All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Siphoviridae sp. ctHip2506Open in IMG/M
C2291859All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → unclassified Saccharibacteria → Candidatus Saccharibacteria bacterium1193Open in IMG/M
C2319335All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales3731Open in IMG/M
SRS047219_WUGC_scaffold_18901All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria1090Open in IMG/M
SRS047219_WUGC_scaffold_19780All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA41656794Open in IMG/M
SRS047219_WUGC_scaffold_24027All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA4169520Open in IMG/M
SRS047219_WUGC_scaffold_33129All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA41623873Open in IMG/M
SRS047219_WUGC_scaffold_36275All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes23142Open in IMG/M
SRS047219_WUGC_scaffold_4061All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA41626884Open in IMG/M
SRS047219_WUGC_scaffold_41754All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria1889Open in IMG/M
SRS047219_WUGC_scaffold_42129All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria2801Open in IMG/M
SRS047219_WUGC_scaffold_49315All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA41621987Open in IMG/M
SRS047219_WUGC_scaffold_50997All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales21552Open in IMG/M
SRS047219_WUGC_scaffold_52645All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria1317Open in IMG/M
SRS047219_WUGC_scaffold_58068All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria1250Open in IMG/M
SRS047219_WUGC_scaffold_58477All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae12168Open in IMG/M
SRS047219_WUGC_scaffold_59356Not Available562Open in IMG/M
SRS047219_WUGC_scaffold_61395All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Prevotella4342Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
C2212453C2212453__gene_135847F105380GRQTEYSINLSELDDQISDGIVYADHEGKMIYKFGAKKILQTAITNGLTITGLASEFKMKHYSFWVPDLYFISYSSFNPNSDLYIAYRSKDAKKICLTNIWSGSGNVDFYSPNGGRLVYNRLCKGRMMDDIGTDDYEAWKRSPVNRASNFVNKFISARGNSDLDFISN
C2291859C2291859__gene_164987F046432MWEMTENELSEIISKYQMPEGRYSLVEEGSFGESEFFWVIKNQSTNQKYLLMNTYSHHGVEAEAEYYREEGFDNLEAIPRKIETLENASDAEDAIFKYLFGLYSIFEIKS
C2319335C2319335__gene_179095F085820MKLSSLYLFAMLFLATTFFSCETDEPAPRPTWGEIVNPIEAFMYPRDLKVYADDNDGRRWLILVIPDSTKSSFAPTSKSTPGEVARYKELSQLVGNPTEPVVNECHFHRTWLTQGVKAIRVVRTQADGRDEEVTAQCGNLYFYTDKQIFDCQFKCGGRSIFAKPLGETVEADYLWLPGRDVFGLIAPPNPDHLKQRIVLRLADGTEIEKELSKKGKK
SRS047219_WUGC_scaffold_18901SRS047219_WUGC_scaffold_18901__gene_25256F046432MTESKLSNIISKYQLPMDDYSVEVDGAFGRGEFFWVIKNQSTNKKYLLVNTYSHHGVESELECYREGGFDNLEAIPRRIETLELASDAEDEISKYLFGMYSIF
SRS047219_WUGC_scaffold_19780SRS047219_WUGC_scaffold_19780__gene_26462F099452MDKTYTELLQETLSKIYELKDLNNRDRGKALTIFIGERLNRELLLSSMNIFNLYKEIVNLDDVSLLNDLRKTPWYKDWFIDDKRNSDLIDLSKFNFRSLERFEKESYLKDVEHYDFEGVIEVDSYSLYDTLAEDNSVELFKLAAENILINHGFFNNTDYNLYDIPDKYMEDIEVSLYMCLLNSGNMDFMDKKTFKSTELFYIVKNNICGTIFFTLFDRMNEDTRTRAR
SRS047219_WUGC_scaffold_24027SRS047219_WUGC_scaffold_24027__gene_31891F095631MASDAVSVNKTLRSYQETVLNTVQNDTLLNANIYDIHQYIENIKKRYVDEDEITLSMGIFGYMGDVNSNALQNAVTMAAEYSNEAIPIKAKFEKNVISHALMLGINKIFAEPATMQAMFVFYEDELILNTVSDTFKFDRNIKIMVGDYEFHLPYDLIIKRIELPTGEYIYTGMYDTTQSNPIITRNSNDVDPYLKPTVRSKIDGRNVVMLLVDLRQYEYTTYHKTIITTNPLESKMLQFEFDNQLAGFDVDVKEYDQPTRKLKPVYNGLNTDGINNFCNYTYIDSSTIRVMFDNSSYLPTANTEVTVNLYTCQGANGNISYKDSIYFRVKSEKINYDRLNLLVIPTSDAQYGIDKRSIADLKKLIPKEALSRGSVTNSTDINNYFNTIDDDDNKLFFFKKMDNPLARLYYAFVLMDSPTNIIPTNTIPIEAIRRDFDNISDSNYILTAGNIIKYDGTTNASVAYQSSEEELNNARRNQFLYMNPFMCIVNKKPLYVSYYMNIMDVNKLLEFTYVNQDSKVQFVANKMNWYRHYLSERDTYVGDISIMQNIQSDIGLVHRDDPYDPEKITGVDVKVLAVFYTDEKYQVPYRWAEAEFVNYDQNTFIMDYKFKLNTDNKIDKNIKLKINNVYEVGNATRLSPGYMANNMNMKIFVFAKDVFGYNAGLHKTDQIFTADFLEGYSLTNEYTVKYGIDFLYNYSDLIESHIKIRKQDNGQISYIIDRVPVISYDYVNTEERIQDFINNLEKKRIHILECLDVLEDSFGIDIKFFNTYGPSKLFYVNDGVPLNRVNLSMTFKVKFLTTTDKYLTEYIKNDIRKYIEDKSRISDIHIPNIITYITQKYAENVTYFEFLDFNGYGPGYQHIYRKDESIVGRIPEFLNINTIGTENNALDINIIIA
SRS047219_WUGC_scaffold_33129SRS047219_WUGC_scaffold_33129__gene_43833F099453MNRFDIIELAQETLIFVYNTFNGKVNTLDPYTRLNFVSGYLDTKTNIARTTPYGCIYVSLEAFADTVERQGFIDTDQIRNLALEIIIHELTHVDQLIDYKYIKFNNGYREEVELKCVKQSCQWILDNIQYIRSLGLVVIPEVYQARLANLTNVIYTPKYPIAIVMAKLEYMLGKKFREFSNNNIEIQYIDRLKTHYSFMVCENRSYINSRNLNDLGERLLNDKQYTVEYLEYGNSKLVIKITQGA
SRS047219_WUGC_scaffold_33129SRS047219_WUGC_scaffold_33129__gene_43852F092229MNKEYRFNHIPEVVLRNIRFIRDNNIDIGTGDDVLECMMDINPVVRTKIYDDYEFAKDVAERRFGSTIEKLDLRTVLQKCITRPYNSILNNIYFRYFNSELIDDLFKLGQSSKVLDLAIKYECEYYTVNAAKTNIRRYNTDAYYNKFAADSNIISSHRSLHDPQVNAVKSAEFTYDLLMASRAEEFNPEIVREIFIKYGLKPNSSRNLYNRINDNLNLFYYIEDYLDEYREEGRFIYGTKEYKILKELRGLPLMVVLTQLTRKNDSGYILNSNLELVKG
SRS047219_WUGC_scaffold_36275SRS047219_WUGC_scaffold_36275__gene_47867F105379MVIHFPLSQSDIESLLSISKLLKCDKILYDRNYVNPIIGVGPEKSYFQTTSFMVDLSPHINNLLVNISDLKNLGKITQLEPSKENPEIAIHKPVVSVFNWDAEYVKACMNSLREYQIDDNIIARTDEFHNTDCYNELMAGSASTGAFRINIGGYMIDIPKSAIPTLKSDHVVATVYNAPNKNFNILRFKITKRNGIIVNQSMLFLPY
SRS047219_WUGC_scaffold_36275SRS047219_WUGC_scaffold_36275__gene_47872F095629MMRELIICVCLLGCFGIANANNVEQPKEVKIVHNDNSVALHKKIYQLEKRIERLEELLKKEGK
SRS047219_WUGC_scaffold_36275SRS047219_WUGC_scaffold_36275__gene_47887F092232MNTQAKFIAEYNDKNRPKFNDKFFNKSDDDIIEDLKDVILSCERNKFYTIKVLNFEVIDDYNEVQKLLIGDETPSISIKDSDLKILKVTYHVACTKDEDTFDVLIAIPRVIDGAYIHLNGNDYFPLFQLVDGSTYNNTTAASAKTQSITLKTNSNAVKMLRNFIDLNTTNEETVRAAMFSVYLFDHKVTLFEYYLARFGWYETLSKFNFEDVIKISDHDLNDPEYYTFAIANAHMKTPFYISAVKSFMDNDRILQSFVASFARAISLYATKKTTLDQIYTTEFWICKLGYNFVSSETSVFTKGNAIIESLENSYDIPTKKRLRLPDHIKEDIYSVLKWMACEFSSIRLKNNLDASSKRIRWSEYIAAMYIMLINVKLRRLPEKHDPNMEAYRIKQQLNTQPMALIAELQKSNLKGFRNMVNDRDSFLQLKYTIKGPSGPGESNSKNVARNVRAIDPSHLGIIDLNTSSASDPGVGGMLCPLNYGVYEWNSFTNEEEPNVWDENFSKMLNIYREEKGYTSAIMLADDAGLELTDARDPAAVAFDADLLGQTIAKVARTRAFEKQLRPALINMEDSCSIYFEEV
SRS047219_WUGC_scaffold_4061SRS047219_WUGC_scaffold_4061__gene_4942F089057MINVIPLIAKKYNRKGDTSGSLKSLISDLNCVTDNDDVLLFLSSIPRETKYSLDDAFDIIVSDDTYSNIFRATLVFLNIDLDYHRLLLNAIKSESYTIICMINKAIPTPDLFLAKNNYECLTIALDKSYAVFDKVLGMVIGQIKHTASSKEGRVLGIFMTLCILNKDIDKLASLCTGYLATCRSEYMVKDLMNKSAMDAFQYMSEEDIHTVVDDINSRTVLSRYLNKM
SRS047219_WUGC_scaffold_41754SRS047219_WUGC_scaffold_41754__gene_55256F033081MFPPDLIVVHRPQKGVMAWLLRRVMPQDPRPVFVWPRLVAAIGNVGYFSRRGFSVLAVGLIIVTIATIKILLFVPGLNQSVVSLLTRGLETFLPTGWATIAAWVVGTTGVFLIGSFTSSYTPSQRLLYSLEATGCGVYDALLLLALIEEQAFRSGSEKWSWHGRARASVCFGLIHVTNIWYSFAAGIALSATGFGFLLVYLWYYRKYRNQIVATAAAATVHTLYNVIALSLIVVAAAIILVIDIAKMM
SRS047219_WUGC_scaffold_42129SRS047219_WUGC_scaffold_42129__gene_55755F095633MGNYENSTEAWRREGLTEGELRIMGALAVEAIEKLRKTTVREETVLLGSVPFGSWDEFAKAVQEMAAHSYEPIPVEINTKRLIATAFLDDEGEMSVEERFVPEEVFIDLSRTRCDAEEDRNHKSYEFTCPVLMEHPDGELCPTRKAYVISAIDVNGSQEVDFNIIYGGLN
SRS047219_WUGC_scaffold_49315SRS047219_WUGC_scaffold_49315__gene_65489F081455METTNIQEINMKAAEKLGELFDYVFCGKKPNTEEKDIPPVEEIGAETVDIIDAAEEAIQQPLQNKDTSIAVNFSQMVNKPKEEVKTEVNSVPPEGETKVNVLFPKTEHILGNYVDYDSFIKIKESNTDKVVRAVRLLNYKMSDQNAAAAFAQFVSEFNPECDPNKRLRYELIRHQGREKYLVIRLSTVVNGTTKYYADIYPDLNKIDLDHHLISSAKK
SRS047219_WUGC_scaffold_49315SRS047219_WUGC_scaffold_49315__gene_65490F080166VNILANFENYNKVVEQIFELNYYLTFKLEVTFNNIIKRINTEIKENFHSEYVVGANKLTTNLRYKYQMRLSPRGEKIGIVIDWDNYDDLCTIIEESINICDPENKMSPFKRLYSTTGDLLDIKCDSLKVRYLHLEDRWNNKVDLIPFVLVDDNRGTLTEAMRFRFNNDLTFDVPVSRLKGFRRFLMTYNPVLHAGAMARYMAMTPLLGSNRQNMLR
SRS047219_WUGC_scaffold_50997SRS047219_WUGC_scaffold_50997__gene_67945F103432MKLIHSLFSLSLLLVLGGLLCITACQDDAEPTQRTGLISTDSLIHAAEVYDGKAFEHVVSTTATGLRVSEPRRVVPMLPRQLHVEMEGKTLFRRHSLPSVSAYSFQVLAVGDTIYRQKESDAQFNADLDALFHQSIGIAPRLFGVKELSVVGIDSKGKPRDLGNYSCPLLQGKRKNVNFRTTEGFRHEYFEAERIDTFSVKSNWLLKTKAEPSLYAPSFRLLVWEQPAEGCTKLRFTLTLVDGRSLVAEVPLY
SRS047219_WUGC_scaffold_52645SRS047219_WUGC_scaffold_52645__gene_70419F033081MHTDITVVYRPKKGVMAWLFRRAMPQDTRPTFVWSRLVTEIENAGYFSRWKFSILAVGLIIMTIATIKMLLFVPGLNQSAVSLLTRGLETFLPTRWATATAWTVGMAGVFLMGDLTNYTPSQTILHKIKATRYEVYNIILFLALLEEQAFRSGSERWNWRERVRASVCFGLLHIMNIWYNFATGIALSVTGFGFLLVYLWYYRKYRIQIIATAAAATVHALYNVIALSLIAVVLAIDIAKLLCSLLRTSVKVSA
SRS047219_WUGC_scaffold_58068SRS047219_WUGC_scaffold_58068__gene_80427F066860MTTKKQKLQKQQAIDTWIVVALWVSAIWFSLARGFITGIGGWVLALLGPWALIVSCICLAIISRQMKKRHASKDHLTTIVRVSFIVMSISLFICGLAMPDFSDMETFSTLSVYTNNAISFKTSKTIAIISGFVVVLSLFVAVTFGIAEDKE
SRS047219_WUGC_scaffold_58477SRS047219_WUGC_scaffold_58477__gene_81249F032313MACDNDTPQEKPREQEKHEVPVPKPKPQFDEVGERIWYGQTPAMRLDSTDYGAGLTWVLEMRTSSIPKQRFDSLFKQTVWEIKDICAVETDLSLAKKIPKFVGGSITKEFTCRNGVILRHMQGIDINLVDTVNYVYNEDLNEIVLEGTGIRWYVLRLNKYAVEFLQQGHNIWGPFDWYYGRNSGRSEVTLEAK
SRS047219_WUGC_scaffold_58477SRS047219_WUGC_scaffold_58477__gene_81250F032313MYRFLILIFALTLMACDNNTPQEKPHEQEKHEVPVPVSKPQFDEVGERIWYGRTPAMRLDSTDYGAGLTSVFGMRTSSIPKQRFDSLFKQTVWEIKDIRVVETDLSLAKKNPGIMGWVTTTEFTCRNGVIVLHRQGIDVNHVDTVNYVYDEVGNEIVLEGTGIRWSVLRLNKNAVEFLQRGRTMWGPFDWYYGRNSGRSEVTLEEK
SRS047219_WUGC_scaffold_59356SRS047219_WUGC_scaffold_59356__gene_83145F068942TKSVMIRKILSLPTLALCFTLCTALFAGCGENNLGFVTEVRWSNVKNPKYGDDINITLKAEGETFTTVGDHSQIFFGYDASTLDTFTRHRFPEMDKDTAYYKDIVIYLTRNESKGTATLKLVAPPNRTQQPKHFKFSVSVTPPGTYFFNVRQPALPAKAQ
SRS047219_WUGC_scaffold_61395SRS047219_WUGC_scaffold_61395__gene_88627F072446MRKLLFLLSGLCLYCLAACDNDHEPTKPVRPFHGDTLAQIAWNFRFIVEHHYHSIPGIVPERTTYRVPVIPRSVEDKTKKEYNDMELGKEAHLVFRATVHGDTINRQKRELEALSLQLNRLTLTSIGTSPVLCGVKSIEAVGIAENGNTYDLSAEMKLRIRDYSYRLKYPSGIITLDCENTESLTAKYVVPLGRIREYELAEHIQPELKFYLPVKRCMDFSSIRFAITLFNGEVLSFQHKLPSKSVLQELPNKSVLENYQQYGFIPESTYFTTLWPLPDYKYNEREL

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.