NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 7000000508

7000000508: Human tongue dorsum microbial communities from NIH, USA - visit 1, subject 765013792



Overview

Basic Information
IMG/M Taxon OID7000000508 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0063646 | Gp0053060 | Ga0031277
Sample NameHuman tongue dorsum microbial communities from NIH, USA - visit 1, subject 765013792
Sequencing StatusPermanent Draft
Sequencing CenterBaylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size140881499
Sequencing Scaffolds20
Novel Protein Genes23
Associated Families21

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
Not Available6
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes2
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → unclassified Flavobacteriaceae → Flavobacteriaceae bacterium1
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria5
All Organisms → Viruses → Predicted Viral1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales1
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonia → Candidatus Nanosynbacterales → Candidatus Nanosynbacteraceae → Candidatus Nanosynbacter1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Oral Cavity → Tongue Dorsum → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal surface

Location Information
LocationUSA: Maryland: Natonal Institute of Health
CoordinatesLat. (o)39.0042816Long. (o)-77.1012173Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F032313Metagenome180N
F033081Metagenome178Y
F036281Metagenome170N
F043235Metagenome156N
F043991Metagenome155N
F045567Metagenome152N
F046431Metagenome151Y
F051214Metagenome144N
F066860Metagenome126N
F068942Metagenome124N
F071328Metagenome122N
F074985Metagenome119N
F076191Metagenome118N
F080164Metagenome115N
F080165Metagenome115N
F085820Metagenome111N
F089055Metagenome109Y
F095633Metagenome105N
F099454Metagenome103N
F103432Metagenome101N
F105376Metagenome100N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
C3609730Not Available634Open in IMG/M
C3625295Not Available704Open in IMG/M
C3630491All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes732Open in IMG/M
C3645892All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas835Open in IMG/M
C3658445All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → unclassified Flavobacteriaceae → Flavobacteriaceae bacterium953Open in IMG/M
C3658817All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes957Open in IMG/M
C3668094All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria1085Open in IMG/M
C3685531All Organisms → Viruses → Predicted Viral1481Open in IMG/M
C3690875All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales1681Open in IMG/M
C3709167Not Available3983Open in IMG/M
SRS018739_WUGC_scaffold_13867All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonia → Candidatus Nanosynbacterales → Candidatus Nanosynbacteraceae → Candidatus Nanosynbacter1014Open in IMG/M
SRS018739_WUGC_scaffold_23270All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae1081Open in IMG/M
SRS018739_WUGC_scaffold_32985All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria1523Open in IMG/M
SRS018739_WUGC_scaffold_43287All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria30417Open in IMG/M
SRS018739_WUGC_scaffold_45178Not Available2352Open in IMG/M
SRS018739_WUGC_scaffold_48794All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes2389Open in IMG/M
SRS018739_WUGC_scaffold_54250All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria652Open in IMG/M
SRS018739_WUGC_scaffold_59731Not Available797Open in IMG/M
SRS018739_WUGC_scaffold_59741Not Available2500Open in IMG/M
SRS018739_WUGC_scaffold_59760All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria817Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
C3609730C3609730__gene_177996F068942KYPLEATKPILQIQSSVLIDNPHTKSVMIRKILSLPTLALCFTLCTALFAGCGENNLGFVTEVRWSNVKNPKYGDDINITLKAEGETFTTVGDHSQIFFGYDASTLDTFTRHRFPEMDKDTAYYKDIVIYLTRNESKGTATLKLVAPPNRTQQPKHFKFSVSVTPPGTYFFNVRQPALPAKAQ
C3625295C3625295__gene_185541F085820LFAMLFLATTFFSCETNEPAPRATWGEIVNPIKAFMYPRDLEVSEDQYDSRRWHILVVPDSTKSSFAPTSKSTPAEVARYKELSRLVGNPTEPVVNECNFRRTWLTQGVKAIRVVRTQADGRDEEVTAQCGNLYFYTYKQIFDCHFKCGDRSIFAKPLGEVVEADYLWLPGIDEFGLVTPPNPDHLKQRIVLRLADGTEIEKELSEKGKK
C3630491C3630491__gene_188147F051214MPGKIVAHDTHLRIDTEFIELNDCFEAFRRGVEYREKNDVDDILVICNAPDIIEYQLKNGDSFIVTYDPIHRIIVMRVFLHDEDITIKPIYIYNNREYQIACEFLRQVMHDKIDLKDEWI
C3645892C3645892__gene_196006F099454MKASELLWAVVIALTFVFTSCDRLTDEPTLEDRGYKYFDSTAQRKSFRVVTASGKPYNHKIDWHIIGIRDSKSDTYLTKKVDTLSNGDLKISYDWISFTIREKKSVIDVEVQKNETGEDRSVKFVAQDNHKGLASPSMKVIQQAK
C3658445C3658445__gene_202771F080165VKERVFYGEDMKLGEEDEERFQDFQETNNPKKYEYYTKQNPQLAWVHYPYIRNKKTKQEYSIYEDGPMGGRYIIYGDSLYISNHYNIYEEDSLRYTFTRYILR
C3658817C3658817__gene_202980F043991KNPSVVDYFDLNGDLNEEAYEFEDVKLEEYIDKRSNVKPSWVGKYSHQMHFDLPDDTEVSFYKGLNIVYADINFAGGIRTILFKCRQKKNLTRFISRVLEIAQGDPSNVHPDFRA
C3668094C3668094__gene_208156F066860MTTKKQKLQKQQAIDTWIVIALWVSAIWFSLARGFITGIGGWVLALLGPWALIVSCICLAIISRQMKKRHASKDHLTTIVRVSFIVMSISLFICGLAMPDFSDMETFSTLSVYTNNAISFKTSKTITIISGFVVVLSLFVAVTFGIAEDKE
C3685531C3685531__gene_219150F074985MELTDGGWYKTPRIIKGTDFLAHIHDTYASGNAMYAEFKASEGEVRILEYQRLYEVDTESAVLFTINTYPQESILLKNIEEYEFIQYRPQQAWKAIHMGSTKRFNLEQFDQLWLDQTFKKLHPVIVNHDGKFWHVMGLKLDVDADGSFWGLYLKRQDSDFMKEIRMPLTQKFIYNPISGSWSLDDPTQEIKDLEEIKQTLRADAILDVTVSGVPMKLIRVQEIAKGVLFFVFQDEEKNKRYYYNRPAIKLR
C3685531C3685531__gene_219151F036281MITLIKVDEGPVDIYELRMQYLAKLKETDGVMLPTFIYRNKDLFVTEFKPTCDDRWIMYMTNAEGLITKMRIKNGDLMSNGSVLFLAEERKTYSAKEYYDYWAAREGKPAPFFYESRQYHVKSFMRVPGSTDLWITAERETGHWYTFRMSDDQKSKFTRHTMTNEKGHQSYDWVLENVEWAADTIRYF
C3690875C3690875__gene_223003F043235MDEVVPSDEGHLLIDLCDDDPRSLCGGLGIVTRYPEGAIALFIGLAHRDQCDIDRIDTIPKEVWEFMEVTREIVNTLIQISGAAILVKEVKDGMYMPHHLWAEVPRLGKVQHMEGFHVREALAIIVEGFGEAAGGCHSMAKDQEVPALYCRSHGFEGGRSMASNVLLPGSAH
C3709167C3709167__gene_239194F071328MSQKHWTFTHIIRYIEEYERNPLLIERMKWEFILEGEYIVEFVKLCKHLVLERTIDPNKHQAAAIYLRYSSQLLLKKRRAIRRLGIEKKYVSAILRQYGIHYIEYGDNEHRVFFLDRGINLYFSKHDQSSIYIIQRIEFSNEEYRSFILKVLPVKKSEW
SRS018739_WUGC_scaffold_13867SRS018739_WUGC_scaffold_13867__gene_14555F105376MDNTDKEVSAKEFGALGADVIHIKESVDRHTVTLERIENIARANVTQAQLKAYIAEHEKESEEKYVKRTEIEGVMNFWSLVTSNLAKLFAIALVGLAIYATNNLIQQNKAVTELQEEVQQTQVRRK
SRS018739_WUGC_scaffold_23270SRS018739_WUGC_scaffold_23270__gene_24876F045567MRQRDGRDTLTEELEGGITPLLYRAEGEARRPWVRMVTEDVVHTSTHRVKDALLPVDGDILTPRDGTHIVQTERVVVVLVSQEDSIDTIDTETCGLVVEVRATVDEDTLPTLGDDEGRGAQTTVTSIRAMAHRAATAYLGDTSAGARTEKNYLHVSRRESYHGKEIPSLSSERGCVVERVVR
SRS018739_WUGC_scaffold_32985SRS018739_WUGC_scaffold_32985__gene_36147F089055KVEKMNSTPECVTKTPEIEAREKLAAIFSDAERRGDNSKVSPELGKTAIDIENTSKMDSADNGAVDFCNQALGGYGKSLDYINNSPLETVQAIGNSLQLFREDKTKESCK
SRS018739_WUGC_scaffold_43287SRS018739_WUGC_scaffold_43287__gene_49283F076191MSMKIIAENPAEEALLWRIKALSDELVNQDNRYTSMPVWTILDNNKAGKDYGAVMYFTGKAAEHHINENDHHYENPTTCIRSAHDNRELKDVIHLLILAGDNEIPSNHYGVLRDA
SRS018739_WUGC_scaffold_45178SRS018739_WUGC_scaffold_45178__gene_51836F103432MKLIHSLFSLPLLLVLGGLLCLTACQDEAEPARAAWLISTDSLIHAAEVYDGKAFEHVVSTTATGLRVSEPRRIVPMLPRQLHVTMEGKTLFRRHTLPSVSAYSLRVVAVGDTIYRQKESDAEFNADLDALFHESIGIAPRLFGVKELSVVGIDRKGKPRDLGNYSCPLLQGKRRNVNYRTREGIFHEHYEAASVDTFSVKSNWLLKTKAEPSLYVPSFRLLVWEQPAEGCTKLRFTLTLVDGRSLVAEVPLR
SRS018739_WUGC_scaffold_45178SRS018739_WUGC_scaffold_45178__gene_51837F080164MVGLALCAAPQVTLRERASAFPLITEKDATEVDAPYAWRLPVVPLSLDNREIRNFAKFPLLPSLSGGILTVRVLVVGDTVAVHRDLMDDFAKRCRTTLGLGVRTAPKLFGIKGMHVYGVQKDGSRQAVDEHVTLHLPGFEKAEKPLHYKEQTGQLVLCEYYGSHRGDLLLNAANARPEIFGELCPVVDFHFPVELRRAYAWLLLEIELEDGTKLSTSLQHYDEQTSILDHPDRS
SRS018739_WUGC_scaffold_48794SRS018739_WUGC_scaffold_48794__gene_56743F046431MKKITKINITIILILSIIISYGSVIISMAAESELTLTPKPETNNIHLKWTGPQNSSYKVYQKKPDSNQFETIGLTDFSNNATDEEVKVLNIYPQENNANPRLYPDQTPEQVRQHFTSLPKVQVTYLDGQTETIPKSALLKVWMEGRNSKRR
SRS018739_WUGC_scaffold_54250SRS018739_WUGC_scaffold_54250__gene_65813F095633MKNYENSTEAWRGEGLTEGELRTMGTLAMRATEELRKTTVRKEAVLLGSVPFNSWDEFAKAVQEMAAHSYEPIPVKINTKRLIATAFLDDGGEMSVEENSVPEEVFIDLSRTRCVVDADRSHKSYEFTCPVLKKFPDGELYPIREAYVISAIDVNGSQEVDFKII
SRS018739_WUGC_scaffold_59731SRS018739_WUGC_scaffold_59731__gene_77117F085820RSTFYLFAVLFLATTFFSCETVEPSPRPTWGEIVNPIEAFMYPRDLKVFADGEEGRRWLILVVPDSTKSSFAPTSKSTPAEVARYKELSQLVGNPTEPVVNECHFHRTWLTQGVKAIRVVRTQADGRDEDVTAQCGNLYFYTDKQIFDCQFKCGNRSIFAKPLGEVVEADYQWLPGRDGFGLITPPNPDHLKQRIVLRLADGTEIEKELSEKGKK
SRS018739_WUGC_scaffold_59741SRS018739_WUGC_scaffold_59741__gene_77146F032313MYRFLILIFALTLMACDNNTPQEKPHEQEKHEVPVSVSKPQFDEVGERIWYGRTPAMRLDSTDYGAGLTSVFGMRTSSIPKQRFDSLFKQTVWEIKDIRVVETDLSLAKKNPGIMGWVTTTEFTCRNGVIVLRRQGINVNHVDTVNYVYDEVGNEIVLEGTGIRWSVLRLNKNAVEFLQRGRTMWGPFDWYYGRNSGRSEVTLEAK
SRS018739_WUGC_scaffold_59741SRS018739_WUGC_scaffold_59741__gene_77147F032313MYRFLILIFALTLMACDNDTPQEKPREQEKHEVPVPKPKPQFDEVGERIWYGRTPAMRLDSTDYGAGLIWVLEMRTSSIPKQRFDSLFKQTVWEIKDICAVETDLSLAKKIPRFVGGSIAKEFTCRNGVILRHMQGIDINCVDTVNYVYNEDLNEIVLEGTGIRWYVLRLNKYAVEFLQQGHNIWGPFDWYYGRNSGRSEVTLEAK
SRS018739_WUGC_scaffold_59760SRS018739_WUGC_scaffold_59760__gene_77195F033081MAWLFRRAMPQDTRPTFVWSRLVAEIENAGYFSRWKFSILAVGLIIMTIATIKMLLFVPGLNQSAVSLLTRGLETFLPTRWATATAWTVSMAGVFLMGDLTNYTPSQKILHKIKATRYEVYNILLLLALLEEQAFRSGSERWNWRERVRASVCFGLLHVMNIWYSFAAGIALSVTGFGFLLVYLWYYRKYCSQIIATAAAATVHALYNAIALSLIAVVLAIDIAKLL

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.