NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300008275

3300008275: Human tongue dorsum microbial communities from NIH, USA - visit 2, subject 764447348 reassembly



Overview

Basic Information
IMG/M Taxon OID3300008275 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0063646 | Gp0053113 | Ga0114253
Sample NameHuman tongue dorsum microbial communities from NIH, USA - visit 2, subject 764447348 reassembly
Sequencing StatusPermanent Draft
Sequencing CenterBaylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size162144041
Sequencing Scaffolds21
Novel Protein Genes22
Associated Families21

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → unclassified Porphyromonas → Porphyromonas sp. oral taxon 2796
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → Candidatus Saccharimonas aalborgensis1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella → Alloprevotella sp. oral taxon 4731
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae4
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes1
All Organisms → cellular organisms → Bacteria1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae → Haemophilus → Haemophilus parainfluenzae1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → unclassified Porphyromonas → Porphyromonas sp.1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae1
Not Available2

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Oral Cavity → Tongue Dorsum → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal surface

Location Information
LocationUSA: Maryland: Natonal Institute of Health
CoordinatesLat. (o)39.0042816Long. (o)-77.1012173Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F030786Metagenome184N
F032313Metagenome180N
F040149Metagenome162N
F041827Metagenome159Y
F042387Metagenome158N
F043235Metagenome156N
F045567Metagenome152N
F047508Metagenome149N
F051213Metagenome144N
F053092Metagenome141N
F055792Metagenome138N
F061925Metagenome131N
F061926Metagenome131N
F066860Metagenome126N
F077404Metagenome117N
F077405Metagenome117N
F078842Metagenome116N
F089056Metagenome109N
F090516Metagenome108N
F092230Metagenome107N
F094006Metagenome106Y

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0114253_1000307All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → unclassified Porphyromonas → Porphyromonas sp. oral taxon 27938181Open in IMG/M
Ga0114253_1001016All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → unclassified Porphyromonas → Porphyromonas sp. oral taxon 27917426Open in IMG/M
Ga0114253_1001028All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → unclassified Porphyromonas → Porphyromonas sp. oral taxon 27917295Open in IMG/M
Ga0114253_1001080All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → Candidatus Saccharimonas aalborgensis16718Open in IMG/M
Ga0114253_1001335All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → unclassified Porphyromonas → Porphyromonas sp. oral taxon 27914371Open in IMG/M
Ga0114253_1001683All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales12282Open in IMG/M
Ga0114253_1001816All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella → Alloprevotella sp. oral taxon 47311656Open in IMG/M
Ga0114253_1001980All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → unclassified Porphyromonas → Porphyromonas sp. oral taxon 27911018Open in IMG/M
Ga0114253_1002468All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae9464Open in IMG/M
Ga0114253_1002497All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → unclassified Porphyromonas → Porphyromonas sp. oral taxon 2799400Open in IMG/M
Ga0114253_1004310All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae6060Open in IMG/M
Ga0114253_1004472All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes5866Open in IMG/M
Ga0114253_1005054All Organisms → cellular organisms → Bacteria5272Open in IMG/M
Ga0114253_1005537All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae4911Open in IMG/M
Ga0114253_1006420All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae4320Open in IMG/M
Ga0114253_1008449All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae → Haemophilus → Haemophilus parainfluenzae3387Open in IMG/M
Ga0114253_1011748All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae2461Open in IMG/M
Ga0114253_1016768All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → unclassified Porphyromonas → Porphyromonas sp.1727Open in IMG/M
Ga0114253_1020612All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae1407Open in IMG/M
Ga0114253_1024381Not Available1180Open in IMG/M
Ga0114253_1049956Not Available559Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0114253_1000307Ga0114253_100030710F053092MQSAQGLILCLTHALLVLRALILEPTEMEDTMDDHAVQLFGILGAKELRIATHRIKADEHVPRDHIPLTLVEGDDIGIVVMIEKVLIGLQDALITTELVAELADTTVIAGSDLTDPVAKDTLSEARLLDVFVSIVSYKLRFFRHK*
Ga0114253_1001016Ga0114253_100101612F055792VEIAGEALDSTSAVTHRILLLTTQLGESLLASFGAEDGVIAEAMVTRALESNLSIDSTLEEVGPVLIDKGDDGTEASTTWGRYTLETLQKESNILFEGSMLPCEARRVDPRCSVKSLDLEPRIIGEAIEPVALPDVTRLDESISLQGIGGLRDLLMTPDVSETDHLQTSREEGTDLLQLMGIIARKYQLFHTFVS*
Ga0114253_1001016Ga0114253_100101613F040149VADEGAEELRWEVLIEEQGIPVLFVEVEAWYDGRVSSSEILRSVGVALEREPRLAPVWSHNSEDAIHDFIYDTSVPKGHTLTAVRERETVVAQLLNIHRYVYYP*
Ga0114253_1001028Ga0114253_10010287F045567MVTEDVVHTSTHRVEDALLPVDGDILTPRDGTHIVETERVVVVLVSQEDSIDTINTETCGLVVEVRATVDEDTLPTLGDDEGRGAQTTVTSIRAMAHRAATAYLGDTSAGARTEKNYLHVSRRESYHDKEIPSLSSERGSVVERVVR*
Ga0114253_1001080Ga0114253_10010809F066860MTTKKQKLQKQQAIDTWIVIALWVSAIWFSLARGFITGIGGWVLALLAPWALIVSCICLAIISRQVKKRHASKDHLTTIVRVSFIVMSISLFIDMETFSTLSVYTNNAISFETSKTIAIISGFVVVLSLFVAVTFGIAEDRE*
Ga0114253_1001335Ga0114253_10013353F094006MWSALEPAGPTGALYESAVCVEAHIGYLQTGGIACRGLFAFEGSNVLAVAELGAEAPHAINMYDIAVCEPLSELFAKELEYAFNFGA*
Ga0114253_1001683Ga0114253_100168313F061926ASVLLLIFVMNVSGLFVRLHHQETHQKTHQKTEKIAECSDKVCYHKAHLQTKSDCDCGFLCTLNYFYILPEKPQTEIHVNEYFSYFSSYKIFVSERIILLWQSRAPPVLS*
Ga0114253_1001816Ga0114253_10018166F077404MAQQIIMTHKLAAAALSLKGPTQIGNTQNPFAMNTLKTIFTCFFTLCFMMVANSYAQKTDSINTEAGKSVLHRNAIYIPPALEQYADTALLHQRFNVENKGNYLYTPFTEDNEPTIPFNYGFLHPLGERFYNCFMGKVDRILRPKADKGFIILTSYLVVLGDSYAFDTSNKDTSKLADLKYLDFRHIKRDFSYGHPYQGFTHNDRIELSNFVQSYGRQAALETANAWVMASYPFSLQSTKFENRYTRGRKLILTDGHSTLYLYFLMIDSVAPNFDTEVLPYIKGVFRFNRFR*
Ga0114253_1001980Ga0114253_10019808F051213MALTFVLTSCDPFSQNEPTIEGNRYKYFDSSAQRKSFRVVNGSGKQYNHKVDWHIIGIQEENSDTYLTKKVDTLSNGDFRISYDWVTFTVKENKSVIDVEVQKNETGKDRSVFFATSNSYKQAYLPNMIVTQRAK*
Ga0114253_1002468Ga0114253_10024687F042387MKRIILFFMAGVFHVFFCRGNDKMTDETLANSAKMQLPTKVTIAENKKVISKRFEYQNDNELKEIIDEGSGERIVFVYEKDFITSKIRYSQAGEELGKTNYQYNNGKLSSVIDEVVISDSGIQYKRVVTREYHYNGSEVSVNENIKYHSESYAYNLRDENFTHTYVLNGENITKIHHEISKNVPGGHFYLNPNNMVVVDEEVTYDAKNSPYKNIKGFSVLAVEFCGLDEDENTIADYLNFRWVSHNPTLIKKSINLYGSGVDSSEYKFQYEYKNNFPIKTKLNIDNQIVTTMVYEYNK*
Ga0114253_1002497Ga0114253_10024973F043235VDEFGTQLTDALLDRPDRSKRELHQWAVNGDDIVQLRHMDEVVPSDEGHLLIDLCDDDPRSLCGGLGIVTRYPEGAIAPFIGLAHRDQCDIDRIDTIPKEVWEFMEVTREIVDTLIQVSGAAILVKEVKDGMYMPHHLWAEVPRLGKVQHVEGFHVREALAIIVEGFGETAGGCHSMAKDQEVPALYCRSHGFKGGRSMASNVLLPGSAH*
Ga0114253_1004310Ga0114253_10043101F041827MKHFLSALALGCLLLSCNRDNETNETPAPPQNEKLVLLKSASPGGIELNYKNRNEIESLSTDGMYGKSDINYEYDAYGRIIKERRFSHRYDYGETNITYQYDGQGRLTSSHAISTEFYPGTGLTPRCSVEKKHAYTYQGNKVIVKIEMGTDTCSAIPETGKEKTITLLVENGKVVKSLDENNQIIETIEYHNTKNALRNIKGFPALVVEFYIRAFTYKLNWYNNIELVQDLRFIDNVKTRNYPNGNYIMYEYRTENYEITDKD
Ga0114253_1004472Ga0114253_10044725F092230MRQAHKRTVDKLKAYLLKVFFPLFIVCIILVAFFRQIGCGSDGDYAFQISEWGAKLKNIYGTDFINKEIIVRDNAVRVDGIRCLYAVNQNEDGLSIYLLLPGGDYLTHNYVGSSFVRFSNSSEYINMAYGEGSVEVSDSTSTGEVQNTEEKEARDKVDEAINSMRHLFASAIMVNLRVVELYKILTVCMILIVIAMTIGYYSYLKPETVYEFYCKLRRKEKYPSDVNLVKRIGFLVIILPPCMLFLLIV*
Ga0114253_1005054Ga0114253_10050544F078842MIISSIYKIADNDGLIAHIYEHLLAQYVLKRLQDNEFFVLSDIILSAKTYGDTCFMDAELYSPEVKKTYDEALREFDKLVISEDDILRAASECGIEMNRNIVEVDRSELSKKLREVQISPWRKQIDMAYRKAHDESSVNTLFRTSYIKYSKESDDLFRECVLEYSIDESHIQTPVDQALAAIVMQIVALNFLTVVREKYTVYDRGDQWSEASISVGYRMFLGSLKKDDKIINQLSYDFLEYIKILSSSVFCDNLQKALVRCSDNHKQVILNRSMLNAILGGCVIGGKGWLEMADSARIRQMINSIELDIYEVNS*
Ga0114253_1005537Ga0114253_10055372F032313MACDNNTPQEKPHEQEKHEVPVPKPKPQFDEVGERIWYGRTPAMRLDSTDYGAGLIWVLEMRTSSIPKQRFDSLFKQTVWEIKDICAVETDLSLAKKIPRFVGGSIAKEFTCRNGVILRHMQGIDINCVDTVNYVYNEDLNEIVLEGTGIRWYVLRLNKNAVEFLQQGHNIWGPFDWYYGRNSGRSEVTLEEK*
Ga0114253_1006420Ga0114253_10064204F061925MSWEYSINLDSEEAVSSVVTDLKICELFSSSTTDYIDWKNPKSIDSAPYDARFYTDKKKTIYISINSFSKNIFSALKSILAKQNYHLTDCDTDEEVTLEHIFRSVI*
Ga0114253_1008449Ga0114253_10084495F077405LLVAKQRGVFYGFISLCQIKFVKKVFDWLKIVQKQKARLK*
Ga0114253_1011748Ga0114253_10117482F090516MKSNLSLKLFLAFERYFIENDEVISLDKSSEFTDVIVGIGFLPQDMSENTDFKKKTIEKYGFSSSMALADDFRKRVLNIDEPIPENFEKDGIGYVYTVISGYDTFYNRMYMFGIHCFNGDFNVTYYDLDNDAGTGDYYEEHELYSQAKGYRWLDPESDYYEDVLAWEALNKLATDIYFHLEDKLDVKIDIKPIPEEEKVVPTQEHLAKFLAFCGVEQDVIDENKERLLKALEEYTPDEYEGISEAMAEMMEYSHKIQRAEPVIEIIREYGVCRFSDWKFYAEELEEYILDLADFSDWKWEYPADTYSADLFPYMRKQLSLYHLWLCHLDEGADAYLFLLFSEKDMPEIMKLARILDLPLKAYFK*
Ga0114253_1016768Ga0114253_10167683F047508MLGHRLVEGRVKYPYLRDIGEYLRHSFDTEDVGWVVKWSELCALMEHIYYLWGDTYALSKALCTVYEAVTDGVDLIEGLYEVFFFENVEDNLYAACVVRNVKVALDLLSFGVTEGDEGVVDPYALFVPRGQDLVVGELDEGELQGGAATVEDQDFHKVLYYMVRCE
Ga0114253_1020612Ga0114253_10206123F089056IIKKMNVFCKIILPLLCIISCSKRKEADNTMVLEKNHAFSLWNNDSLGCKHERTIEMGEELYNTFKKSNKNDSILLKKYLGTPTRRFKDKEEIVFMYYINSCCDNGLLLEECDVSFISITFTNKNKILFGKGIQ*
Ga0114253_1024381Ga0114253_10243811F030786MKRTKIHNVVFQMLVVMIVTGSLQLLLKNGSAAKGGNAMGAKKISLADITGGDDSETGVLKVKYFDDEGADKIFENNNNRILNTINSQHISYN
Ga0114253_1049956Ga0114253_10499562F053092MQSDQGLILCLTHALLVLGALILEPAEMEDTMDDHTVQLFGILIAKELGIATHRIKADEHVPRDHIPLTLIEGDDIGIVVMIEEVLIGLQDTLITTELVAELADTTVIASSDLTDPVAKDTLSEARLLDVFVSIVSY

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.