NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 7000000250

7000000250: Human tongue dorsum microbial communities from NIH, USA - visit 1, subject 764588959



Overview

Basic Information
IMG/M Taxon OID7000000250 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0063646 | Gp0052768 | Ga0031258
Sample NameHuman tongue dorsum microbial communities from NIH, USA - visit 1, subject 764588959
Sequencing StatusPermanent Draft
Sequencing CenterBaylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size70069287
Sequencing Scaffolds26
Novel Protein Genes29
Associated Families29

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
Not Available4
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ct6uZ82
All Organisms → Viruses → Predicted Viral2
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria2
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Bacteroidaceae → Bacteroides1
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae → Chryseobacterium group → Chryseobacterium → Chryseobacterium gleum3
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → unclassified Flavobacteriaceae → Flavobacteriaceae bacterium 3519-102
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae3
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → Capnocytophaga1
All Organisms → cellular organisms → Bacteria1
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA4162
All Organisms → cellular organisms → Bacteria → Proteobacteria1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Oral Cavity → Tongue Dorsum → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal surface

Location Information
LocationUSA: Maryland: Natonal Institute of Health
CoordinatesLat. (o)39.0042816Long. (o)-77.1012173Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F042387Metagenome158N
F043990Metagenome155N
F047127Metagenome150N
F051212Metagenome144N
F054109Metagenome140N
F058221Metagenome135N
F061925Metagenome131N
F061926Metagenome131N
F066860Metagenome126N
F071327Metagenome122N
F072446Metagenome121N
F077404Metagenome117N
F080164Metagenome115N
F080165Metagenome115N
F080166Metagenome115N
F081455Metagenome114N
F081510Metagenome114N
F089056Metagenome109N
F089057Metagenome109N
F090516Metagenome108N
F092229Metagenome107N
F092232Metagenome107N
F094007Metagenome106N
F095629Metagenome105N
F097525Metagenome104N
F103431Metagenome101N
F103435Metagenome101N
F105378Metagenome100N
F105379Metagenome100N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
C2483834Not Available724Open in IMG/M
C2507209All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ct6uZ81172Open in IMG/M
C2508507Not Available1221Open in IMG/M
C2510972All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ct6uZ81337Open in IMG/M
C2517361All Organisms → Viruses → Predicted Viral1818Open in IMG/M
C2518429All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria1944Open in IMG/M
C2518857All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Bacteroidaceae → Bacteroides2006Open in IMG/M
C2519403All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes2093Open in IMG/M
C2521492All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria2526Open in IMG/M
C2522388All Organisms → Viruses → Predicted Viral2790Open in IMG/M
C2525242Not Available4858Open in IMG/M
C2525448All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae → Chryseobacterium group → Chryseobacterium → Chryseobacterium gleum5262Open in IMG/M
C2525636All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → unclassified Flavobacteriaceae → Flavobacteriaceae bacterium 3519-105642Open in IMG/M
C2525960All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae → Chryseobacterium group → Chryseobacterium → Chryseobacterium gleum6765Open in IMG/M
C2526000All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes6861Open in IMG/M
C2526082All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → unclassified Flavobacteriaceae → Flavobacteriaceae bacterium 3519-107227Open in IMG/M
C2526528All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae11621Open in IMG/M
C2526546All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → Capnocytophaga11795Open in IMG/M
C2526644All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae14400Open in IMG/M
C2526682All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae15919Open in IMG/M
SRS016086_WUGC_scaffold_23220All Organisms → cellular organisms → Bacteria3517Open in IMG/M
SRS016086_WUGC_scaffold_24580All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA4166586Open in IMG/M
SRS016086_WUGC_scaffold_25924Not Available590Open in IMG/M
SRS016086_WUGC_scaffold_26343All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA4165114Open in IMG/M
SRS016086_WUGC_scaffold_26577All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae → Chryseobacterium group → Chryseobacterium → Chryseobacterium gleum10744Open in IMG/M
SRS016086_WUGC_scaffold_8246All Organisms → cellular organisms → Bacteria → Proteobacteria646Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
C2483834C2483834__gene_94443F072446RPFHGDTLAQIAWNFRFIVEHHYHSIPGIVPERTTYRVPVIPRSVEDKTEKEYNDMELGKEAHLVFRATVHGDTINRQKRELEAISRQLNALTLTSIGTSPVLCGVKSIEAVGVAERGNTYDLSREMKLRIRDYFGRVKYRSSGIVTLNCENTESMTAKYVVPLARIREDELAEHIQPELKFYLPVKRCMDFSSIRFAITLFNGKVLSFQHKLPSKSVLQELPNKSVLENYQQYGFEREA
C2507209C2507209__gene_109008F105379MVIHFPLSQSDIESLLSISKLLKCDKILYDRNYVNPIIGVGPEKSYFQTTSFMVDLSPHINNLLVNISDLKNLGKITQLEPSKENPEIAIHKPIVSVFNWDAEYVKACMNSLREYQIDDNIIARTDEFHNTDCYNELMAGSASTGAFRINIGGYMIDIPKSAIPTLKSDHVVATVYNAPNKNFNILRFKITKRNGIIVNQSMLFLPY
C2508369C2508369__gene_109819F103435DKDLMVEYGDYKDYWDLKNQQDALPENIFVAELTSRDYPRNPWNYVSQLISKLTYSYLIDNPEFENIFSEILFNQSEEEFYEFYKAIDRFYNGSEIFIIVSNDEYSDMVTQMMCNVIRRMYGIHPQVIYDIDDILNIRDDIDFSPQGAQLAYLQRSAYYKLEAKRSLEPLQIWYPFDMNTYTNALE
C2508507C2508507__gene_109910F080164MVGLTLCAAPQVTLRERANAFPLITEKDATEVDAPYAWRLPVVPLSLDNREIRNFAKFPLLPSLSGGILTVRVLVVGDTVAVHRDLMDDFAKRCRTTLGLGVRTAPKLFGIKSMHVYGVQKDKSRQAVDEQVTLHLPGFEKAEKPLHYKGQAGQLVLCEYYESHRGDLLLDAANARPEIFGELCPVIDFHFPVELRRAYAWLLLEIELEDGTKLSTSLQHYDEQTSILDHPDRS
C2510972C2510972__gene_111690F095629LIICLCLLGCFGIANANNIEQPKEVKLVHNDDSVILHKKIYQLEKRIERLEELLKKEGK
C2517361C2517361__gene_116936F089057MINVIPLIAKKYNRKGDTSGSLKSLISDLNCVTDNDDVLLFLSSIPRETKYSLDDAFDIIVSDDTYSNIFRATLVFLNIDLDYHRLLLNAIKSESYTIICMINKAIPTPDLFLAKNNYECLTIALDKSYAVFDKVLGMVVGQIKHTASSKEGKALGIFMTLCILNKDIDKLASLCTGYLATCRSEYMVKDLMNKSAMDAFQYMSEEDIHVVVDDINSRTVLSRYLNKI
C2518429C2518429__gene_117849F066860MTTKKQKLQKQQAIDTWIVIALWVSAIWFSFARGFITGIGGWVLALLAPWALIVSCICLAIISRQMKKRHASKDHLTTIVRVSFIVMSISLFICGLAMPDFSDMETFSTLSVYTNNAISFKTSKTIAIISGFVVVLSLFVAVTFGIAEDKE
C2518857C2518857__gene_118248F077404MAQQIIMTHKLAAAALSLKEPTQIGNTQNPFAMNTLKTIFTCFFTLCFMMVANSYAQKTDSINTEAGKSVLHRNAIYIPPALEQYADTVLLHQRFNVENKGNYLYTPFTEDNEPTIPFNYGFLHPLGERFYNCFMGKVDRILRPKADKGFIILTSYLVVLGDSYAFDTSNKDTSKLADLKYLDFRHIKRDFSYGHPYQGFTHNDRIELSNFVQSYGRQAALETANAWVMASYPFSLQSTKFENRYTRGRKLILTDGHSTLYLYFLMIDSVAPNFDTEVLPYIKGVFRFNRFR
C2519403C2519403__gene_118753F081510MKFDLNAVKATAKTTWVTTKILGKKYAPVILLGAGLVGYGYSVYEGIKSGKKLEATKAKYEEMEAAGEEFSRVDVVKDIAKDVAIPVAVATASTASIILGFAIQTNRLKAVSAALAMVTEEHARYRLRAKTVLDEETFKKIDAPLETKTVNVDGEDIEVESIVPNEGDFYGMWFKKSHKYASDSPEYNEGVIKEADNVLTEKMMRKGMLTFAEVLDILGFEVPKAALPFGWTDTDGFYIEWDAHEVWNDDKQEYEVQFYVRWKTPRNLYATTNFHDFVTKKTRKELN
C2521492C2521492__gene_120868F094007MKSKTVEVLELARPNRAGVIDVVDSDGNVVPLDYLGEDFVPDANSYSDEDFTKRNRIIVEMCDLFGRIRRRAGFAERHRGRGDYDRARRIERNRGSDISEVGRLAINACEACPLKLDCELYGKLGGAVLSDVLDYKKVRTATSLTKAGKKRSGWNKGCIDNNA
C2522388C2522388__gene_121863F081455VLFKAKEKHIMEPLGKKSTKLMKEVLDNIILKSKKDFPPVEEIGAETVDIIDAAEEAIQQPLQNTDSSIAVNFSQMVNKPKEEVKTEVNSVPPEGETKVNVLFPKTEHILGNYVDYDSFKKIKESNTDKVVRAVRLLNYKMSDQNAAVAFAQFVSEFNPECDPNKRLRYELIRHQGREKYLVIRLSTVVNGTTKYYADIYPDLNKIDLDHHLISSAKK
C2522388C2522388__gene_121864F080166VNILANFENYNKVVEQIFELNYYLTFKLEVTFNTIHKKINTEIKENFHSEYVVGANKLTTNLRYKYQMRLSPRGEKIGIVIDWDNYDDLCTVIEEAINICDPENKMSPFKRLYSTTGDLLDIKCDSLKVRYLHLEDRWNNKVDLIPFVLVDDNRGTLTEAMRFRFNNDLTFDVPVSRLKGFRRFLMTYNPVLHAGAMARYMAITPLLGSNRQNMLR
C2525242C2525242__gene_125923F054109MAGRPKTKRGMKVHTAFKIYPDDKARAQAMADKLDMSLSAYINKAVLEKVERDEKSEA
C2525448C2525448__gene_126339F043990MIKKFGIIFIFGVIILGIAVYTNHKIERSAIEREFGVNMSNMNIDEKYRKEEWAPNGDGEKTIILTYDQLDSSFTKLNKLPIKEDLPPNGIPKQFLNIANGYYKYVVDENDDRDFGILIVDTTRKEICIYNQIF
C2525636C2525636__gene_126750F051212MKKFFFIFVLYWLHSCNGTEKTMTTSPDTQKTSISEKQNAEKIERIIYSQTGGDTGGKNVHLVITKDSIIYRLTEGVTDEKTIANLSLNNNNKDWEAFIDKIDLEDFEKGKPSKELIMDLPTTKIIIKTNKKEYSKTNIQANKTWDYITKQIIDIKFSQLYKHLNLEK
C2525636C2525636__gene_126752F047127MKKTFAFILLSIISLAKAQLTDIRYIPVISTDTISTKANLYPHVLSKNKFNPLVFENGFKVGERREAVRLWDKDIFYLEFTDRKMNKRVFRQIPELKKNGKLFEVMLQGDVSWYRRYFSYRADTWDANYEHEDYFVKGDEIVNIPVKGRYKKKLKALLSDKPEIAKEVDQMVGDRDIREILEKYNSK
C2525960C2525960__gene_127510F042387MKRIILFFMAGLFLVSCSRENDKMTDETLANSAKMQLPTKVTIAENKKVILKRFEYQNDNELKEIIDEGSGERIVFVYEKDFITSKIRYSQAGEELGKTNYQYNNGKLSSVIDEVVISDSGIQYKRVVTREYHYNGSEVSVNENIKYHSESYAYNLRDENFTHTYILNGENITKIHHEISKNVPGGHFYLNPNNMVVIDEEVTYDAKNSPYKNIKGFPVLAVEFCGLDEDENENTIADYLNFRWISHNPTLIQKSINLYGSGADSSEYKFQYEYKNNFPIKTKLNINNQTVITMVYKYNK
C2526000C2526000__gene_127589F080165MKSIKILLLLLSLTACNNKTKTISTLDLEKVVINYKELPAPVKKKVFPPNNGLITFGEENREEYESFQETNNPKKYEYYTKQDPQLAWVHYPYIRNKKTKQEYSIDKDGPMGGRYIIYGDSLYISNHYNIYEKDSLKYTFTRYILR
C2526082C2526082__gene_127815F061926MMIKTAKHIKTFLASVLLLIFVMNVSGLFVQLHHQETHQKIEKIAECSDKVCYHKAHLQTKNDCDCGFLCTLNYFYILPEKPQAEFHVNEYFSYFSSYKIFVSERIILLWQSRAPPVFS
C2526528C2526528__gene_129294F061925MSWEYSINLDSEESVSSVVTDLKICELFSSSTTDYIDWKNPESIDDIPYDARFYTDKKKTIYIAINSFSKNIFSALKSILAKQNYHLTDCDTDEEVTLEHIFRSVI
C2526546C2526546__gene_129381F097525MQQKKIMFKIQNTYQKIIFSIHGHRERKDNFEDWLKVEVKVKDDLEGKYYTRVSECMLFSEVLDLLEWFEQISADKEKSTEIDFIEPELAFEYQNKKLTVLLCYDIAPVSYGEEPYQLTFSLDDKTLAMIIKELGEAVASFKKV
C2526644C2526644__gene_129869F090516MKSNLSLKLFLAFERYFIENDEVISLDKSSEFTDVIAGIGFLPQDMSENTDFKKKTIEKYGFSSSTALADDFRKRVLNIDEPIPENFEKDGIGYVYTVISGYDTFYNRMYMFGIHCFNGDFNVTYYDLDNDAGTGDYYEEHELYSQAKGYRWLDPESDYYEDVLAWEALNKLATDIYFHLEDKLDVKIDIKPIPEEEKVVPTQEHLAKFLAFCGVEQDVIDENKERLLRALEEYTPDEYEGVSAAMAEMMEYSYKIQRAEPVIEIIREYGVCRFSDWKFYAEELEEYILDLADFSDWKWEYPADTYSADLFPYMRKQLSLYHLWLCHLDEGADAYLFLLFSEKDMPEIMKLARILDLPLKAYFK
C2526682C2526682__gene_130071F058221MRNKMGKIKNFQDLKNQKEELKAEIKEIESVLSFENPRKSFGVITNGVTEKYLGGMMDSSLAQNAFFLADKFLFPSLKIGSAKLLSNALLKRVRPSMKKTLIGLGVAVLTPIVIMQIKKRLDDFQQRETAKSLSKLI
SRS016086_WUGC_scaffold_23220SRS016086_WUGC_scaffold_23220__gene_23088F089056MNIFCKIILPLLCIISCSKRKEADNTMVLEKNHTFSLWNNDSLGCKHERTIEMGEELYNTFKKSNKNDSILLKEYLGIPNRRFKDKEEIVFMYYINSCCDNGQLLEECDVSFIAITFTNKNEIFFRKGIQ
SRS016086_WUGC_scaffold_24580SRS016086_WUGC_scaffold_24580__gene_25265F092232MNTQAKFIAEYNDKNRPKFNDKFFNKSDDDIIEDLKDVILSCERNKFYTIKVLNFEVIDDYNEVQKLLIGDETPSISIKDSDLKILKVTYHVACTKDEDTFDVLIAIPRVIDGAYIHLNGNDYFPLFQLVDGSTYNNTTASSAKTQSITLKTNSNAVKMLRNFIDLNTTNEETVRAAMFSVYLFDHKVTLFEYYLARFGWYETLDKFNFEDVIRISDHDLNDPEYYTFAIANAHMKTPFYISAVKSFMDNDRILQSFVASFARAISLYATKKTTLDQIYTTEFWICKLGYNFVSSETSVFTKGNAIIESLENSYDIPTKKRLRLPDHIKEDIYSVLKWMACEFSSIRLKNNLDASTKRIRWSEYIAAMYIMLINVKLRRLPEKHDPNMEAYRIKQQLNTQPMALIAELQKSNLKGFRNMVNDRDSFLQLKYTIKGPSGPGESNSKNVARNVRAIDPSHLGIIDLNTSSASDPGVGGMLCPLNYGVYEWNSFTNEEEPNVWDENFSRMLNIYREEKGYTSAIMLADDAGLELTDTRDPAAVAFDADLLGQTIAKVARTRAFEKQLRPALINMEDSCSIYFEEV
SRS016086_WUGC_scaffold_25924SRS016086_WUGC_scaffold_25924__gene_27621F105378EVLDVNKNLSDLKDKEAAITNLGLYEKFISKEALESGFLPDVFTPDNIVTDSTHQFVTDEEKNKWNNKLNAPVPMQDHLANNQIGYDSVNSKFYIGLNNQNVLLGGSSCFDNIIVVNGFFSGNSQPTVIRNNKFNEAGQLITPVFVDVQCVEYTAGDLGEVSVSYTADAVSIYNTGSFTGSFQCLIVYPLGSVNE
SRS016086_WUGC_scaffold_26343SRS016086_WUGC_scaffold_26343__gene_28555F092229MNKEYRFNHIPEVVLRNIRFIRDNNIDIGTGDDVLECMMDINPIVRTKIYDDYEFAKDVAERRFGSTIEKLDLRTVLQKCITRPYNSILNNIYFRYFNSELIDDLFKLGQSSKVLDLAIKYECEYYTVNAAKTNIRRYNTDAYYNKFAADSNIISSHRSLHDPQVNAVKSAEFTYDLLMASRAEEFNPEIVREIFVKYGLKPNSSRNLYNRINDNLNLFYYIEDYLDEYREEGKFIYGTKEYKILKELRSLPLMVVLTQLTRKNDSGYILNSNLELVKG
SRS016086_WUGC_scaffold_26577SRS016086_WUGC_scaffold_26577__gene_29202F071327MEDLFNSVYSTHKGISFSTVVVFGAFIFLILQVHLSYKGRISDVLRKTSFFSMLLLYIQGILGVFLGIYSPEFSEASGFSSYFKLFEYGIIILACAGMITYVYMFLKSNQILTLKVLIIALAAALLFEYAYPWRIIFG
SRS016086_WUGC_scaffold_8246SRS016086_WUGC_scaffold_8246__gene_7425F103431DALVVGMLFFIQLFLQGIAWRVAITHFLHAERGNAAAAAFDGAFGEDIADCHTEDDNDKNAESKEEGFHVCIPED

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.