NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300008362

3300008362: Human tongue dorsum microbial communities from NIH, USA - visit 1, subject 160400887 reassembly



Overview

Basic Information
IMG/M Taxon OID3300008362 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0063646 | Gp0053223 | Ga0115107
Sample NameHuman tongue dorsum microbial communities from NIH, USA - visit 1, subject 160400887 reassembly
Sequencing StatusPermanent Draft
Sequencing CenterBaylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size128921695
Sequencing Scaffolds16
Novel Protein Genes26
Associated Families22

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Neisseriales → Neisseriaceae → Neisseria → Neisseria mucosa1
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes3
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Negativicutes → Veillonellales → Veillonellaceae → Veillonella → Veillonella tobetsuensis1
All Organisms → cellular organisms → Bacteria2
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → Candidatus Saccharimonas aalborgensis2
Not Available2
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Lachnospiraceae1
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria2
All Organisms → Viruses → Predicted Viral1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Oral Cavity → Tongue Dorsum → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal surface

Location Information
LocationNational Institutes of Health, USA
CoordinatesLat. (o)N/ALong. (o)N/AAlt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F018385Metagenome235Y
F032313Metagenome180N
F046433Metagenome151N
F054110Metagenome140N
F066860Metagenome126N
F074985Metagenome119N
F078842Metagenome116N
F080166Metagenome115N
F081455Metagenome114N
F081510Metagenome114N
F089057Metagenome109N
F092229Metagenome107N
F092230Metagenome107N
F092232Metagenome107N
F095629Metagenome105N
F095631Metagenome105N
F095633Metagenome105N
F099452Metagenome103N
F099453Metagenome103N
F103431Metagenome101N
F103433Metagenome101N
F105379Metagenome100N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0115107_100006All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales236647Open in IMG/M
Ga0115107_100009All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Neisseriales → Neisseriaceae → Neisseria → Neisseria mucosa180404Open in IMG/M
Ga0115107_100107All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes56959Open in IMG/M
Ga0115107_100223All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes38708Open in IMG/M
Ga0115107_100279All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Negativicutes → Veillonellales → Veillonellaceae → Veillonella → Veillonella tobetsuensis34230Open in IMG/M
Ga0115107_100437All Organisms → cellular organisms → Bacteria26568Open in IMG/M
Ga0115107_101220All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → Candidatus Saccharimonas aalborgensis13529Open in IMG/M
Ga0115107_101875All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → Candidatus Saccharimonas aalborgensis9999Open in IMG/M
Ga0115107_103649All Organisms → cellular organisms → Bacteria5699Open in IMG/M
Ga0115107_104032Not Available5190Open in IMG/M
Ga0115107_104138All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes5086Open in IMG/M
Ga0115107_104348All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Lachnospiraceae4854Open in IMG/M
Ga0115107_106958Not Available3100Open in IMG/M
Ga0115107_107764All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria2768Open in IMG/M
Ga0115107_110917All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria1949Open in IMG/M
Ga0115107_113793All Organisms → Viruses → Predicted Viral1539Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0115107_100006Ga0115107_100006115F092232MNTQAKFIADYNDKNRPKFNDKFFTKSDDDIIEDLKDVILSCERNKFYTIKVLGFEVIDDYTEVQKLLIGDETPSISIKDSDLKILKVTYHVACTKDEDTFDVLIAIPRVIDGAYIHLNGNDYFPLFQLVDGSTYNNTTAAAAKTQSITLKTNSNAVKMLRNFVDLNTTKEKTLRMAMFSVYLFDHKVTLFEYYLARFGWYETLSKFNFENIIKISDHDIDDPEYYTFAIANAHMKNPFYISAVKSFVDNDRILQSFIASFAKAISLYATKKTTLDQIYTTEFWVCKLGYNFVSSETSVFTKGNAIIESLENSYDIPTKKRLRLPDHIKEDIYSVLKWMACEFSSIRLKNNLDASSKRIRWSEYIAAMYIMLINVKLRRLPEKHDPNMEAYRIKQQLNTPPMALIAELQKSNLKGFRNMVNDRDSFLQLKYTIKGPSGPGESNSKNVARNVRAIDPSHLGIIDLNTSSASDPGVGGMLCPLNYGVYEWNSFTNEEEPNVWDDNFSKMLNIYREEKGYTSAIMLADDAGLELTDTRDPEAVAFDAHLLGQTIAKVARTRAFEKQLRPALINMEDSCSIYFEEV*
Ga0115107_100006Ga0115107_10000615F095631MASDAVSVNKTLRSYQETVLNTVQNDTLLNANIYDIHQYIENIKKRYVDEDEITLSMGIFGYLGDVNSNALQNAVTMAAEYSNEAIPIKAKFEKNVISHALMLGINKIFAEPATMQAMFVFYEDELVLNTISDTFRFDRDIKIMVGDYEFHLPYDLIIKRIELPTGEYIYTGMYDTTQSNPIITRNSNDVDPYLKPTVRSKIDGRNVVMLLVDLRQYEYMTYHKTIITNNPLESKMLQFEFDNQLAGFDVDVKEYDQPTRKLKPVYNGLNTDGVQNFCNYTYIDSSTIRVMFDNTSYLPTANTEVTVNLYTSQGANGNISYKDSIYFRVKSDKMNYDRLNLLVIPTSDSQYGIDKKSIADLKRLIPKEALARGSVTNSTDINNYFNTIDDDDNKLFFFKKMDNPLARLYYAFVLMDSPTNIIPTNTIPIEAIRRDFDNISDSNYILTAGNIIKYDGTTNASIAYQASEDELNAARKNEFLYMNPFMCIVNKKPLYVSYYMNIMDVNKLLEFTYVNQDSKVQFIATKMNWYRHYISDRDTYFGDISIMQNIQSDIGLVHKDDPHDPEKITGVDIKVLAVFYTDDKYQVPYRWAEAEFVNYDQGTYVMDYKFKLNTDNKIDKNIKLKINNVYEVGNATRLSPGYMANNMHMKIFVFAKDVFGYNAGLHKSDQIFTADFLEGYSLTNEYTVKYGIDFLYNYSDLIESHIKVKKQDNGQISYIVDRVPVISYDYVNTEERIQDFINNLEKKRIHILDCLDVLEDSFGIDIKFFNTYGPSKLFYVNDGVPLNRVNLSMTFKVKFLTTTDKYLSEYIKNDIRKYIEDKSRISDIHIPNIVTYITQKYAENVTYFEFLDFNGYGPGYQHIYRKDESIVGRIPEFLNINTIGTENNALDINIIIA*
Ga0115107_100006Ga0115107_100006160F099452MDKTYTELLQETLSKIYKLKDLNNRDRGKALTIFIGERLNRELLLSSMNIFNLYKEIVNLDDVSLLTDLRKTPWYKDWFTHSHENASLIDLSKFNFRSLERFEKEEYLKDVESYDFEAATTVDSYSLFDTLIEDNSVDLFKLAAENILINHGFFNNTDYNLYDIPDKYMEDTDVCLYMCLLNSDNMDFMDKKTFKSTELFYIVKNNICGTIFFTLFDRMNEDIRTRAR*
Ga0115107_100006Ga0115107_100006199F089057MTNIIPLIAKKYNRKGDTSGTLKSLVDDLVFIGDANDSLLFITNIPRETKYSIEEVFNIITSNDKYSEILNNVLSSLNIDLDYHKLLLNAINSESYKIISLISDNIPTPDLFLSKNNYGCLTTALGKSYTIFDKVLGMVISQLLHTSSKEDKILGLFMTICIINKDIDKLASLCTGYLAITKDELAVKDLMNESAMTAFKYMSDEDIHDVVDDINSRMVLSRYLNQM*
Ga0115107_100006Ga0115107_10000640F081455MENFLAKEIGTLIGKHFGFVDNIDLDKDPIITSNNIIDIPPVEEIGMQNVEIIDSAEEAIQQPLANTDSSIAVNFSQMINKPEEVKTELVSTPDNGEAKVNVVFPKNEHILGNYVDYESFNKIKESNTDKIVRAVRLLNYKMADQNAAMKFGQFVSEFNYECDPNKRLRYELIRHQGREKDLVVRLSTVINGTTKYYVDIYPDLNKIDIDHHLISSARK*
Ga0115107_100006Ga0115107_10000641F080166VNILANFENYTKVVEQIFELNYQLTLKMEVTFNNIIKRINTEIKENFHTEYVVGANKLTTNLRYKYRMRLSPRGETVGVIIDWDNYDDLCNIIDEAIDICDPNNKTSPFKRIYSTAGDLLDIKCDSLKVRYLHLDDRFGNRLDLMPFVLIDDHNGTLTEAMKFRFNNDLIFDVPVSRLKGFRRFLMTYNPLLHAGSMARYMAITPLLGNNRQNMMRS*
Ga0115107_100006Ga0115107_10000644F099453MNRFDIIELAQETLIFVYNTFNGKVNTLDPYTRLNFVAGYLDTKTNIARTTPYGCIYISLEAFADTVEAHKFIDTDQIRNLALEIIIHELTHVDQLIDYKYIKFNNGYRDEIELQCVKQSCQWILDNIQYIRSFGLVVIPEVYQARLANLTNVVYTPKYPMAIAMGKLEYMLGKKFREFSNNNIEIEYVDRLKTHYTFMVCENRIYINSANLNDLGERLLNDKQYTVEYLEYGDSKLVIKITQGA*
Ga0115107_100006Ga0115107_10000666F092229MNKEYRFKHIPEVVLRNVKFIRENNIDIGNGDDVLDCMMEINPVLRQRIYDDYDLAKDVAERRFNSTIEDLDLTTILQKCTTRPYIAILNNIYFRYFNSKLIEDMFKLGESIKVLDLAIEYECEYYTVNSAKTNIRRYMQQAYFDKYAADADIISSHRVLTDPQVNAVKSAEFTYDLLMAARSENFNPEMVRDIFLKYGLKTNSSRNLYTRMDNNLSLYYYMEDYLDEYVKNGKVTYGSQEYRTIKEFKYLPLMNVLTQLTSSNPSGYILNHKLELVKENK*
Ga0115107_100006Ga0115107_10000692F105379MVIQFQLSQSDIESLLSISKLLKCDKILYDRNYINSIIGVGPERSYFQTTSYMIDLDPSINNLLVNSLDLKNLSKATDGADITKTNVPVFDWDTVYIKSCMNSLREYQVDSHIIAKDDNFHESNCYSELMAGSASTGACRINVDKYLIDIPKSAMPTLKSDHVEAIVYEVPNRNFNVLRFKITKRNGIIVNQSMLFLPY*
Ga0115107_100006Ga0115107_10000697F095629MEVFKMRELIICACLFGCFGVANAATPIEQPKEVKVVHNDDSVALHKKVYKLEQRVERLEKLLLEKEGK*
Ga0115107_100009Ga0115107_10000916F103431MIDLDALIVGMLFFIQFFLQGIAWRVAIAHFLHAERGNAAAAAFDGAFGEDIADCHAEDDNDKDAESQKEGFHVCIPEG*
Ga0115107_100107Ga0115107_10010736F081510MKLPNMKAIKSAAKHSYTVSKILAKKYAPVALVTTGLVGYGVAVYKGIQSGKKLEATKAKYEAKDEAGEEYTRLDVIKDVTKDVAVPVAIAVASTAAIGLGFAIQTNRLKAVSAALTMVTEEHARYRLRAKEVLDEETFKKIDAPIETKKVEIDGKEVEVESIVPKEGDFYGRWFKYSRHYASDDPDYNEAWVKEVDNMLTQKINTQSGGGMLTFAEVLDALGFEVPKAALPFGWTDTDGFYLEWDTHEVWNEDKQEHEPQIYVRWQTPRNLYSTTNLRDIIPGRKELA*
Ga0115107_100223Ga0115107_10022348F074985MMELTDGGWYKTPRIIKGKDFLAHIHDTYASGNAMYVEFKASEGEVRILEYRQLYEVDTESAVLFTINTYPQESILLKNIEEYEFIQYRPQQAWKAIHMGSTKRFNLEQFDQLWLDQTFQKLHPVIVNHDGKFWHVMGLKLDVDADGSFWGLYLKRQDSDFMKEIRMPLTQKFIYNPISGSWSLDDPTQEIKDLEEIKQTLRADAILDVTVSGVPMKLIRVQEIAKGVLFFVFQDEEKNKRYYYNRPAIKLRIVTDPTTGEQKYLLDHIKAMYID*
Ga0115107_100279Ga0115107_10027920F054110VNYQPTIKKLLKALQMNGRRYVVDVRQSWSKYDKPCKVYIINRMYTEEEYKLTFPHKYKKGKTFKQGQLYKKESEYSSTKQHEVLLFLVKTYKGGE*
Ga0115107_100437Ga0115107_10043710F103433MKEWSKNKPGVVFFFVVWFILSISFIGNFFGTGLWNGWFDGFQKDSSAIVEKTAYCKNKYDYKGPLIAADSKDYNKIMMSQDCNPSQVKPYVSQYGLQARVIAGLSPNDASKIPAYIKKVSIFLAVLTAFLLALVVQKIRALFGGITASVFVVMLSFSPWIAGYARNIYWIEPLLIAPFVISFVGYQYFKKSKKLWLFYIIESVAMFLKLLNGYEYVSTIAISVLAPIIFFELIHKNVKIINLWKQAVPVFAATVVAFFGAYWVNFVSLTDYYGSSDKAASAINARASDRGISGIRSMRAYAVGNFKILRPESYNFINQLVNLDNMANNSGKTYKYIIVNVVNYLLLPAITLPVHINGMFGEFVQSILFWTILGYLIILSSRKIIGKKYSRPFLWSMNFSVIGAFCWLALMPGHALPHAHINGIIFYIPLLLFVYMLIGLWADYVVKRTVKYE*
Ga0115107_101220Ga0115107_10122016F046433MIELPTSPDALSELNSVTPPDLTLQARDTSRNNPVTYVVDDGYMGTRTSDPRFRKMRRETTEYKALNEFLYFMKMVEPYVPDHIKDDARELRNELVFLGMSELHFAATALAKRLRHLLKVDNNPVYVDVGNSLSQCRVKNEMKSSQYILSLVLSKFLDDEFEEYEGRLKVYGGRGEIDKSSKILFLDDWIIGGDQMRERISVFGAYNNPGAHKVSVLVMAASSNCIDNGIGADSLWGEATYPVEAYYRLKNDHNDWGVSRVTGIHSSTDSTFGCEVDDIAYRAIEGGVLKGERIDRLTLPALVNIVRPYRNGEDFDGLSRFRQLLEKG*
Ga0115107_101875Ga0115107_10187513F095633MRNYENSTEVGRREGLTEGELRTMGTLAMEATEELKKTTVRKETVLLGSVPFGSWDEFAKAVQEMAAHSYEPIPVKINTKRLIATAFLDDRGEMSVEENSVPEEVFIDLSRTRCVVDADRSHKSYEFTCPVLKKYPDGELYPIREAHVISAIDVNGSQEVDFKII*
Ga0115107_103649Ga0115107_1036491F078842MIISSIYKTADNDGLIAHIYEHLLAQYVLKRLQDNEFFVLSDIILSAKTYGDTCFMDAELYSPEAKKTYDEALREFDKLVIPEDDILRAAGECGIEMNRNIVEVDRSELSKKLREVQISPWRKQIDMAYRKAQDESSVNTLFRTSYIKYSKESDDLFRECVLEYSIDESHIQTPVDQALAAIVMQIVALNFLTVVREKYTVYDRGDQWSEASISVGYRMFLGSLKKDDKIINQLGCDFLEYIKILSSSVFCDNLQKALVRCSDNHKQVILNRSVLNAILGGCVIGGKGWLEMASSAQIRRMINSIELDIYEVNS*
Ga0115107_104032Ga0115107_1040323F054110VNYQPTIKKLLKALQMNGRRYVVDVRQSWSKYDKPCKIYIVSRMYNEEEYKLTFPHKYKKGKTFKPKQLYKKESEYSSTKQHEVLLFLVKTYKGGD*
Ga0115107_104138Ga0115107_1041381F018385MMADLVNSWLPYQELSIEKDRDPVTDDEIIYGNNVKHFTLTIYSPEGRISKYWNARILQDQLGRCRIACPRDGKILCFAWFEWTSYMFSHDGLNELVFMPRTNSRLPSTLWNTKEVK*
Ga0115107_104348Ga0115107_1043483F092230MVDKLKTHLLKVFLPLFIVCIILVAFFRQIGCGSDGDYAFQISEWGAKLKNIYGTDFINKEIIVRDNAVRVDGIRCLYAVNQNEDGLSIYLLLPGGDYLTHNYVGSSFVRFSNSSEYNNMAYGEGNVEVSDSTSTGEVQNTEEKEARDKVDDAINSMRHLFASAIMVNLRVVELYKILTVCMILIAIAMTVGYYSYLKPETVYEFYCKLRRKEKYPSDENLVKRIGFLVIILPPCMLFLLIV*
Ga0115107_106958Ga0115107_1069581F032313MYRFLILLFALTLMACDNNTPQEKPHEQEKHEVPVPVSKPQFDEVGERIWYGRTPAMRLDSTDYGAGLTSVFGMRTSSIPKQRFDSLFKQTVWEIKDIRVVETDLSLAKKNPGIMGWITTTEFTCRNGVIVLHRQGIDVNHVDTVNYVYDEVGNEIVLEGTGIRWSVLRLNKNAVEFLQRGRTMWGPFDWYYGRNSGRSEVTLEAK*
Ga0115107_106958Ga0115107_1069582F032313MYRLLILLFAITLMACDNDTPQEKPREQEKHEVPVPKPKPQFDEVGERIWYGQTPAMRLDSTDYGAGLIWVLEMRTSSIPKQRFDSLFKQTVWEIKDICAVETDLSLAKKIPKFVGGSITKEFTCRNGVILRHMQGVDINCVDTVNYVYNEDLNEIVLEGTGIRWYVLRLNKYAVEFLQQGHNIWGPFDWYYGRNSGRSEVTLEAK*
Ga0115107_107764Ga0115107_1077643F066860MTTKKQKLQKQQAIDTWIVIALWVSAIWFSFARGFITGIGGWVLALLAPWALIVSCICLAIISRQMKKRHVSKDHLTTIVRASFIVMSISLFICGLAMPDFSDTETFSTLSVYTNNAISFETSKTIAIISGFVVVLSLLVVVTFGIAEDRE*
Ga0115107_110917Ga0115107_1109171F046433VEPYVPDHIKDDARELRNELVFLGMSELHFAATALAKRLRYYLEVDNKPVYIDVGNSLSQCRVKNEMKSSQYILSLVLSKFPDDEFEEYEGRLKVYGGRGEIDKSSKILFLDDWIIGGDQVKERIAGFEAYNNPGAHKVSVLVMAASSKCIDNGIGADPLWGKATYPVEAYYRLKNDHDDWGMSRVTGIHSSTDRVFGCEVDDIAYCAIDGGILKGERIDRLTLPALVNIVRPYRNGEDFDGLSRFRQLLEKG*
Ga0115107_113793Ga0115107_1137932F054110VNYQPTIKKLLTALRMNGRRYVVDTRQSWSKYDKPCKIYIVSRMYNEEEYKLTFPHKYKKGKTFKQGQLYKKESEYSSTKQHEVLLFLVRTYKGGE*

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.