NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300008138

3300008138: Human tongue dorsum microbial communities from NIH, USA - visit 2, subject 764892411 reassembly



Overview

Basic Information
IMG/M Taxon OID3300008138 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0063646 | Gp0053176 | Ga0114843
Sample NameHuman tongue dorsum microbial communities from NIH, USA - visit 2, subject 764892411 reassembly
Sequencing StatusPermanent Draft
Sequencing CenterBaylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size131286902
Sequencing Scaffolds20
Novel Protein Genes24
Associated Families17

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA4165
All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Negativicutes → Veillonellales → Veillonellaceae → Veillonella → Veillonella tobetsuensis2
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes2
All Organisms → cellular organisms → Bacteria1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Negativicutes1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Lactobacillales → Streptococcaceae → Streptococcus2
All Organisms → Viruses → Predicted Viral2
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Prevotella → environmental samples → Prevotella sp. CAG:11241
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales1
Not Available1
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ct6uZ81

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Oral Cavity → Tongue Dorsum → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal surface

Location Information
LocationUSA: Maryland: Natonal Institute of Health
CoordinatesLat. (o)39.0042816Long. (o)-77.1012173Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F054109Metagenome140N
F054110Metagenome140N
F068942Metagenome124N
F073671Metagenome120N
F080164Metagenome115N
F080166Metagenome115N
F081455Metagenome114N
F089057Metagenome109N
F092229Metagenome107N
F092232Metagenome107N
F095629Metagenome105N
F095631Metagenome105N
F099452Metagenome103N
F099453Metagenome103N
F103432Metagenome101N
F103436Metagenome101Y
F105379Metagenome100N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0114843_100054All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA41668283Open in IMG/M
Ga0114843_100165All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae43765Open in IMG/M
Ga0114843_100194All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Negativicutes → Veillonellales → Veillonellaceae → Veillonella → Veillonella tobetsuensis41191Open in IMG/M
Ga0114843_100232All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA41638042Open in IMG/M
Ga0114843_100243All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Negativicutes → Veillonellales → Veillonellaceae → Veillonella → Veillonella tobetsuensis37171Open in IMG/M
Ga0114843_100338All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes31172Open in IMG/M
Ga0114843_100510All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA41624923Open in IMG/M
Ga0114843_100947All Organisms → cellular organisms → Bacteria17031Open in IMG/M
Ga0114843_101914All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA41610670Open in IMG/M
Ga0114843_102626All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Negativicutes8331Open in IMG/M
Ga0114843_106756All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Lactobacillales → Streptococcaceae → Streptococcus3650Open in IMG/M
Ga0114843_106770All Organisms → Viruses → Predicted Viral3638Open in IMG/M
Ga0114843_108552All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Lactobacillales → Streptococcaceae → Streptococcus2871Open in IMG/M
Ga0114843_111003All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Prevotella → environmental samples → Prevotella sp. CAG:11242206Open in IMG/M
Ga0114843_111493All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales2107Open in IMG/M
Ga0114843_112512Not Available1922Open in IMG/M
Ga0114843_114930All Organisms → Viruses → Predicted Viral1584Open in IMG/M
Ga0114843_117241All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA4161350Open in IMG/M
Ga0114843_119276All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ct6uZ81198Open in IMG/M
Ga0114843_129028All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes781Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0114843_100054Ga0114843_10005410F080166VNILANFENYTKVVEQIFELNYQLTLKMEVTFNNIIKRINTEIKENFHTEYVVGANKLTTNLRYRYRMRLSPRGETVGIIIDWDNYDDLCNIIDEAIDICDPNNKTSPFKRMYSTAGDLLDIKCDSLKVRYLHLEDRFGNRLDLMPFVLIDDHNGTLTEAMKFRFNNDLIFDVPVSRLKGFRRFLMTYNPLLHAGSMARYMAITPLLGNNRQNMMR*
Ga0114843_100054Ga0114843_10005440F095631MASDAVSVNKTLRSYQETVLNTVQNDTLLNANIYDIHQYIENIKKRYVDEDEITLSMGIFGYLGDVNSNALQNAVTMAAEYSNEAIPIKAKFEKNVISHALMLGINKIFAEPATMQAMFVFYEDELVLNTISDTFRFDRDIKIMVGDYEFHLPYDLIIKRIELPTGEYIYTGMYDTTQSNPIITRNSNDVDPYLKPTVRSKIDGRNVVMLLVDLRQYEYMTYHKTIITNNPLESKMLQFEFDNQLAGFDVDVKEYDQPTRKLKPVYNGLNTDGVSNFCNYTYIDSSTIRVMFDNTSYLPTANTEVTVNLYTSQGANGNISYKDSIYFRVKSDKMNYDRLNLLVIPTSDSQYGIDKKSIADLKRLIPKEALARGSVTNSTDINNYFNTIDDDDNKLFFFKKMDNPLARLYYAFVLMDSPTNIIPTNTIPIEAIRRDFDNISDSNYILTAGNVIKYDGTTNASIAYQASEDELNAARKNEFLYMNPFMCIVNKKPLYVSYYMNIMDVNKLLEFTYVNQDSKVQFIATKMNWYRHYLSDRDTYFGDISIMQNIQSDIGLVHKDDPHDPEKITGVDIKVLAVFYTDEKYQVPYRWAEAEFVNYDQGTYVMDYKFKLNTDNKIDKNIKLKINNVYEVGNATRLSPGYMANNMHMKIFVFAKDVFGYNAGLHKSDQIFTANFLEGYSLTNEYTVKYGIDFLYNYSDLIESHIKVKKQDNGQISYIVDRVPVISYDYVNTEERIQDFINNLEKKRIHILDCLDVLEDSFGIDIKFFNTYGPSKLFYVNDGVPLNRVNLSMTFKVKFLTTTDKYLSEYIKNDIRKYIEDKSRISDIHIPNIITYITQKYAENVTYFEFLDFNGYGPGYQHIYRKDESIVGRIPEFLNINTIGTENNALDINIIIA*
Ga0114843_100054Ga0114843_1000547F099453MLRRKDMNRFDIIELAQQTLTFVYDTFNGKVNTLDPYTRLNFVAGYLDTKTNIARTTPYGCIYISLEAFADTVELHGFIDTDQIRNLALEIIIHELTHVDQLIDYKYIKFNNGYRDEIELQCVKQSCQWILDNIQYIRSFGLVVIPEVYQARLANLANVAYTPKYPMAIAMGKLEYMLGRKFREFSNNNIEIEYVDRLKTHYTFMVCENRIYINSANLNDLGERLLNDKQYTVEYLEYGNSKLVIKITQGA*
Ga0114843_100165Ga0114843_10016562F073671MEKQQAEHELAELHEKERSLEKALELVREKIRELINYTDKNKEQK*
Ga0114843_100194Ga0114843_10019446F054110VNYQPTIKKLLKALQMNGRRYVVDTRQSWSKYDKPCKIYIVSRMYTEEEYKLTFPHKYKKGKTFKPKQLYKKESEYSSTKQHEVLLFLVKTYKGGD*
Ga0114843_100232Ga0114843_10023230F092232MNTQAKFIADYNDKNRHKFNDKFFTKSDDDIIEDLKDVILSCERNKFYTIKVLGFEVIDDYTEVQKLLIGDETPSISIKDSDLKILKVTYHVACTKDEDTFDVLIAIPRVIDGAYIHLNGNDYFPLFQLVDGSTYNNTTAAAAKTQSITLKTNSNAVKMLRNFVDLNTTKEKTLRMAMFSVYLFDHKVTLFEYYLARFGWYETLSKFNFEDIIKISDHDIDDPEYYTFAIANAHMKNPFYISAVKSFVDNDRILQSFIASFAKAISLYATKKTTLDQIYTTEFWVCKLGYNFVSSETSVFTKGNAIIESLENSYDIPTKKRLRLPDHIKEDIYSVLKWMACEFSSIRLKNNLDASSKRIRWSEYIAAMYIMLINVKLRRLPEKHDPNMEAYRIKQQLNTPPMALIAELQKSNLKGFRNMVNDRDSFLQLKYTIKGPSGPGESNSKNVARNVRAIDPSHLGIIDLNTSSASDPGVGGMLCPLNYGVYEWNSFTNEEEPNVWDDNFSKMLNIYREEKGYTSAIMLADDAGLELTDTRDPEAVAFDAHLLGRTIAKVARTRAFEKQLRPALINMEDSCSIYFEEV*
Ga0114843_100243Ga0114843_10024317F054110VLDVNYQPTIKKLLKALQMNGRRYVVDVRQSWSKFDKPCKVYIVNRMYTEEEYKLTFPHKYKKGKTFKQGQLYKKESEYSSTKQHEVLLFLVRTYKGGD*
Ga0114843_100338Ga0114843_10033811F105379MVIHFPLSQSDIESLLSISKLLKCDKILYDRNYVNPIIGVGPEKSYFQTTSFMVDLSPHINNLLVNISDLKNLGKITQLEPSKENPEIAIHKPVVSVFNWDAEYVKACMNSLREYQVDDNIIARTDEFHNTDDYNELMAGSASTGAYRINVGGYMIDIPKSAIPTLKSDHVVATVYNAPNKDFNVLRFKITKRNGIIVNQSMLFLPY*
Ga0114843_100510Ga0114843_1005103F099452MDKTYTELLQETLSKIYELKDLNNRDRGKALTIFIGERLNRELILSSVNIFNLYKDIINLDDVSLLTDLRKTEWYKDWFTNDKRNSDLIDLSRFNFRVLERFEKEEYLRDAEHYDFEGVSEVDSYDLFDTLREDEDIELFKLAAENILINHGFFNNTDYNLYEVPDEYMSNQEVCLYMCLLNTDNLDFMDKKTFDSTLLYNIIKDRICGSVYFTIFDSLNEDTRTRAR*
Ga0114843_100947Ga0114843_1009473F054109MAGRPKSKKGVKVHTAFKIYPDDKARVQVMADKLELSLSAYINKAVLEKVERDEKSED*
Ga0114843_101914Ga0114843_10191414F092229MNKEYRFKHIPEVVLRNVKFIRENNIDIGTGDDVLDCMMEINPVLRQRIYDDYDLAKDVAERRFNSTIEDLDLTTILQKCTTRPYIAILNNIYFRYFNSKLIEDMFKLGESIKVLDLAIEYECEYYTVNSAKTNIRRYMQQAYFDKYAADADIISSHRVLTDPQVNAVKSAEFTYDLLMAARSENFNPEMVRDIFLKYGLKTNSSRNLYNRMDNNLSLFYYLEDYLEEYVNTGKFTYGSQEYHTIKEFKYLPLMNVLTQLTSSNPSGYILNHKLELVKENK*
Ga0114843_102626Ga0114843_1026261F089057MTNIIPLIAKKYNRKGDTSGTLKSLVDDLVFIEDVDDSLLFITNIPRETKYSIEEVFNIISSNDKYSEVLSNVLSSLNIDLDYHKLLLNAIDSESYKIISLISDNIPTPDLFLSKNNYGCLTTALGKSYTIFDKVLGMVISQLLHTSSKEDKILGLFMTICIINKDIDKLASLCTGYLAITKDEVLVKKLMNESATMAFQYM
Ga0114843_106756Ga0114843_1067562F103436MITTSKDGWCDMSDAEILNSLRDWVLKCDLKYSKREALKKIDSAFALWGGRQYVAAVDLLDENEVYFSKEDWPYYALGIEILKARKYTYFY*
Ga0114843_106770Ga0114843_1067703F092229MNKEYRFNHIPEVVLRNIRFIRENNIDIGTGDDVLECMMDSTIEKLDLRTVLQKCITRPYNSILNNIYFRYFNSELIDDLFKLGQSPKVLDLAIEYECEYYTVNAAKTNIRRYNTDAYYNKFAADSNIISSHRSLHDPQVNAVKSAEFTYDLLMASRAEEFNPEIVREIFVKYGLKPNSSRNLYNRINDNLNLFYYIEDYLEEYCEEGKFIYGTKEYKILKELRSLPLMVVLTQLTRKNDSGYILNSNLELVKG*
Ga0114843_108552Ga0114843_1085523F103436MMATSKYGWGNKSDXXXXSKYSWCNKSDAEILENLKDWVSKCDVSSKREALKKIDSAFALWGARQYVAAVHLLDENEVFLEKSDWPYYALGIEILKARKHEFFNE*
Ga0114843_111003Ga0114843_1110032F080164MRPTSFVLSLLLGVIGLAPCAARQVTLRERALAFPLITEKDATEVDAPYAWRLPVVPLSLDNREIRNFAKYPLLPSLSGGILTVRVLVVGDTVAVHQDLMDDFAKRCRTTLGLGVRTAPKLFGIKGMHVYGVQKDGSRQAVDKQVTLHLPGFEKAEKPLHYKEQTGQLVLCEYYESHRGDLFLDVANAHPEIFGELCPVVDFHFPVELRRAYAWLLLEIELEDGTKLSTSLQHYDEQTSILDHPDRS*
Ga0114843_111003Ga0114843_1110033F103432MKLIHSLFSLPLLFVLGGLFCTTACQDDVEPTQRTGLISTDSLIHAAEVYDGKAFEHVVSTTATGLRVSEPRRVVPMLPRQLHVTMDGKTIFRRHTLPSVSAYSLQVVAVGDTIYRQKESDAQFNADLDALFRQSIGMAPRLFGVKELSVGGIDRKGKPRDLGNYSCPLLQGKRRNVNFRTKEGVFHEYFEAASVDTFSVKSNWLLKTKAEPSLYAPSFRLLVWEQPAEGCTKLRFTLTLVDGRSLVAEVPLR*
Ga0114843_111493Ga0114843_1114932F068942MIRKILSLPTLALCFTLCTALFAGCGENNLGFVTEVRWSNVKNPKYGDDINITLKAEGETFTTVGDHSQIFFGYDASTLDTFTRHRFPEMDKDTAYYKDIVIYMTRNERERTTTLKLVAPPNRTQQPKQFKFSVSVTPPAMYMFKVRQPALPAKAQ*
Ga0114843_112512Ga0114843_1125122F099452MDKTYAELLQETLSKIYELKDLNNRDRGKALTIFIGERLNRELILSSVNIFNLYKDVINLDDVSLLTDLRKTPWYKDWFIDDKRNSDLIDLSRFNFRSLERFEKAEYLRDAEHYDFEGVIEVDSYSLYDILAEDNGISLFGLAAMNILLNHGFFNNTDYQLYDIPDAYINDQEVCLYMCLLNKDNLDFMDKKTFDDTLLYDIVKDRICGAIYFSIYDSLNEDTRTRAM*
Ga0114843_114930Ga0114843_1149301F081455METTNIQEINMKAAEKLGELFDYVFCGKKPNTEEKDIPPVEEIGAETVDIIDAAEEAIQQPLQNKDTSIAVNFSQMVNKPKEEVKTEVNSVPPEGETKVNVLFPKTEHILGNYVDYDSFKKIKESNTDKVVRAVRLLNYKMSDQNAAAAFAQFVSEFNPECDPNKRLRYELIRHQGREKYLVIRLSTVVNGTTKYYADIYPDLNKIDLDHHLISSAKK*
Ga0114843_114930Ga0114843_1149302F080166VNILANFENYNKVVEQIFELNYYLTFKLEVTFNTIHKKINTEIKENFHSEYVVGANKLTTNLRYKYQMRLSPRGEKIGIVIDWDNYDDLYTVIEEAINICDPENKMSPFKRLYSTTGDLLDIKCDSLKVRYLHLEDRWNNKVDLIPFVLVDDNRGTLTEAMRFRFNNDLTFDVPVSRLKGFRRFLMTYNPVLHAGAMARYMAMTPLLGTNRQNMLK*
Ga0114843_117241Ga0114843_1172411F099453QQTLTFVYNTFNGKVNTLDPYTRLNFVSGYLDTKTNIARTTPYGCIYVSLEAFADTVERQGFIDTDQIRNLALEIIIHELTHVDQLIDYKYIKFNNGYREEVELKCVKQSCQWILDNMQYIRSLGLVVIPEVYQARLTNLTDIIYTPKYPIAIAMAKLEYMLGKKFREFSNNNIEIQYIDRLKTHYSFMVCENRSYINSRNLNDLGERLLNDKQYTVEYLEYGDSKLVIKITQGA*
Ga0114843_119276Ga0114843_1192762F095629MIFKERMMRELIICVCLLGCFSIVNANNVEQPKDVKIVHNDDSVVLHKKIYQLEKRIERLELLLQKEGK*
Ga0114843_129028Ga0114843_1290282F054110VLDVNYQPTIKKLLKALQMNGRRYVVDVRQSWSKYDKPCKVYIVNRMYTEEEYKLTFPHKYKKGKTFKQ

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.