NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 7000000442

7000000442: Human tongue dorsum microbial communities from NIH, USA - visit 2, subject 765094712



Overview

Basic Information
IMG/M Taxon OID7000000442 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0063646 | Gp0052992 | Ga0031325
Sample NameHuman tongue dorsum microbial communities from NIH, USA - visit 2, subject 765094712
Sequencing StatusPermanent Draft
Sequencing CenterBaylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size102358025
Sequencing Scaffolds5
Novel Protein Genes14
Associated Families14

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria1
Not Available1
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes1
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Lachnospiraceae1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Oral Cavity → Tongue Dorsum → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal surface

Location Information
LocationUSA: Maryland: Natonal Institute of Health
CoordinatesLat. (o)39.0042816Long. (o)-77.1012173Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F046433Metagenome151N
F080164Metagenome115N
F080166Metagenome115N
F081455Metagenome114N
F081510Metagenome114N
F089057Metagenome109N
F092229Metagenome107N
F092230Metagenome107N
F092232Metagenome107N
F095629Metagenome105N
F095631Metagenome105N
F099452Metagenome103N
F099453Metagenome103N
F105379Metagenome100N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
C1816074All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria1163Open in IMG/M
SRS051791_LANL_scaffold_14453Not Available1087Open in IMG/M
SRS051791_LANL_scaffold_26253All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes56322Open in IMG/M
SRS051791_LANL_scaffold_33739All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales233400Open in IMG/M
SRS051791_LANL_scaffold_44027All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Lachnospiraceae696Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
C1816074C1816074__gene_156930F046433MIELLTSPGALSELSPVAPPKLLSQARDTSRNNPVTYVVDDGYMGTRTSDPCFRKMRRETTEYKALNEFLYFMKMVEPYVPDHIKDDARELRNELVFLGMSELHFAATALAKRLRHLLKVDNNPVYVDVGNSLSQCRVKNEMKSSQYILSLVLSKFLDDEFEEYEGRLKVYGGRGEIDKSSKILFLDDWIIGGDQMRERISVFGAYNNPGAHKVSVLVMAASSNCIDNGIGADSLWGEATYPVEAYYRLKNDHNDWGVSRVTGIHSSTDSTFGCEVDDIAYRAIEGGVLKGERIDRLTLPALVNIVRPYRNGEDFDGLSRFRQLLEKG
SRS051791_LANL_scaffold_14453SRS051791_LANL_scaffold_14453__gene_28645F080164MVGLTLCAAPQVTLRERANAFPLITEKDATEVDAPYAWRLPVVPLSLDNREIRNFAKFPLLPSLSGGILTVRVLVVGDTVAVHRDLMDDFAKRCRATLGLGVRTAPKLFGIKGMHVYGVQKDKSRQAVDEYVTLHLPGFEKAEKPLHYKEQTGQLVLCEYYGSHRGDLLLNAANARPEIFGELCPVVDFHFPVELRRAYAWLLLEIELEDGTKLSTSLQHYDEQTSILDHPDRL
SRS051791_LANL_scaffold_26253SRS051791_LANL_scaffold_26253__gene_51855F081510MKFDLNAVKATAKTTWVTTKILGKKYAPVILLGAGLVGYGYSVYEGIKSGKKLEATKAKYEQMEADGEEFSRVDVVKDIAKDVAIPVAVATASTASIILGFAIQTNRLKAVSAALAMVTEEHARYRLRAKTVLDEETFKKIDAPLETKTVNVDGEDIEVESIVPNEGDFYGMWFKKSHKYASDSPEYNEGVIKEADNVLTEKMMRKGMLTFAEVLDILGFEVPKAALPFGWTDTDGFYIEWDAHEVWNDDKQEYEVQFYVRWKTPRNLYATTNFHDFVPKKTRKELN
SRS051791_LANL_scaffold_33739SRS051791_LANL_scaffold_33739__gene_67466F089057MINVIPLIAKKYNRKGDTSGSLKSLIDDLNCIGDTDDVLLFLTSIPRETKYSLAEAFSVIVSNDEYRNIFRTSIVFLNIDLDYHELLLTAIKSESYDIICMINKAIPTPDLFLAKNNYECLTIALDKLYAIFDKVLGMVVGQIKHTASSKEGRALGIFMTLCILNKDIDKLASLCTGYLATCRSEYMVKDLMNKSAMDAFQYMSEEDIHTVVDDINSRTVLSRYLNNM
SRS051791_LANL_scaffold_33739SRS051791_LANL_scaffold_33739__gene_67507F099452MDKTYAELLQETLSKIYELKDLNNRDRGKALTIFIGERLNRELLLSSMNVFNLYKEIVNLDDVSLLNDLRKTSWYKDWFISDKRNSDLIDLSKFNFRSLERFEKESYLKDVEHYDFKKVIEVDSYSLYDTLSEENGVDLFKLAAENILINHGFFNNTDYNLYDIPDKYMEDIEVSLYMCLLNSGNMDFMDKKTFKSTELFYIVKNNICGTIFFTLFDRMNEDTRTRAR
SRS051791_LANL_scaffold_33739SRS051791_LANL_scaffold_33739__gene_67554F092232MNTQAKFIAEYNDKNRPKFNDKFFNKSDDDIIEDLKDVILSCERNKFYTIKVLNFEVIDDYNEVQKLLIGDETPSISIKDSDLKILKVTYHVACTKDEDTFDVLIAIPRVIDGAYIHLNGNDYFPLFQLVDGSTYNNTTASSAKTQSITLKTNSNAVKMLRNFIDLNTTNEETVRAAMFSVYLFDHKVTLFEYYLARFGWYETLDKFNFEDVIKISDHDLNDPEYYTFAIANAHMKTPFYISAVKSFMDNDRILQSFVASFARAISLYATKKTTLDQIYTTEFWICKLGYNFVSSETSVFTKGNAIIESLENSYDIPTKKRLRLPDHVKEDIYSVLKWMACEFSSIRLKNNLDASSKRIRWSEYIAAMYIMLINVKLRRLPEKHDPNMEAYRIKQQLNTQPMALIAELQKSNLKGFRNMVNDRDSFLQLKYTIKGPSGPGESNSKNVARNVRAIDPSHLGIIDLNTSSASDPGVGGMLCPLNYGVYEWNSFTNEEEPNVWDENFSKMLNIYREEKGYTSAIMLADDAGLELTDTRDPTAVAFDADLLGQTIAKVARTRAFEKQLRPALINMEDSCSIYFEEV
SRS051791_LANL_scaffold_33739SRS051791_LANL_scaffold_33739__gene_67568F095629MTFKERMMRELIICICLIGCFSIANANNIEQPKDVKIVHNDDSVILHKKIYQLEKRIERLEELLKKEDK
SRS051791_LANL_scaffold_33739SRS051791_LANL_scaffold_33739__gene_67573F105379MVIHFPLSQSDIESLLSISKLLKCDKILYDRNYINPIIGVGPEKSYFQTTSFMVDLSPHINNLLVNISDLKNLDKITQLQPSKENPEIAIHRPVVSVFNWDAEYVKACMNSLREYQIDDNIIARTDEFHNTDCYNELMAGSASTGAFRINVGRYMIDIPKSAIPTLKSDHVVATVYNAPNKNFNILRFKITKRNGIIVNQSMLFLPY
SRS051791_LANL_scaffold_33739SRS051791_LANL_scaffold_33739__gene_67597F092229MNKEYRFNHIPEVVLRNIRFIRDNNIDIGTGDDVLECMMDINPVVRTKIYDDYEFAKDVAERRFGSTIEKLDLRTVLQKCITRPYNSILNNIFFRYFNSELIDGLFKLGQSTKVLDLAIEYECEYYTVNAAKTNIRRYNTDAYYNKFAADSNIISSHRSLHDPQVNAVKSAEFTYDLLMASRAEEFNPEIVREIFVKYGLKPNSSRNLYNRINDNLNLFYYIEDYLDEYREEGRFIYGTKEYKILKELRSLPLMVVLTQLTRKNDSGYILNSNLELVKG
SRS051791_LANL_scaffold_33739SRS051791_LANL_scaffold_33739__gene_67615F099453MLRRKDMNRFDVIELAQQTLTFVYDTFNGKVNTLDPYTRLNFVSGYLDTKTNIARTTPYGCIYVSLEAFADTVERQGFIDTDQIRNLALEIIIHELTHVDQLIDYKYIKFNNGYREEVELKCIKQSCQWILDNIQYIRSLGLVVIPEVYQARLANLGNIIYTPKYPIAIAMAKLEYMLGKKFREFSNNNIEIQYIDRLKTHYSFMVCENRSYINSRNLNDLGERLLNDKQYAVEYLEYGNSKLVIKITQGA
SRS051791_LANL_scaffold_33739SRS051791_LANL_scaffold_33739__gene_67617F080166VNILANFENYNKVVEQIFELNYYLTFKLEVTFNTTHKKINTEIKENFHSEYVVGANKLTTNLRYKYQMRLSPRGEKIGIVIDWDNYDDLCTVIEEAINICDPENKMSPFKRLYSTTGDLLDIKCNSLKVRYLHLEDRWNNKVDLIPFVLVDDNRGTLTEAMRFRFNNDLTFDVPVSRLKGFRRFLMTYNPVLHAGAMARYMAMTPLLGSNRQNMLK
SRS051791_LANL_scaffold_33739SRS051791_LANL_scaffold_33739__gene_67618F081455MENFLTKEISALISKHFEFTNNSDLIKDPIISDDNIIDFPPVEEIGAEKVNMSDIIDCVQEALQQPLENKDTSIAVNFSQMINKPKEEVKTEVNSVPPEGETKVNVLFPKTEHILGNYVDYDSFIKIKESNTDKVVRAIRLLNYKMSDQNAAAAFAQFVSEFNPECDPNKRLRYELIRHQGREKDLVIRLSTVINGTTKYYADIYPDLNKIDLDHHLISSAKK
SRS051791_LANL_scaffold_33739SRS051791_LANL_scaffold_33739__gene_67647F095631MASDAVSVNKTLRSYQETVLNTVQNDTLLNANIYDIHQYIENIKKRYVDEDEITLSMGIFGYMGDVNSNALQNAVTMAAEYSNEAIPIKAKFEKNVISHALMLGINKIFAEPATMQAMFVFYEDELILNTVSDTFKFDRNIKIMVGDYEFHLPYDLIIKRIELPTGEYIYTGMYDTTQSNPIITRNSNDVDPYLKPTVRSKIDGRNVVMLLVDLRQYEYTTYHKTIITTNPLESKMLQFEFDNQLAGFDVDVKEYDQPTRKLKPVYNGLNTDGINNFCNYTYIDSSTIRVMFDNSSYLPTANTEVTVNLYTCQGSNGNISYKDSIYFRVKSEKINYDRLNLLVIPTSDAQYGIDKRSIADLKKLIPKEALSRGSVTNSTDINNYFNTIDDDDNKLFFFKKMDNPLARLYYAFVLMDSPTNIIPTNTIPIEAIRRDFDNISDSNYILTAGNIIKYDGTTNASVAYQSSEEELNNARRNQFLYMNPFMCIVNKKPLYVSYYMNIMDVNKLLEFTYVNQDSKVQFVANKMNWYRHYLSERDTYVGDISIMQNIQSDIGLVHRDDPYDPEKITGVDVKVLAVFYTDEKYQVPYRWAEAEFVNYDQNTFIMDYKFKLNTDNKIDKNIKLKINNVYEVGNATRLSPGYMTNNMNMKIFVFAKDVFGYNAGLHKADQIFTADFLEGYSLTNEYTVKYGIDFLYNYSDLIESHIKIRKQDNGQISYIIDRVPVISYDYVNTEERIQDFINNLEKKRIHILECLDVLEDSFGIDIKFFNTYGPSKLFYVNDGVPLNRVNLSMTFKVKFLTTTDKYLTEYIKNDIRKYIEDKSRISDIHIPNIITFITQKYAENVTYFEFLDFNGYGPGYQHIYRKDESIVGRIPEFLNINTIGTENNALDINIIIA
SRS051791_LANL_scaffold_44027SRS051791_LANL_scaffold_44027__gene_88086F092230MVDKLKAHFLKVLLPLFIVCVIFVAFFRQIACGSDGDYAFQISEWGAKLKNIYGTDFINKEIIVRDNAVRVDGIRCLYAVNQNEDGLSIYLLLPGGDYLTHNYVGSSFVRFSNSSEYNNMAYGEGNVEVSDSTSTGEVQNTEEKEARDKVDDAINSMRHLFASAIMVNLRVVELYKILTVCMILIAIAITIGYYSYLK

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.