NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 7000000671

7000000671: Human tongue dorsum microbial communities from NIH, USA - visit 1, subject 638754422



Overview

Basic Information
IMG/M Taxon OID7000000671 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0063646 | Gp0053247 | Ga0031302
Sample NameHuman tongue dorsum microbial communities from NIH, USA - visit 1, subject 638754422
Sequencing StatusPermanent Draft
Sequencing CenterBaylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size134783148
Sequencing Scaffolds14
Novel Protein Genes14
Associated Families13

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
Not Available9
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes1
All Organisms → cellular organisms → Bacteria1
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ct6uZ81
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA4161
All Organisms → Viruses → Predicted Viral1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Oral Cavity → Tongue Dorsum → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal surface

Location Information
LocationNational Institutes of Health, USA
CoordinatesLat. (o)N/ALong. (o)N/AAlt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F053092Metagenome141N
F054110Metagenome140N
F072445Metagenome / Metatranscriptome121Y
F072446Metagenome121N
F080166Metagenome115N
F081455Metagenome114N
F081510Metagenome114N
F089057Metagenome109N
F095629Metagenome105N
F103430Metagenome101N
F103432Metagenome101N
F105376Metagenome100N
F105378Metagenome100N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
C4584226Not Available531Open in IMG/M
C4584290Not Available531Open in IMG/M
C4594818Not Available561Open in IMG/M
C4663401All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes963Open in IMG/M
C4688339Not Available1459Open in IMG/M
C4689813Not Available1516Open in IMG/M
SRS022143_LANL_scaffold_17204Not Available756Open in IMG/M
SRS022143_LANL_scaffold_2853All Organisms → cellular organisms → Bacteria2252Open in IMG/M
SRS022143_LANL_scaffold_38401Not Available1567Open in IMG/M
SRS022143_LANL_scaffold_39473Not Available3166Open in IMG/M
SRS022143_LANL_scaffold_44835All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ct6uZ82502Open in IMG/M
SRS022143_LANL_scaffold_4513All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA416609Open in IMG/M
SRS022143_LANL_scaffold_5510All Organisms → Viruses → Predicted Viral1738Open in IMG/M
SRS022143_LANL_scaffold_84155Not Available1387Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
C4584226C4584226__gene_182792F054110QPTIKKLLKALQMNGRRYVVDVRQSWSKFDKPCKVYIVNRMYTEEEYKLTFPHKYKKGKTFKQGQLYKKESEYSSTKQHEVLLFLVRTYKGGD
C4584290C4584290__gene_182810F053092FGILIAKELGIATHRIKADKHVPRDHIPLTLVEGDDIGIVVMIEKVLIGLQDALITTELVAELADTTVIASSDLTDPVAKDTLSEARLLDVFVSIVSYKLRFFRHK
C4594818C4594818__gene_186480F072445KFIDYTSTYRDNNKEFLRGDMWEFKVLSAPKIVYYPGDDIINARLNSVQVGVDTSVTGIEKRMRGGYAIYQQTNQTTSGNLTLSFVDREDQAITYFLDDWRQKISDRETKYSFRKDDVVMDCKLFITNAQRLDVRELTFYNVIIQDAGIDNNGQAEAESDRSDVTLSAKFEHYSLEFKNL
C4663401C4663401__gene_212084F081510RKIMKLPKLPNMQTIKSTAKSAMVTTKILGKKYAPFVLLGVGLVGYGYSVYAGVKSGKKLEATKAKYEAKDAAGEEYTRMEVVKDVAKDVAIPVAVATASTAAIVLGFAIQTNRLKAVSSALAIVTEEHARYRLRAKEVLDEATFKKIDAPLETKTVELDGQEVEVESIVPNEGDFYGQWFKYSSNYVSDDPDYNESYIKEAETYLVNRMMKKGVLTFGEVLDKLGFDVPRAALPFGWTDTDDFYIEWDAHEVFDDVKQEYDLQFYVRWKTPRNLYATTSFKDFVPKKTRKELN
C4688339C4688339__gene_222839F105378MNVKVYVDKIKKWVQISSDEVLDVNKNLSDLKDKEAAITNLGLYEKFISKEALESGFLPDVFTPDNIVTDSTHQFVTDEEKSKWNNKLNAPIPMQDHLENNQIGYDSNNSKFYIGLNNQNVLFGGSSCFDNIIVVNGFFSGNSQPTVIRNNKFNEAGQLITPVFVDVQCVEYTAGDLGEVSVSYTADSISIYNTGSFTGSFQCLIVYPLGSVN
C4689813C4689813__gene_223553F105378MNVKVYVDKIKKWVQISSDEVLDVNKNLSDLKDKEAAITNLGLYEKFISKEALESGFLPDVFTPDNIVTDSTHQFVTDEEKNKWNNKLNVPVPMQDHLANNQIGYDSVNSKFYIGLNNQNVLLGGSSCFDNIIVVNGFFSGNSQPTVIRNNKFNEAGQLITPVFVDVQCVEYTAGDLGEVSVSYTTDAISIYNTGSFTGSFQCLIVYPLGSVN
SRS022143_LANL_scaffold_17204SRS022143_LANL_scaffold_17204__gene_17939F103432MKLIHSLFSLPLLLVLGGFLCLTACQDDAEPTQRTGLISTDSLFHAAEVYDGKAFEHVVSTTATGLRVSEPRRVVPMLPRQLHVEMEGKTIFRRHNLPSVSAYSFQVLAVGDTIYRQKESDAQFNADLDALFHQSIGIAPRLFGVKELSVLGIDSRGKTRDLGNYSYPLLRGVRIYMAFRSREGVFHEHYEAASVDTFSVKSNWLLKTKAEPSL
SRS022143_LANL_scaffold_2853SRS022143_LANL_scaffold_2853__gene_2992F105376MNKNKKEGNMEPEVSAKEFGALEADVRHIKEGVDKHTITLERIENIARANVTQSQLKTYIAEHEKESEEKYVKRTEIEGVMNFWKLVTSNLAKLFAIALVGLAIYATNNLIQQNKTVTELQEEVQQTVRRK
SRS022143_LANL_scaffold_38401SRS022143_LANL_scaffold_38401__gene_41371F081455METTNLQEINERAAKSLEDLMNFVLYGKKPKTEEKDIPPVEEIGSETVDIIDAAEEAIQQPLQNKDTSIAVNFSQMVNKPKEEVKTEVNSVPPEGETKVNVLFPKTEHILGNYVDYDSFIKIKESNTDKVVRAVRLLNYKMSDQNAAAAFAQFVSEFNPECNPNKRLRYELIRHQGREKDLVIRLSTVIDGKTKYYADIYPDLNKIDLDHHLISSA
SRS022143_LANL_scaffold_39473SRS022143_LANL_scaffold_39473__gene_42719F103430MEQITIKAFIGSNNKTKKLEVDKIISTVNANHEAFTLDYPVIGCWRGEVEETAVLYLSDERQKVMNTLNKLKEVLGQEAIAYQIENDLQLI
SRS022143_LANL_scaffold_44835SRS022143_LANL_scaffold_44835__gene_49108F095629MRELIICVCLLGCFSVSNANNVEQPKDVKIVHNDDNVILHKKIYQLEKRIERLEELLKKEDK
SRS022143_LANL_scaffold_4513SRS022143_LANL_scaffold_4513__gene_4730F080166MEVTFNNIIKRINTEIKENFHTEYVVGANKLTTNLRYRYRMRLSPRGETTGVIIDWDNYDDLCNIIDEAIDICDPNNKTSPFKRMYSTAGDLLDIKCDSLKVRYLHLEDRFGNRLDLMPFVLIDDHNGTLTEAMKFRFNNDLIFDVPVSRLKGFRRFLMTYNPLLHAGSMARYMAITPLLGNNRQNMM
SRS022143_LANL_scaffold_5510SRS022143_LANL_scaffold_5510__gene_5786F089057MINVIPLIAKKYNRKGDTSGSLKSLISDLNCVTDNDDVLLFLSSIPRETKYSLDDAFDIIVSDDTYSNIFRATLVFLNIDLDYHRLLLNAIKSESYTIICMINKAIPTPDLFLAKNNYECLTIALDKSYAVFDKVLGMVISQIRHTASSKEGKALGLFMTLCILNKDIDKLASLCTGYLATCRSEYMVKDLMNKSAMDAFQYMSEE
SRS022143_LANL_scaffold_84155SRS022143_LANL_scaffold_84155__gene_98628F072446SIPGIVPEGTTYRVPVIPRSVEDKTEKEYNDMELGKEAHLVFRATVHGDTINRHKKELKALSLQLNRLTETSIGTSPVLCGVKSIEAVGIAENGNTYDLRGEMKLRIRDYSYRLKYPSGIITLDCENTESLTAKYVVPLGRIREYELAEHIQPELKFYLPVKRCMDFSSIRFAITLFNGEVLSFQHKLPSKSVLQELPNKSVLENYQQYGFIPESTYFTTLWPLPDYKYNEREL

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.