NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300007096

3300007096: Human tongue dorsum microbial communities from NIH, USA - visit 1, subject 604812005



Overview

Basic Information
IMG/M Taxon OID3300007096 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0063646 | Gp0052575 | Ga0102538
Sample NameHuman tongue dorsum microbial communities from NIH, USA - visit 1, subject 604812005
Sequencing StatusPermanent Draft
Sequencing CenterBaylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size92640720
Sequencing Scaffolds13
Novel Protein Genes14
Associated Families12

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales1
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonia → Candidatus Nanosynbacterales → Candidatus Nanosynbacteraceae → Candidatus Nanosynbacter → Candidatus Nanosynbacter lyticus6
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Negativicutes → Veillonellales → Veillonellaceae1
All Organisms → cellular organisms → Bacteria1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Negativicutes → Veillonellales → Veillonellaceae → Veillonella → Veillonella parvula1
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria1
Not Available1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Oral Cavity → Tongue Dorsum → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal surface

Location Information
LocationUSA: Maryland: Natonal Institute of Health
CoordinatesLat. (o)39.0042816Long. (o)-77.1012173Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F033081Metagenome178Y
F046433Metagenome151N
F051210Metagenome / Metatranscriptome144Y
F054110Metagenome140N
F067846Metagenome125Y
F073671Metagenome120N
F077405Metagenome117N
F078842Metagenome116N
F094007Metagenome106N
F095630Metagenome105N
F095633Metagenome105N
F103433Metagenome101N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0102538_100009All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales151742Open in IMG/M
Ga0102538_100049All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonia → Candidatus Nanosynbacterales → Candidatus Nanosynbacteraceae → Candidatus Nanosynbacter → Candidatus Nanosynbacter lyticus42841Open in IMG/M
Ga0102538_100260All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Negativicutes → Veillonellales → Veillonellaceae19129Open in IMG/M
Ga0102538_100411All Organisms → cellular organisms → Bacteria15404Open in IMG/M
Ga0102538_100540All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonia → Candidatus Nanosynbacterales → Candidatus Nanosynbacteraceae → Candidatus Nanosynbacter → Candidatus Nanosynbacter lyticus13596Open in IMG/M
Ga0102538_100686All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonia → Candidatus Nanosynbacterales → Candidatus Nanosynbacteraceae → Candidatus Nanosynbacter → Candidatus Nanosynbacter lyticus12186Open in IMG/M
Ga0102538_101267All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonia → Candidatus Nanosynbacterales → Candidatus Nanosynbacteraceae → Candidatus Nanosynbacter → Candidatus Nanosynbacter lyticus9229Open in IMG/M
Ga0102538_102800All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae5920Open in IMG/M
Ga0102538_102930All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Negativicutes → Veillonellales → Veillonellaceae → Veillonella → Veillonella parvula5731Open in IMG/M
Ga0102538_105430All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonia → Candidatus Nanosynbacterales → Candidatus Nanosynbacteraceae → Candidatus Nanosynbacter → Candidatus Nanosynbacter lyticus3623Open in IMG/M
Ga0102538_108469All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonia → Candidatus Nanosynbacterales → Candidatus Nanosynbacteraceae → Candidatus Nanosynbacter → Candidatus Nanosynbacter lyticus2478Open in IMG/M
Ga0102538_119397All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria1014Open in IMG/M
Ga0102538_123288Not Available814Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0102538_100009Ga0102538_10000968F051210MNNYSQILGNSAMMDALRASSVSAEDARLRGNEYAKMFSRNEEMMDVFGLGGNNTNLLQKTFSGYSETPLLSTQYFNASVASYVSSFAGYMSIERDFDQPNGLFYWFDVLGVTDLRSVLPNLGPDQYQDVQVMGGFELPVTVNAGTAAYSPLVGRKLIPGTVRVKVEDGTGKKYELIDNGQGSFMAVAGVLKTGTVNYLNGKIDFELTTAVPANGSITIVGKEDTTGTPSCTNGASNAHANDKRFIAKMQQIALNTVPDMLVAEYNIAALGAMKKATGSDMATFLFTKLRELYTKTINFKLVSTLEKGYAGNVMDDLDLSNAPASLASKFMDYRSRVDLFDAYLINVESALATKAVKGVTTTAYIAGNQAANQFQKGGVIGKFERNTKMTYISDLLGWYDGVPVLRSTDIQEKAGEGTFYAIHKTQDGQMAPLARGIYMPLTDTPTIGNYNNPTQMASGIYYQEGVRYLAPELVQKVSFKFGF*
Ga0102538_100049Ga0102538_10004949F078842MIISSIYKTADNDGLIAHIYEHLLAQYVLKRLQDNEFFVLGDIILSAKTYGDTCFMDAELYSPEAKKTYDEALREFDKLVIPEDAALRAASECGIEMNRNIAEVDRSELSKKLREVQISPWRKQIDMAYRKAHNESSVNTLFHTSYVKYSKESDDLFREYVLEYSIDESHIQTPVDQALAAIVMQIVALNFLTVVREKYTVYDRGDQWSEVSISVGYRMFLGLLKKDDKIINQLSCDFLEYIKILSSSVFCDNLQKALVRCSDNHKQVILNRSTLNAILGGCIIGGKGWLEMADSVRIRQMVNSIELDIYEVNS*
Ga0102538_100260Ga0102538_10026032F054110VNYQPTIKKLLKALQMNGRRYVVDVRQSWSKYDKPCKIYIVSRMYNEEEYKLTFPHKYKRGKTFKAKQLYKKESEYSSTKQHEVLLFLVKTYKGGD*
Ga0102538_100411Ga0102538_10041112F103433MIFVMKEWSKNKPGVVFFFVVWFILSISFIGNFFGTGLWNGWFDGFQKDSSAIVEKTAYCKNKYDYKGPLIAADSKDYNKIMMSQDCNPSQVKPYVSQYGLQARVIAGLSPNDASKIPAYIKRVSIFLAVLTAFLLALVVQKIRALFGGITASVFAVMLAFSPWIAGYARNIYWIEPMLIAPFVISFVGYQYFKKSKKLWLFYIIESVAMFLKLLNGYEYVSTIAISVLVPIIFFELVHKNVKIINLWKQAVPVFAATVVAFFGAYWVNFISLTDYYGSSDKAASAINARASDRGISGIRSMRAYAVGNFKILRPETYNFINQLVNLDNMANNSGKTYKYIIVNVVNYLLLPAITLPVHINGMFGEFIQSILFWTILGYLIILSSRKIIGKKYNRPFLWSMNFSVIGAFCWLALMPGHALPHAHINGIIFYIPLLLFVYVLIGLWADYVVKRTVKYE*
Ga0102538_100540Ga0102538_1005401F033081MYTDITVVHRPKKGVMAWLFRRAMPQDTRPTFVWSRLVAEIENAGYFSRWKFSILAVGLIIMTIATIKMLLFVPGLNQSAVSLLTRGLETFLPTRWATATAWTVGMAGVFLMGDLTNYTPSQKILHKIKATRYEVYNTILFLALLEEQAFRSGSERWNWRERVRASVCFGLLHIMNIWYNFATGIALSVTGFGFLLVYLWYYRKYRIQIIATAAAATVHALYNAIALSLIAI
Ga0102538_100686Ga0102538_10068613F046433MIELPTSPDALSELSPVAPPKLLSQAQDASRGNLMVYIKADNYLGTETSDPSFMKSRCETTEYEAINDFVQFIEMIKHYLPDYMENCAKELIDELAFLGMPELNFAANALAKRLRCHLEVDNKPVYIDVGNSLSQYRAKNEMKSSQYILSLVLSKFSGDEFEEYEGRLKVYGGRGEIDKSSKILFLDDWIVSGDQVKERIAGFEVDNDPEDHEASVLVMAASGDYLDNGISAYSQYGGTIYPVEACYVLKNSPDAGGMSRVTGIHSSTDNTFGYEVDGIAYCAIERGILKGEGIGELSLPALANIVRPYRNGEDFDGLSRFRQLLEKE*
Ga0102538_101267Ga0102538_1012676F095633MGNYKNSAEVWRREGLTEDELRTMGTLAMEATEKLKKTIIRKETVLLGSVPFGSWGEFAKAVQEMAAHSYEPIPVEINTKRLIAKAFLDDRGEMSVEEHSVPEEVFIDLSRTRCDAEEDRNHKSYEFTCPALMEYPDGKLYLTRKAYVISVIDVNGSQEVDFNIIYGGLN*
Ga0102538_102800Ga0102538_10280012F073671LGERMNKEQAEHELAELHEKERSLEKALELVREKIRELVNYMDKNKGQK*
Ga0102538_102800Ga0102538_1028006F067846MSIIADWERQEFNKWDKQCSKEDDYNRAVEMEIEAIKENISNLDDDVICAFREKMLDYGEVINAFDDDTFNDDEFIKAVALGTDYEEMRIKILTAMAEDRLEQLEEDYRKGYILND*
Ga0102538_102930Ga0102538_1029307F054110MNGRRYVVDTRQSWSKFDKPCKIYIVSRMYTEEEYKLTFPHKYKKGKTFKQGQLYKKESEYSSTKQHEVLLFLVKAYKGGE*
Ga0102538_105430Ga0102538_1054301F033081ITVVYRPKKGVMAWLFRRAMPQDPRPVFVWPRLVAAIGNVGYFSRRGFSVLAVGLIIVTIATIKILLFVPGLNQSVVSLLTRGLETFLPTGWATIAAWVVGTTGVFLIGSFTSSYTPSQRLLYSLEATGCGVYDTLLLLALIEEQAFRSGSEKWNWRERVRTSVCFGLLHIANIWYSFAAGIALSATGFGFLLVYLWYYRKYRIQIIATAAAATVHALYNAIALSLIAVVLAIDIAKLL*
Ga0102538_108469Ga0102538_1084692F094007MKLKTVEVLELARPSRAGVIDVVDSDGNVVPLDYLGEDFVPDANSYGDEDFTKRNRIIVEMCDLFGGIRRRAGFAERHRGRGDYDRARRIERNRGSDISEVGRLAISACEACPLKLDCELYGKLGGAVLSDVLNYKKVRTAISLTKAGKKRSGWNKGCIDNNA*
Ga0102538_119397Ga0102538_1193972F095630DTCFMDVEFYSPEAQDAYNEALRLFDKWDIPRSAALRAATECGIEMNRLVAELAQDELLHELSAVQSSPWRQQSDITYRKADSKSSVNTLFRMPCIKYGVESKNLFPEYVLEYSVDEKYIQSPVDQALAAVVMQAVALNFLVMIREKHTVYDRGDQWSEASKSVGYRMFLGLIKKDDSIVHQLKCEFMEYVQFLSRSPFCDNLQAALVRCSHNYEQVLLDRGALNSILGGCVIGGKGWLKMADNTLIGQILKAIEIDVYDI*
Ga0102538_123288Ga0102538_1232882F077405QGRALPTELFPRLLVAKQRGVFYGFISLCQIKFVKKVFDWLKIVQK*

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.