Basic Information | |
---|---|
IMG/M Taxon OID | 3300007096 Open in IMG/M |
GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0063646 | Gp0052575 | Ga0102538 |
Sample Name | Human tongue dorsum microbial communities from NIH, USA - visit 1, subject 604812005 |
Sequencing Status | Permanent Draft |
Sequencing Center | Baylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis |
Published? | N |
Use Policy | Open |
Dataset Contents | |
---|---|
Total Genome Size | 92640720 |
Sequencing Scaffolds | 13 |
Novel Protein Genes | 14 |
Associated Families | 12 |
Dataset Phylogeny | |
---|---|
Taxonomy Groups | Number of Scaffolds |
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales | 1 |
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonia → Candidatus Nanosynbacterales → Candidatus Nanosynbacteraceae → Candidatus Nanosynbacter → Candidatus Nanosynbacter lyticus | 6 |
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Negativicutes → Veillonellales → Veillonellaceae | 1 |
All Organisms → cellular organisms → Bacteria | 1 |
All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae | 1 |
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Negativicutes → Veillonellales → Veillonellaceae → Veillonella → Veillonella parvula | 1 |
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria | 1 |
Not Available | 1 |
Ecosystem Assignment (GOLD) | |
---|---|
Name | Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
Type | Host-Associated |
Taxonomy | Host-Associated → Human → Digestive System → Oral Cavity → Tongue Dorsum → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
Alternative Ecosystem Assignments | |
---|---|
Environment Ontology (ENVO) | Unclassified |
Earth Microbiome Project Ontology (EMPO) | Host-associated → Animal → Animal surface |
Location Information | ||||||||
---|---|---|---|---|---|---|---|---|
Location | USA: Maryland: Natonal Institute of Health | |||||||
Coordinates | Lat. (o) | 39.0042816 | Long. (o) | -77.1012173 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
Zoom: | Powered by OpenStreetMap © |
Family | Category | Number of Sequences | 3D Structure? |
---|---|---|---|
F033081 | Metagenome | 178 | Y |
F046433 | Metagenome | 151 | N |
F051210 | Metagenome / Metatranscriptome | 144 | Y |
F054110 | Metagenome | 140 | N |
F067846 | Metagenome | 125 | Y |
F073671 | Metagenome | 120 | N |
F077405 | Metagenome | 117 | N |
F078842 | Metagenome | 116 | N |
F094007 | Metagenome | 106 | N |
F095630 | Metagenome | 105 | N |
F095633 | Metagenome | 105 | N |
F103433 | Metagenome | 101 | N |
Scaffold | Taxonomy | Length | IMG/M Link |
---|---|---|---|
Ga0102538_100009 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales | 151742 | Open in IMG/M |
Ga0102538_100049 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonia → Candidatus Nanosynbacterales → Candidatus Nanosynbacteraceae → Candidatus Nanosynbacter → Candidatus Nanosynbacter lyticus | 42841 | Open in IMG/M |
Ga0102538_100260 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Negativicutes → Veillonellales → Veillonellaceae | 19129 | Open in IMG/M |
Ga0102538_100411 | All Organisms → cellular organisms → Bacteria | 15404 | Open in IMG/M |
Ga0102538_100540 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonia → Candidatus Nanosynbacterales → Candidatus Nanosynbacteraceae → Candidatus Nanosynbacter → Candidatus Nanosynbacter lyticus | 13596 | Open in IMG/M |
Ga0102538_100686 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonia → Candidatus Nanosynbacterales → Candidatus Nanosynbacteraceae → Candidatus Nanosynbacter → Candidatus Nanosynbacter lyticus | 12186 | Open in IMG/M |
Ga0102538_101267 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonia → Candidatus Nanosynbacterales → Candidatus Nanosynbacteraceae → Candidatus Nanosynbacter → Candidatus Nanosynbacter lyticus | 9229 | Open in IMG/M |
Ga0102538_102800 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae | 5920 | Open in IMG/M |
Ga0102538_102930 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Negativicutes → Veillonellales → Veillonellaceae → Veillonella → Veillonella parvula | 5731 | Open in IMG/M |
Ga0102538_105430 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonia → Candidatus Nanosynbacterales → Candidatus Nanosynbacteraceae → Candidatus Nanosynbacter → Candidatus Nanosynbacter lyticus | 3623 | Open in IMG/M |
Ga0102538_108469 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonia → Candidatus Nanosynbacterales → Candidatus Nanosynbacteraceae → Candidatus Nanosynbacter → Candidatus Nanosynbacter lyticus | 2478 | Open in IMG/M |
Ga0102538_119397 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria | 1014 | Open in IMG/M |
Ga0102538_123288 | Not Available | 814 | Open in IMG/M |
Scaffold ID | Protein ID | Family | Sequence |
---|---|---|---|
Ga0102538_100009 | Ga0102538_10000968 | F051210 | MNNYSQILGNSAMMDALRASSVSAEDARLRGNEYAKMFSRNEEMMDVFGLGGNNTNLLQKTFSGYSETPLLSTQYFNASVASYVSSFAGYMSIERDFDQPNGLFYWFDVLGVTDLRSVLPNLGPDQYQDVQVMGGFELPVTVNAGTAAYSPLVGRKLIPGTVRVKVEDGTGKKYELIDNGQGSFMAVAGVLKTGTVNYLNGKIDFELTTAVPANGSITIVGKEDTTGTPSCTNGASNAHANDKRFIAKMQQIALNTVPDMLVAEYNIAALGAMKKATGSDMATFLFTKLRELYTKTINFKLVSTLEKGYAGNVMDDLDLSNAPASLASKFMDYRSRVDLFDAYLINVESALATKAVKGVTTTAYIAGNQAANQFQKGGVIGKFERNTKMTYISDLLGWYDGVPVLRSTDIQEKAGEGTFYAIHKTQDGQMAPLARGIYMPLTDTPTIGNYNNPTQMASGIYYQEGVRYLAPELVQKVSFKFGF* |
Ga0102538_100049 | Ga0102538_10004949 | F078842 | MIISSIYKTADNDGLIAHIYEHLLAQYVLKRLQDNEFFVLGDIILSAKTYGDTCFMDAELYSPEAKKTYDEALREFDKLVIPEDAALRAASECGIEMNRNIAEVDRSELSKKLREVQISPWRKQIDMAYRKAHNESSVNTLFHTSYVKYSKESDDLFREYVLEYSIDESHIQTPVDQALAAIVMQIVALNFLTVVREKYTVYDRGDQWSEVSISVGYRMFLGLLKKDDKIINQLSCDFLEYIKILSSSVFCDNLQKALVRCSDNHKQVILNRSTLNAILGGCIIGGKGWLEMADSVRIRQMVNSIELDIYEVNS* |
Ga0102538_100260 | Ga0102538_10026032 | F054110 | VNYQPTIKKLLKALQMNGRRYVVDVRQSWSKYDKPCKIYIVSRMYNEEEYKLTFPHKYKRGKTFKAKQLYKKESEYSSTKQHEVLLFLVKTYKGGD* |
Ga0102538_100411 | Ga0102538_10041112 | F103433 | MIFVMKEWSKNKPGVVFFFVVWFILSISFIGNFFGTGLWNGWFDGFQKDSSAIVEKTAYCKNKYDYKGPLIAADSKDYNKIMMSQDCNPSQVKPYVSQYGLQARVIAGLSPNDASKIPAYIKRVSIFLAVLTAFLLALVVQKIRALFGGITASVFAVMLAFSPWIAGYARNIYWIEPMLIAPFVISFVGYQYFKKSKKLWLFYIIESVAMFLKLLNGYEYVSTIAISVLVPIIFFELVHKNVKIINLWKQAVPVFAATVVAFFGAYWVNFISLTDYYGSSDKAASAINARASDRGISGIRSMRAYAVGNFKILRPETYNFINQLVNLDNMANNSGKTYKYIIVNVVNYLLLPAITLPVHINGMFGEFIQSILFWTILGYLIILSSRKIIGKKYNRPFLWSMNFSVIGAFCWLALMPGHALPHAHINGIIFYIPLLLFVYVLIGLWADYVVKRTVKYE* |
Ga0102538_100540 | Ga0102538_1005401 | F033081 | MYTDITVVHRPKKGVMAWLFRRAMPQDTRPTFVWSRLVAEIENAGYFSRWKFSILAVGLIIMTIATIKMLLFVPGLNQSAVSLLTRGLETFLPTRWATATAWTVGMAGVFLMGDLTNYTPSQKILHKIKATRYEVYNTILFLALLEEQAFRSGSERWNWRERVRASVCFGLLHIMNIWYNFATGIALSVTGFGFLLVYLWYYRKYRIQIIATAAAATVHALYNAIALSLIAI |
Ga0102538_100686 | Ga0102538_10068613 | F046433 | MIELPTSPDALSELSPVAPPKLLSQAQDASRGNLMVYIKADNYLGTETSDPSFMKSRCETTEYEAINDFVQFIEMIKHYLPDYMENCAKELIDELAFLGMPELNFAANALAKRLRCHLEVDNKPVYIDVGNSLSQYRAKNEMKSSQYILSLVLSKFSGDEFEEYEGRLKVYGGRGEIDKSSKILFLDDWIVSGDQVKERIAGFEVDNDPEDHEASVLVMAASGDYLDNGISAYSQYGGTIYPVEACYVLKNSPDAGGMSRVTGIHSSTDNTFGYEVDGIAYCAIERGILKGEGIGELSLPALANIVRPYRNGEDFDGLSRFRQLLEKE* |
Ga0102538_101267 | Ga0102538_1012676 | F095633 | MGNYKNSAEVWRREGLTEDELRTMGTLAMEATEKLKKTIIRKETVLLGSVPFGSWGEFAKAVQEMAAHSYEPIPVEINTKRLIAKAFLDDRGEMSVEEHSVPEEVFIDLSRTRCDAEEDRNHKSYEFTCPALMEYPDGKLYLTRKAYVISVIDVNGSQEVDFNIIYGGLN* |
Ga0102538_102800 | Ga0102538_10280012 | F073671 | LGERMNKEQAEHELAELHEKERSLEKALELVREKIRELVNYMDKNKGQK* |
Ga0102538_102800 | Ga0102538_1028006 | F067846 | MSIIADWERQEFNKWDKQCSKEDDYNRAVEMEIEAIKENISNLDDDVICAFREKMLDYGEVINAFDDDTFNDDEFIKAVALGTDYEEMRIKILTAMAEDRLEQLEEDYRKGYILND* |
Ga0102538_102930 | Ga0102538_1029307 | F054110 | MNGRRYVVDTRQSWSKFDKPCKIYIVSRMYTEEEYKLTFPHKYKKGKTFKQGQLYKKESEYSSTKQHEVLLFLVKAYKGGE* |
Ga0102538_105430 | Ga0102538_1054301 | F033081 | ITVVYRPKKGVMAWLFRRAMPQDPRPVFVWPRLVAAIGNVGYFSRRGFSVLAVGLIIVTIATIKILLFVPGLNQSVVSLLTRGLETFLPTGWATIAAWVVGTTGVFLIGSFTSSYTPSQRLLYSLEATGCGVYDTLLLLALIEEQAFRSGSEKWNWRERVRTSVCFGLLHIANIWYSFAAGIALSATGFGFLLVYLWYYRKYRIQIIATAAAATVHALYNAIALSLIAVVLAIDIAKLL* |
Ga0102538_108469 | Ga0102538_1084692 | F094007 | MKLKTVEVLELARPSRAGVIDVVDSDGNVVPLDYLGEDFVPDANSYGDEDFTKRNRIIVEMCDLFGGIRRRAGFAERHRGRGDYDRARRIERNRGSDISEVGRLAISACEACPLKLDCELYGKLGGAVLSDVLNYKKVRTAISLTKAGKKRSGWNKGCIDNNA* |
Ga0102538_119397 | Ga0102538_1193972 | F095630 | DTCFMDVEFYSPEAQDAYNEALRLFDKWDIPRSAALRAATECGIEMNRLVAELAQDELLHELSAVQSSPWRQQSDITYRKADSKSSVNTLFRMPCIKYGVESKNLFPEYVLEYSVDEKYIQSPVDQALAAVVMQAVALNFLVMIREKHTVYDRGDQWSEASKSVGYRMFLGLIKKDDSIVHQLKCEFMEYVQFLSRSPFCDNLQAALVRCSHNYEQVLLDRGALNSILGGCVIGGKGWLKMADNTLIGQILKAIEIDVYDI* |
Ga0102538_123288 | Ga0102538_1232882 | F077405 | QGRALPTELFPRLLVAKQRGVFYGFISLCQIKFVKKVFDWLKIVQK* |
⦗Top⦘ |