Basic Information | |
---|---|
IMG/M Taxon OID | 3300007123 Open in IMG/M |
GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0063646 | Gp0052609 | Ga0102684 |
Sample Name | Human tongue dorsum microbial communities from NIH, USA - visit 1, subject 764325968 |
Sequencing Status | Permanent Draft |
Sequencing Center | Baylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis |
Published? | N |
Use Policy | Open |
Dataset Contents | |
---|---|
Total Genome Size | 122383058 |
Sequencing Scaffolds | 17 |
Novel Protein Genes | 17 |
Associated Families | 14 |
Dataset Phylogeny | |
---|---|
Taxonomy Groups | Number of Scaffolds |
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales | 2 |
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes | 2 |
All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae | 1 |
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Negativicutes → Veillonellales → Veillonellaceae → Veillonella | 1 |
All Organisms → cellular organisms → Bacteria | 2 |
All Organisms → cellular organisms → Bacteria → Proteobacteria | 1 |
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonia → Candidatus Nanosynbacterales → Candidatus Nanosynbacteraceae → Candidatus Nanosynbacter → Candidatus Nanosynbacter lyticus | 1 |
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales | 1 |
Not Available | 2 |
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → unclassified Saccharibacteria → Candidatus Saccharibacteria bacterium | 1 |
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Lactobacillales → Streptococcaceae → Streptococcus | 1 |
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria | 1 |
All Organisms → Viruses → Predicted Viral | 1 |
Ecosystem Assignment (GOLD) | |
---|---|
Name | Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
Type | Host-Associated |
Taxonomy | Host-Associated → Human → Digestive System → Oral Cavity → Tongue Dorsum → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
Alternative Ecosystem Assignments | |
---|---|
Environment Ontology (ENVO) | Unclassified |
Earth Microbiome Project Ontology (EMPO) | Host-associated → Animal → Animal surface |
Location Information | ||||||||
---|---|---|---|---|---|---|---|---|
Location | USA: Maryland: Natonal Institute of Health | |||||||
Coordinates | Lat. (o) | 39.0042816 | Long. (o) | -77.1012173 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
Zoom: | Powered by OpenStreetMap © |
Family | Category | Number of Sequences | 3D Structure? |
---|---|---|---|
F032313 | Metagenome | 180 | N |
F046432 | Metagenome | 151 | Y |
F046433 | Metagenome | 151 | N |
F054110 | Metagenome | 140 | N |
F066860 | Metagenome | 126 | N |
F067846 | Metagenome | 125 | Y |
F078842 | Metagenome | 116 | N |
F081510 | Metagenome | 114 | N |
F095629 | Metagenome | 105 | N |
F097527 | Metagenome | 104 | N |
F103430 | Metagenome | 101 | N |
F103433 | Metagenome | 101 | N |
F105376 | Metagenome | 100 | N |
F105378 | Metagenome | 100 | N |
Scaffold | Taxonomy | Length | IMG/M Link |
---|---|---|---|
Ga0102684_100005 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales | 239045 | Open in IMG/M |
Ga0102684_100006 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales | 231265 | Open in IMG/M |
Ga0102684_100089 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes | 58787 | Open in IMG/M |
Ga0102684_100102 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes | 54592 | Open in IMG/M |
Ga0102684_100174 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae | 40647 | Open in IMG/M |
Ga0102684_100303 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Negativicutes → Veillonellales → Veillonellaceae → Veillonella | 31010 | Open in IMG/M |
Ga0102684_102168 | All Organisms → cellular organisms → Bacteria | 9256 | Open in IMG/M |
Ga0102684_105582 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 4377 | Open in IMG/M |
Ga0102684_107436 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonia → Candidatus Nanosynbacterales → Candidatus Nanosynbacteraceae → Candidatus Nanosynbacter → Candidatus Nanosynbacter lyticus | 3410 | Open in IMG/M |
Ga0102684_107655 | All Organisms → cellular organisms → Bacteria | 3324 | Open in IMG/M |
Ga0102684_109863 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales | 2580 | Open in IMG/M |
Ga0102684_111724 | Not Available | 2159 | Open in IMG/M |
Ga0102684_116564 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → unclassified Saccharibacteria → Candidatus Saccharibacteria bacterium | 1470 | Open in IMG/M |
Ga0102684_116895 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Lactobacillales → Streptococcaceae → Streptococcus | 1439 | Open in IMG/M |
Ga0102684_119059 | Not Available | 1243 | Open in IMG/M |
Ga0102684_119135 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria | 1235 | Open in IMG/M |
Ga0102684_119443 | All Organisms → Viruses → Predicted Viral | 1211 | Open in IMG/M |
Scaffold ID | Protein ID | Family | Sequence |
---|---|---|---|
Ga0102684_100005 | Ga0102684_100005100 | F095629 | MRELIICACLFGCFGVANAAAPVDQPKEVKVVHNDDSVALHKKVYKLEQRIERLEKLLAEKEGK* |
Ga0102684_100006 | Ga0102684_100006215 | F105378 | MKVNVYVDKLKKWVPISSDEILDRNKNLSDVKDKDAAITNLGLYDKFISKEALQSGFLPDVFTPENIQTDADHQFVSDSDKNNWNNKLNKPVEIQTNLEENQIGYDEVNEKFYIGLNNKNVLIGGASALDNIKIVNGFFSGNSQPTIIRNTKTREDGTLISPVFVDVQCVEYTGGDLGEVSVSYTSELINIYNTGSFTGAFQCMIVYPLGSVNR* |
Ga0102684_100089 | Ga0102684_10008940 | F081510 | MKLPKINVKAIKAGAKTTYNTAKILGKKYAPVVLVTTGLVGYGIAVYQGIKSGKKLEATKAKYEAKDAAGEEYTRLEVIKDVTKDVAVPVAIAVASTAAIGLGFAIQTNRLKAVSAALTAVTEEHARYRLQCKEVLDEETFKKIDTPMDQVTVEEDGKEAQSFVPKEGLMYGNWFKYSANYASDSPEYNEQWIRESIRVLEEKIARKGLLNFSDMLDQLGFDVPKAALPFGWTDTDGFYIEYDIMEVWNAEEQVHEPQIYVRWKCPRNLYATTNFRDLIPGRKELA* |
Ga0102684_100102 | Ga0102684_10010214 | F081510 | MKFNVNAIKSTAKTTWVTTKILGKKYAPYILLGAGLVGYGYSVYEGIKSGKKLEATKAKYEQMDAQGDPYSRMEVVTDIAKDVAVPVAVATASTAAIVLGFAIQTNRLKAVSAALAMVTEEHARYRLRAKTVLDEETFKKIDAPLETKTVEVDGEEIEVESIVPNEGDFYGMWFKKSHKYASDSPEYNEGVIKEADNVLTEKMMRKGMLTFAEVLDILGFEVPKAALPFGWTDTDGFYLEWDAHEVWNDDKQETEIQFYVRWKLPRNLYATTNFHDFIPKKTRKELK* |
Ga0102684_100174 | Ga0102684_10017412 | F067846 | MSIIADWERQEFNKWDKQCSKEDDYNRAIEMEIEAIKENISNCDDDVICVFREKMLDYDEVISAFDDDAFNDDEFIKAVALGTDYEEMRIKILTAMAEDRLEQLEEDYRKGYILND* |
Ga0102684_100303 | Ga0102684_1003039 | F054110 | MNGRRYVVDTRQSWSKYDKPCKVYIVSRMYTEEEYKLTFPEKYKKGKTFKEKQLYKKESEYSSTKQHEVLLFLVRTYKGGD* |
Ga0102684_102168 | Ga0102684_10216811 | F078842 | MIISSIYKTADNDGLIAHIYEHLLAQYVLKYLQDNEFFVLSDIILSAKTYGDTCFMDAELYSSEAKKTYDEALREFDKLVISEDEILRAVGECGIEMNRNIVEVDRSELSKKLREVQISPWRKQIDMAYRKAYDESSVNTLFRTSYIKYSKESDDLFRECVLEYSIDESHIQTPVDQALAAIVMQIVALNFLTVVREKYTVYDRGDQWSEASISVGYRMFLGSLKKDDKIINQLSYDFLEYIKNLSSSVFCDNLQKALVRCSDNHKQVILNRSTLNAILGGCIIGGKGWLEMADSARIGQMVNSIELDIY |
Ga0102684_105582 | Ga0102684_1055825 | F103433 | NKYDYKGPLIAADSKDYNKIMMSQDCNPSQVKPYVSQYGLQARVIAGLSSNDASKIPAYIKRVSIFLAALTAFLLALVVQKIRALFGRITASVFVVMLAFSPWIAGYARNIYWIEPMLIAPFVISFVGYQYFKKSKKLWLFYIIESVAMFLKLLNGYEYVSTIAISVLVPIIFFELVHKNVKIINLWKQAVPVFAATVVAFFGAYWVNFVSLTDYYGSSDKAANAINARASDRGISGIRSMRAYAVGNFKILRPETYNFINQLVNLDNMANNSGKTYKYIIVNVVNYLLLPAITLPVHINGMFGEFIQSILFWTILGYLIILSSRKIIGKKYSRPFLWSMNFSVIGAFCWLALMPGHALPHAHINGIIFYIPLLLFVYMLIGLWADYVVKRTVKYE* |
Ga0102684_107436 | Ga0102684_1074362 | F066860 | MTTKKQKLQKQQAIDTWIVIALWVSAIWFSLARGFITGIGGWVLALLGPWALIVSCICLAIISRQMKKRHASKDHLTTIVRVSFIVMSISLFICGLAMPDFSDMETFSTLSVYTNNAISFETSKTIAIISGFVVVLSLFVAVTFGIAEDKE* |
Ga0102684_107655 | Ga0102684_1076556 | F046432 | MWEMTESKLSNIISKYQLPMDDYLVEIDGAFGRGEFFWVIKNQSTNKKYLLVNTYSHHGVESELECYREGGFDNLEAIPRRIETLELASDAEDEISKYLFGMYSIFEIKS* |
Ga0102684_109863 | Ga0102684_1098637 | F103430 | KAKLYKTKELEVDKIISTVNANHEAFTLDYPVIGYWRGETEETAVLYLSDERQKVMNTLNELKEVLDQEAIAYQIENDLQLI* |
Ga0102684_111724 | Ga0102684_1117242 | F105376 | MNEKPEVSAKEFGALQAKVEYIKDGVDKHTVMLERIENIARDNVTQAQLKTYIAEHEKESEEKYVKRTEIEGVMNFWSLVTSNLAKLFAIALVGLAIYATNNLIQQNKAVTELQEEVQTQVRRK* |
Ga0102684_116564 | Ga0102684_1165642 | F046432 | MWEMTESELSEVISKYQMPEGRYLVEQEGSFGESEFFWVIKNQLTNQKYLLMNTYSHHGVEDEVEYYREEGFDNLEAIPRKIETLENASDADDEISKYLFGMYSIFEIKS* |
Ga0102684_116895 | Ga0102684_1168953 | F097527 | MIYFKMEKIGNSTHNKEKKTRSENLVFNTIPAAGVEPAR |
Ga0102684_119059 | Ga0102684_1190591 | F032313 | MYRLLFLLFALTLMACDNDTPQEKPREQEKHEVPVPKPKPQFDEVGERIWYGQTPAMRLDSTDYGAGLTWVLEMRTSSIPKQRFDSLFKQTVWEIKDICAVETDLSLAKKIPKFVGGSITKEFTCRNGVILRHMQGIDINLVDTVNYVYNEDLNEIVLEGTGIRW |
Ga0102684_119135 | Ga0102684_1191352 | F046433 | MSELSASLDVLGVLNPVTPPDLTLQARDTSRNNPVIYVAEDGYRGTRTSDPCFRKMRRETTEYKALNEFLYFMKMVEPYVPDHIKDDARELRNGLVFLGMSELHFAATALAKRLRYHLEVDNKPVYIDVGNSLSQCRVKNEMKSSQYILSLVLSKFPDDEFEECEGRLKVYAGRGEIDKSSKILFLDDWIIGGDQVKERIAGFEAYNNPGAHKVSVLVMAASSNYIDNGIGADPLWGEATYPVEAYYRLKNDHDDWGMSRVTGIHSSTDRTFGCEVDDIAYLAIEGGILKGERIDRLTLPALVNIVRPYRNGENFDGLSRFRQLLEKG* |
Ga0102684_119443 | Ga0102684_1194431 | F097527 | MIYFKMEKIGNSTYNKEKKTRSENLVFNTIPAAGVEPAR |
⦗Top⦘ |