Basic Information | |
---|---|
IMG/M Taxon OID | 3300008503 Open in IMG/M |
GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0063646 | Gp0052918 | Ga0111013 |
Sample Name | Human tongue dorsum microbial communities from NIH, USA - visit 1, subject 159632143 reassembly |
Sequencing Status | Permanent Draft |
Sequencing Center | Baylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis |
Published? | N |
Use Policy | Open |
Dataset Contents | |
---|---|
Total Genome Size | 116698126 |
Sequencing Scaffolds | 17 |
Novel Protein Genes | 17 |
Associated Families | 16 |
Dataset Phylogeny | |
---|---|
Taxonomy Groups | Number of Scaffolds |
Not Available | 4 |
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes | 1 |
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA416 | 2 |
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Siphoviridae sp. ctHip2 | 1 |
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → Candidatus Saccharimonas aalborgensis | 1 |
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ct6uZ8 | 1 |
All Organisms → cellular organisms → Bacteria | 1 |
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Prevotella | 1 |
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria | 1 |
All Organisms → Viruses → Predicted Viral | 1 |
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Bacteroidaceae | 1 |
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas | 1 |
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → unclassified Saccharibacteria → Candidatus Saccharibacteria bacterium | 1 |
Ecosystem Assignment (GOLD) | |
---|---|
Name | Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
Type | Host-Associated |
Taxonomy | Host-Associated → Human → Digestive System → Oral Cavity → Tongue Dorsum → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
Alternative Ecosystem Assignments | |
---|---|
Environment Ontology (ENVO) | Unclassified |
Earth Microbiome Project Ontology (EMPO) | Host-associated → Animal → Animal surface |
Location Information | ||||||||
---|---|---|---|---|---|---|---|---|
Location | USA: Maryland: Natonal Institute of Health | |||||||
Coordinates | Lat. (o) | 39.0042816 | Long. (o) | -77.1012173 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
Zoom: | Powered by OpenStreetMap © |
Family | Category | Number of Sequences | 3D Structure? |
---|---|---|---|
F032313 | Metagenome | 180 | N |
F046433 | Metagenome | 151 | N |
F054109 | Metagenome | 140 | N |
F066860 | Metagenome | 126 | N |
F072446 | Metagenome | 121 | N |
F077405 | Metagenome | 117 | N |
F078842 | Metagenome | 116 | N |
F081455 | Metagenome | 114 | N |
F085820 | Metagenome | 111 | N |
F089055 | Metagenome | 109 | Y |
F092229 | Metagenome | 107 | N |
F095629 | Metagenome | 105 | N |
F099452 | Metagenome | 103 | N |
F099454 | Metagenome | 103 | N |
F105379 | Metagenome | 100 | N |
F105380 | Metagenome | 100 | N |
Scaffold | Taxonomy | Length | IMG/M Link |
---|---|---|---|
Ga0111013_100002 | Not Available | 133457 | Open in IMG/M |
Ga0111013_100603 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes | 17912 | Open in IMG/M |
Ga0111013_100886 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA416 | 14657 | Open in IMG/M |
Ga0111013_102479 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Siphoviridae sp. ctHip2 | 7613 | Open in IMG/M |
Ga0111013_103620 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → Candidatus Saccharimonas aalborgensis | 5645 | Open in IMG/M |
Ga0111013_104798 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ct6uZ8 | 4432 | Open in IMG/M |
Ga0111013_104864 | All Organisms → cellular organisms → Bacteria | 4378 | Open in IMG/M |
Ga0111013_105398 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA416 | 3991 | Open in IMG/M |
Ga0111013_105747 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Prevotella | 3782 | Open in IMG/M |
Ga0111013_106114 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria | 3585 | Open in IMG/M |
Ga0111013_107200 | All Organisms → Viruses → Predicted Viral | 3111 | Open in IMG/M |
Ga0111013_107254 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Bacteroidaceae | 3089 | Open in IMG/M |
Ga0111013_107959 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas | 2861 | Open in IMG/M |
Ga0111013_113873 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → unclassified Saccharibacteria → Candidatus Saccharibacteria bacterium | 1723 | Open in IMG/M |
Ga0111013_118034 | Not Available | 1337 | Open in IMG/M |
Ga0111013_122606 | Not Available | 1067 | Open in IMG/M |
Ga0111013_128157 | Not Available | 857 | Open in IMG/M |
Scaffold ID | Protein ID | Family | Sequence |
---|---|---|---|
Ga0111013_100002 | Ga0111013_10000253 | F054109 | MVGRPKSKKGTKVHTAFKIYPTDKERAQNMADKLGMSLSAYINKAVLEKVERDEKSEN* |
Ga0111013_100603 | Ga0111013_10060315 | F105379 | MVIHFPLSQSDIESLLSISKLLKCDKILYDRNYVNPIIGVGPEKSYFQTTSFMVDLSPHINNLLVNISDLKNLDKITQLQPAADNPEIAIHRPVVSVFNWDAEYVKACMNSLREYQIDDNIIARTDEFHNTDCYNELMAGSASTGAFRINIGGYMIDIPKSAIPTLKSDHVVATVYNAPNKNFNILRFKITKRNGIIVNQSMLFLPY* |
Ga0111013_100886 | Ga0111013_10088616 | F099452 | MDKTYAELLQETLSKIYELKDLNNRDRGKALTIFIGERLNRELLLSSMNIFNLYKEIVNLDDVSLLNDLRKTSWYKKWFTYDQENSRLIDLSKFNFRSLERFEKVEYLKDVEHYDFEGVIEVDSYSLYDILAEENGLNLFCFAAENILLNHGFFNNTDYQLYDVPEEYIDDQEVCMYMCLLNKDNIDFIDKDTYEDTVLWDIVKDRLFGYVYWSIRDSIEEDTRTRAR* |
Ga0111013_102479 | Ga0111013_1024796 | F105380 | MPILELDASQYVKQGRIFKKFDSNLLDSYMDGRQNNYNINLAELDDQISDGIVYADRNGKMIYKFGAKKIIQTAITNGLEISGLSKDLEMKHYSFWVPDLYFVSFYSFNPNEDLYIAYRSKDEEFICLTNIWPDGSSREESYFPNGKRLKLKRLCKGSMMSDVSSSDYDAWRNNAITRASQFVNKFVSARGNSDLDFVGSALRNKVPSHNIKKCAVFLGKITKEQENVNTYEEFMEWTKNTKWFK* |
Ga0111013_103620 | Ga0111013_1036204 | F089055 | LKVEKMNSTPECVTKTPEIEAREKLAAIFSDAEQRDNSKVNPELGKTAIDIKMDFADNEAVDLCNQTLGSYGKSLDYINNNPLKTVQAIGSSLQLFREDKTKESCK* |
Ga0111013_104798 | Ga0111013_1047983 | F095629 | MMRELIICVCLLGCFGIANANNIEQPKEVKIVHNDDSVALHKKIHQLEKRIERLEELLKKEGK* |
Ga0111013_104864 | Ga0111013_1048644 | F078842 | MIISSIYKTADNDGLIAHIYEHLLAQYVLKRLQDNEFFVLSDIILSAKTYGDTCFMDAELYSSEAKKTYDEALREFDKLVIPEDDILRAAGECGIEMNRNIVEVDRSELSKKLREVQISPWRKQIDMAYRKAHNESSVNTLFRTSYIKYSKESNDLFRECVLEYSIDESHIQTPVDQALAAIVMQIVALNFLTVVREKYTVYDRGDQWSEASISVGYRMFLGLLKKDDKIINQLSCDFLEYIKILSSSVFCDNLQKALVRCSDNHKQVILNRSTLNAILGGCVIGGKGWLGMADSARIRQMINSIELDVYEVNS* |
Ga0111013_105398 | Ga0111013_1053985 | F092229 | MNEEYRFKHIPEVILRNVKFIRENNIDIGTGDDVLECMMDINPVLRQRIYDDYDLAKDVAERRFHTTIEELDLTTILQKCTTRPYIAILNNIYFRYFNSKLIDDMFKLGESTKVLDLAIEYECEYYTVNSAKTNIRRYMQQAYFDKYAADADIISSHRVLSDPQVNAVKSAEFTYDLLVAARSENFNPEMVRDIFLKYGLKTNSSRNLYNRMDNNLSLFYYLEDYLEEYVNTGKFTYGSQEYHTIKEFKYLPLMNVLTQLTRSNPSGYVLNHKLELVKG* |
Ga0111013_105747 | Ga0111013_1057472 | F032313 | MACDNDTPQEKPREQEKHEVPVPKPKPQFDEVGERIWYGQTPAMRLDSTDYGAGLIWVLEMRTSSIPKQRFDSLFKQTVWEIKDICAVETDLSLAKKIPRFVGGSIAKEFTCRNGVILRHMQGIDINCVDTVNYVYNEDLNEIVLEGTGIRWYVLRLNKNAVEFLQQGHNIWGPFDWYYGRNSGRSEVTLEAK* |
Ga0111013_106114 | Ga0111013_1061144 | F046433 | MIELPASPDALSELSPVAPPKLLSQAQDASRDNLMVYVKADNYLGTETSDPLFMKSRYKTTEYEAINDFVQFIEMTKHYLPDYMEYCAKELIDELAFLGMPELNFAANALAKRLRHHLEVDNKPVYIDVGNSLSQCRVKNEMKSSQYILSLVLSKFPDDEFEEYEGRLKVYGGRGEIDKSSKVLFLDDWIISGDQVKKRIAGFAVDNDPESHEASVLVMAASSEYIDNGIVADSLWGEATYPVEAYYRLKNDHNDWGVSRVTGIHSSTDSAFGYEVDDIAYRAIEGGILKGERIDRLTLPALVNIVRPYRNGENFDGLSRFRQLLERE* |
Ga0111013_107200 | Ga0111013_1072005 | F081455 | KSKKDIPPVEEIGAETVDIIDAAEEAIQQPLQNTDSSIAVNFSQMVNKPKEEVKTEVNSVPPEGETKVNVLFPKTEHILGNYVDYDSFIKIKESNTDKVVRAVRLLNYKMSDQNAAAAFAQFVSTFNPEGDPNKRLRYELIRHQGREKDLVIRLSTVVNGTTKYYADIYPDLNKIDLDHHLISSAKK* |
Ga0111013_107254 | Ga0111013_1072543 | F072446 | MKKLLFLLSGLCLYCLAACDNDHEPTKPVRPFHGDTLAQIAWNFRFIVEHHYHSIPGIVPERTTYRVPVIPRSVEDKTKKEYNDMELGKEAHLVFRATVHGDTINRQKRELEAISRQLNALTLTSIGTSPVLCGVKSIEAVGVAERGNTYDLSREMKLRIRDYFGRVKYSSSGIVTLNCENTESMTAKYVVPLARIREYELAEHIQPELKFYLPVKRCMDFSSIRFAITLFNGKVLSFQHKLPSKSVLQELPNKSVLENYQQYGFEREATYFTTLWPLPDYKYNEREW* |
Ga0111013_107959 | Ga0111013_1079594 | F099454 | MKASKLLWAVIMALTFVFTSCDRLTDEPTLEDRGYKYFDSTAQRKSFRVVTASGKPYNHKIDWHIIGILDSKSDTYLTKKVDTLSNGDFRISYDWVSFTVREKKSVIDVEVQKNETGEDRSVKFVAQANH |
Ga0111013_113873 | Ga0111013_1138733 | F066860 | MTTKKQKLQKQQAIDTWIVIALWVSAIWFSFARGFITGIGGWVLALLGPWALIVSCICLAIISRQMKKRHASKDHLTTIVRVSFIVMSISLFICGLAMPDFSDMETFSTLSVYTNNAISFKTSKTIAIISGFVVVLSLFVAVTFGIAEDKE* |
Ga0111013_118034 | Ga0111013_1180342 | F085820 | MLFLATTFFSCETGEPAPRPTWGEIVNPIEAFMYPRDLKVFADGEEGRRWLILVVPDSTKSSFAPTSKSTPAEVARYKELSQLVGNPTEPVVNECHFHRTWLTQGVKAIRVVRTHADGRDEDVTAQCGNLYFYTDKQIFDCQFKCGNRSIFAKPLGETVEADYLWLPGRDVFGLVAPLNPDGLKQRIVLRLANGTEIEKELSKNRTK* |
Ga0111013_122606 | Ga0111013_1226061 | F054109 | RGMKVHTAFKIYPDDKARAQAMADKLELSLSAYINKAVLEKVARDEKSED* |
Ga0111013_128157 | Ga0111013_1281572 | F077405 | QGRALPTELFPRLLVAKQRGVFYGFIVLCQIKFVKKVFDWLKIIQK* |
⦗Top⦘ |