Basic Information | |
---|---|
IMG/M Taxon OID | 7000000727 Open in IMG/M |
GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0063646 | Gp0053321 | Ga0031240 |
Sample Name | Human tongue dorsum microbial communities from NIH, USA - visit 1, subject 763577454 |
Sequencing Status | Permanent Draft |
Sequencing Center | Baylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis |
Published? | N |
Use Policy | Open |
Dataset Contents | |
---|---|
Total Genome Size | 112258145 |
Sequencing Scaffolds | 12 |
Novel Protein Genes | 12 |
Associated Families | 10 |
Dataset Phylogeny | |
---|---|
Taxonomy Groups | Number of Scaffolds |
Not Available | 4 |
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA416 | 4 |
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Lactobacillales → Streptococcaceae → Streptococcus | 1 |
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella → Alloprevotella sp. oral taxon 473 → Alloprevotella sp. oral taxon 473 str. F0040 | 1 |
All Organisms → Viruses → Predicted Viral | 1 |
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria | 1 |
Ecosystem Assignment (GOLD) | |
---|---|
Name | Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
Type | Host-Associated |
Taxonomy | Host-Associated → Human → Digestive System → Oral Cavity → Tongue Dorsum → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
Alternative Ecosystem Assignments | |
---|---|
Environment Ontology (ENVO) | Unclassified |
Earth Microbiome Project Ontology (EMPO) | Host-associated → Animal → Animal surface |
Location Information | ||||||||
---|---|---|---|---|---|---|---|---|
Location | National Institutes of Health, USA | |||||||
Coordinates | Lat. (o) | N/A | Long. (o) | N/A | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
Zoom: | Powered by OpenStreetMap © |
Family | Category | Number of Sequences | 3D Structure? |
---|---|---|---|
F033081 | Metagenome | 178 | Y |
F077404 | Metagenome | 117 | N |
F080164 | Metagenome | 115 | N |
F080166 | Metagenome | 115 | N |
F081455 | Metagenome | 114 | N |
F085820 | Metagenome | 111 | N |
F092229 | Metagenome | 107 | N |
F099452 | Metagenome | 103 | N |
F103436 | Metagenome | 101 | Y |
F105378 | Metagenome | 100 | N |
Scaffold | Taxonomy | Length | IMG/M Link |
---|---|---|---|
C3698104 | Not Available | 706 | Open in IMG/M |
C3698358 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA416 | 707 | Open in IMG/M |
C3709064 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Lactobacillales → Streptococcaceae → Streptococcus | 786 | Open in IMG/M |
C3719699 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA416 | 888 | Open in IMG/M |
C3738730 | Not Available | 1207 | Open in IMG/M |
C3755218 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA416 | 1959 | Open in IMG/M |
C3757230 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella → Alloprevotella sp. oral taxon 473 → Alloprevotella sp. oral taxon 473 str. F0040 | 2151 | Open in IMG/M |
C3758172 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ctYA416 | 2261 | Open in IMG/M |
C3765622 | All Organisms → Viruses → Predicted Viral | 4278 | Open in IMG/M |
SRS014470_WUGC_scaffold_1500 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria | 1021 | Open in IMG/M |
SRS014470_WUGC_scaffold_15456 | Not Available | 881 | Open in IMG/M |
SRS014470_WUGC_scaffold_47773 | Not Available | 6082 | Open in IMG/M |
Scaffold ID | Protein ID | Family | Sequence |
---|---|---|---|
C3698104 | C3698104__gene_153398 | F085820 | FYLFAMLFLATTFFSCETVEPSPRPTWGEIVNPIEAFMYPRDLKVFADGEEGRRWLILVVPDSTKSSFAPTSKSTPAEVARYKELSQLVGNPTEPVVNECHFHRTWLTQGVKAIRVVRTQADGRDEDVTAQCGNLYFYTDKQIFDCQFKCGNRSIFAKPLGETVEADYLWLPGRDVFGLVAPLNPDGLKQRIVLRLADGTEIEKELSKKGKK |
C3698358 | C3698358__gene_153535 | F080166 | VEQIFELNYYLTFKLEVTFNTIHKKINNEIKENFHSEYVVGANKLTTNLRYKYQMRLSPRGEKIGIVIDWDNYDDLCTVIEESINICDPENKMSPFKRLYSTTGDLLDIKCDSLKVRYLHLEDRWNNKVDLIPFVLVDDNRGTLTEAMRFRFNNDLTFDVPVSRLKGFRRFLMTYNPVLHAGAMARYMAITPLLGTNRQNMLR |
C3709064 | C3709064__gene_159362 | F103436 | MMATSKYGCCDKSDAEILENLKDWVSKCDVSSKREALKKIDSAFALWGAGEYVSAVHLLDENEVFLEKSDWPYYALGIEILKARKHEFFNE |
C3719699 | C3719699__gene_165266 | F080166 | VEQIFELNYQLTLKMEVTFNNTIKRINTEIKENFHTEYVVGANKLTTNLRYRYRMRLSPRGETVGVIIDWDNYDDLCTVIDEAIDICDPGNKTSPFKRMYSTTGDLLDIKCDSLKVRYLHLDDRFGNRLDLMPFVLIDDHNGTLTEAMKFRFNNDLIFDVPVSRLKGFRRFLMTYNPLLHAGAMARYMSMTPLLGTNRQNMMK |
C3738730 | C3738730__gene_176926 | F080164 | MVGLALCAAPQVTLRERASAFPLITEKDASEIDAPYAWRLPVVPLSLDNREIRNFAKFPLLPSLSGGILTVRVLIVGDTVAVHQDLMDDFAKRCRTTLGLGVRTAPKLFGIKGMHVYGVQKDKSRQAVDEQVTLHLPGFEKAEKPLLYKGQEGRLVLCEYYESHRSDLLLDAANARPEIFGELRPVIDFHFPVELRRAYAWLLLEIELEDGTKLSTSLQHYDEQTSILDHPARS |
C3755218 | C3755218__gene_188945 | F092229 | DVLECMMDINPIVRTKIYDDYEFAKDVAERRFGSTIEKLDLRTVLQKCITRPYNSILNNIYFRYFNSELIDDLFKLGQSPKVLDLAIEYECEYYTVNAAKTNIRRYNTDAYYNKFAADSNIISSHRSLHDPQVNAVESAKFTHELLMASRSENFNPEMVREIFVKYGLKPNASRNLYNRINDNLNLFYYIEDYLEEYKEEGKFIYGTREYKILKELRSLPLMVVLTQLTRKNDSGYILNSNLELVKG |
C3757230 | C3757230__gene_190656 | F077404 | MAQQIIMTHKLAAAALSLKEPTQIGNTQNPFAMNTLKTIFTCFFTLCFMMVANSYAQKTDSINTEAGKSVLHRNAIYIPPALEQYADTALLHQRFNVENKGNYLYTPFTEDNEPTIPFNYGFLHPLGERFYNCFMGKVDRILRPKADKGFIILTSYLVVLGDSYAFDTSNKDTSKLADLKYLDFRHIKRDFSYGHPYQGFTHNDRIELSNFIQSYGRQAALETANAWVMASYPFSLQSTKFENRYTRGRKLILTDGHSTLYLYFLMIDSVAPNFDTEVLPYIKGVFRFNRFR |
C3758172 | C3758172__gene_191433 | F081455 | VIDLDDLSPIYYSIVLFKAKEKHIMETMKRDLELEQMLENATNILYQMADLAICGKKPNTEEKDIPPAEEIGAEKVDIIDSAEEAIQQPLENKDSSIAVNFSQMVNKPKEEVKTEVNSVPPEGETKVNVLFPKTEHILGNYVDYDSFIKIKESNTDKVVRAVRLLNYKMSDQNAAAAFAQFVSTFNPEGDPNKRLRYELIRHQGREKD |
C3765622 | C3765622__gene_199763 | F099452 | MDKTYAELLQETLSKIYELKDLNNRDRGKALTIFIGERLNRELLLSSMNIFNLYKEIVNLDDVSLLNDLRKTPWYKDWFIDDKRNSDLIDLSKFNFRSLERFEKESYLKDVEHYDFKKVIEVDSYSLYDTLAEENGVDLFKLAAENILINHGFFNNTDYNLYDIPDKYMEDIEVSLYMCLLNSGNMDFMDKKTFKSTELFYIVKNNICGTIFFTLFDRINEDTRTRAR |
SRS014470_WUGC_scaffold_1500 | SRS014470_WUGC_scaffold_1500__gene_1279 | F033081 | MYPPDLIMVHRPQKGVMAWLFKKGMPQDPKPVFVWPRLVTEIENAGYFSRRKFSILAVGLIIMTIATIKMLLFVPGLNQSVVSLLTRGLETFLPAGWATGAAWTVGMTGVFLMGNFTNYTPSQRFLHKTKATRCEAYNTLLLLALWEEQAFRAGSEKWSWRERVRASMCFGLAHIVNIWYSFAAGIALSVTGFGFLLVYLWYYRKYRSQIIATAAA |
SRS014470_WUGC_scaffold_15456 | SRS014470_WUGC_scaffold_15456__gene_13684 | F085820 | MRSTFYLFAMLFVATTFFSCETGEPAPRATWGEIVNPIEAFMYPRDLKVFAGDNDGRRWLILVIPDSTKSSFAPTSKSTPAEVARYKELSQLVGNPTEPVANECHFHRTWLTQGVKAIRVVRTQADGRDEDVTAQCGNLYFYTDKQIFDCQFKCGDRSIFAKPLGETVEADYLWLPGRDVFGLVAPLNPDGLKQRIVLRLADGTEIEKELSKKRTK |
SRS014470_WUGC_scaffold_47773 | SRS014470_WUGC_scaffold_47773__gene_52546 | F105378 | MKVSVYVDKLKKWVPISSDEILDRNKNLSDVKDKDAAITNLGLYDKFISKEALQSGFLPDVFTPENIQTDADHQFVSDSDKNNWNNKLNKPVEIQTNLEENQIGYDEVNEKFYIGLNNKNVLIGGASALDNIKIVNGFFSGNSQPTIIRNTKTREDGTLISPIFVDVQCVEYTGGDLGEVSVSYTSELINIYNTGSFTGAFQCMIVYPLGSVNR |
⦗Top⦘ |