Basic Information | |
---|---|
IMG/M Taxon OID | 3300006475 Open in IMG/M |
GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0063646 | Gp0052508 | Ga0100234 |
Sample Name | Human buccal mucosa microbial communities from NIH, USA - visit 2 of subject 158883629 |
Sequencing Status | Permanent Draft |
Sequencing Center | Baylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis |
Published? | N |
Use Policy | Open |
Dataset Contents | |
---|---|
Total Genome Size | 21424045 |
Sequencing Scaffolds | 5 |
Novel Protein Genes | 5 |
Associated Families | 5 |
Dataset Phylogeny | |
---|---|
Taxonomy Groups | Number of Scaffolds |
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes | 2 |
All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae → Haemophilus → Haemophilus parainfluenzae | 1 |
All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Neisseriales → Neisseriaceae → Neisseria | 1 |
All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Deuterostomia → Chordata → Craniata → Vertebrata → Gnathostomata → Teleostomi → Euteleostomi → Sarcopterygii → Dipnotetrapodomorpha → Tetrapoda → Amniota → Mammalia → Theria → Eutheria → Boreoeutheria → Euarchontoglires → Primates → Haplorrhini → Simiiformes → Catarrhini → Hominoidea → Hominidae → Homininae → Pan | 1 |
Ecosystem Assignment (GOLD) | |
---|---|
Name | Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
Type | Host-Associated |
Taxonomy | Host-Associated → Human → Digestive System → Oral Cavity → Buccal Mucosa → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
Alternative Ecosystem Assignments | |
---|---|
Environment Ontology (ENVO) | Unclassified |
Earth Microbiome Project Ontology (EMPO) | Host-associated → Animal → Animal surface |
Location Information | ||||||||
---|---|---|---|---|---|---|---|---|
Location | USA: Maryland: Natonal Institute of Health | |||||||
Coordinates | Lat. (o) | 39.0042816 | Long. (o) | -77.1012173 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
Zoom: | Powered by OpenStreetMap © |
Family | Category | Number of Sequences | 3D Structure? |
---|---|---|---|
F077405 | Metagenome | 117 | N |
F077781 | Metagenome / Metatranscriptome | 117 | N |
F101360 | Metagenome | 102 | N |
F103431 | Metagenome | 101 | N |
F103434 | Metagenome | 101 | N |
Scaffold | Taxonomy | Length | IMG/M Link |
---|---|---|---|
Ga0100234_100208 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes | 7052 | Open in IMG/M |
Ga0100234_100358 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes | 4750 | Open in IMG/M |
Ga0100234_101021 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae → Haemophilus → Haemophilus parainfluenzae | 2170 | Open in IMG/M |
Ga0100234_104662 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Neisseriales → Neisseriaceae → Neisseria | 718 | Open in IMG/M |
Ga0100234_105002 | All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Deuterostomia → Chordata → Craniata → Vertebrata → Gnathostomata → Teleostomi → Euteleostomi → Sarcopterygii → Dipnotetrapodomorpha → Tetrapoda → Amniota → Mammalia → Theria → Eutheria → Boreoeutheria → Euarchontoglires → Primates → Haplorrhini → Simiiformes → Catarrhini → Hominoidea → Hominidae → Homininae → Pan | 689 | Open in IMG/M |
Scaffold ID | Protein ID | Family | Sequence |
---|---|---|---|
Ga0100234_100208 | Ga0100234_1002083 | F103434 | VVGFRPRRGRYLENGPHVVEVTLAVVKEGRTGRRFERGETFVVDKVLVQPSAGNALKATENRVIRGDLTDETTLKVFGTGRKWPGGPHSWVKIIKGPESLVGKTFQQAGEPLTYDASPMTRHWSVRCDTLGTESR* |
Ga0100234_100358 | Ga0100234_1003582 | F101360 | VSKESALRRAAIAAHIAKVASQEKKKALKELEEYMAPGDTSKPQDDGLQVGTVSVSAPQPRYQVVDENALVTWLEWNKPDAVHKVPAPWFVATAALEGFIKQTGEVPDGVEVVQGDPRISVRVSGAQEEAIRELISTGDISLIEIEGGDA* |
Ga0100234_101021 | Ga0100234_1010214 | F077405 | SNSRPRPWQGRALPTELFPRLLVAKQRGVFYGFILLCQIKFVKNFFDWLKIIQK* |
Ga0100234_104662 | Ga0100234_1046623 | F103431 | MINLDGLIVGMLFFIQLFLQSIAWGVAIAHFLHAERGNAAAAAFDGAFGENIADCHAEDDNDKNAESQKEGFHVCIPEG* |
Ga0100234_105002 | Ga0100234_1050021 | F077781 | PIAAPARPASAHLKAPAAVTGGWGSPRQNDFFILLTLESPAPCDPLQTESHLVFRTRIHSQVQWGDVRGVAGRTKDSLPSDSVCTVGPRAGALSVRTADSLYLGFPRPHPGTPGLGRFWNFLALQSLSETPSHARMPRVTVARTSPETLEISPLRAAT* |
⦗Top⦘ |