| Basic Information | |
|---|---|
| IMG/M Taxon OID | 3300006244 Open in IMG/M |
| GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0063646 | Gp0052793 | Ga0099354 |
| Sample Name | Human buccal mucosa microbial communities from NIH, USA - visit 2, subject 764811490 |
| Sequencing Status | Permanent Draft |
| Sequencing Center | Baylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis |
| Published? | N |
| Use Policy | Open |
| Dataset Contents | |
|---|---|
| Total Genome Size | 21672235 |
| Sequencing Scaffolds | 6 |
| Novel Protein Genes | 6 |
| Associated Families | 6 |
| Dataset Phylogeny | |
|---|---|
| Taxonomy Groups | Number of Scaffolds |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae → Haemophilus → Haemophilus parainfluenzae | 1 |
| All Organisms → Viruses → Predicted Viral | 1 |
| All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes | 3 |
| All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Siphoviridae sp. ctHip2 | 1 |
| Ecosystem Assignment (GOLD) | |
|---|---|
| Name | Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
| Type | Host-Associated |
| Taxonomy | Host-Associated → Human → Digestive System → Oral Cavity → Buccal Mucosa → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
| Alternative Ecosystem Assignments | |
|---|---|
| Environment Ontology (ENVO) | Unclassified |
| Earth Microbiome Project Ontology (EMPO) | Host-associated → Animal → Animal surface |
| Location Information | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location | USA: Maryland: Natonal Institute of Health | |||||||
| Coordinates | Lat. (o) | 39.0042816 | Long. (o) | -77.1012173 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F018385 | Metagenome | 235 | Y |
| F027205 | Metagenome | 195 | N |
| F067847 | Metagenome | 125 | N |
| F077405 | Metagenome | 117 | N |
| F081456 | Metagenome | 114 | N |
| F105380 | Metagenome | 100 | N |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| Ga0099354_100013 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae → Haemophilus → Haemophilus parainfluenzae | 56676 | Open in IMG/M |
| Ga0099354_102305 | All Organisms → Viruses → Predicted Viral | 1709 | Open in IMG/M |
| Ga0099354_103158 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes | 1256 | Open in IMG/M |
| Ga0099354_103312 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Siphoviridae sp. ctHip2 | 1205 | Open in IMG/M |
| Ga0099354_106915 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes | 608 | Open in IMG/M |
| Ga0099354_108053 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes | 526 | Open in IMG/M |
| Scaffold ID | Protein ID | Family | Sequence |
|---|---|---|---|
| Ga0099354_100013 | Ga0099354_1000131 | F077405 | LLVAKQRGVFYGFILLCQIKFVKKFFDWLKIIQK* |
| Ga0099354_102305 | Ga0099354_1023052 | F067847 | MFSHIIRVRGIFDDEPTTKKLYFHMSRREMFDFIKRYDNVTNFEKWLQAAIDNEDLYTMMKFFDDLIGTSYGERQGERFVKSEQIKESFLNSPEYEELFDQLMDNPALVREFYNGILPEKIMKQVQQDPKYKELDDKLKETELNNL* |
| Ga0099354_103158 | Ga0099354_1031584 | F081456 | MFEEPPIYYILISLIFLIVFGAISFATWLVWLTAVSFFVKLVITAIGFLFCAMTVILYTISAE* |
| Ga0099354_103312 | Ga0099354_1033121 | F105380 | MSILELDSSQYVKQGRIFKKFDSALLDSYMDGRQTEYSINLSELDDQISDGIVYADHEGKMIYKFGAKKILQTAITNGLTITGLASEFKMKHYSFWVPDLYFISYSSFNPNSDLYIAYRSKDAKKICLTNIWSGSGNVDFYSPNGGRLVYNRLCKGRMMDDIGTDDYEAWKRTPVNRASNFVNKFISARGNSDLD |
| Ga0099354_106915 | Ga0099354_1069151 | F018385 | GVLGLMAEFVNRWDPYAEVPIETHRDPVKDDHLIYGVNVPHFTVTVYSPDGRVNKYWNSRILEDMLGYCRIACPRDGKILKFKWSDWSVYMFTHDGLNELVFMPDSGRKIITQLFEKEVK |
| Ga0099354_108053 | Ga0099354_1080531 | F027205 | IMRAVKESEEFERKALAEARKRDRAEGKEPRETLYPNPDLKPGREIVLDYIKNPERRRTPRCSVHLEKRTANNSYRFVVDVSQVRNRDLADEIEKDLFAFMDYLLDEYDIPRRIRK* |
| ⦗Top⦘ |