Basic Information | |
---|---|
IMG/M Taxon OID | 3300006496 Open in IMG/M |
GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0063646 | Gp0052537 | Ga0100375 |
Sample Name | Human saliva microbial communities from NIH, USA - visit 2, subject 763577454 |
Sequencing Status | Permanent Draft |
Sequencing Center | Baylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis |
Published? | N |
Use Policy | Open |
Dataset Contents | |
---|---|
Total Genome Size | 75609468 |
Sequencing Scaffolds | 14 |
Novel Protein Genes | 14 |
Associated Families | 14 |
Dataset Phylogeny | |
---|---|
Taxonomy Groups | Number of Scaffolds |
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonia → Candidatus Nanosynbacterales → Candidatus Nanosynbacteraceae → Candidatus Nanosynbacter → Candidatus Nanosynbacter lyticus | 3 |
All Organisms → cellular organisms → Bacteria | 2 |
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes | 4 |
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Siphoviridae sp. ctWKa2 | 1 |
All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria | 1 |
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ct6uZ8 | 1 |
All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Deuterostomia → Chordata → Craniata → Vertebrata → Gnathostomata → Teleostomi → Euteleostomi → Sarcopterygii → Dipnotetrapodomorpha → Tetrapoda → Amniota → Mammalia → Theria → Eutheria → Boreoeutheria → Euarchontoglires → Primates → Haplorrhini → Simiiformes → Catarrhini → Hominoidea → Hominidae → Homininae → Pan | 1 |
Not Available | 1 |
Ecosystem Assignment (GOLD) | |
---|---|
Name | Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
Type | Host-Associated |
Taxonomy | Host-Associated → Human → Digestive System → Oral Cavity → Saliva → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
Alternative Ecosystem Assignments | |
---|---|
Environment Ontology (ENVO) | Unclassified |
Earth Microbiome Project Ontology (EMPO) | Host-associated → Animal → Animal surface |
Location Information | ||||||||
---|---|---|---|---|---|---|---|---|
Location | USA: Maryland: Natonal Institute of Health | |||||||
Coordinates | Lat. (o) | 39.0042816 | Long. (o) | -77.1012173 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
Zoom: | Powered by OpenStreetMap © |
Family | Category | Number of Sequences | 3D Structure? |
---|---|---|---|
F033081 | Metagenome | 178 | Y |
F046433 | Metagenome | 151 | N |
F048299 | Metagenome / Metatranscriptome | 148 | Y |
F049707 | Metagenome | 146 | N |
F077405 | Metagenome | 117 | N |
F077781 | Metagenome / Metatranscriptome | 117 | N |
F081456 | Metagenome | 114 | N |
F081510 | Metagenome | 114 | N |
F084362 | Metagenome | 112 | N |
F095632 | Metagenome | 105 | N |
F101360 | Metagenome | 102 | N |
F103433 | Metagenome | 101 | N |
F103434 | Metagenome | 101 | N |
F105379 | Metagenome | 100 | N |
Scaffold | Taxonomy | Length | IMG/M Link |
---|---|---|---|
Ga0100375_1000001 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonia → Candidatus Nanosynbacterales → Candidatus Nanosynbacteraceae → Candidatus Nanosynbacter → Candidatus Nanosynbacter lyticus | 82166 | Open in IMG/M |
Ga0100375_1000007 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonia → Candidatus Nanosynbacterales → Candidatus Nanosynbacteraceae → Candidatus Nanosynbacter → Candidatus Nanosynbacter lyticus | 61368 | Open in IMG/M |
Ga0100375_1000008 | All Organisms → cellular organisms → Bacteria | 56410 | Open in IMG/M |
Ga0100375_1000036 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonia → Candidatus Nanosynbacterales → Candidatus Nanosynbacteraceae → Candidatus Nanosynbacter → Candidatus Nanosynbacter lyticus | 21850 | Open in IMG/M |
Ga0100375_1000037 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes | 21139 | Open in IMG/M |
Ga0100375_1000063 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes | 15710 | Open in IMG/M |
Ga0100375_1000319 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Siphoviridae sp. ctWKa2 | 7939 | Open in IMG/M |
Ga0100375_1000343 | All Organisms → cellular organisms → Bacteria | 7680 | Open in IMG/M |
Ga0100375_1000590 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes | 6046 | Open in IMG/M |
Ga0100375_1006984 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria | 1572 | Open in IMG/M |
Ga0100375_1024222 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ct6uZ8 | 707 | Open in IMG/M |
Ga0100375_1031320 | All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Deuterostomia → Chordata → Craniata → Vertebrata → Gnathostomata → Teleostomi → Euteleostomi → Sarcopterygii → Dipnotetrapodomorpha → Tetrapoda → Amniota → Mammalia → Theria → Eutheria → Boreoeutheria → Euarchontoglires → Primates → Haplorrhini → Simiiformes → Catarrhini → Hominoidea → Hominidae → Homininae → Pan | 598 | Open in IMG/M |
Ga0100375_1032734 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes | 581 | Open in IMG/M |
Ga0100375_1036653 | Not Available | 538 | Open in IMG/M |
Scaffold ID | Protein ID | Family | Sequence |
---|---|---|---|
Ga0100375_1000001 | Ga0100375_100000162 | F103433 | MILVMKEWSKNKPGVVFFFVVWFILSISFIGNFFGTGLWNGWFDGFQKDSSAIVEKTAYCKNKYDYKGPLIAADSKDYNKIMMSQDCNPSQVKPYVSQYGLQARVIAGLSPNDASKIPAYIKRVSIFLAVLTAFLLALVVQKIRVLFGGITASVFAVMLAFSPWIAGYARNIYWIEPMLIAPFVISFVGYQYFKKSKKLWLFYIIESVAMFLKLLNGYEYVSTIAISVLVPIIFFELVHKNVKIINIWKQAVPVFAATVVAFFGAYWVNFMSLTDYYGSSDKAANAINARASDRGISGIRSMRAYAVGNFKILRPETYNFINQIVNLDNMANNSGKTYKYIIVNVVNYLLLPAITLPVHINGMFGEFIQSILFWTILGYLIILSSRKIIGKKYSRPFLWSMNFSVIGAFCWLALMPGHALPHAHINGIIFYIPLLLFVYVLIGLWADYVVKRTVKYE* |
Ga0100375_1000007 | Ga0100375_100000726 | F046433 | MIELPTSPDALSELSPVAPPKLLSQAQDASRDNLMVYVKADNYLGTETSDPSFMKSRCKTTEYEAINDFVQFIEMAKRYLPDYMEDCAKELIDELAFLGMPELNFAANALAKRLRHHLEVDNKPVYIDVGNSLSQYRAKNEMKSSQYILSLVLSKFPDDEFEEYEGRLKVYGGRGEIDKSSKILFLDDWIVSGDQVKERIAGFEVDNDPEIHEASVLVMAASEDYLVDGISAYSQYGGATYPVEAYYILKNSPDAGDMSRVTGIHSSTDNTFGYEVDGIAYCAIERGILKGEGIDELSLPALANIVRPYRNGEDFDGLSRFRQLLERE* |
Ga0100375_1000008 | Ga0100375_10000081 | F084362 | ENTTPPAKPSAPEADAKRAKKWLLEHGVRSVDIAVKINNKPAGSLKDDKQSETEAEQKGFWWEK* |
Ga0100375_1000036 | Ga0100375_100003612 | F033081 | MYPPDLIMVHRPQKGVMAWLFKKGMPQDPKPVFVWPRLVTEIENAGYFSRRKFSILAVGLIIMTIATIKMLLFVPGLNQSVVSLLTRGLETFLPAGWATGAAWTVGMTGVFLMGNFTNYTPSQRFLHKTKATRCEAYNTLLLLALWEEQAFRAGSEKWSWRERVRASMCFGLAHIVNIWYSFAAGIALSVTGFGFLLVYLWYYRKYRSQIIATAAAATVHALYNAIALSLIAVVLAIDIAKLL* |
Ga0100375_1000037 | Ga0100375_100003713 | F103434 | VVGFRPRRGRYLENGPHVVEVTLAVIKEGRTGRRFERGETFMIDKVLVQPSAGNALKATENRDIRGDLTDETTLKIMGTGRKWPGGPHSWVKIIKGPDALVGKTFQQAGEPLTYDASPMTHHFSVRCDTLGTESR* |
Ga0100375_1000063 | Ga0100375_10000639 | F101360 | MAGTELRGRESPVSKENALRRAAIAAHVAKVASQEKKKALKELEEYMAPGDTSKPMIDGMQVGTVSVSAPQPRYQVVDEKALVAWLEWNKPDAVHKVPAPWFVATAALDGFIKQTGEVPDGVEVVQGDPRISVRISTAQEEAIRDLISTGDISLLMIEGGDA* |
Ga0100375_1000319 | Ga0100375_100031913 | F081510 | MKLPNMNTIKVAAKTTYTTSKILTKKYAPFILLGVGLAGYGYSVYEGIKSGKKLEKTKAKYEELDQANIPYSKKEVVMDIAKDVAVPVAVATASTAAIVLGFAIQTNRLKAVSAALAMATEEHARYRLRAKTVLDEETFKKIDAPLETKSVEVDGKEIEVESIVPNEGDFYGRWFKYSSNYASDDPEYNEAWVREVDDLMTARISKVGMITFAEVLDALGFEVPKAALPFGWTDGDGFFLEWDTHEVWNDDKQEYEAQLYVRWKTPRNLYATTNFKDLMPKKTRKELN* |
Ga0100375_1000343 | Ga0100375_100034311 | F095632 | MSFKETTGYKVVSLVASTSASITAGAVVGALCPPAGVVLTAIYGVGSSVLGTYVGDKAGRQYAETLAETIDSVKTPQTN* |
Ga0100375_1000590 | Ga0100375_10005903 | F049707 | VSEYRSPHNDGHDPYILIWEYGNDIRRAEFSERWAEYDETGWTVWYFRLVDGGIMTFSSREWEQKDDVNHLTTIWMRPSLYDSEKKTS* |
Ga0100375_1006984 | Ga0100375_10069843 | F077405 | SNSRPRPWQGRALPTELFPRLLVAKQRGVFYGFISLCQIKFVKKVFDWLKIIQKQKVRLK |
Ga0100375_1024222 | Ga0100375_10242222 | F105379 | MVIHFPLSQSDIESLLSISKLLKCDKILYDRNYVNPIIGVGPEKSYFQTTSYMVDLSPHINNLLVNISDLKNLGKITQLEPAKDNPEIAIHKPVVSVFNWDAEYVKACMNSLREYQIDDNIIARTDEFHNTDCYNELMAGSASTGASRINVGGYMIDIPKSAMPTLKSDHVVATVYNAPNKDFNVLRFKITKRNGIIVNQSMLFL |
Ga0100375_1031320 | Ga0100375_10313201 | F077781 | PPPIAAPARPASAHLKAPAAVTGGWGSPRQNDFFILLTLESPAPCDPLQTESHLVFRTRIHSQVQWGDVRGVAGRTKDSLPSDSVCTVGPRAGALSVRTADSLYLGVPRPHPGTPGLGRFWPFLALQSLSETPSHARMPRVTVARPSPETLEISPLRAAT* |
Ga0100375_1032734 | Ga0100375_10327342 | F081456 | PIYYILISLIFLIVFGAISFATWLVWLTNVAFFVKLIITAIGALFAAFTVILYTISAE* |
Ga0100375_1036653 | Ga0100375_10366532 | F048299 | QKTWVQSLGQEDPLEKEMATHSSILAWRIPWTEEPGRLQSMGSQRVRQDSATK* |
⦗Top⦘ |