| Basic Information | |
|---|---|
| IMG/M Taxon OID | 3300007196 Open in IMG/M |
| GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0063646 | Gp0052642 | Ga0103270 |
| Sample Name | Human saliva microbial communities from NIH, USA - visit 1, subject 763961826 |
| Sequencing Status | Permanent Draft |
| Sequencing Center | Baylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis |
| Published? | N |
| Use Policy | Open |
| Dataset Contents | |
|---|---|
| Total Genome Size | 110290466 |
| Sequencing Scaffolds | 22 |
| Novel Protein Genes | 25 |
| Associated Families | 21 |
| Dataset Phylogeny | |
|---|---|
| Taxonomy Groups | Number of Scaffolds |
| All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales | 1 |
| All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Negativicutes → Veillonellales → Veillonellaceae → Veillonella | 1 |
| All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Lactobacillales → Streptococcaceae → Streptococcus | 1 |
| Not Available | 4 |
| All Organisms → cellular organisms → Bacteria | 3 |
| All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Lachnospiraceae | 1 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae → Haemophilus → Haemophilus parainfluenzae | 1 |
| All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae | 1 |
| All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales | 1 |
| All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes | 1 |
| All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella | 1 |
| All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → unclassified Saccharibacteria → Candidatus Saccharibacteria bacterium | 1 |
| All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria | 3 |
| All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Lactobacillales → Streptococcaceae → Streptococcus → Streptococcus parasanguinis | 1 |
| All Organisms → Viruses → Predicted Viral | 1 |
| Ecosystem Assignment (GOLD) | |
|---|---|
| Name | Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
| Type | Host-Associated |
| Taxonomy | Host-Associated → Human → Digestive System → Oral Cavity → Saliva → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
| Alternative Ecosystem Assignments | |
|---|---|
| Environment Ontology (ENVO) | Unclassified |
| Earth Microbiome Project Ontology (EMPO) | Host-associated → Animal → Animal surface |
| Location Information | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location | USA: Maryland: Natonal Institute of Health | |||||||
| Coordinates | Lat. (o) | 39.0042816 | Long. (o) | -77.1012173 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F018385 | Metagenome | 235 | Y |
| F027205 | Metagenome | 195 | N |
| F032313 | Metagenome | 180 | N |
| F033081 | Metagenome | 178 | Y |
| F036281 | Metagenome | 170 | N |
| F046433 | Metagenome | 151 | N |
| F051211 | Metagenome | 144 | N |
| F054109 | Metagenome | 140 | N |
| F054110 | Metagenome | 140 | N |
| F066860 | Metagenome | 126 | N |
| F067847 | Metagenome | 125 | N |
| F068942 | Metagenome | 124 | N |
| F072446 | Metagenome | 121 | N |
| F077405 | Metagenome | 117 | N |
| F078842 | Metagenome | 116 | N |
| F085820 | Metagenome | 111 | N |
| F092230 | Metagenome | 107 | N |
| F097527 | Metagenome | 104 | N |
| F099454 | Metagenome | 103 | N |
| F103433 | Metagenome | 101 | N |
| F105378 | Metagenome | 100 | N |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| Ga0103270_100008 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales | 224115 | Open in IMG/M |
| Ga0103270_100189 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Negativicutes → Veillonellales → Veillonellaceae → Veillonella | 41434 | Open in IMG/M |
| Ga0103270_100494 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Lactobacillales → Streptococcaceae → Streptococcus | 24260 | Open in IMG/M |
| Ga0103270_100909 | Not Available | 15573 | Open in IMG/M |
| Ga0103270_100968 | All Organisms → cellular organisms → Bacteria | 14862 | Open in IMG/M |
| Ga0103270_101245 | All Organisms → cellular organisms → Bacteria | 12331 | Open in IMG/M |
| Ga0103270_102118 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Lachnospiraceae | 7964 | Open in IMG/M |
| Ga0103270_103510 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae → Haemophilus → Haemophilus parainfluenzae | 4999 | Open in IMG/M |
| Ga0103270_103974 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae | 4427 | Open in IMG/M |
| Ga0103270_105687 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales | 3108 | Open in IMG/M |
| Ga0103270_106650 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes | 2621 | Open in IMG/M |
| Ga0103270_108308 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella | 2065 | Open in IMG/M |
| Ga0103270_108998 | Not Available | 1894 | Open in IMG/M |
| Ga0103270_109097 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → unclassified Saccharibacteria → Candidatus Saccharibacteria bacterium | 1871 | Open in IMG/M |
| Ga0103270_109278 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria | 1828 | Open in IMG/M |
| Ga0103270_109835 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria | 1715 | Open in IMG/M |
| Ga0103270_111820 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Lactobacillales → Streptococcaceae → Streptococcus → Streptococcus parasanguinis | 1413 | Open in IMG/M |
| Ga0103270_112259 | Not Available | 1361 | Open in IMG/M |
| Ga0103270_112602 | All Organisms → cellular organisms → Bacteria | 1322 | Open in IMG/M |
| Ga0103270_112659 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria | 1316 | Open in IMG/M |
| Ga0103270_116125 | All Organisms → Viruses → Predicted Viral | 1010 | Open in IMG/M |
| Ga0103270_129696 | Not Available | 558 | Open in IMG/M |
| Scaffold ID | Protein ID | Family | Sequence |
|---|---|---|---|
| Ga0103270_100008 | Ga0103270_100008103 | F105378 | MNINVYVDNIKKWVQISSDEVLDRNKNLSDLKDKNAAIINLGLYDKFISKEALESGFLPDIFTPENIVTDSNHQFVTDAEKNKWNNKLNKPIANQDHLEENQIGYDELNEKFYIGLNNKNVLIGGASALDNIKVVNGFFSGNSQATVIRNTKQNQNGEFVRPIFVDVQCTEYTGGDLGEISVTYTAEAINVYNTGSFTGSFQCMIVYPLGSANR* |
| Ga0103270_100189 | Ga0103270_10018935 | F054110 | VNYQPTIKKLLKALQMNGRRYVVDTRQSWSKFDKPCKVYIVSRMYTEEEYKLTFPHKYKKGKTFKQGQLYKKESEYSSTKQHEVLLFLVKTYKGGD* |
| Ga0103270_100494 | Ga0103270_1004945 | F054109 | MTGRPKSKKGVKVHTAFKIYPKDKERAQIMADKLDMSLSAYINKAVLEKLANDEKSEA* |
| Ga0103270_100909 | Ga0103270_1009094 | F105378 | MNVKVYVDKIKKWVQISSDEVLDVNKNLSDLKDKEAAITNLGLYEKFISKEALESGFLPDVFTPDNIVTDSIHQFVTDEEKNKWNNKLNAPVPMQDHLANNQIGYDSVNSKFYIGLNNQNVLLGGSSCFDNIIVVNGFFSGNSQPTVIRNNKFNEAGQLITPVFVDVQCVEYTAGDLGEVSVSYTADAISIYNTGSFTGSFQCLIVYPLGSVNE* |
| Ga0103270_100968 | Ga0103270_10096810 | F103433 | MIFVMKEWSKNKPGVVFFFVVWFILSISFIGNFFGTGLWSGWFDGFQKDSSAIVEKTAYCKNKYDYKGPLIAADSKDYNKIMMSQDCNPSQVKPYVSQYGLQARAIAGLSPDDASKIPAYIKRASIFLAVLTAFLLALVVQKIRALFGGITASVFVVMLSFSPWIAGYARNIYWIEPMLIAPFVISFVGYQYFKKSKKLWLFYIIESVAMFLKLLNGYEYVSTIAISVLVPIIFFELIHKNVKIINLWKQAVPVFAATVVAFFGAYWVNFVSLTDYYGSSDKAASAINARASDRGISGIRSMRAYAVGNFKILRPESYNFINKIVNLDNMANNGGKTYKYIIVNVVNYLLLPAITLPVHINGMFGEFVQSILFWTILGYLIILSSRKIIGKKYSRPFLWSMNFSVIGAFCWLALMPGHALPHAHINGIIFYIPLLLFVYILIGLWADYVVKRTVKYE* |
| Ga0103270_101245 | Ga0103270_10124510 | F036281 | MITLIKVDEGPVDIHELRMQYLAKLKETDGVMLPTFIYRNKDLFVTEFKPTCDDQWIMYMTNAEGLITKMRIKNGDLMSNGSVLFLAEERKTYSYKEYFDYWTAREGRPAPFFYESRQYHVKSFMRVPGMPELLITAEREKDHWYTFRLSDGLKSKFTRHTITNEKGHQSYDWVLKNAQWELDTIRYF* |
| Ga0103270_101245 | Ga0103270_1012452 | F018385 | MAEYVNQWESYKELSIENDRDPVLDNPIIYGVNVKHFTLTVYSPEGRVNKYWNARILKDQVGRCRIACPRDGKILCFAWFEWTSYMFSHDGLNELVFMPRMNSRLPSTLWNTKEVN* |
| Ga0103270_101245 | Ga0103270_1012459 | F027205 | VASRLIVSADDILKAVKESEEFERKALTEARKRDRAEGKEPRETLYPNPDLKPGREIVLDYIKNPERRRTPRCSVHLEKRTANNSYRFIVDVSQVRNRELADEIEKDLFAFMDYLLDEYDIPRRIKRSTK* |
| Ga0103270_102118 | Ga0103270_1021189 | F092230 | MRQAHKRMVDKLKTHLLKVFLPLFIVCIILVAFFRQIGCGSDGDYAFQISEWGAKLKNIYGTDFINKEIIVRDNAVRVDGIRCLYAVNQNEDGLSIYLLLPGGDYLTHNYVGSSFVRFSNSSEYINMAYGEGSVEVSDSTSTGEVQSTEEKEARDKVDEAINSMRHLFASAIMVNLRVVELYKILTVCMILIVIAMTIGYYSYLKPETVYEFYCKLRRKEKYPSDVNLVKRIGFLVIILPPCMLFLLIV* |
| Ga0103270_103510 | Ga0103270_1035101 | F077405 | NSRPRPWQGRALPTELFPHLLVAKQRGVFYGFILLCQIKFVKNFFDWLKIIQKQKVRLK* |
| Ga0103270_103974 | Ga0103270_1039741 | F068942 | MIRKILSLPTLALCFTLGSAFFAGCNEDYIKNTETKVRWSNVKNPKYGDYINITLKAEGETFTTVGDHSRISFSGSVSTLDTFTRHDFSESDKDTAYYKDIVIYLTRNKSKGTATLKLVAPPNRTQQPKQFDFSIEVTPPAMYIF |
| Ga0103270_103974 | Ga0103270_1039743 | F068942 | MIRKILSLPTLALCFTLCTALFAGCGENNDGFVTEVHWSNVKNPEYGNAINITLKAEGETFTTVGNHSWISFSGSVSTLDTFTRHRFSEVDKDTAYYKDIIIYMTRNERERTTTLKLVAPPNRTQQPKQFDFSIEVTPPAMYIFKVRQPALPAKAQ* |
| Ga0103270_105687 | Ga0103270_1056874 | F099454 | MALTFVLTSCDRLTDEPTLEDRGYKYFDSTAQQKSFRVVTASGKPYNHKIDWHIIGILDPKSENYLTKKIDTLSNGDLKISYDWVAFIVRENKSVIDVEVQKNETGQDRSVDFVAQDNHKGLASPSMTVIQRAK* |
| Ga0103270_106650 | Ga0103270_1066502 | F105378 | MNIKVYVDKIKKWVQISSDEVLDVNKNLSDLKDKEAAITNLGLYEKFISKEALESGFLPDVFTPDNIVTDSTHQFVTDEEKSKWNNKLNAPIPIQDHLENNQIGYDSANSKFYIGLNNQNVLLGGSSCFDNIIVVNGFFSGNSQPTVIRNNKFNETGQLIIPVFVDVQCVEYTAGDLGEVSVSYTADAISIYNTGSFTGSFQCLIVYPLGSVNE* |
| Ga0103270_108308 | Ga0103270_1083083 | F072446 | MSSKLHKRRSFTPRHIHYQNRSGLGMTTGPKHSTTNLTIFLRIYKLGDTLLENIPPMKKLLFLLSGLCLYCLAACDNDHEPTKPVRPFHGDTLAQIAWNFRFIVENHYHSIPGIVPEGTTYRVPVIPRSVEDRTEKEYNDMELGKEAHLVFRATVHGDTINRHKKELKALSLQLNRLTETSIGTSPVLCGVKSIEAVGIAENGNTYDLRGEMKLRIRDYSYRLKYPSGIITLDCENTESLTAKYVVPLGRIREYELAEHIQPELKFYLPVKRCMDFSSIRFAITLFNGEVLSFQHKLPSKSVLQELPNKSVLENYQQYGFIPESTYFTTLWPLPDYKYNEREL* |
| Ga0103270_108998 | Ga0103270_1089982 | F032313 | MSNCNWRAQSVSIGSFKIVLMYRFLILLFALTLMACDNDTPQEKPREQEKHEVPVPVPKPQFDEVGERIWYGRTPAMRLDSTDYGAGLISVFGMLTSKIPKQRFDSLFKQTVWEVKDIRVVETDLSLAKKNPGILGWVTTTEFTCRNGVILLHWQGIDVNHVDTVNYVYDEVDNEIVLEGTGIRWSVLRLNKNAVEFLQRGRTMWGPFDWYYGRNSGRSEVTLEAK* |
| Ga0103270_109097 | Ga0103270_1090972 | F066860 | MTTKKQKLQKQQAIDTWIVIALWVSAIWFSLARGFITGIGGWVLALLAPWALIVSCICLAIISRQMKKRHASKDHLTTIVRVSFIVMSISLFICGLAMPDFSDMETFSTLSVYTNNAISFETSKTIAIISGFVVVLSLFVAVTFGIAEDRE* |
| Ga0103270_109278 | Ga0103270_1092782 | F033081 | MHTDMHTDITVVYRPKKGVIAWLFRRAMPQDTRPTFVWSRLVTEIENAGYFSRRKFCVVAVGLIIMTIATIKMLLFVPGLNQSVVSLLTRGLETFLPTGWATVTAWTVGMAGGVFLMGDFTNYTPSQKFLHKIKATRCEVYNTILFLALLEEQAFRSGSERWNWRERVRASVCFGLLHITNIWYSFAAGIALSATGFGFLLVYLWYSRKYRNQIIATAAAATVHALYNAIALSLITVVVAVYLAIDIAKLL* |
| Ga0103270_109835 | Ga0103270_1098351 | F078842 | MIISSIYKTADNDGLIAHIYEHLLAQYVLKRLQDNEFFVLSDIILSAKTYGDTCFMDAELYSPEAKKTYDEALREFDKLVIPEDDILRAAGECGIEMNRNIAEVDRSELSKKLREVQISPWRKQIDMAYRKAHNESSVNTLFCTSYIKYSKESDDLFRECVLEYSIDESHIQTPVDQALAAIVMQIVALNFLTVVREKYTVYDRGDQWSEASISVGYRMFLGLLKKDDKIINQLSYDFLEYIKILSSSVFCDNLQKALVRCSDNHKQVILNRSTLNAILGGCIIGGKGWLEMSDSARIRQMINSIELDVYEINS* |
| Ga0103270_111820 | Ga0103270_1118201 | F097527 | MIYFKMEKIGNSTHNKEKKTRSENLVFNTIPAAGVEPARPCGHWILS |
| Ga0103270_112259 | Ga0103270_1122592 | F085820 | MYEKTTFLLLLSENDYLCVVLIRNCSNFICALMRSTFYLFAMLFLATTFFSCETVEPSPRPTWGEIVNPIEAFMYPRDLKVFADGEEGRRWLILVVPDSTKSSFAPTSKSTPAEVARYKELSQLVGNPTEPVANECHFHRTWLTQGVKAIRVVRTHADGRDEDVTAQCGNLYFYTDKQIFDCQFKCGNRSIFAKPLGETVEADYLWLPGRDVFGLVAPLNPAGLKQRIVLRLADGTEIEKELSDKRKK* |
| Ga0103270_112602 | Ga0103270_1126022 | F051211 | MRVKKAIKVFEKIRDLPYGTSGSNEVWSCYQKCVLLKQELQHIGITSQLLIGVFDWQDLQIPEYILKIRRQQYERHVILRVFIDEFAYDVDPSIDIGLTPMLPMACWDGKSSTTTMAPLRYLRVYRPHSLHERILSQLRRKIFRSNPESFYTAIDIWLATIRVS* |
| Ga0103270_112659 | Ga0103270_1126591 | F046433 | MIELPTSPDALSELSPVAPPKLLSQAQDASRGNLMVYVKADNYLGTETSDPSFMKIRCKTTEYEAINDFVQFIEMIKHYLPDYMEDCAKELIDELAFLGMPELNFAANALAKRLRCHLEVDNKPVYIDVGNSLSQYRAKNEMKSSQYILSLVLSKFSGDEFEEYEGRLKVYGGRGEIDKSSKILFLDDWIVSGDQVRERIAGFEVDNDPEDHEASVLVMAASGDYLDNGISAYSQYGGTIYPVEACYVLKNSPDAGGMSRVTGIHSSTDNTFGYEVDGIAYCAIERGILKGEGIDELSLPALANIVRPYRNGEDFDGLSRFRQLLERE* |
| Ga0103270_116125 | Ga0103270_1161252 | F067847 | MFSHIIRVRGIFDDEPTTKKLYFHMSRREMFDFIKRYDNVTNFEKWLQAAIDNEDLYAMMKFFDDLIGTSYGERQGERFVKSEQIKESFLNSPEYEELFDQLMDNPSLVREFYNGILPEKIMKQVQQDPKYKELDSKLKETELKNL* |
| Ga0103270_129696 | Ga0103270_1296961 | F054110 | LPSHCVRCIDYNRYCVVLDVNYQPMIKKLLKALQMNGRRYVVDVRQSWSKYDKPCKVYIVNRMYTEEEYKLIFPHKYKKGKTFKQGQLYKKESEYSSTKQHE |
| ⦗Top⦘ |