| Basic Information | |
|---|---|
| IMG/M Taxon OID | 3300006498 Open in IMG/M |
| GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0063646 | Gp0052511 | Ga0100374 |
| Sample Name | Human supragingival plaque microbial communities from NIH, USA - visit 1, subject 159591683 |
| Sequencing Status | Permanent Draft |
| Sequencing Center | Baylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis |
| Published? | N |
| Use Policy | Open |
| Dataset Contents | |
|---|---|
| Total Genome Size | 151863366 |
| Sequencing Scaffolds | 16 |
| Novel Protein Genes | 27 |
| Associated Families | 27 |
| Dataset Phylogeny | |
|---|---|
| Taxonomy Groups | Number of Scaffolds |
| All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria | 2 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Epsilonproteobacteria → Campylobacterales → Campylobacteraceae → Campylobacter | 1 |
| All Organisms → cellular organisms → Bacteria | 3 |
| Not Available | 2 |
| All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes | 1 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Cardiobacteriales → Cardiobacteriaceae → Cardiobacterium → Cardiobacterium valvarum → Cardiobacterium valvarum F0432 | 1 |
| All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → Candidatus Saccharimonas aalborgensis | 1 |
| All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae | 3 |
| All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Actinomycetales → Actinomycetaceae → Actinomyces → Actinomyces timonensis | 1 |
| All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonia → Candidatus Nanosynbacterales → Candidatus Nanosynbacteraceae → Candidatus Nanosynbacter → Candidatus Nanosynbacter lyticus | 1 |
| Ecosystem Assignment (GOLD) | |
|---|---|
| Name | Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
| Type | Host-Associated |
| Taxonomy | Host-Associated → Human → Digestive System → Oral Cavity → Supragingival Plaque → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
| Alternative Ecosystem Assignments | |
|---|---|
| Environment Ontology (ENVO) | Unclassified |
| Earth Microbiome Project Ontology (EMPO) | Host-associated → Animal → Animal surface |
| Location Information | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location | USA: Maryland: Natonal Institute of Health | |||||||
| Coordinates | Lat. (o) | 39.0042816 | Long. (o) | -77.1012173 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F018385 | Metagenome | 235 | Y |
| F022002 | Metagenome | 216 | Y |
| F027205 | Metagenome | 195 | N |
| F030786 | Metagenome | 184 | N |
| F036281 | Metagenome | 170 | N |
| F040685 | Metagenome | 161 | N |
| F043991 | Metagenome | 155 | N |
| F047127 | Metagenome | 150 | N |
| F049707 | Metagenome | 146 | N |
| F051211 | Metagenome | 144 | N |
| F051212 | Metagenome | 144 | N |
| F051214 | Metagenome | 144 | N |
| F061926 | Metagenome | 131 | N |
| F061927 | Metagenome | 131 | N |
| F063778 | Metagenome | 129 | Y |
| F071327 | Metagenome | 122 | N |
| F074985 | Metagenome | 119 | N |
| F078842 | Metagenome | 116 | N |
| F081454 | Metagenome | 114 | N |
| F081456 | Metagenome | 114 | N |
| F084362 | Metagenome | 112 | N |
| F085821 | Metagenome | 111 | N |
| F095630 | Metagenome | 105 | N |
| F095632 | Metagenome | 105 | N |
| F097526 | Metagenome | 104 | Y |
| F101358 | Metagenome | 102 | Y |
| F103430 | Metagenome | 101 | N |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| Ga0100374_100013 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria | 223347 | Open in IMG/M |
| Ga0100374_100019 | All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Epsilonproteobacteria → Campylobacterales → Campylobacteraceae → Campylobacter | 208104 | Open in IMG/M |
| Ga0100374_100399 | All Organisms → cellular organisms → Bacteria | 43772 | Open in IMG/M |
| Ga0100374_100408 | All Organisms → cellular organisms → Bacteria | 42730 | Open in IMG/M |
| Ga0100374_100443 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria | 40315 | Open in IMG/M |
| Ga0100374_100623 | Not Available | 31051 | Open in IMG/M |
| Ga0100374_101183 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes | 17801 | Open in IMG/M |
| Ga0100374_102127 | All Organisms → cellular organisms → Bacteria | 10682 | Open in IMG/M |
| Ga0100374_102419 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Cardiobacteriales → Cardiobacteriaceae → Cardiobacterium → Cardiobacterium valvarum → Cardiobacterium valvarum F0432 | 9479 | Open in IMG/M |
| Ga0100374_103258 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → Candidatus Saccharimonas aalborgensis | 7183 | Open in IMG/M |
| Ga0100374_103573 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae | 6590 | Open in IMG/M |
| Ga0100374_103866 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Actinomycetales → Actinomycetaceae → Actinomyces → Actinomyces timonensis | 6131 | Open in IMG/M |
| Ga0100374_105412 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonia → Candidatus Nanosynbacterales → Candidatus Nanosynbacteraceae → Candidatus Nanosynbacter → Candidatus Nanosynbacter lyticus | 4457 | Open in IMG/M |
| Ga0100374_106444 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae | 3754 | Open in IMG/M |
| Ga0100374_111251 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae | 2120 | Open in IMG/M |
| Ga0100374_122400 | Not Available | 998 | Open in IMG/M |
| Scaffold ID | Protein ID | Family | Sequence |
|---|---|---|---|
| Ga0100374_100013 | Ga0100374_100013202 | F095630 | MRYSPLYKTSKDNGLLAHVYEHLLAQYVLKYLQDRGFFISSDIILTAKTYGDTCYMDVELYNPAAPNAYNEALQAFDKHTIPEKAVRRVVSECGIEMNRAVLELKQDELMSNLSRMQSSDWRQQGEMTYRKSYDKSSVNTLFCVPYLKYGKKSKKLFPEYVLEYSIDEEYINSPIDQALAAIVMQAVALNFLVAVRENYTVYDRGDQWSEASLSVGYRMFLGLAKEDKQITSQLKHEFAAYIQYLLKSPFCSNLQKALLRCSCNLEQVLLGRSTLNNILGGCVIGGRGWLEMADDARIKQMIAAIQLDVYDI* |
| Ga0100374_100019 | Ga0100374_10001971 | F085821 | MNDVFERLAPLAQTKKQQTAVAKFIEKGFKFTRQPDAQKFTFLVYDFFVAGNMPAVQLCIDYLVAQGYPTAENEKKFLNWVFMEPVYYLKYFISDASTQKKLHHNLLNLWYESTRRQILEQDNGKHDEAYIDEWVKANVNKTLSTIRSGETIASCKEDVAKHSRTLNDEYEQNIGLLDEYCRALLYTAETTQSQQELVAQTQNHIALLKDLYKKVNKIK* |
| Ga0100374_100399 | Ga0100374_10039914 | F074985 | MTRSVISEEDIVELTDGGWYKTPRIIKGKDFLAHIHDTYASGNAMYVEFKASEGEVRILEYRRLYDVDTESAVLFTINTYPQESILLKNIEEYEFIQYRPQQAWKAIHMGSTKRFSVSDFDELYIDQTFRKMTPVAFTHNNQSWTVMGLELASAETGWFIYLKRRDSDFMTRLNFNRDQKFLYNPISGSWSLDDPTQEIKDLEEIKQALRADAILDVTVSGVPMKLIRVQEIAKGVLFFVFQDEEKNKRYYYNRPAIKLRIVTDSETGEQKYLLDHIKAMHID* |
| Ga0100374_100399 | Ga0100374_10039915 | F036281 | MITLIKVDEGPVDIYELRMQYLAKLKETDGVMLPTFIYRNKDLFVTEFKPTCDDQWIMYMTNAEGLITKMRIKNGDLMSNGSVLFLAEERKTYNAKEYYDYWTAREGRPAPFFYESRQYHVKSFMRVPGSTDLWITAERETGHWYTFRMSDAQKSKFTRNTMTNEKGHQTYDWVLENVEWADDTIRYF* |
| Ga0100374_100399 | Ga0100374_10039916 | F027205 | VASRLIVSADDILKAVKESEEFERKALSEARKRDRDEGKEPRETLYPNPDLKPGREIVLDYIKNPERRRTPRCSVHLEKRTANNSYRFIVDVSQVRNRELADEIEKDLFAFMDYILDEYDIPRRIKRSAK* |
| Ga0100374_100399 | Ga0100374_10039919 | F043991 | VSKKNPSVIDYFDLNGDLNEEAYEFEEVKLEDYIDKRSNVKPSWVGKYSHQMHFDLPDDTEVSFYKGLNIVYADINFAGGIRTILFKCRQKKNLTRFISRVLEIAQGDPSNVHPDFRA* |
| Ga0100374_100399 | Ga0100374_10039920 | F051214 | MPGKIVAHDTHLQIDTEFIELKDCFEAFRRGVEYRDKNDVDDILVICNAPDIIEYQLKNGDSFIVTYDPIHRIIVMRVFLHDEDITIKPIYIYNNREYQIACEFLRQVMHDKIDLKDEWIA* |
| Ga0100374_100399 | Ga0100374_10039922 | F018385 | MAEYENQWGPYKEHSIEKDRDPALDDPIIYGVNVKHFTVTVYSQDGRVNKYWNARILKDDLGCCRIACPRDGKILCFNWVHWTAYMFTHDGLNELVFMPGSSRKTISRLYYEEMK* |
| Ga0100374_100399 | Ga0100374_10039933 | F081456 | MFEEPPIYYILISLIFLIVFGAIAFATWLVWLTPIAFMAKLVMTAIGFLLCAITVILYTISAD* |
| Ga0100374_100408 | Ga0100374_10040825 | F095632 | MSFKETTGYKVVSLVASTSASITAGAVVGALCPPAGVVLTAIYGLGSSVLGTYVGDKAGRQYAETLAETIDSLQTPQNN* |
| Ga0100374_100408 | Ga0100374_10040844 | F049707 | VSEYRSPHNDGHDPYILIWEYGNDIRRAEFTERWAEYDETGWTVWYFRLVDGGVMTFSSREWDQKDDVNHLTTIWMRPSLYDIERKTS* |
| Ga0100374_100443 | Ga0100374_1004434 | F051211 | MKAKKAIRIFKKIRDLPYGTSGSDEVWSCYQKCVLLKQELQHIGITSQLLIGVFDWQDLPIPERILNLRRQRHERHVILRIFIDGSTYNIDPSIDIGLAPTLPIAHWDGTSSTATMASLKHLRVYRPHSLHERILSRLRSKLFRGNPKEFYIAIDKWLADTRAHHSS* |
| Ga0100374_100623 | Ga0100374_10062344 | F084362 | MSLMNCTFTVRWSDEKNKPHAKTYATEADAKRAKKWLLEHGVRSVDIAVKINNKPAGSLKDDKQSETEAEQKGFWWEK* |
| Ga0100374_100623 | Ga0100374_1006236 | F103430 | MIEQITIKAFIGSDNKTKKLEVDKIISIVNTNHEAFTLDYPVVGYWRGEAEETAVLYLSDDRQRVMNTLSELKEVLDQEAIAYQIENDLQLI* |
| Ga0100374_101183 | Ga0100374_10118317 | F081454 | MKAFKLVLLLFITSASLVFGQEKRYFFKHEFQPNSKYLIKYKTDMDGGYKFVGSKEVIDKIGMDGVKMTINSDIESAISTQKKQGNNVPFILEYTKYFYKAEINGETVNRKIPLQGVKLIGDIVNGKKMEVKNVEGNIDENTKKILIESIKQFSAIDTDFPKEGLKIGDSFDIVVPYKQSTQMGDIEMIMNIKYKFLKVEKEEAYFDMLIDFVMGDKNVKNMDLSASGDGKGFLLFDMKNNYFTSQNIDMTINLKLKTELLTLENTSKAKSVITQQKIK* |
| Ga0100374_102127 | Ga0100374_1021277 | F030786 | MKRTRIHKVVFQMLVVMVVTGSLQMLLKNGSATKGGNTMGTKKISLADITDGDDSKEGAIKVKYFDDDGADKIFENNNNRILNTINSQHISFNSQSAEYSKPQLFLLYRSLKVDC* |
| Ga0100374_102419 | Ga0100374_1024193 | F097526 | MTENSTMATTIYTQDDYCRLCERLNDDTVAALLHAHIDAFAADGNRQLLKRLTEAMRIAAEFEQAGRNGHPDPAELERRERWDSHCKRLQQHARMAGDQITNADNAAVSRLTKQCEKNGSRDTAIPSPHDDGYGFTAGLRDFPLNASQTALLWRMAVLTIAEMTDTTPELTAHYLNGTGGEHLGRALAGKTVYPVTVVSNLAWLLHEQHKSGQLQRDLLSAAKAYENRNTDNRLEDAMTKP* |
| Ga0100374_102795 | Ga0100374_1027951 | F061927 | MKNPIFILSAMLILGACSSDSTQKATEKIKNAYSEGLRKTFIKEGIKSCIENSGLKESEAREYCECAMNKLNESLSN |
| Ga0100374_103258 | Ga0100374_1032584 | F101358 | LTQPITTDGMPQERATHPDTYIIPKHENVPQYLWNVLRSSGQLDEGWIDREIINKDDVTLVRMTKLVNRSNIYQLEKCVPLAVLQEQNIDFTNAYLQKAYGVMVKDGRLQPAADTTHATHQNDSTDEQETEVLLAQRGDTYDHLGGDSARQLVGSVAAKASGEVDNSPLVQRALECIR* |
| Ga0100374_103573 | Ga0100374_1035733 | F040685 | MKKLLFKLFFALTLTSISLHGQEKIQQVEVHIFGGMALYSSHYTINFLYKEFEAKQVMGEPAELPKKILLLNPPDKWRVFTKKINLDRFKKLRDGPSEQAFDGQDEVIIIKTDKKTYRKMNASGNDHDREVWYDLLQIIAKEFGKKGIYE* |
| Ga0100374_103573 | Ga0100374_1035734 | F051212 | MKKFFFIFVLCWLHSCNGTEKAMATSPDTQKTSISEKQNAEKIERIIYSETGGDTGGKNVHLVITKDSIIYRLTEGVTDKKTIANLSLNNNNKDWEAFIDKIDLEDFEKGKPSKELIMDLPTTKIIIKTDKKEYSKTNIQNNTTWDYITKQIIDIKYSKLYNHLNLKK* |
| Ga0100374_103573 | Ga0100374_1035736 | F047127 | MKKTFAFILLSIISLAKAQLTDIRYIPVISTDTISTKANLYPHVLSKNKFNPLVFENGFKVGERRQAVRLWDKDIFYLEFTDRKMNKRVFRQIPELKKNGKLFEVMLQGDVSWYRRYFSYRADTWDANYEHEDYFVKGDEIVNIPVKGRYKKKLKALLSDKPEIAKEVDQMVGDRDIREILEKYNRK* |
| Ga0100374_103866 | Ga0100374_1038666 | F063778 | MPETISEGAQQKLLQQLQNALGLVADADTSAHDVAAITHSAADGHQLTEVMLQQMTAIDAYLKNCQTSINDAIGNIEAIPLDPPPED* |
| Ga0100374_105412 | Ga0100374_1054125 | F078842 | MIISSIYKTVDNDGLIAHIYEHLLAQYVLKRLQDNELFVLSDIILSAKTYGDTCFMDAESYSPEAKKTYDEAVREFDKLVIPEDDILRAAGECGIEMNRNIVEVDRSELSKKLREVQISPWRKQIDMAYRKARNESSVNTLFRTSYIKYSKESDDLFRECVLEYSIDESHIQTPVDQALAAIVMQIVALNFLTVVREKYTVYDRGDQWSEASISVGYRMFLGLLKKDDKIINQLGCDFLEYIKNLSSSVFCDNLQKALVRCSDSHKQVILNRSTLNAILGGCVIGGKGWLEMADSARIRQMVNSIELDIYEVDS* |
| Ga0100374_106444 | Ga0100374_1064443 | F071327 | MEDLFNSVYSTHKGISFSTVVVFGAFIFLLLQVHLSYKGTISEVLRKTSFFSMILLYIQGILGVFLGIYSPEFSEASEFSSYFKLFEYGIIILTCAGMITYVYIFLKNNQTLTLKIVVIALATALLFEYAYPWRLIFG* |
| Ga0100374_111251 | Ga0100374_1112513 | F061926 | MMIKTAKHIKTFLASVLLLIFVMNVSGLFVRLHHQEIHQKTEKIAECSDKVCYHKVHLQTKSDCDCGFLCTLNYFYILPEKPQTEFHVNEYFSYFSSYKIFVSERIILLWQSRAPPVLS* |
| Ga0100374_122400 | Ga0100374_1224001 | F022002 | MSSRLALTLGLLLALLLSLPSYAQDGQSKEPIDRTISGFTLGVTTPAEARAIIQRQGGEIEETQAWSDEVVYAITGLKYARRPTLSVRLYFYKGHLRSISFVFGDLKIFEQIESGLENKYGTMAEGKATSKMRVKGIADAFTSLEVVVHSFEDDGHVGFAYAYISYTDLELDRAYSAENENEI* |
| ⦗Top⦘ |