| Basic Information | |
|---|---|
| IMG/M Taxon OID | 3300006524 Open in IMG/M |
| GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0063646 | Gp0052545 | Ga0101033 |
| Sample Name | Human tongue dorsum microbial communities from NIH, USA - visit 2, subject 158479027 |
| Sequencing Status | Permanent Draft |
| Sequencing Center | Baylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis |
| Published? | N |
| Use Policy | Open |
| Dataset Contents | |
|---|---|
| Total Genome Size | 154100708 |
| Sequencing Scaffolds | 25 |
| Novel Protein Genes | 28 |
| Associated Families | 24 |
| Dataset Phylogeny | |
|---|---|
| Taxonomy Groups | Number of Scaffolds |
| All Organisms → cellular organisms → Bacteria | 5 |
| All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → unclassified Porphyromonas → Porphyromonas sp. oral taxon 279 | 2 |
| All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales | 2 |
| All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes | 2 |
| All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae | 4 |
| All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonia → Candidatus Nanosynbacterales → Candidatus Nanosynbacteraceae → Candidatus Nanosynbacter → Candidatus Nanosynbacter lyticus | 1 |
| All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae | 2 |
| All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Siphoviridae sp. ctHip2 | 1 |
| All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → unclassified Flavobacteriaceae → Flavobacteriaceae bacterium | 2 |
| All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → unclassified Eubacteriales → Clostridiales bacterium | 1 |
| All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes | 1 |
| Not Available | 1 |
| All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria | 1 |
| Ecosystem Assignment (GOLD) | |
|---|---|
| Name | Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
| Type | Host-Associated |
| Taxonomy | Host-Associated → Human → Digestive System → Oral Cavity → Tongue Dorsum → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
| Alternative Ecosystem Assignments | |
|---|---|
| Environment Ontology (ENVO) | Unclassified |
| Earth Microbiome Project Ontology (EMPO) | Host-associated → Animal → Animal surface |
| Location Information | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location | USA: Maryland: Natonal Institute of Health | |||||||
| Coordinates | Lat. (o) | 39.0042816 | Long. (o) | -77.1012173 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F030786 | Metagenome | 184 | N |
| F032313 | Metagenome | 180 | N |
| F033081 | Metagenome | 178 | Y |
| F041827 | Metagenome | 159 | Y |
| F046431 | Metagenome | 151 | Y |
| F046432 | Metagenome | 151 | Y |
| F047127 | Metagenome | 150 | N |
| F051212 | Metagenome | 144 | N |
| F051213 | Metagenome | 144 | N |
| F055792 | Metagenome | 138 | N |
| F058221 | Metagenome | 135 | N |
| F061925 | Metagenome | 131 | N |
| F061927 | Metagenome | 131 | N |
| F063777 | Metagenome | 129 | N |
| F064818 | Metagenome | 128 | N |
| F068942 | Metagenome | 124 | N |
| F071327 | Metagenome | 122 | N |
| F071328 | Metagenome | 122 | N |
| F077404 | Metagenome | 117 | N |
| F078842 | Metagenome | 116 | N |
| F089055 | Metagenome | 109 | Y |
| F090516 | Metagenome | 108 | N |
| F099454 | Metagenome | 103 | N |
| F105380 | Metagenome | 100 | N |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| Ga0101033_100002 | All Organisms → cellular organisms → Bacteria | 179210 | Open in IMG/M |
| Ga0101033_100003 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → unclassified Porphyromonas → Porphyromonas sp. oral taxon 279 | 178667 | Open in IMG/M |
| Ga0101033_100353 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → unclassified Porphyromonas → Porphyromonas sp. oral taxon 279 | 29662 | Open in IMG/M |
| Ga0101033_100548 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales | 23686 | Open in IMG/M |
| Ga0101033_101324 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes | 14683 | Open in IMG/M |
| Ga0101033_101395 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae | 14255 | Open in IMG/M |
| Ga0101033_101599 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae | 13154 | Open in IMG/M |
| Ga0101033_105139 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes | 6063 | Open in IMG/M |
| Ga0101033_106127 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae | 5271 | Open in IMG/M |
| Ga0101033_107112 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonia → Candidatus Nanosynbacterales → Candidatus Nanosynbacteraceae → Candidatus Nanosynbacter → Candidatus Nanosynbacter lyticus | 4653 | Open in IMG/M |
| Ga0101033_109021 | All Organisms → cellular organisms → Bacteria | 3768 | Open in IMG/M |
| Ga0101033_109442 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae | 3612 | Open in IMG/M |
| Ga0101033_114295 | All Organisms → cellular organisms → Bacteria | 2337 | Open in IMG/M |
| Ga0101033_114327 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales | 2331 | Open in IMG/M |
| Ga0101033_115204 | All Organisms → cellular organisms → Bacteria | 2190 | Open in IMG/M |
| Ga0101033_115589 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Siphoviridae sp. ctHip2 | 2132 | Open in IMG/M |
| Ga0101033_115821 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → unclassified Flavobacteriaceae → Flavobacteriaceae bacterium | 2098 | Open in IMG/M |
| Ga0101033_118184 | All Organisms → cellular organisms → Bacteria | 1797 | Open in IMG/M |
| Ga0101033_120926 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae | 1535 | Open in IMG/M |
| Ga0101033_121619 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → unclassified Eubacteriales → Clostridiales bacterium | 1475 | Open in IMG/M |
| Ga0101033_122972 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → unclassified Flavobacteriaceae → Flavobacteriaceae bacterium | 1369 | Open in IMG/M |
| Ga0101033_125768 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes | 1192 | Open in IMG/M |
| Ga0101033_134679 | Not Available | 823 | Open in IMG/M |
| Ga0101033_136161 | All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria | 781 | Open in IMG/M |
| Ga0101033_136370 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae | 774 | Open in IMG/M |
| Scaffold ID | Protein ID | Family | Sequence |
|---|---|---|---|
| Ga0101033_100002 | Ga0101033_100002181 | F046432 | MQMTENELSEVISKFQMPEGRYSIEQEGSFGRGEFFWIIKNQSTNQKYLLMNTYSHHGVESELECYREEGFDNLEAIPRKIETLEIPSDAEDEISKYLFGFYSIFEIKS* |
| Ga0101033_100003 | Ga0101033_10000317 | F099454 | MAFTFVFTSCDRLTDEPTLEDRGYKYFDSTAQRKSFRVVTASGKPYNHKIDWHIIGIRDSKSDTYLTKKVDTLSNGDLKISYDWISFTIREKKSVIDVEVQKNETGEDRSVKFVAQDNHKGLASPSMKVIQQAK* |
| Ga0101033_100003 | Ga0101033_10000318 | F051213 | MKASKLLWAVVMALTFVFTSCDPFSQNEPTIEGDRYKYFDSSAQRQSFRVVNGSGKPYNHKVDWHIIGIQEENSDTYLTKKVDTLSNGDFIISYDWVTFTVKENKSVIDVEVQKNETGKDRSVFFATSNSYKQAYLPNMIVTQRAK* |
| Ga0101033_100003 | Ga0101033_10000319 | F051213 | MKASKLLWAVVVALTFVFTSCDWVGDEPTIEGKLDKFFDSQAQRKSFRILTGSGKPYNHKVDWHIIGITDPYSDTYLTKKVDTLSNGDLKISYDWVSFTVRENKSVIDVEVQKNETGKVRAVYLNTNTSGRHITLPDMRVTQRAK* |
| Ga0101033_100353 | Ga0101033_10035314 | F055792 | VEIAGEALDSTSAVAHRILLLTTQLGESLLASLRTEDGVIAEAMVTGALERDLAIDCALEEVRPVFVDESDDGTEAGTTWSRHPLETLQKEGYILFEGSMLPCKACRVDPRSSVKSLDLEPRIIGEAIEPVALPDVTRLDESIALQGIGSLRDLLMTPDVSETDYLQTSREEGTDLLQLMSIIARKYQLFHTFVS* |
| Ga0101033_100548 | Ga0101033_10054810 | F071327 | MEDLFNSVYSTHKGISFSTVVVFGAFIFLILQVHLSYKGRISDVLRKTSFFSMILLYIQGILGVFLGIYSPEFSEASGFSSYFKLFEYGIIILTCAGMITYVYMFLKSNQILTLKVLIIALAAALLFEYAYPWRIIFG* |
| Ga0101033_101324 | Ga0101033_10132419 | F071328 | MQGLICHVNEGCKMSRKHWTFINLIRYIEEYERNPLLIERMKWKFIPEGECIVEFVELCKHLVLERTIDSKESLTTAIYLRYSSQLLLKKKRAIRRLGIEKENISAILRQCGRHYKEYGDDEHRVFFLDTDVNIYFCKHYQLPIYILQRIEFSNKEYRPFILKVLPWKKDEW* |
| Ga0101033_101395 | Ga0101033_1013951 | F051212 | MKKFFFIFVLYWLHSCNGTEKAMPTSPDTQKTSISEKQNTEKIERVIYESGGDKSGKNVHLVITKDSIIYRLTEGVTDEKTIANLSLNNNNKDWEAFIDKIDLEDFEKGKPSEELIMDLPTTKIIIKTDKKEYSKTDIQANKTWDYITKQIIDIKYSQLNNHLNLEK* |
| Ga0101033_101395 | Ga0101033_1013953 | F047127 | MKKTFAFILLSIISLAKAQQTDIRYIPVISTDTISTKANLYPHVLSKNKFNPLVFENGFKVGERRQAVRLWDKDIFYLEFTDRKMNKRVFRQIPELKKNGKLFEVMLQGDVSWYRRYFSYRADTWDANYEHEDYFVKGDEIVNIPVKGRYKKKLKALLSDRPEIAKEVDQMVGDRDIREILEKYNSK* |
| Ga0101033_101599 | Ga0101033_1015998 | F064818 | MGKRKKYPIPNNINGNTPKLYFINHKFIVVLLPYFTYMDKDEFLKKLIAFIADNSSELHPKIFKNKIRFGINKNSYTEMRFDYSQNYKGFYLQLASYNKEVGDFFEQEMGNSFLKMLEDESKEFRNLFFVQNSFQISHYYYGFPIMTNDNTGHLYPEMGTTIFNDILRNLQANHFKFIQAAEVLSPDLLHYIKKFPSCFFNTALVALLIIEKNLLSLNDERVQGLFEYDNMVTKNECKLFSPFDLIFGKKDYQQTAKQRILQRR* |
| Ga0101033_105139 | Ga0101033_1051399 | F030786 | MKRTKIHNVVFQMLVVMIVTGSLQLLLKNGSAAKGGNAMGAKKISLADITGGDDSETGVLKVKYFDDEGVDKIFENNNNRILNTINSQHISYNSQSAEYSKPQLFLLYQSLKVDC* |
| Ga0101033_106127 | Ga0101033_1061273 | F061925 | MSWEYSINLDSEEAVSSVVTDLKICELFSSSTTDYIDWKNPKSIDSTPYDARFYTDKKKTIYIAINSFSKNIFSALKSILAKQNYHLTDCDTDEEVTLEHIFRSVI* |
| Ga0101033_107112 | Ga0101033_1071123 | F089055 | MNSTPECVTKTPEIEAREKLAAIFSDAKRYDGNSGVKPELGKAAIDGKDIKNIIKVNSADNEAVDLCNQALGSYGKSLDRINNSPLETVREIGDLLQSFREDKTKESCR* |
| Ga0101033_109021 | Ga0101033_1090213 | F078842 | MIISSIYKTADNDGLIAHIYEHLLAQYVLKRLQDNEFFVLSDIILSAKTYGDTCFVDAELYSSEAKKTYDEALREFDKLVIPEYDILRAAGECGIEMNRNIVEVDRSELSKKLREVQISPWRKQIDMAYRKAHDESSVNTLFRTSYIKYSKESDDLFRECVLEYSIDESHMQTPVDQALAAIVIQIVALNFLTVVREKYTVYDRGDQWSEASISVGYRMFLGLLEKDDKIINQLSCDFLEYIKILSSSVFCDNLQKALVRCSDNHKQVILNRSVLNAILGGCVIGGKGWLEMASSAQIRRMINSIELDVYEVDL* |
| Ga0101033_109442 | Ga0101033_1094425 | F077404 | MAQQIIMTHKLAAAALSLKEPTQIGNTQNPFAMNTLKTIFTCFFTLCFMMVANSYAQKTDSINTEAGKSVLHRNAIYIPPALEQYADTALLHQRFNVENKGNYLYTPFTEDNEPTIPFNYGFLHPLGERFYNCFMGKVDRILRPKADKGFIILTSYLVVLGDSYAFDTSNKDTSKLADLKYLDFRHIKRDFSYGHPYQGFTHNDRIELSNFVQSYGRQAALETANAWVMASYPFSLQSTKFENRYTRGRKLILTDGHSTLYLYFLMIDSVAPNFDTEVLPYIKGVFRFNRFW* |
| Ga0101033_114295 | Ga0101033_1142955 | F046431 | ITIVSILLIIMLTYTQILVFAAESELTLTPKPETNNIHLKWTGPQNSSYRVYQKKPGSSTFETIGLTDFSNTDEEVKVLNVYPVSIAEYNTPDVNVTYLDGSSETIPKSALLKVWMEGRNSYRRV* |
| Ga0101033_114327 | Ga0101033_1143272 | F063777 | MKLLLYLCVPFLFIFGILLLIGWGIYSGISSIISAVKEDFFGIKDKTNIKTSKNILFENEQFKLKKEDYLPDENSQEYKIFDDFCAKSNEYLDDGYIFYRLTDEKSATELNGAIISEFQEDVGNYILLQNLILEDNQLKNQLISFNKNTGKITVLADIKDFFWLDFDSETKTINGYNNKEQIEITISE* |
| Ga0101033_115204 | Ga0101033_1152042 | F033081 | MCPPDLMVVHRPRKGVMAWLFNRVMPTDSRPVFVWPKLVAAIEDTGNFGKRWLTAIATGLIIVTIATIKALLMIPGLDSSVVGLLTSIFETFLPARWATGAAWIVGTTGVFLMGNFTSYTKKQKSLQSLKATGCGMYNTLLLFALLEEQAFRSGSERWNWRERVRASVCFGLLHVTNIWYSFAAGIALSVTGFGFLMVYLWYYRKYRSQIIATAAAATVHALYNAIAISLIAVVVAVYLAIDIAKLL* |
| Ga0101033_115589 | Ga0101033_1155891 | F105380 | MSILELDATQYVKQGRIFKKFDSNLLDSYMDGRQNNYNINLAELDDQISDGIVYADRNGKMIYKFGAKKIIQTAITNGLEISGLSKDLEMKHYSFWVPDLYFVSFYSFNPNEDLYIAYRSKDEEFICLTNIWPDGSSYAEDYFPNGDRLTLKRLCKGSMMSDVSSSDYDAWRNNAVTRASQFVNKFVSARGNSDLDFVGSALRSKVPSHNIKKCAVFLGTITKEQENVNTYEEFMEWTKNTKWFK* |
| Ga0101033_115821 | Ga0101033_1158212 | F041827 | MKHFLSALALGCLLLSCNRDLENDETPAPAPQKEKLVLLKQLSEGSTVTFQYKNGNEIESVNIDGVGKSDIDYEYDTYGRIVKERRFHRRYDRGETNITYQYDNQGRLVSSHAISTKFYPGTGLTPRCSVEKKHTYTYQGNKVIVKIEMGADTCSAIPETGKEKTITLLVENGRVTKTFDEQGNIEQTIEYYNTKNALRNIKGFPALVVEFYIRPLTYELPYYNHGIERIEDLRYIDNIKTRDFHDGDYWEYRYRYDNGSTDNDYPNGVGIHARSHNDPTYDEYLYEISANRSYIKEE* |
| Ga0101033_118184 | Ga0101033_1181841 | F090516 | PLSPLFGKNKMTTFERYFIENDEVISLDKSSEFTDVIVGIGFLPQDMSENTDFKKKTLEKYGFSSSTALADDFRKRVLNIDEPIPENFEKDGIGYVYTVISGYDTFYNRMYMFGIHCFNGDFNVTYFDLENDAETGDYYKEHELYSQAKGYRWLDPESDYYEDVLAWEALNKLATDIYFHLEDKLDVKIDIKPIPEEEKVVPTQEHLAKFLAFCGVEQDVIDENKERLLKALEEYTPDEYEGVSEVMAEMMEYSHKIQRAEPVIEIIREYGVCRFSDWKFYAEELEEYILDLADFSDWKWEYPADTYSADLFPYMRKQLSLYHLWLCHLDEGADAYLFLLFSEKDMPEIMKLARILDLPLKAYFK* |
| Ga0101033_120926 | Ga0101033_1209263 | F058221 | MGKIKNFQDLKNQKEELRAEIKEIESVLSFENPRKSFGVITNGVTEKYLGGMMDSSLAQNAFFLADKFLFPSLKVGSAKLLSNVLLKRTKPSMKKTLIGLGVVVLAPVVIMRIKKRLDDFQQRETAKSLSKLI* |
| Ga0101033_121619 | Ga0101033_1216194 | F046431 | MKILKKITIVSILLIIMLTYTQILVFAVESELTLIPNPETNNIHLKWTGPQNSSYKVYQKKPGSSNFETIGLTDFNNVDEEVKVLNIYPTIEGLPMVNVTYLDGQTETIPKSGLLKVWMEGRNSK* |
| Ga0101033_122972 | Ga0101033_1229721 | F061927 | LNYIIMKNPIFILSAMFILGACSSESAKKAYNDSFRKTFIEEGVKSCIENSGLKESEAREYCECAMNKINENLSNDEIIDISMDNPPKDLDERIDKAISSCVENKP* |
| Ga0101033_125768 | Ga0101033_1257683 | F046431 | KKITIICILIAIILTYMQTLVMAAESKLTLTPKLETNNIHLKWTGPQNSSYRVYQKKPGATQFETIGLTDFSNNAIDEEVKVLNIYPQESNTDGRLWPTLNSSQVANIAKVLPKVQVTYLDGQTETIQKTALLKVWMEGRNSK* |
| Ga0101033_134679 | Ga0101033_1346792 | F032313 | MYRLLILLFAITLMACDNDTPQEKPREQEKHEVPVPKPKPQFDEVGERIWYGQTPAMRLDSTDYGAGLTRVLEMRTSSIPKQRFDSLFKQTVWEIKDICAVETDLSLAKKIPKFVGGSITKEFTCRNGVILRHMQGIDINLVDTVNYVYNEDLNEIVLEGTGIRWYVLRLNKYAVEFLQQGHNIWGPFDWYYGRNSGRSEVMLEAK* |
| Ga0101033_136161 | Ga0101033_1361611 | F089055 | MNSTPKCVTETLEIKAREKLAVIFSDAEQRDNSKVNPELGKTAIDIENTSRMNSADDGAVYLCNQALGSYGKSLDYINNSPLEAVQAIGNSLQLFREDKTKESCR* |
| Ga0101033_136370 | Ga0101033_1363702 | F068942 | MIRKILSLPTLALCFTLCTALFAGCGENYDGSVTEVHWSNVKNPEYGNAINIMLKAEGETFTTVGNHSWISFSNDASTLDTFTRHRFPEVDKDTAYYKDIVIYMTRNERERTTTLKLVAPPNRTQQPKQFKFS |
| ⦗Top⦘ |