NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300007728

3300007728: Human tongue dorsum microbial communities from NIH, USA - visit 2, subject 686765762 reassembly



Overview

Basic Information
IMG/M Taxon OID3300007728 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0063646 | Gp0052831 | Ga0105754
Sample NameHuman tongue dorsum microbial communities from NIH, USA - visit 2, subject 686765762 reassembly
Sequencing StatusPermanent Draft
Sequencing CenterBaylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size122909038
Sequencing Scaffolds21
Novel Protein Genes21
Associated Families21

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella → Alloprevotella sp. oral taxon 4734
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Negativicutes → Veillonellales → Veillonellaceae → Veillonella → Veillonella tobetsuensis1
All Organisms → cellular organisms → Bacteria1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → unclassified Porphyromonas → Porphyromonas sp. oral taxon 2795
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Lachnospiraceae1
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → Candidatus Saccharimonas aalborgensis2
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella → Alloprevotella sp. oral taxon 473 → Alloprevotella sp. oral taxon 473 str. F00401
All Organisms → cellular organisms → Bacteria → Terrabacteria group1
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria2
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes3

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Oral Cavity → Tongue Dorsum → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal surface

Location Information
LocationUSA: Maryland: Natonal Institute of Health
CoordinatesLat. (o)39.0042816Long. (o)-77.1012173Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F018385Metagenome235Y
F027205Metagenome195N
F032313Metagenome180N
F040149Metagenome162N
F043235Metagenome156N
F046432Metagenome151Y
F046433Metagenome151N
F047508Metagenome149N
F053092Metagenome141N
F054110Metagenome140N
F068942Metagenome124N
F071329Metagenome122N
F072446Metagenome121N
F080164Metagenome115N
F085820Metagenome111N
F092230Metagenome107N
F094007Metagenome106N
F095630Metagenome105N
F097527Metagenome104N
F098763Metagenome103N
F103433Metagenome101N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0105754_1000067All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella → Alloprevotella sp. oral taxon 47342585Open in IMG/M
Ga0105754_1000069All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Negativicutes → Veillonellales → Veillonellaceae → Veillonella → Veillonella tobetsuensis42063Open in IMG/M
Ga0105754_1000110All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella → Alloprevotella sp. oral taxon 47331619Open in IMG/M
Ga0105754_1000370All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella → Alloprevotella sp. oral taxon 47316375Open in IMG/M
Ga0105754_1000582All Organisms → cellular organisms → Bacteria12843Open in IMG/M
Ga0105754_1000583All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → unclassified Porphyromonas → Porphyromonas sp. oral taxon 27912839Open in IMG/M
Ga0105754_1000601All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → unclassified Porphyromonas → Porphyromonas sp. oral taxon 27912667Open in IMG/M
Ga0105754_1000768All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella → Alloprevotella sp. oral taxon 47311030Open in IMG/M
Ga0105754_1000772All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → unclassified Porphyromonas → Porphyromonas sp. oral taxon 27910993Open in IMG/M
Ga0105754_1001082All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → unclassified Porphyromonas → Porphyromonas sp. oral taxon 2799283Open in IMG/M
Ga0105754_1001173All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → unclassified Porphyromonas → Porphyromonas sp. oral taxon 2798908Open in IMG/M
Ga0105754_1002182All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Lachnospiraceae6400Open in IMG/M
Ga0105754_1003795All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → Candidatus Saccharimonas aalborgensis4637Open in IMG/M
Ga0105754_1004237All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → Candidatus Saccharimonas aalborgensis4327Open in IMG/M
Ga0105754_1006193All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella → Alloprevotella sp. oral taxon 473 → Alloprevotella sp. oral taxon 473 str. F00403411Open in IMG/M
Ga0105754_1022123All Organisms → cellular organisms → Bacteria → Terrabacteria group1276Open in IMG/M
Ga0105754_1022172All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria1273Open in IMG/M
Ga0105754_1022540All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria1253Open in IMG/M
Ga0105754_1027479All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes1042Open in IMG/M
Ga0105754_1037292All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes768Open in IMG/M
Ga0105754_1046232All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes612Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0105754_1000067Ga0105754_100006717F085820MRSTFYLFAVLFLATTFFSCETDEPAPRARREIVNPIEAFMYPRDIKVYADDDDGRRWLILVLPDSTKSSFAPTSKSTPAEVARYKELSKLVGNPTEPVVNECNFRRTWLTQGVKAIRVVRTQADGRDEEVTAQCGNLSFYTYKQIFDCQFKCGDRSIFAKPLGEVVEADYQWLPGRDGFGLITPPNPDHLKQRIVLRLADGTEIEKELSEKGKK*
Ga0105754_1000069Ga0105754_100006923F054110VFDVNYQPTIKKLLKALQMNGRRYVVDVRQSWSKYDKPCKVYIVNRMYTEEEYKLTFPHKYKKGKTFKQGQLYKKESEYSSTKQHEVLLFLVRTYKGGD*
Ga0105754_1000110Ga0105754_100011018F080164MVGLALCAAPQVTLRERASAFPLITEKDATEVDAPYAWRLPVVPLSLDNREIRNFAKFPLLPSLSGGILTVRVLVVGDTVAVHRDLMDDFAKRCRATLGLGVRTAPKLFGIKGMHVYGVQKDGSRQAVDKQVTLHLPGFEKAEKPLHYKEQTGQLVLCEYYESHRGDLFLDVANAHPEIFGELCPVVDFHFPVELRRAYAWLLLEIELEDGTKLSTSLQHYDEQTSILDHPDRS*
Ga0105754_1000370Ga0105754_10003708F032313MYRFLILLYALTLMACDNDTPQEKPREQEKHEVPVPKPKPQFDEVGERIWYGQTPAMRLDSTDYGAGLTSVFGMLTSKISKQRFDSLFKQTVWEIKDIRVVETDLSLAKKNPGIMGWVTTTEFTCRNGVILLHRQGINVNHVDTVNYVYDEVGNEIVLEGTGIRWSVLRLNKNAVEFLQRGRTMWGPFDWYYGRNSGRSEVTLEAK*
Ga0105754_1000582Ga0105754_10005823F103433MKEWSKNKPGVVFFFVVWFILSISFIGNFFGTGLWNGWFDGFQKDSSAIVEKTAYCKNKYDYKGPLIAADSKDYNKIMMSQDCNPSQVKPYVSQYGLQARVIAGLSPNDTSKIPAYIKRVSIFLAVLTAFLLALVVQKIRALFGGITASVFAVMLAFSPWIAGYARNIYWIEPMLIAPFVISFVGYQYFKKSKKLWLFYIIESVAMFLKLLNGYEYVSTIAISVLVPIIFFELVHKNVKIINLWKQAVPVFAATVVAFFGAYWVNFISLTDYYGSSDKAANAINARASDRGISGIRSMRAYAVGNFKILRPETYNFINQLVNLDNMANNSGKTYKYIIVNVVNYLLLPAITLPVHINGMFGEFVQSILFWTILGYLIILSSRKIIGKKYSRPLLWSMNFSVIGAFCWLALMPGHALPHAHINGIIFYIPLLLFVYVLIGLWADYVVKRTVKYE*
Ga0105754_1000583Ga0105754_10005836F047508MLGHRLVEGCVKYPYLRRIGEYLRHSFDTEDVGWVVKRSKLCALMEHIYYLWGDTYALSKALCTVYEAVTNGIDLIEGLYEVLFFENVEDNLYAARVVRNVKVALDLLSFGVTEGDEGVVDPYALFVPRGQDLVVGELDEGKLQGGAATVEDQDFHKVLYYMVRCELILSSP*
Ga0105754_1000601Ga0105754_10006014F053092MQSDQGLILCLTHALLVLGALILEPAEMEDTMDDHTVQLFGILIAKELGIATHRIKADEHVPRDHIPLTLVEGDDIGIVVMIEKVLIGLQDALITTELVAELADTTVIASSDLTDPVAKDTLSEARLLDVFVSIVSYKLRFFRHK*
Ga0105754_1000768Ga0105754_10007681F068942MIRKILSLPTLALCFTLGSAFFAGCNENYIENTETKVRWSNVKNPKYGDYINITLKAEGETFTTVGDHSQIFFGYDASTLDTFTRHRFPEVDKDTAYYKDIVIYLTRNESKGTATLKLVAPPNRTRQPKQFKFSVSVNPPATYFFNVRQPALPAKAQ*
Ga0105754_1000772Ga0105754_10007728F098763MDRRASLHILSPALYLATKLFEAVVWATISYDSSDEVEGKGGMNAVPTAVDE*
Ga0105754_1001082Ga0105754_10010824F043235MDEVVPSDEGHLLIDLCDDDPRSLCGGLGIVTRYPEGAIALFIGLAHRDQCDIDRIDTIPKEVWEFMEVTREIVDTLIQVSGTAILVKEVKDGMYMPHHLWAEVPRLGKVQHVEGFHVREALAIIVEGFGEAAGGCHSMAKDQEVPALYCRSHGFEGGRSMASNVLLPGSTH*
Ga0105754_1001173Ga0105754_10011733F040149VADEGDEELRWEVLIEEQGIPVLFVEVEAWYDGRVSSSEILRSVGVALEREPRLTPVWSHDSEDAIDYFIYDVLVPEGHALTAVGERETVVAQLLNIHRYVYYP*
Ga0105754_1002182Ga0105754_100218210F092230MRQAHKRTVDKLKAYLLKVFFPLFIVCIILVAFFRQIGCGSEGEYAFQISEWGAKLKNIYGTDFINKEIIVRDNIVRVDGIRCLYAVNQNEDGLSIYLLLPGGDYLTHNYVGSSFVRFSNSSEYINMAYGEGSVEVSDSTSTGEVQNTEEKEAHDKVDEAINSMRHLFASAIMVNLRVVELYKILTVCMILIVIAMTIGYYSYLKPETVYGFYCKLRRKEKYPSDVNLVKRIGFLVIILPPCILFLLIV*
Ga0105754_1003795Ga0105754_10037953F046433MIELPASPDALSELSPVAPPKLLSQSQDASRDNLMVYVKADNYLGTETSDPSFMKSRYKTTEYEAINDFVQFIETTKHYLPDYMEDCAKELIDELAFLGVPELNFAANALAKRLRHHLEVDNKPVYIDVGNSLSQYRAKNEMKSSQYILSLVLSKFPDDEFEEYEGRLKVYGGRGEIGKSSKILFLDDWIVSGDQVKERIAGFEADNDSKIHEASVLVMAASGDYLDNGISVYSQYGGAIYPVEACYRLKNSPDAGGMSRVTGIHSSTDNTFGYEVDGIAYCAIERGILKREGIDELSLPALANIVRPYRNGEDFDGLSRFRQLLEKG*
Ga0105754_1004237Ga0105754_10042374F094007MKSKTVEVLELARPNRAGIIDVVDSDGNVVPLDYSGEDFVPDVNSYSDKDFTKRNSIIVEMCDLLGGTRRRAGFAEYHRGRGNYDRARRIERNRGSDISEVGRLAMNACEACPLKLDCELYGKLGGAVLSDVLDYKKVRTATSLTKAGKKRPGWNKGCIDNNA*
Ga0105754_1006193Ga0105754_10061931F072446MKKLLFLLSGLCLYCLAACDNEHEPTKPVRPFHGDTLAQIAWNFRFIVENHYHSIPGIVPEGTTYRVPVIPRSVEDRTEKEYNDMDLGKEAHLVFRATVHGDTINRRRKELDAIALQLGRLTETSIGTSPVLCGVKSIEAVGIAENGNTYDLRGEMKLRIRDYSYRLKYPSGIVTLDCENTESLTAKYVVPLGRIREYELAEHIQPELKFYLPVKRCMDFSSIRFAITLFNGEVLSFQHKLPSKSVLQELPNKSVLENYQQYGFIPESTYFTTLWPLPDYKYNEREL*
Ga0105754_1022123Ga0105754_10221231F097527MIYFKMEKIGNSTYNKEKKTRSENLVFNTIPAARGAQAHPP
Ga0105754_1022172Ga0105754_10221721F046432MWEMTESKLSEIISKYQLPMDDYSVEIDGAFGRGEFFWVIKNQSTNKKYLLVNTYSHHGVESELECYREGGFDNLEAIPRKIETLENASDAEDEIPKYLFGMYSIFEMKS*
Ga0105754_1022540Ga0105754_10225402F095630MIISSLYKTVENNGLLAHIYEHLLAQYVMKALQGRGFFISSDIILTAKTYGDTCFMDVEFYSPEAQDAYNEALRLFDEWDIPRSAALRAATECGIEMNRLVAELAQDELLHELSAVQSSPWRQQSDMTYRKADSKSSVNTLFHAPYIKYGLESKNLFPEYVLEYSVDEKYIQSPVDQALAAVVMQAVALNFLVMIREKHTVYDRGDQWSEASKSVGYRMFLGLIKKDDSIVHQLKCEFMEYVQFLSASSFCNNLQAALVRCSHSYEQVLLGRDTLNSILGGCVIGSKGWLKMADNTLIGQILKAIEIDVYDI*
Ga0105754_1027479Ga0105754_10274793F071329MLPVAKIIISGLSSIGAGMIASKLTKPIVSNANGIAKILLWFGSVSTGVAASAIVAREVELQFDATVKAVQEARDHVEIED*
Ga0105754_1037292Ga0105754_10372921F027205VLIVSADYILKAVKESEACERKALSEARKRDRAEGKEPRETLYPNPDLKPGREIVLDYIKNPERRRTPRCSVHLEKRTANNSYRFIVDVSQVRNRELADEIEKDLFAFMDYLLDEYDIPRRIKRSTK*
Ga0105754_1046232Ga0105754_10462322F018385MADLINSWLPYQELSIEKDRDPVTDDEIIYGNNVKHFTLTVYSPEGRVNKYWNARILKDQVGYCRVACPREKKILCFNWVNWTAYMFTHDGMNELVFMPDSRRRTVSQLSLDN*

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.