NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300007978

3300007978: Human throat microbial communities from NIH, USA - visit 1, subject 763496533 reassembly



Overview

Basic Information
IMG/M Taxon OID3300007978 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0063646 | Gp0052996 | Ga0111424
Sample NameHuman throat microbial communities from NIH, USA - visit 1, subject 763496533 reassembly
Sequencing StatusPermanent Draft
Sequencing CenterBaylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size100998577
Sequencing Scaffolds21
Novel Protein Genes22
Associated Families20

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → unclassified Porphyromonas → Porphyromonas sp. oral taxon 2793
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella → Alloprevotella sp. oral taxon 4732
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella1
All Organisms → cellular organisms → Bacteria2
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → Candidatus Saccharimonas aalborgensis3
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Negativicutes → Veillonellales → Veillonellaceae → Veillonella → Veillonella tobetsuensis1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Lachnospiraceae → Oribacterium → Oribacterium parvum1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → unclassified Flavobacteriaceae → Flavobacteriaceae bacterium3
All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae → Haemophilus → Haemophilus parainfluenzae2
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → Porphyromonas bobii1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Oral Cavity → Throat → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal surface

Location Information
LocationUSA: Maryland: Natonal Institute of Health
CoordinatesLat. (o)39.0042816Long. (o)-77.1012173Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F033081Metagenome178Y
F041827Metagenome159Y
F042387Metagenome158N
F043235Metagenome156N
F045567Metagenome152N
F046433Metagenome151N
F047508Metagenome149N
F054110Metagenome140N
F058221Metagenome135N
F061927Metagenome131N
F068942Metagenome124N
F071328Metagenome122N
F072446Metagenome121N
F077405Metagenome117N
F078842Metagenome116N
F090517Metagenome108N
F094007Metagenome106N
F095633Metagenome105N
F099454Metagenome103N
F103433Metagenome101N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0111424_100001All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → unclassified Porphyromonas → Porphyromonas sp. oral taxon 279463504Open in IMG/M
Ga0111424_100005All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → unclassified Porphyromonas → Porphyromonas sp. oral taxon 279263750Open in IMG/M
Ga0111424_100009All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella → Alloprevotella sp. oral taxon 473184908Open in IMG/M
Ga0111424_100012All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella → Alloprevotella sp. oral taxon 473134253Open in IMG/M
Ga0111424_100020All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Prevotellaceae → Alloprevotella112349Open in IMG/M
Ga0111424_100046All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → unclassified Porphyromonas → Porphyromonas sp. oral taxon 27981477Open in IMG/M
Ga0111424_100196All Organisms → cellular organisms → Bacteria35326Open in IMG/M
Ga0111424_100659All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → Candidatus Saccharimonas aalborgensis15699Open in IMG/M
Ga0111424_100998All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria11625Open in IMG/M
Ga0111424_101512All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → Candidatus Saccharimonas aalborgensis8444Open in IMG/M
Ga0111424_101757All Organisms → cellular organisms → Bacteria7510Open in IMG/M
Ga0111424_102539All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Saccharibacteria → Candidatus Saccharimonas → Candidatus Saccharimonas aalborgensis5639Open in IMG/M
Ga0111424_102587All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Negativicutes → Veillonellales → Veillonellaceae → Veillonella → Veillonella tobetsuensis5552Open in IMG/M
Ga0111424_104878All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Lachnospiraceae → Oribacterium → Oribacterium parvum3295Open in IMG/M
Ga0111424_111113All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → unclassified Flavobacteriaceae → Flavobacteriaceae bacterium1634Open in IMG/M
Ga0111424_115373All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae → Haemophilus → Haemophilus parainfluenzae1221Open in IMG/M
Ga0111424_115436All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae → Haemophilus → Haemophilus parainfluenzae1218Open in IMG/M
Ga0111424_123155All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → unclassified Flavobacteriaceae → Flavobacteriaceae bacterium832Open in IMG/M
Ga0111424_124299All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales794Open in IMG/M
Ga0111424_125563All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → unclassified Flavobacteriaceae → Flavobacteriaceae bacterium756Open in IMG/M
Ga0111424_136772All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas → Porphyromonas bobii522Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0111424_100001Ga0111424_10000124F043235MDEVVPSDEGHLLIDLCDDDPRSLCGGLGIVTRHPKGAIALFIGLAHRDQCDIDRIDTIPKEVWEFMEVTREIVDSLIQVSGAAILVKEVKDGMYMPHHLWAEVPRLGKVQHVEDFHVREALAIIVEGFGEAAGGCHSMAKDQEVPALYCRSHGFEGGRSMASNVLFPGSTH*
Ga0111424_100001Ga0111424_10000198F047508MLGHRLVEGRVKYPYLRCIGEYLRHSFDTEDVGWVVKRSELCALMEHIYYLWGDTYALSKALCTVYEAVTNGIDLIEGLYEVLFFENVEDNLYAACVVRNVKVALNLLSFGVTEGDEGVVDPYALFVPRGQDLVVGELDEGELQGGAATIEDQDFHKVLYYMVRCELILSSP*
Ga0111424_100005Ga0111424_10000563F099454MALTFVFTSCDRLTDEPTLEDRGYKYFDSTAQRKSFRVVTASGKPYNHKIDWHIIGILDSKSDTYLTKKVDTLSNGDFRISYDWVSFTVREKKSVIDVEVQKNETGEDRSVKFVAQDNHKGLASPSMKIIQQAK*
Ga0111424_100009Ga0111424_10000914F071328MSQKHWTFTHIIRYIEEYERNPLLIERMKWEFILEGEYIVEFVKLCKHLVLERTIDPKKHQAAAIYLRYSLQLLLKKRRAIRRLGVEKEYVSAILRQYGIHYREYGDNEHRVFFLDTGINLYFSKHDQSSIYIIQRIEFSNEEYRSFILKVLPVKKSEW*
Ga0111424_100012Ga0111424_10001291F072446MHEFDAQAHKNKFNPPTHPQMKKLVFLLFGLCLYGFTACDSDHEPTKPVRPFHGDTLAQIAWNFPYIVEHHYHSIPGIVPEGTTYRVPVIPRSVEDRTKKEYNEQDVDKVAHLVFRATVHGDTLNRHKKELETLARQLHALTLPTIGTSPVLCGVKSIKSVGVAENGRTYDLSWEMKLRIRDYSSRRKYNSGRIVTLDCEDTESMTARYVVQLGQIREHELAEHIQPELKFYLPVKRCMDFSSIRFDITLFNGEVLSFQHKLPSKSVLQELPNNSVLENYKQYGFEREATYFTTLWPVLEKEIER*
Ga0111424_100020Ga0111424_10002093F068942MIRKILSLPTLALCLTLGSALFAGCNEDYIKNTETKVRWSNVKNPKYGDPINITLKAEGETFTTVGDHSQIFFGYDASTLDTFTRHRFSEVDKDTAYYKDIVIYMTRNERERTTTLKLVAPPNRTQQPKQFDFSIEVSPPGTYFFNVRQPALPAKAQ*
Ga0111424_100046Ga0111424_10004651F045567MRQRDGHDTLTEELEGSITPLLYRAEGEARRPWVRMVTEDVVHTSTHRVEDALLPVDGDILTPRDGTHIVQTERVVVVLVSQEDRIDMIDTETCGLVVEVRATVDEDTLPTLGDDEGGGAQTTVTSIRAMAHRAATAYLGDTSAGARTEKNYLHVSRRESYHGKEMPSLSRERGCVVERVVR*
Ga0111424_100196Ga0111424_10019614F103433MKEWSKNKPGVVFFFVVWFILSISFIGNFFGTGLWNGWFDGFQKDSSAIVEKTAYCKNKYDYKGPLIAADSKDYNKIMMSQDCNPSQVKPYVSQYGLQARVIAGLSPNDASKIPAYIKRVSIFLAVLTAFLLALVVQKIRALFDRITASVFAVMLAFSPWIAGYARNIYWIEPMLIAPFVISFVGYQYFKKSKKLWLFYIIESVAMFLKLLNGYEYVSTISISVLVPIIFFELVHKNVKIINLWKQAAPVFAATVVAFFGAYWVNFMSLTDYYGSSDKAANAINARASDRGISSIRSMRAYAVGNFKILRPETYNFINQIVNLDNMANNSGKTYKYIIVNVVNYLLLPAITLPVHINGMFGEFIQSILFWTILGYLIILSSRKIIGKKYSRPFLWSMNFSAIGAFCWLALMPGHALPHAHINGIIFYIPLLLFVYVLIGLWADYVVKRTVKYE*
Ga0111424_100659Ga0111424_10065914F095633MRNYENSTEVGCREGLTEGELRTMGTLAMEATEELKKTTISKEAVLLGGVPFGSWDEFAKAVQEMAAHIYEPIPVKINTKRLIATAFLDDGGEMSVEEHSVPEEVFIDLSRTRCVVDADRNHKSYEFTCPALMEHPDGKLYLTRKAYVISVIDVNGSQEVDFNIIYGGLN*
Ga0111424_100998Ga0111424_10099815F094007MKLKTVEVLELARPNRAGVIDVVDSDGNVVPLDYLGEDFVPDANSYSDEDFTKRNRIIVEMCDLFGRIRRRAGFAERHRGRGDYDRARRIERNRGSDISEVGRLAINACEACPLKLDCELYGKLGGAVLSDVLDYKKVRTATSLTKAGKKRSGWNKGCIDNNA*
Ga0111424_101512Ga0111424_1015125F033081MHTDITVVYRPKKGVMAWLFRRVMPQDARPTFVWSRLVTEIENAGYFSRRKFSILAVGLIIMTIAMIKMLLFVPGLNQSVVSLLTRGLETFLPTGWATVTAWTVGMAGVFLMGDFTNYTPSQKFLHKIKATRYEMYNILLLLALLEEQAFRSGSEKWNWRERVRASVCFGLLHIANIWYSFAAGIALSVTGFGFLLVYLWYYRKYRIQIIATAAAATVHALYNAIAISLIAVVVAVYLAIDIAKLL*
Ga0111424_101757Ga0111424_1017574F078842MIISSIYKTADNDGLIAHVYEHLLAQYVLKRLQDNEFFVLGDIILSAKTYGDTCFMDAELYSPEAKKTYDEALREFDKLVIPEDAALRAASECGIEMNRNIAEVDRSELSKKLREVQISPWRKQIDMAYRKAHNESSVNTLFHTSYVKYSKESDDLFREYVLEYSIDESHIQTPVDQALAAIVMQIVALNFLTVVREKYTVYDRGDQWSEVSISVGYRMFLGLLKKDDKIINQLSCDFLEYIKILSSSVFCDNLQKALVRCSDNHKQVILNRSTLNAILGGCIIGGKGWLEMADSVRIRQMVNSIELDIYEVNS*
Ga0111424_102539Ga0111424_1025392F046433MIELPTSPNALSELSPVAPPKLLSQAQDASRGNLMVYVKADNYLGTETSDPSFMKSRYKTTEYEAINDFVQFIEMIKHYLPDYMENCAKELIDELAFLGMPELNFAANALAKRLRHHLEVDNKPVYIDVGNSLSQYRAKNEMKSSQYILSLVLSKFPDDEFEEYEGRLKVYGGRGEIGKSSKILFLDDWIVSGDQVKERIAGFEVDNDPESHEASVLVMAASGDYLDNGISAYSQYGGATYSVEACYRLKNSPDAGGMSRVTGIHSSTDNTFGYEVDGIAYCAIERGILKGEKIDELSLPALANIVRPYRNGKNFDGLPRFRQLLEKE*
Ga0111424_102587Ga0111424_1025872F054110VVLDVNYQPMIKKLLKALQMNGRRYVVDTRQSWSKFDKPCKVYIVSRMYTEEEYKLTFPHKYKKGKTFKQGQLYKKESEYSSTKQHEVLLFLVRTYKGGD*
Ga0111424_104878Ga0111424_1048784F061927MKNPIFILSAMLILGACSSESAKKAYNDSFRKTFIEEGVKSCIENSGLKESEAREYCECAMNKINENLSNDEIIDISMDNPPKDL
Ga0111424_111113Ga0111424_1111132F041827MKKLVLLDELREGSTITFQYKNRNEIESVNIDGVGNSDIDYEYDTYGRIVKERRFHRRYDYGETNITYQYDSQGRLASSHAISTEFYPGTGLTPRCSVEKKHTYTYQGNKVTVKIEMGADTCSAIPETGKEKTITLLVENGRVTKTFDEQGNIKQTIEYYNTKNALRNIKGFPALVVEFYIRPLTYELPYYNLGIERIEDLRYIDNIKTRDFHDGDYWEYRYSYDKENTYNGDYPNGVGIHARVHNDPTYDEYLYEISADRSYIKEE*
Ga0111424_115373Ga0111424_1153731F077405PWQGRALPTELFPRLLVAKQRGVFYGFIVLCQIKFVKIFFDWLKIIQK*
Ga0111424_115436Ga0111424_1154361F077405QGRALPTELFPRLLVAKQRGVFYGFISLCQIKFVKKSFD*
Ga0111424_123155Ga0111424_1231551F042387MKRIILFFMAGVFLVSFSRGNDKKTDETLANSAKMQLPTKVTIAENKKVISKRFEYQNDNELKEIIDEGSGERIVFVYEKDFITSKIRYSQAGEELGKANYQYNNGKLSSVIDEVVISDSGIQYKRVVTREYHYNGSEVSVNENIKYHSESYAYNLRDENFTHTYVLNGENITKIHHEISKNVPNGHFYLNPNNMVVVDEEVTYDAKNSPYKNIKGFSVLAVEFCGLDEDENTIADYLNFRWVSHNPTLIQKSTNLYGSGADSSEYKFQYEYKNNFP
Ga0111424_124299Ga0111424_1242992F058221NKMGKIKNFQDLKNQKEELRAEIKEIESVLSFENPRKSFGVITNGVTEKYLGGMMDSSLAQNAFFLADKFLFPSLEIGSAKLLSNALLKRVRPSMKKTLIGLGVAVLTPIVIMQIKKRLDDFQQRETAKSLSKLI*
Ga0111424_125563Ga0111424_1255631F041827ILKLFFMKHFLSALALGCLLLSCNRDLENNENNETPAPPKEERLVLASLFEYISNVRFQYKNGNEINRMTINEASIDFEYDTYGRIVKERRFDHKSDYGETIITYQYDNQSRLTSSHAISTQYYPDTGYTPRCSVEKKHTYTYQGNKVTVKIEMGTDTCSAIPETGKEKTITLFVENGKVVKSLDENNQIIETIEYLNTKNTLRNMKGFPPLVVEFYIRPLTYELPFYNDIEHIEDLRFIDNIKTRDFHNGS
Ga0111424_136772Ga0111424_1367721F090517LLLGLLFLASSCKNKKDTPRLQLSSVELRQTVWNGTLEYKNPKRDSYSVYLNFLSDSEVEVSGYDLKDPTYSRDLQVRYYYTITDRILTLKAQVNRELRPPMDQNTWYLIRKEPSLLVFQANAGNPDLEATLTLRKKL*

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.