NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300008102

3300008102: Human supragingival plaque microbial communities from NIH, USA - visit 1, subject 404239096 reassembly



Overview

Basic Information
IMG/M Taxon OID3300008102 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0063646 | Gp0053103 | Ga0114249
Sample NameHuman supragingival plaque microbial communities from NIH, USA - visit 1, subject 404239096 reassembly
Sequencing StatusPermanent Draft
Sequencing CenterBaylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size133971712
Sequencing Scaffolds21
Novel Protein Genes24
Associated Families19

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae10
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Propionibacteriales → Propionibacteriaceae → Arachnia2
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Actinomycetales → Actinomycetaceae → Actinomyces1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae → Haemophilus → Haemophilus parainfluenzae1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → unclassified Flavobacteriaceae → Flavobacteriaceae bacterium2
All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Deuterostomia → Chordata → Craniata → Vertebrata → Gnathostomata → Teleostomi → Euteleostomi → Sarcopterygii → Dipnotetrapodomorpha → Tetrapoda → Amniota → Mammalia → Theria → Eutheria → Boreoeutheria → Euarchontoglires → Primates → Haplorrhini → Simiiformes → Catarrhini → Hominoidea → Hominidae → Homininae → Pan1
All Organisms → cellular organisms → Bacteria1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Oral Cavity → Supragingival Plaque → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal surface

Location Information
LocationUSA: Maryland: Natonal Institute of Health
CoordinatesLat. (o)39.0042816Long. (o)-77.1012173Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F040149Metagenome162N
F040685Metagenome161N
F042387Metagenome158N
F043990Metagenome155N
F051213Metagenome144N
F061925Metagenome131N
F061927Metagenome131N
F063777Metagenome129N
F063778Metagenome129Y
F064818Metagenome128N
F064819Metagenome128N
F071329Metagenome122N
F077405Metagenome117N
F077781Metagenome / Metatranscriptome117N
F080165Metagenome115N
F092231Metagenome107Y
F097490Metagenome104N
F097525Metagenome104N
F101359Metagenome102N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0114249_1000009All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae86977Open in IMG/M
Ga0114249_1000011All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae82030Open in IMG/M
Ga0114249_1000019All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae71577Open in IMG/M
Ga0114249_1000034All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae60764Open in IMG/M
Ga0114249_1000071All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes42922Open in IMG/M
Ga0114249_1000090All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae37833Open in IMG/M
Ga0114249_1000156All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae29078Open in IMG/M
Ga0114249_1000603All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae14520Open in IMG/M
Ga0114249_1002277All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae6435Open in IMG/M
Ga0114249_1002918All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae5525Open in IMG/M
Ga0114249_1003034All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae5391Open in IMG/M
Ga0114249_1003380All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Propionibacteriales → Propionibacteriaceae → Arachnia5044Open in IMG/M
Ga0114249_1004682All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Actinomycetales → Actinomycetaceae → Actinomyces4016Open in IMG/M
Ga0114249_1011453All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Weeksellaceae2038Open in IMG/M
Ga0114249_1012311All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae → Haemophilus → Haemophilus parainfluenzae1924Open in IMG/M
Ga0114249_1020547All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Porphyromonadaceae → Porphyromonas1255Open in IMG/M
Ga0114249_1028289All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Propionibacteriales → Propionibacteriaceae → Arachnia962Open in IMG/M
Ga0114249_1029506All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → unclassified Flavobacteriaceae → Flavobacteriaceae bacterium929Open in IMG/M
Ga0114249_1037750All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → Flavobacteriaceae → unclassified Flavobacteriaceae → Flavobacteriaceae bacterium758Open in IMG/M
Ga0114249_1048952All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Deuterostomia → Chordata → Craniata → Vertebrata → Gnathostomata → Teleostomi → Euteleostomi → Sarcopterygii → Dipnotetrapodomorpha → Tetrapoda → Amniota → Mammalia → Theria → Eutheria → Boreoeutheria → Euarchontoglires → Primates → Haplorrhini → Simiiformes → Catarrhini → Hominoidea → Hominidae → Homininae → Pan609Open in IMG/M
Ga0114249_1049699All Organisms → cellular organisms → Bacteria601Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0114249_1000009Ga0114249_100000928F063777LKVLNIKISALPPENNSFCMKLILYLCIPFLLIFGILFLIGWGIYSGINSIISAVKEDFFGIKDKTNIKTSKNILFENEQFKLKKEDYLPDENSQEYKIFDDFCVKSNEHLDDDGYIFYKLTDKKSATDLNGAIISEFQEDIGNYILLQNLILEDNQLKNQLISFNKTLGKSRFWLILKTFFG*
Ga0114249_1000009Ga0114249_100000946F097490VFLKTQKMNKKMNVFCKIILPLLCIISCSERKEIEVYNMKIDENKKEVLVEIRNNTENNYYLLSPIVSIMAKHLQYIDGEMIEGQIHHKKLDSIVCSVCIWDDICKEEYYAMRDIVLLPKKSVKKIKYKYDNEEYIEIETVHIGFPYNGYYNEIGKKMQFMLKKKLDSSNIIKGYEFYNKDIETMTIKM*
Ga0114249_1000011Ga0114249_10000113F042387MKRIILFFMAGLFLVSCSRENDKMNDETLANSAKMQLPTKVTIAENNKIISKRFEYQNDNELKEIIDEGSGERIVFVYEKDFITSKIRYSQAGEELGKTNYQYNNRKLSSVIDEVVISDSGIQYKRVVTREYSYNGSEVLVNENIKYHSESYAYNLRDENFTHTYVLNGENITKIHHEISKNVPGGHFYLNPNNMVVIDEEVTYDAKNSSYKNIKGFPVLAVEFCGLDKDENTIADYMNFRWVNHNPTLIKKSINLYGSGADSSEYKFQYEYKNNFPIKAKLDINNQTVTTMVYEYNK*
Ga0114249_1000019Ga0114249_100001943F080165MKNKKFLLAILLSLTACNNKTKTISTLDLEKTIIDYKDLPSKVKERVFYGEAMKLGEEDEERFQDFQETNNPKKYEYYTKQDPQLAWVHYPYIRNKKTKQEYSIDKDGPMGGRYIIYGDSLYISNHYNIYEEDSLRYTFTRYILR*
Ga0114249_1000034Ga0114249_100003438F061925MSWEYSFNLDSEESVSLVVMDLKNCELFSSSTTDYIDWKNPKSIDDIPYDARFYTDKKKTIYIAINSFSKNIFSALKSILAKQNYHLTDCDTDEEVTLEHIFRSVI*
Ga0114249_1000034Ga0114249_100003459F097525MRQKKIMFKIQNAYQKIIFSIHGHRERKDNFEDWLKVEVKVKDDLEGKYYTRVSECMLFSEVLGLLEWFEQISVDKEKSTEIGFIEPELAFEYQNKKLTVLLCYDVAPVSYGEEPYQLTFSLDDKTLAMIIKELGEAVASFKRE*
Ga0114249_1000071Ga0114249_100007143F071329MRPRYSKGILMLPVAKIIISGLSSIGAGMIASKLTKPIVSNANGIAKILLWFGSVGTGVAASAIVAREVEKQFDETVKAVKEARDHIEIED*
Ga0114249_1000090Ga0114249_100009038F064818MDKDEFLKKLIAFIADNSSELHPKVFKNKIRFGINKSSYTEMRFDYSQNYEGFYLQLASYNKDVGDFFEQEMGNAFLKMLEDESKEFRNLFFVQNSFQLSHYYYGFPIMTNDNTGHLYPEMGTTIFNDVLRNLQANHFKFIQAAEVLSPDLLHYIKKFPSCFFNTALVALLIIEKNLLSLDDERVQGLFEYDNMVTKNECKLFSPFDLIFGKKDYQQTAKQRILQRK*
Ga0114249_1000156Ga0114249_100015621F040685MKKLLFKLFFALAFTSISLHGQEKIQQVEFSSFGGRAVYSSRYTLNSLKKEFSAKPLMGQKEELPKEISLPNTPKNWEAFTKKINLDKFKKLRDGPSQQDFDGQDEVIIIKTDKKTYRKMNARGNDHDREVWYSLLQIIAKEFGKKSIYE*
Ga0114249_1000603Ga0114249_100060317F061927MKNSIFILSAMFILGACSSDSAQKVTEKIKNAYSDGLRKTFIEEGVKSCIKNSGLKESEAREYCECAMNKLNESLSNDEIVDISMDNPPKDLDERVDKAIDSCIGE*
Ga0114249_1002277Ga0114249_10022777F043990MIKKLGIIFTFGVIILGIVVYANHKIERSWIEGEFGVNMSNMNIDEKYRKEEWAPNGDGEKIIILTYDQLDSSFTKLNKLPIKEDLPPNRIPKQFLNITNGYYKYVVDENDDWNLGILIVDTTRKEICIYYQIL*
Ga0114249_1002918Ga0114249_10029181F101359MKKFLSKRSILVGALALVLGFIVSSCSRDKDDDAIYTAKLQVQHHSNNSRNSAVNGSVTYRDANGNKRKIYLRSGMSESISFEVNKGFKSFVEVKATDIYGNLFVKWTVTKTYTGARVQNWSTNFTSYTGQGITDSYKETVK*
Ga0114249_1003034Ga0114249_10030346F064819MILKDNQGKDRIKFSDIVRLNKEEPVFVKMKVRISTETMVSEENLDIERSDLEDLTRNLKDLCKYKIRKFFFQNIDETIEIVFSINDIGTIAVEGKMYDESYMNSINFSFQTDLNGIATFSKEISQELKKNINNKKKQYARY*
Ga0114249_1003380Ga0114249_10033807F092231MIIFPPQEFITWANALAQFPWPIQLNDFLPYAEQLGWKPTSRPTWFNVLPDNDYALTNISSDRDGNVRNLFLPMASDETEDDEGAAELNDHFASCVVAGRKEWGEPFLLEAGDGPSVTWRFLGDMFAQVTSGRWSVLFNFFTPEGRRQFL*
Ga0114249_1003380Ga0114249_10033808F092231MIIFAPQEFITWVNMLVQFPWPIQVDDFVPYAERLGWTPTPRPTWFNVNANNDTRNVILGDDRSGNVRDLFLSMARNEAEDDEGAVELNDHFVSCVAAGRKAWGEPFLLEAGDGPSVTWRFPGDMFAQVTSGRWSVLFNFFTPEGRRQFL*
Ga0114249_1004682Ga0114249_10046821F063778MPETISEGAKQQLLQQLQDALGLVENADTSAHDVAAITHSAADGHQLTEVMLQEMTAARGHLKSCADQIEYAISSIKAIPLDPPPED*
Ga0114249_1011453Ga0114249_10114533F061927MKKSIFILSVMFILGACSSDSAQKATEKIKNAYSEGLRKTFIKEGIKSCIENSGLKESEAREYCECAMNKLNESLSNDEIVDISMDNSPKDLDNRIEKAISSCIGE*
Ga0114249_1012311Ga0114249_10123111F077405GRALPTELFPHLLVAKQRGVFYGFILLCQIKFVKKFFDWLKIIQK*
Ga0114249_1020547Ga0114249_10205472F051213MNLYKILGALILALSFVFTSCDWVGNEPTIEGDRYKTFDSSAQRKSFRVVNASGKRYNHKVDWDIVGIKQLWVKTYLTKKVDTLSNGDLKISYDWVSFTVKERKSVIEVEVQKNETGQDRSVFFVTSNNRTQSHRPNIVVTQKAK*
Ga0114249_1028289Ga0114249_10282892F092231MIIFSPQEFITWTNRLVQFPWPIQLKDFPPYAEKLGWKPTSLPDEFGVCPDKDDEITILSSDRDGNVRNLFLYMASNEAEDDEGAAEMNDHFVSCVATGRKAWGEPFLLEAGDGPGVTWRFPGDMFSQVTGGFTAVLFNFFTPEGRRQFL*
Ga0114249_1029506Ga0114249_10295061F043990VVYANHKIERSWIEGEFGVNMNIDEKYREEEWAPNGDGEKTIILTYDQLDSSFTKLNKLPIKEDLPPNGIPKQFLNNTNGYYKYIGDENDDWNFGMLIVDTTRKEICIYNQIL*
Ga0114249_1037750Ga0114249_10377502F040685MKKLLFKLFFALAFASISLHGQEKIQQVEVHIFGGMALYSSHYTINFLYKEFEAKQVMGEPAELPKKILLLNPPDKWRVFTKKINLDKFKKLRDGPSEQAVDGQDEVIIIKTDKKTYRKMNAYSNDHDREVWYDLLQIIAKEFGKKGIYE*
Ga0114249_1048952Ga0114249_10489521F077781SSWAPARPAPAHLKAPAAVTGGWGSPRQNDFFILLTLESPAPCDPLQTESHLVFRTRIHSQVQWGDVRGVAGRTKDSLPSDSVCTVGPRAGDLSVRTADSLYLGFPRPHPGTPGLGRFWPYLALQSLSETPSHARMPRVTVARTSPATLEISPLRAAT*
Ga0114249_1049699Ga0114249_10496992F040149VADEGAKEFRWKVLIEEQGIPVLFVEVVAWYDGRVSSSEILSSFGIALEREPRLTPVWHHDSEDAIHDFIYDISVPKGHALTAVRERETVVMQLLNIHRMF*

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.