NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300007362

3300007362: Human stool microbial communities from NIH, USA - visit 1, subject 675950834 reassembly



Overview

Basic Information
IMG/M Taxon OID3300007362 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0063646 | Gp0052693 | Ga0104788
Sample NameHuman stool microbial communities from NIH, USA - visit 1, subject 675950834 reassembly
Sequencing StatusPermanent Draft
Sequencing CenterBaylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size145922181
Sequencing Scaffolds14
Novel Protein Genes25
Associated Families24

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → unclassified Flavobacteriales → Flavobacteriales bacterium2
Not Available2
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Faecalibacterium → Faecalibacterium prausnitzii2
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Rikenellaceae → Alistipes2
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Bacteroidaceae → Bacteroides → Bacteroides salyersiae1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales2
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes2
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → unclassified Oscillospiraceae → Oscillospiraceae bacterium1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Large Intestine → Fecal → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal surface

Location Information
LocationUSA: Maryland: Natonal Institute of Health
CoordinatesLat. (o)39.0042816Long. (o)-77.1012173Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F026592Metagenome / Metatranscriptome197Y
F029444Metagenome188Y
F042095Metagenome159N
F050794Metagenome145N
F052660Metagenome142N
F055775Metagenome138N
F057001Metagenome137Y
F058555Metagenome135N
F075481Metagenome119N
F076064Metagenome118N
F077313Metagenome117N
F078005Metagenome117N
F080673Metagenome115N
F081453Metagenome114N
F083451Metagenome113N
F083452Metagenome113N
F089053Metagenome109Y
F089590Metagenome109N
F089591Metagenome109N
F089592Metagenome109N
F090514Metagenome108N
F101356Metagenome102N
F105374Metagenome100N
F106193Metagenome100N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0104788_100361All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → unclassified Flavobacteriales → Flavobacteriales bacterium53276Open in IMG/M
Ga0104788_100533Not Available42204Open in IMG/M
Ga0104788_101173All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Faecalibacterium → Faecalibacterium prausnitzii22633Open in IMG/M
Ga0104788_101391Not Available19176Open in IMG/M
Ga0104788_101514All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → unclassified Flavobacteriales → Flavobacteriales bacterium17374Open in IMG/M
Ga0104788_103135All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Rikenellaceae → Alistipes8053Open in IMG/M
Ga0104788_104156All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Rikenellaceae → Alistipes5691Open in IMG/M
Ga0104788_104240All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Bacteroidaceae → Bacteroides → Bacteroides salyersiae5553Open in IMG/M
Ga0104788_107204All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Faecalibacterium → Faecalibacterium prausnitzii2802Open in IMG/M
Ga0104788_107460All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales2671Open in IMG/M
Ga0104788_115546All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes1098Open in IMG/M
Ga0104788_115926All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales1066Open in IMG/M
Ga0104788_115930All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → unclassified Oscillospiraceae → Oscillospiraceae bacterium1066Open in IMG/M
Ga0104788_127767All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes598Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0104788_100361Ga0104788_10036134F078005MKKSKFVKELERIIDMVKAEDDGFEYGGKVIFYKEDDDNYEISVKNIEMNLMVEANTMASMDDRTFACLMSEVYKQKFTKAIMMSEDEDDEDN*
Ga0104788_100361Ga0104788_1003615F080673LATIGMNMNEIELLRLQDEALSYLRDNITKDEAYYVLTTDKDMIEILIADKKDGSKRIKILDMEYTIEKDDMLLLFDTDGIIDECLLAASYIGVNMYFRRQDVNAILNNINREKVMKYPYIAIQLDNIQTIEKRRVIFEITGHRIDDNKEKIDFMFVYFMARIL*
Ga0104788_100361Ga0104788_10036154F083451MIMDKNEREKQVLDLLMSRKDIRKLVEKSNECYSKMDFVGAMKYRQEIKDIVDRESKIMLTKSESLIGLMNNADNEYKFNMLVWLHSMMCMADVFNGILEDFKDGVRKANGNSKFVKFDNLDRLMAECKKEIDYLMKGTSKSFQISFAVRSDELREMIENMVGDNIREGYDMFKEEAKMTKETDRSKIEEFNKKLDHDQM*
Ga0104788_100533Ga0104788_10053311F106193VGCNTCKEKALKAERERIERSMMNRPSSTVVSDREYASRSTAGCMVMLDPLKTMERDVVSIYKQTRTIGDVGIVYLNMQKKIREWIKNLPYGCPPDEEVQEMRKEILNGRAEHIKP*
Ga0104788_100533Ga0104788_10053318F075481MRLRISLKAVFCLGLSLSLSSCGSRRQVSDTSIDSRLISRIETMIDEVMDRRIVEIKTSDLNADIVITEREFDTDKDVDPATGERPVSSQTDTHIVIGRRDSTVTADSLGVNKTRNDIKDLDNKTNIKSKDVDDKKESRWPIVWIVAGILMILLVLVYILKKIKVL*
Ga0104788_100533Ga0104788_10053319F089590MKDKDMIERVGALWNIALAYGASCWAYFQPVHHLLTVLLIVLIANFLARLAQSVRGWKLRRSRRRRFSFKRWLREVRFTDILKEFALSCFIVMTLCVIYKTLYPIEEEASMILTVTKYGVYIALVGYVMLFLNTIGDAFSDAYLVKVFKAVFKRINVFKMFSFSKNIPDETFDDIKKIADDEVKDKS*
Ga0104788_100533Ga0104788_10053321F042095MAKTLYKYEASSNKFVWFTTWDRALRNYYTDDYNYVPDPVIGNPFNTYVQFRSRKPGMANVDWGDGIKEQFPMTKVQGQNDYRIIFRSLAIQYRKNPNTTWWFRKEDGSQYIPVDNHLYADGRSDVQRSVAIDFTCDIYYAEIMTCKMTAFPIVDTPGLESLIVHNTTYANDGIPVDKLSRSKKLTYISLEDVGTRMTVMPEAMTSKTEVYYLNMFNMLDLRNIESSGIRNIKNMKNLQTLNLSSCYLDRYIKEFNDLPKLTSLNITSGPPDMWNYFDINTLPSFEVDKINPNITGFVFLDDWMNGERRTGWNDDNMSGRGLDHLTGFTANHSNSLRMDKLPDYIYEMRAITRFNVNASTHSQKRSDDFVNSFYDLVVGWDQITMTSVAKDGKRNQFYSLSVSMYNAIFPTENQRPSGTEQAPEGFVKGQSNGSPATPMEKIYVLKNNYAQRWTIKPA*
Ga0104788_100533Ga0104788_10053328F089591MTIQEAYLRSLQKNEQNLANGGIKLDPGRFVLLFNEAQDRLVKYYLNRKDDETIRSIQNLLVYWMSLDNAGRMDDPESTSFNLPDDYLWFSNIKGVFSYKGCEVTDFVMWEAKNENIHELLGDENNRPSYDYRETFYSIGNGKVVVYESGFRTEEVKMTYYRRPVRVDLSGYINAAGIQSTDIDPELPDYLVEEILDMVAKQFSLNENELQRYQLDKDNVASFK*
Ga0104788_100533Ga0104788_10053329F089592MKEILKSRKVLAEVNGFNIMSDTLYEVVGKHDGSAPQAFQDANIAKAPFPENATHVCCPWDDFSKAYNTGFYPRSRCYNGLDKNEIDKLVKQRVDNIMKPFEEMSQMDLSQTNLEFWDDAKDKIFMGKVYNTANTVDLFYLYLAVFSGMLTPQEMDGDPVFMNSMFCFVEKDNMKDFVQQREINKMNISYKFISALKKGGDDRQAVIDLLLYIGIVTRPDFTEDEYYTGSLSNWMNEKKTNVDYLLDIWDRSLEGDFKEVLEFYRIVNVLQRNGRINMTPSGLQYNGQIIGPDVRTSAEFLATKKDFINIKANVLDEYEEIMSMSNIDDKSKTKKVKDIKKKDDVEEGDKAKEE*
Ga0104788_101173Ga0104788_1011734F090514MNLRIHALKKASRQRPGVKIAAARFFSILYYPFCAKRSSFFQQNVQPRGNFFANRPIFVYFADVLPRLTGANWFVILLTIVPAGSRTRAGSSLPLIPASNIFLKQTPRRGLPLAWNAGAVPFLFCG*
Ga0104788_101391Ga0104788_10139110F050794MLYTSNYFVDVRQRQLLDHTYALSRDQAFDYMTEFNKRLSDKVGIKCTMDILLPTDDDNANIIIEHNGIIKKLMREAEKLELDTDAIKAMMRDLLNELKDDIDLNILIFDVSQLLIKYNLFRLEAITEQEFKGSFVRMDSRNMEIKKLTLSDIKKVVMMMEDRYDYALYMTEEYN*
Ga0104788_101391Ga0104788_1013917F077313MGKYVIKRKIPKYQEAGEVTPIMPGNVVGLQGIGVEPLVSSTQIGFDIQQPDINTIDTSDLSALVDSNKKVDKSGSTDVFDFTTIPYYGADDIGSRFTQMGRGIGRMRSEGYGDLSTGAKTANTITTIASGISGIMGLARNVVSGIASEKGTRTNIRLAQEREARQRRQSQMQYKDGGGVYLGPNNRFDSGSLTGEYLYPLPKSMEDQANVEVEKGEYVEQPGEAPMEAMGQKHADGGTPVSLEQGTKVITDDTTIEPDFAKYIRDTYGIKATPKDTYATLMDRYKAKIGLKSAYDDQKKALEKLKKNDKIDDENTRRLNASVLSKAINDSNDTVNGLEGRFTDFANVIYKEQEDRKMKKDEDTYFAKGGEIDNIISRSMKEYGLTEEDIAEAKKELLKKVAGIRQKMEIGGTSLFGRKLTFRPIENRFNNDPNYFGYQRQGTDGSYGGINTDERLNYYKTFNPVAYDAYMGASEGTRARALQDAIYGQTSSWMGLATAENPIIANAEALRDYTTLVSFGGEDSQGNYPEDKKAAYHDRMRDNKLGLFTTSRPMIGLDVVTEEQHKALNDAGITHFSQLFSDKNKDVVNKILGEDMLKMQALRSMKGMEGLDFILDPHKVAPGPMDIGDVENPDVKLDMPELIDPNTLPKTNTNAGKSNSGNGGRNIVGGGLDFPEVFRMTPGAVTTEGLERHYAPTVDPVLRSADQYMVEANRAFQSQLDQMGNVPDSQRGALSSNLQAIMSSNIGRYINEVEQGNVAQRTWADNVNSQSWANTYDKNIAQRQAYQQRILQGLAINDENWARYFDSVNDEIQQKWNTATTMNTLRSIFGDVKIGPNGQLIADPQGDILSYRRLYPAQEVTKGKKG*
Ga0104788_101514Ga0104788_1015142F058555MMGTKISLLQKMKSNFDKILTEAYIPKDIQAKKDELGCLRLPAGSLVCPVDYKPVTNKDGKKVTAVKYSNKKDNIRGSGMVIEKKCKQVVAYLTIVNVQKHVFLRNRMREGYRDRIEINTDDFIDILSDGIAYFCYRHVIEDCHEDIDYQLKTLKAYAEGEIRIALSDIMIYSYKAKKNEDTKDIFVGKKTSVYKCMNKNLSSDERRNMANKSRKLDRVRILSKIIFRARTRNVHHIYKVTKRKTVKFNVAYLLNELNKKLAGIGMREISQSTIYRYISMFLDMCKKSISDLYEEVVKNNGVVNTKDNNNVTIGHIRASYKGSVLYILISTDYIINVFLGKKSAEMSKAG*
Ga0104788_101514Ga0104788_10151420F083452MSGRIKIKPKNKDKKPDIDVFKVIENRFKNMNELRDLIDMDPRKGLVRIRDGPGFREVERGGCLHRNYLNLLEEELGAKLSIDLIERYIKR*
Ga0104788_101514Ga0104788_10151425F057001MTGKDLNKVQNEVKKTSEKTLTGAVKAWCQLFKSGKEINEILKDNDIKVDKAIVSALVNLAKDKEIVIQLCKEILPRVNNTFCSYKEVEREYYDKNDKDKNKKLKMNEIEDVAILGSFHKRFGYNEPIEFDFGIYYETFNDADKRIVKCAVPIKRYTFSLIAKCITYYLTHPKNDR*
Ga0104788_103135Ga0104788_1031355F076064LKKTIPGRALENLFYLQYDLPPEASFHATTEEAKRPDELYMRKLLPELTRLKLQPRHVVANDEAYYAVMKGVMLFTPEAEKLMLTEDYFSARRQIRLCAPDLKRRNETRRHPMPVLKLY*
Ga0104788_104156Ga0104788_1041563F105374LRPGFGAAENIRYLVLSKGVFAMKKRVALLVALCIWKVVAAQTPYGEMPERFRPDTLPCRLGGGVCFGMDGLDAAIPRGEGASSCRDPRVVFVAGDTLITFISVAGVADTALGDPVCRFYGRNVARVVSRTRRMTGGRMGAADNDFPDDPDFAELQGVVIENQRYPWESYAAGDSAYRLPVARSLVGGKEDPLLGSDMRRRYVRLLTEVSVELKAGGTRPFVHVVYLLPDP*
Ga0104788_104240Ga0104788_1042407F055775MAQIAQQDNLVIEVTTTAAALDGNTKKKLIECIEGGTITDVILVTKEVEKKISHARVVSWLVDTAGDYPKYTIDIINANSGTVAAIALN*
Ga0104788_107204Ga0104788_1072041F026592RRTVCQNRLRTASPQGIAALAVQGGVATLTERSDATFAGKQFSSADRE*
Ga0104788_107460Ga0104788_1074601F052660MCILRVLPENTPEKIGQERAGTEWTVVKSKIRLCIRNRSYGQFLHGGILMGIALPIPSHRAKSHDFACWWPAAAGHS
Ga0104788_108894Ga0104788_1088942F101356VGELIXXXXTDVGELIPVVTPEKDGLSNSKFATTKIKSEGKRSVLLYRSSSSQWAPFAIRVSCISTGEPLSDFCVYIAGNTMELQDSTKVYVKYLYGQPNSDTYLKMKYETDHRISIYLTSDNSLGDRTIVRELIVRDSMYDMATQDDEITGLADCTIVQ*
Ga0104788_115546Ga0104788_1155463F026592CCLRTASTQGIVALAAQGGVATLTEQSDATFSVVQFSSADRE*
Ga0104788_115926Ga0104788_1159262F081453MSPFFLLAGDNIIEKHISANPVCGTNIKAPEPAVKGTPGRKFM*
Ga0104788_115930Ga0104788_1159302F029444MEVSEQLTGFEPEDLMSWTVSNLRRPFREDFSLEKSGIIAEKGAQIFGRRFVGFDGPKKAAPFFNF*
Ga0104788_127767Ga0104788_1277673F089053ATERSCVSKMEGPRPDKFAFGVRIGRVVDGCRPIKGCIRKWIYGSANGQPELSTAEEVNVKNMEVSL*

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.