NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300006502

3300006502: Human stool microbial communities from NIH, USA - visit 1, subject 764588959



Overview

Basic Information
IMG/M Taxon OID3300006502 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0063646 | Gp0052541 | Ga0100528
Sample NameHuman stool microbial communities from NIH, USA - visit 1, subject 764588959
Sequencing StatusPermanent Draft
Sequencing CenterBaylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size164448925
Sequencing Scaffolds11
Novel Protein Genes30
Associated Families30

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → Viruses → Duplodnaviria → Heunggongvirae1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Bacteroidaceae → Bacteroides → Bacteroides fragilis1
Not Available1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales2
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Lachnospiraceae → Dorea → Dorea longicatena1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → unclassified Oscillospiraceae → Ruminococcaceae bacterium LM1581
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Faecalibacterium → Faecalibacterium prausnitzii2
All Organisms → cellular organisms → Bacteria1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Large Intestine → Fecal → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal surface

Location Information
LocationUSA: Maryland: Natonal Institute of Health
CoordinatesLat. (o)39.0042816Long. (o)-77.1012173Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F026592Metagenome / Metatranscriptome197Y
F029444Metagenome188Y
F042095Metagenome159N
F047125Metagenome / Metatranscriptome150N
F050794Metagenome145N
F051936Metagenome143N
F055715Metagenome138N
F055775Metagenome138N
F057001Metagenome137Y
F058555Metagenome135N
F064725Metagenome128N
F068855Metagenome124N
F070093Metagenome123N
F073656Metagenome120N
F075481Metagenome119N
F076653Metagenome118N
F077313Metagenome117N
F078005Metagenome117N
F078006Metagenome117N
F078693Metagenome116N
F080673Metagenome115N
F083451Metagenome113N
F083452Metagenome113N
F085718Metagenome111N
F089590Metagenome109N
F089591Metagenome109N
F089592Metagenome109N
F093883Metagenome106N
F102167Metagenome102N
F106193Metagenome100N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0100528_100124All Organisms → Viruses → Duplodnaviria → Heunggongvirae144491Open in IMG/M
Ga0100528_100238All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Bacteroidaceae → Bacteroides → Bacteroides fragilis94701Open in IMG/M
Ga0100528_100603Not Available44852Open in IMG/M
Ga0100528_100978All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales26924Open in IMG/M
Ga0100528_102759All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Lachnospiraceae → Dorea → Dorea longicatena8076Open in IMG/M
Ga0100528_104248All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → unclassified Oscillospiraceae → Ruminococcaceae bacterium LM1585049Open in IMG/M
Ga0100528_105349All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales4021Open in IMG/M
Ga0100528_105713All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Faecalibacterium → Faecalibacterium prausnitzii3751Open in IMG/M
Ga0100528_107349All Organisms → cellular organisms → Bacteria2923Open in IMG/M
Ga0100528_116862All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Faecalibacterium → Faecalibacterium prausnitzii1199Open in IMG/M
Ga0100528_127903All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes673Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0100528_100124Ga0100528_10012410F080673MDEIIKLQDEILSYLRNNITKDEAYYILTTDKDMIEVLISDKKDGSKRIKILDMEYTIERDDMLLLFDTDGIIDEYLLTSSHIGINMYFRRQDIRDILSKKLEVMEYRYIKIQVDNIPVVEKRRVILDLTGHRVDRDDRDKIDFMFIYYMARLCE*
Ga0100528_100124Ga0100528_100124140F076653MTIRDKYFGWKDIFFGRFVHCCNEKSDQPQGSNIPLAKINFDNKTGYVEDGTINIAELLQYLWINNKVYGCEYAPIDISSVLQTLIRLTENAKFIFDDQPGIHDMIPYRGFFLRDDFSSGKDYSLDLDKIVSGMGGWYGEDEDPCYSMFVSQDQIWNLNPILKVLADEGSILAKELGYDMNSYVSDNGYTIYNPYLSWINHYYHYCPTFNEDKLKPWDRVEDRKNKFKMTDKVKRGANNWYYSGGTISCVDSFLGKKYRKDLRTFIYRGIVFFLDRIWHTSLFDRMGVKMKYNAYYCYAATSGIWYDKGFKRRLAKRFNRSLSGGGELFGANLACMVCDRKDIDWEALRFWLEKYDDPTDKGMVNSPIQFMYLYLYYTFNK*
Ga0100528_100124Ga0100528_100124151F083451MIMDKNEREKQVLDLLMSRKDIRKLVEKSNECYSKMDFVGAMKCRQEIKDIVDRESKIMLTKSESLVSLMNNADNEYKFNMLVWLHSMMCMADVFNGILEDFKDGVRKANGNSKFVKFDNLDRLMTECKKEIDYLMKGTSKSFQISFAVRSDELREMIENMVGDNIREGYDIFKEEAEMVNETDRSKIEEFNKKLDHD*
Ga0100528_100124Ga0100528_100124171F078005MEKSEFVKKLEKIIDMVKTEDDGFEYGGKVIFYKEDDSNYEVSVMNIEMNLEVEANVMAGMDDMDFTCLMSEVYKQKAVKAIMMEKDDDEDN*
Ga0100528_100124Ga0100528_10012418F057001MISKEINKVQNEIEKSSEKTLTGAIKAWCQLFKSSKEVNEILKENEIKVDKSIVPALVNLAKDKEVVIQLCKEILPRVNNTFCAYKEVEREYYDKNDQDKNKKLKMNEIEDIAILGSSHKRFGYNEPIEFDFGIYYETFNGADKRIIKCAVPIKRYTFNLIAKCVTYYLTHPKNDR*
Ga0100528_100124Ga0100528_10012423F083452MSGRVKIKSKDKDKKPKIDVFKVIESRFKNMNELRDLIDTDPRKGLVRIRDGAGFREVERGGCLHQNYLNLLEEELGTKLSIDLMDKYIKRNI*
Ga0100528_100124Ga0100528_10012427F102167MDINQIKTYLPSGWDVVDLIDHGIIDLDIMNGKMIGEYVAVLMIKSYDKITESHNLTTFSFHDKDMGGLRRLVSNAIMAVGLRNNPLTGDGNTAIK*
Ga0100528_100124Ga0100528_1001243F085718MVIEFDFEIYKPMVVMIEDDKADDYIILRYNETGRRNINGQSGLDLMLSVKEREPELWVVVISYMDNKDKRQKMVLPNFFSRNIGGNIYLQGSSKSNVSYYVGRLEEDGCFDELCEKIRVKRDRIYNMEIISLSDDKATV*
Ga0100528_100124Ga0100528_10012443F058555MGTKIGILHIMKSNFDKILTERYTPRNIQAKKDELGCVKLPAGSLICPVDFKPVTNKEGKKVTAIKYSLKHEEYHGSGIQISDECKMAMIYLIIINVFKHVFLRNRMHGGNRDQIEINTNDFIDILSDGCAYFCYRHVLRDSHEDMNYQLISLKAWAEGEIMIALSDIIKYKHRASKTPRIKDMFVKKGESVYTCLDKNLDSNTRRWMANKSRKLNRVKMLSKIIFSARNRNINKIYKVTKKRTVKFNVSYLMDRLNIKLSKEGMMLISQRTVYRMIKEVLSMCCKTISDLYDEVKKNNGIVNTKDRKNVTIGHLRLSYRGKIMHIIIAEDFIKDVFLGVKGSEMSKAG*
Ga0100528_100124Ga0100528_10012449F093883MMEEDKDIKKEIRDYLKEEADTHIRHWIAIKRESKLLYSDIEDRTKKIALKSSSLIKEEDFVVLHEMTHKIQMLNIEAVKVNSRLMFIIQLATSFGMDLDLDTTYASTAKSIIEDRTSGFVFYDDKERLRYADKELEDMFHDMSVTEVSKIGVVQSYELLMKQYNEFKDIIKNNINSI*
Ga0100528_100124Ga0100528_1001245F078006MDNTLKRAAAELKEAGCRVFAWQDDTYNRSWSKGDYIMLYYAFPDSPNIGYLSHGEYGMSVAYSRAYIPSRGSGSGCCVKEEATFDLATALDVLNGPLPRWCRSYGVYPKQYDNIDKWYNSDNHNKKLFKEI*
Ga0100528_100124Ga0100528_10012456F089592MKEILKSKKVLVEVNGFNIMSDTLYEVVGKHDESAPQAFQDANIAKAPFPENATHVCCPWDDFSEVYNTGFYPRSRCYNGMDKDEVDKLVDQRVNNIMKPFENISQKDLSQTNFEFWDDAKDKIYMGKVYNTANTVELFYLYLAVFSGMLTPQEMDGDPIFMNSMFCFIEKDNAKDFVQQREINKMNISYKFIDALKKGGKERQAVIDLLLYIGIVTRPDFTEDDYYTGSLSNWMNEKKTNIDYLLDIWDRSLEGDFKEVLEFYRIINVLQRNGRINMTPSGLQYNGQIIGPDTRTSAEFLATKKDLISVKVNVLDEYEELMSISNIDDKTKIKKVNDVKKKEDVDEGDKVNTEE*
Ga0100528_100124Ga0100528_10012457F089591MTIQEAYLRSLQKNEQNLANGGIKLDPGRFVLLFNEAQDRLIRYYLNRKDDETIRSIQTLLVYWKSLNKINHIDDPESTSFGLPDDYLWFSNIKGAFSYNGCEVGDFVMWEAKNENVHELLGDDNNRPSFDYRETFYTIGDGKVVVYEDGFRTDEVRMTYYRNPVRVDLAGYINAAGERSTDIDPELPDPLVEEILDMVAKQFNLNENELQRYRFDKDNVASFR*
Ga0100528_100124Ga0100528_10012463F042095MAKTLYKYEASSNKFVWFTTWDRALRNYYTDDYNYVPDPVVGNPYNTFVEFRSRKPGMANVDWGDGIKEQFPMTKVQGQDNYRIIFRSLAIQHRKNPNTTWWFRKEDGSQYVPIDNHAYADGRKDVQRAVSIDFTCDIYYANIQVCKMTSFPIVDIPGLEFLIVSHTLYVNDGIPVDKLSRSKKLIYIDLQNVGQRMTVIPEAIISKTEVYYLNMFNMLDLRDIESSGIRNIKNMKNLQALELSSCYLDRYIKEFNDLPKLTSLKIHPGPSDMWNYFDINTLPFFEVDKINPNITVFYFLDDWVSGERRTGWNDDNMSGRGLEHLTSFIAAHSNSLRMDKLPDYIYEMRAITWFNVHASTHSQKRSDDFVNSFYDLVVGWDQITMTSVAKDGKRNQFYSLSVSMYNAIYPTENQRPSGTEQAPEGFVKGQSNGSPATPMEKIYVLKNNYAQRWTIKPE*
Ga0100528_100124Ga0100528_10012465F089590MKDKDMIERVGALWNIALAYGASCWAYFQPVHHLLTVLLIVLIANFLARLAQSVRGWKLRRSRRRRFSFKRWFREVRFTDILKEFALSCFIVMTLCVIYKTLYPIEEEASMILTVTKYGVYIALVGYVMLFLNTIGDAFADAYLVKVFKAVFKRINVFKMFSFSKNIPDETFDDIKKIADDNFK*
Ga0100528_100124Ga0100528_10012468F075481MRLRISLRAVFCLGLSLSLSSCGSRRQVSETSIDSRLISRIETMIDEVMDRRIVEIKTSDLNADIVITERKFDTDKDIDPATGERPVSSVTDAHIVIGRRDSTVTADSLGVNKTRNDIKDLDNKTNIKSKDVDDRKESRWPIVWIVAGILMILLVLVYILKKTKIL*
Ga0100528_100124Ga0100528_10012471F106193VGCNTCREKALRAERERIERSMMNHSSSTVVSDREYASRSTAGCMVMQDPLQTMERDVVSIYRQVRTKGDGVGVSYLNMQKKIREWIKNLPYGCPPDEEVQEMRKEILDGRSKHIKP*
Ga0100528_100124Ga0100528_10012488F064725MTIKGLLAEIKADLHKYDDSGAIDTSSVYRWAEIALKRFGGVIAVMSEAVVKTSNKQAVLPSDFFDMLDAYRCEPLVCEIPGGDKAKADLQHEIGWVERTERGFRWNSCTECCKEEFEKTITERIYIGSHEVRFHYHHPVRLSIGRGLRRDCAADKYRDKYDWDNYDITISGNTMYTGFDGFIYIIYRATPKDDDGLPYIPETALGYLEDYVETYIKMKIFENAAVNGLIQGAGDAYKLYAQQEPGKFARAMKELKMSMITLNDYRELAEDNRRRMLSYERMWPNAFDKYIKFI*
Ga0100528_100124Ga0100528_10012496F077313MSKYVIKRKIPKYQEAGEVGSYMLGNMDGIQGLGIEPLVNTNQGLPAPVNPLGIYSLDTPDQLRTKYANAFDQDNVFPASFKGSLQRIAENYQDNGITLNNITVNDVDKSKTGSGETDVFDFTTIPYYGADDIGSRFTQMGRGIGRMRSEGYGDLSTGAKTANTITTIASGISGIMGLARNVVSGIASEKGTRTNIRLAQEREARQRRQSQMQYKDGGGVYLGPNNRFDSGSLTGEYLYPLPKSMEDQANVEVEKGEYVTQPGEAPMEAMGQKHADGGTPVSLEQGTKVITDDTTIEPDFAKYIRDTYGIKATPKDTYATLMDRYKAKIGLKSAYDDQKKALEKLKKNDKIDNENTRRLNASVLSKAINDSNDTVNGLEGRFTDFANVIYKEQEDRKMKKDEDTYFAKGGEIDNIISRSMKEYGLTEEDIADAKKELLKKVAGIRQKMEKGGSSLFDYLLTFRPVENKYNNKDNTFGYQRQGQDGSYGGINTDERLEYYKTFMPLAYDAYMSAPKATAAKALQDAIYSTTGGWMGLATAENPIIANAEALRDYTTLVSFGGEDSQGNYPEDKKASYHDRMRDNKFGQYSSSRPMIGLDVVTEEQHKALNDAGVTHFSQLFSDKNKDVVNKILGEDMLKMQALRSMKGMEGLDFILDPHKVAPGPMDIGDVEEPDVKLDMPELIDPNTLPKTNTNAGKSNSGNGGRNIVGGGLDFPEVFRMTPGAVTTEGLERHYAPTVDPVLRSADQYMVETNRAFQSQLDQMGNVPDSQRGALSSNLQAIMSSNIGRYINEVEQGNVAQRAWADNVNARTWADTYDKNIAQRQAYQQRILQGLAINDENWARYFDSVNDEIQQKWNTATTMNTLRSIFGDVKIGPNGQLIADPQGDILSYRRLYPAQEVTKGKKG*
Ga0100528_100124Ga0100528_10012499F050794MQDLRIQRVKVLMMLYTSNYFVKVRQKQLLDHTYALSRDQAFDYMTEFNKRLSDKVGIKCTMDILLPTDDDNANIIIEHNGIIKKLMKEAEKLELDTDAIKAMMCDLLDELKDDIDLNILIFDVSQLLIKYNLFRLDAITEQEFKDSFVRMDSRNMEIKKLTLSDIKKVVEMIETRYNRFVW*
Ga0100528_100238Ga0100528_10023886F051936QEGTGQVLFFPLALRGKAFGFSVLQEHAVMTPVISISKKMLIVRHLSIHIYIKK*
Ga0100528_100603Ga0100528_10060354F055775MAQIAQQDNLVIEVTTTAAALDGDTKKKLIECIEGGTITDVILVTKEVEKKISHARVVSWLVDTTRDSPKYTIDIINANSGAVEAIALN*
Ga0100528_100978Ga0100528_10097827F029444MEVSGQLTGFELEDLKFWTVSNLQRPFREDFSLKKSGIIAEKESQIFGRRFVGFDGPKKAAPFFNF*
Ga0100528_102759Ga0100528_1027591F070093NFESLLLAEFCPFRIDPPMEPEQERLSKLYSGSGIHSLPKLNPN*
Ga0100528_104248Ga0100528_1042486F068855VPIISKCPSSSVPTVEAQGPQVRKSLALQGLQAEKERENANVQNAGWLHSFDADYHYRGSDLHYHVNVSAALSEGGATW*
Ga0100528_105349Ga0100528_1053492F055715LTKANEFDKINELLIERTAKKFERVSKNKLKKFLTNEKFCDKINELIQVGTAEILDN*
Ga0100528_105713Ga0100528_1057133F047125MDVVLLLMVLGVMLSGFWAADALDHIRREIIRQEGKRRGWWS*
Ga0100528_107349Ga0100528_1073493F073656MTMEQDQEQMQGALYVAVDDDNKIIAMERSRRGDEGFRALLDEFTDYAANCGAIPSVLFFDIRTADAALLPRIEAAEHRYLLDPSTATEKLDIRHAVTLKDLLYCYKYDLDPLAEGNCGNMLSTKADYRQFRNEGLPPVILEDLRRCRAVETERGTVLFTQERDGREACERYMQHHADCFFDPDLGVETLRVYEVEADPDGFWDKVNPQVLPTAGGMMWVPEHPFVDAEVLRRGYCLKEYDMRATADNFWAFVDPQHGESLYVSNGIRDLTGLQIIMQRGYGYLMQNAEKYWNREFVFRSGFDNIERKYASDLSDEGRTAKREEQYNLAAYILDRKFPIRRQPSAEIPPMQAEGIRTFRNFDAINLLFRPDKLLEAYQRRRDEPVRGTEFHLKRH*
Ga0100528_116862Ga0100528_1168622F078693MVLAPVRGAERFFIKANCSLLMSKENQKTTSDFDALDPRERGCSPLSDPEGVVETEKS*
Ga0100528_127903Ga0100528_1279031F026592CLRTASPQGIAALAAQGGVATLTERSDATFSVGQFSSADRE*

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.