NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300007641

3300007641: Human stool microbial communities from NIH, USA - visit number 3 of subject 159227541 reassembly



Overview

Basic Information
IMG/M Taxon OID3300007641 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0063646 | Gp0052764 | Ga0105527
Sample NameHuman stool microbial communities from NIH, USA - visit number 3 of subject 159227541 reassembly
Sequencing StatusPermanent Draft
Sequencing CenterBaylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size133576380
Sequencing Scaffolds12
Novel Protein Genes34
Associated Families33

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales1
All Organisms → Viruses → Duplodnaviria → Heunggongvirae1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Bacteroidaceae → Bacteroides1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae2
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Clostridiaceae → Clostridium1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → unclassified Oscillospiraceae → Ruminococcaceae bacterium LM1581
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales2
All Organisms → cellular organisms → Bacteria1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Rikenellaceae → Alistipes1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Large Intestine → Fecal → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal surface

Location Information
LocationUSA: Maryland: Natonal Institute of Health
CoordinatesLat. (o)39.0042816Long. (o)-77.1012173Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F029444Metagenome188Y
F032312Metagenome / Metatranscriptome180N
F042095Metagenome159N
F044555Metagenome / Metatranscriptome154N
F047755Metagenome149Y
F050794Metagenome145N
F051936Metagenome143N
F056682Metagenome137Y
F057001Metagenome137Y
F058555Metagenome135N
F064725Metagenome128N
F067720Metagenome125Y
F073656Metagenome120N
F074899Metagenome / Metatranscriptome119N
F075481Metagenome119N
F076064Metagenome118N
F076190Metagenome118Y
F077313Metagenome117N
F078004Metagenome117N
F078005Metagenome117N
F078006Metagenome117N
F080673Metagenome115N
F083451Metagenome113N
F083452Metagenome113N
F085718Metagenome111N
F089054Metagenome109N
F089591Metagenome109N
F089592Metagenome109N
F093883Metagenome106N
F094005Metagenome / Metatranscriptome106N
F098313Metagenome104N
F105375Metagenome100N
F106193Metagenome100N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0105527_100038All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales168149Open in IMG/M
Ga0105527_100050All Organisms → Viruses → Duplodnaviria → Heunggongvirae157272Open in IMG/M
Ga0105527_100105All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Bacteroidaceae → Bacteroides116246Open in IMG/M
Ga0105527_100112All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae111819Open in IMG/M
Ga0105527_100947All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales27214Open in IMG/M
Ga0105527_101106All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Clostridiaceae → Clostridium23300Open in IMG/M
Ga0105527_102150All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → unclassified Oscillospiraceae → Ruminococcaceae bacterium LM1589883Open in IMG/M
Ga0105527_103285All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales5553Open in IMG/M
Ga0105527_106626All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales2244Open in IMG/M
Ga0105527_108768All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae1647Open in IMG/M
Ga0105527_113994All Organisms → cellular organisms → Bacteria994Open in IMG/M
Ga0105527_119220All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Rikenellaceae → Alistipes718Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0105527_100038Ga0105527_100038195F089054MLTKGKFLVSFEVPGHTKEYTEGFTEEMVIPYRTEELRPYLRYPNQEINNNHLHSEHIRLQIREMLQIPLRDITIIDIISLP*
Ga0105527_100038Ga0105527_10003879F105375MKNNETFQTTQHLDKLVINLGLQIQELFSLDLEEILDYSNNLMNLLVNAYVENQCLALSAMISKQDGFAIYSFLFQTPDTSNGAADALVSFAMNFTDGEANIKSINRISSNIMQITFTV*
Ga0105527_100038Ga0105527_10003889F044555MKTTNPSSRITISQNGNQILTCKVYKEPNYILSMSNEEILELISGLDYMGNLPTVPDLEKPIQIQVSTTRQIPLEQNKEVQTKIKEIIYNNLYDTLIDELKGTISRFQAQYNIQEINPYLQDILQNPEDLVSLSQHDK*
Ga0105527_100038Ga0105527_10003895F032312MARIKDYDEDLSAPKLLRERARDSKGRFIKKDLPPYLGAEQVLKPKNYYHFDSHGNYKGSSMNFDALVCLGFTWFKLLGVALMMLLWPIVFIYALNDGIEGYPFKKYAIPYIFILVAWFIISLYGLVS*
Ga0105527_100038Ga0105527_10003896F094005MKKEVIKLKEGNSVIYQDKTLMEKANVVSIDKKNGTAILSNKVIITRTTNLEGQFTRLDGKGNAIILPCTTENEQKYNSFVAYHQSKKSLEAIKKWLDDNGKHKDDETTEKVITLDKKLKKLIEKLNE*
Ga0105527_100050Ga0105527_100050104F085718MVIEFDFEIYKNGDYDKVYLRNGKEPRVLCDNGKGNSPMVVMIEDDKADDYIILRYNETGRRNINGQSGLDLMLSVKEREPELWVVVISYMDNKDKRQKMVLPNFFSKNIRGNIYLQGSSKSNVSYYVGRLEEDGCFDELCEKIRVKRDRIYNMEIISLSDDEATV*
Ga0105527_100050Ga0105527_100050106F078006MGDNILRKAADELKKAGCRVFAWQDDTYNRGWSKGDYTMLYYAFPDSPNIGYLSHGEYGMSVAYSRAYIPSCGSGSGCHIKEEATFDLETALDVLNGPLPRWCRSYGVYPKQYDNIDKWYNSDNHNKKLFKEI*
Ga0105527_100050Ga0105527_100050112F080673LATIEINMDEIELLKLQNEALSYLRDNITKDEAYYILTTDKDIIEILIADKKDGSKRIKILDMEYTIEKDDMLLLFDTDGIIDECLLVASYIGVNMYFRRQDVNAILNNINREKVMKYPYIAIQLDNIQTIEKRRVIFEITGHRVDYDKVDFMFVYFMARIL*
Ga0105527_100050Ga0105527_100050118F057001MTSKDINKVQNEVKKASEKTLTGAVKAWCQLFKSGKEINEILKDNDIKVDKAIVPALVALAKDKETVIQLCKDILPRVNNTFCAYKEVEREYYDKNDKDKNKKLKMNEIEDIAILGSSHKRFGYNEPIACDFGIYYETFNGADKRIIKCAVPIKRYTFNLIAKCVAYYLTHPKNER*
Ga0105527_100050Ga0105527_100050124F083452MSGRVKIKIKDKKPKIDVFKVIENRFKNMNELRDLIDMDPRKGLVRIRDGAGFREVERGGCLHRNYLNLLEEELGAKLSIDLIERYIKR*
Ga0105527_100050Ga0105527_100050143F058555MGTKIRFLHIMKSNFDKILTERYIPRNIQTKKDELGCVKLPAGSLICPVDFKPVTNKEGKKVTAIKYSLKHEEYHGSGIQISDECKMAMIYLIIINVSKHVFLRKRMQDGNRDQIEINTNDFIDILSDGCAYFCYRHVLRDSHEDMNYQLISLKAWAEGEIMIALSDIIKYKHKASKTPRIKDMFVKKGESVYTCLDKNLDSNTRRRMANKSRKLNRVKMLSKIIFSARNRNINKIYKVTKKRTIKFNVSYLMDRLNIKLSKEGMMLISQRTVYRMIKEVLSMCCKTISDLYDEVKKNNGIVNTKDRKNVTIGHLRLSYRGTIMHIIIAEYFIKDVFLGVKGVEMSKAG*
Ga0105527_100050Ga0105527_100050148F093883MEEDKDIKKEIRDYLKEEADTHIRHWIAIKRESKRLYSEIEDRTKKIALKSSSLIKEEDFVVLHEMTHKIQMLNIEAVKVNSRLMFIIQLATSFGMDLDLDTTYASTAKSIIEDRTSGFVFYDDKERLRYADKELEDMFHDMSVTEVSKIGVVQSYELLMKQYNEFKELKENATGKTKTDE*
Ga0105527_100050Ga0105527_100050154F089592MKEILKSRKVLAEVNGFNIMSDTLYEVVGKHDGSAPQAFQDANIAKAPFPENATHVCCPWDDFSKAYNTGFYPRSRCYNGLDKNEIDKLVKQRVDNIMKPFEEMSQMDLSQTNLEFWDDAKDKIFMGKVYNTANTVELFYLYLAVFSGMLTPQEMDGDPVFMNSMFCFVEKDNMKDFVQQREINKMNISYKFISALKKGGDDRQAVIDLLLYIGIVTRPDFTEDEYYTGSLSNWMNEKKTNVDYLLDIWDRSLEGDFKEVLEFYRIVNVLQRNGRINMTPSGLQYNGQIIGPDVRTSAEFLATKKDFINIKANVLDEYEEIISMSNIDDKSKTKKVKDVKKKEDVGEGDKEE*
Ga0105527_100050Ga0105527_100050155F089591MTIQEAYLRSLQKNEQNLANGGIKLDPGRFVLLFNEAQDRLVKYYLNRKDDETIRSIQNLLVYWMSLDNAGRMDDPESTSFNLPDDYLWFSNIKGVFSYKGCEATDFVMWEAKNENIHELLGDENNRPSYDYRETFYSIGNGKVVVYESGFRTEEVKMTYYRRPVRVDLSGYINAAGERSTDIDPELPDPLVEEILDMVAKQFNLNENELSRYRMDKDNVASFK*
Ga0105527_100050Ga0105527_100050162F042095MAKTLYKYEASSNKFVWFTTWDRALRNYYTDDYNYVPDPVVDNSYNTFVEFRSRKPGMANVDWGDGIKEQFPMTKVQGENNYRIIFRSLAIQHRKNPNTTWWFRKEDGSQYVPVDNHAYADGRRDVQRAVSIDFTCDIYYANIQVCKMTSFPIVDMPGLEFLVVSHTLYVNDGIPVDRLSRSKKLIYIGLSNMGQRMTVIPEAIISKTEVYYLNMFNMLDLRDIESSGIRNIKNMKNLETLDLSSCYLDRYIKEFNDLPKLKTLNITPAPSDMWNYFDINTLPFFEVDKINPNITDFYFLSDWMSGERRTGWNDDNMSGRGLEHLTGFIAANSNSLRMDKLPDYIYEMRAITWFNVNASTRSQKRSDDFVNSFYDLVVGWDQITMTSVAKDGKRNQFYSLSVSMYDAIHPTENQRPSGTEQAPEGFVKGSSNGSPATPMEKIYVLKNNYAQRWTIKPE*
Ga0105527_100050Ga0105527_100050167F075481MIRLRISLKAVFCLGLSLFLSSCGSRRQVSDTSIDNRLISRIETMIDEVMDRKIVEIRTSDLNADIVITERKFDTTKEVDPSTGERPVSSQTDAHIVIGRRDSTVTVDSLGIDKTITGVKDIDKKTDIEHKDVDDKKESRWPIAITSISVLLILLVLIYLLKKMKVL*
Ga0105527_100050Ga0105527_100050174F106193VGCNTCKEKALKAERERIERSMMNRVSSTVISDREYASRSTAGCMVMLDPLKTMERDVVSIYKQTRTIGDVGIVYLNMQKKIREWIKNLPYGCPPDEEVQEMRKEILDGRSEYIKP*
Ga0105527_100050Ga0105527_100050190F064725MTIKGLLAEIKADLHKYDDSGAIDTSSVYRWAEIALKRFGGVIAVMSEAVVKTSNKQAVLPSDFFDMLDAYRCEPLVCEIPGGDKAKADLQHEIGWVERTERGFRWNSCTECCKEEFEKTITERIYIGSHEVRFHYHHPVRLSIGRGLRRDCAADKYRDKYDWDNYDITISGNTMYTGFDGFIYIIYRATPKDDDGLPYIPETALGYLEDYVETYIKMKIFENAAVNGLVQGAGDAYKLYAQQEPGKFARAMKELKMSMITLNDYRELAEDNRRRMLSHERMWPNAFDKYIKLI*
Ga0105527_100050Ga0105527_100050198F077313MSKYVIKRKIPKYQEAGEVGSYMLGNMDGIQGLGIEPLVNTNQGLPAPVNPLGIYSLDTPDQLRAKYANAFDQDSVFPASFKGSLQRIAENYQDNGITLNNITVNDVDKSKTGSGETDVFDFTTIPYYGADDIGSRFTQMGRGIGRMRSEGYGDLSTGAKTANTITTIASGISGIMGLARNVVSGIASEKGTRTNIRLAQDREARQRRQSQMQYKDGGGVYLGPNNRFDSGSLTGEYLYPLPKSMEDQANVEVEKGEYVTQPGEAPMEAMGQKHADDGTPVSLEQGTKVITDDTTIEPDFAKYIRDTYGIKATPKDTYATLMDRYKAKIGLKSAYDDQKKALEKLEKNNKIDDENTRRLNASVLSKAINDSNDIVNGLEGRFTDFANVIYKEQEDRKMKKDEDTYFAKGGEIDNIISRSMKEYGLTEEDVAEAKKELLKKVAGIRQKMEIGGTSLFGRKLTFRPIENRFNNDPNYFGYQRQGTDGSYGGVNTDERLNYYKTFNPVAYDAYMGASEGARARALQDAIYGQTSSWMGLATAENPIIANAEALRDYTTLVSFGGEDSQGNYPEDKKAAYHDRMRDNKLGLFTTSRPMIGLDVVTEEQHKALNDAGITHFSQLFSDKNKDVVNKILGEDMLKMQALRSMKGMEGLDFILDPHKVAPGPMDIGDVEEPDVKLDMPELIDPNTLPKTNTNAGKSNSGNGGRNIVGGGLDFPEVFRMTPGAVTTEGLERHYAPTVDPVLRSADQYMVETNRAFQSQLDQMGNVPDSQRGALSSNLQAIMSSNIGRYINEVEQGNVAQRAWADNVNARTWADTYDKNIAQRQAYQQRILQGLAINDENWARYFDSVNDEIQQKWNTATTMNTLRSIFGDAKIGPNGQLIADPQGDILSYRILYPAQEVTKGKKG*
Ga0105527_100050Ga0105527_100050201F050794MLYTSNYFVKVRQKQLLDHTYSLSRDQAFDYMTEFNKRLSDKVGIKCTMDILLPTDDDNANIIIEHNGIIKKLMKEAEKLELDTDAIEAMMRDLLDELKDDIDLNILIFDVTQLLIKYNLFRLEAITEQEFKNSFVRMDSMNMEIKKLTLSDIKKVVEMIEDRYSYALYMTEEYG*
Ga0105527_100050Ga0105527_10005053F083451MYKSEKEKQILDLLMSRKDIRKLVEKSNECYSKMDFVGAMRYRQEIKDIVDRESKIMLTKSESLIGLMNNADNEYKFNMLVWLHSMMCMADVFSGILEDFKDGVRKANGNSKFVKFDNLDRLMAECKKEIDYLMKGTNKSFQISFAVRSDELREMIENMVGDNIREGYDMFKEEAKMTKETDRSKIEEFNKKLDHDQM*
Ga0105527_100050Ga0105527_1000507F098313MREMEFDFVIYPLKLIITVGLDYKTLCDRFENMEPEHEGKWGDEDDMDKEASFANLVGDRDDDDKFAILWNFSSDDDLIMRNICHESFHIAMSVCQFCNMSLGFKVGEDEHAAYIAGFAGDCVSEFINSKNTD*
Ga0105527_100050Ga0105527_10005072F078005MKKSKFVKELEKIIDMVKAEDDGFEYGGKVIFYKEDDDNYEISVKNIEMDLTVEANTMASMDDRTFACLMSEVYKQKFTKAITISEDEDDEDN*
Ga0105527_100068Ga0105527_10006830F078004MSTPFYNNIIFFQITVFYVPAVSRRIRSLALLSAVSEHLSSRNTDFLFRDLLFRKSSTGGLSAVAGSAALDIHMIRHTLVIAVINTFYRLTVDTDGMAWMRQGITERLSSLSLLRKALAAGAVTITGMLTSHHDVSLAAQTVLVIGTIFHNTF*
Ga0105527_100105Ga0105527_10010511F051936MKQEGTGQVLFFPLALRGKAFGFSVLQEHAVMTPVISIFFVMLIIRYLSIHISIKK*
Ga0105527_100112Ga0105527_100112100F029444MEVSEQLTGFELEDLKFWTVSNLQRPFREDFSLKKSGIIAEKESQIFGRRFVGFDGSKKAAPFFNF*
Ga0105527_100947Ga0105527_10094724F073656MTMEQEQDQEQTQAALYVAVDDGNKIVAMERSRRGDEGFRALVAEFTDYAANRGEIPSVLFFDIRTTDAALLPRIEAAEHSYLLDPSTATEKLDIRHAVTLKDLLYCCKYGLDPLAEGNCSNMLSTKADYRQFRNEGLPPVAREDLRRCRAVETERGTVLFTQEPDGREACERYMQHHADCFFDPDLGVETLRVYEVEADPDGFWDKVNPQVLPTAGGMMWVPEHPFVDAEVLRRGYCLKEYDMRATADNFWAFVNPQHGENLYVSNGIRDLTGLQIIMQRGYGYLMQNAERYWNREFVFRSGFDNIERKYASDLSDEGRAAKREEQYNLAAYILDRKFPIRRRPSSEIPPMQAEGIRTFRNFDAINLLFRPDKLLEAYQRRRDEPVRGTEFHLKRH*
Ga0105527_101106Ga0105527_10110622F047755MESKKVKYLIDIINDMDIKDKLRLAICMSKSEWYGLIYNTKENYKKFDTMLKEIDEEYRTTLINFGQSKYFNINFAMAKLMEMETTEQNKVTLYLYNCIDKVNIYL*
Ga0105527_102150Ga0105527_1021503F074899MSGRKQWNKQAATSSFLDKKPLESLILQGMEGSTTLGKDEVGSSNLPSSSK*
Ga0105527_103285Ga0105527_10328510F067720MASTTYEHFVDINKMSAAQEQFRHITKMVTKCHRFAVLVDMVRNAGQLPQPFWLGAACGGGSCSAATVPART*
Ga0105527_106626Ga0105527_1066263F056682VAEKPHKQNTMKMQSRAGKVANQPIGQSKTHSASFGAFSSKNRITFPIQELGKIYENQEVL*
Ga0105527_108768Ga0105527_1087681F029444MDVSGQLTGFELEDLMTWTVSNLQRPFREDFSLEKSGIIAEKESQIFGRRFVG
Ga0105527_113994Ga0105527_1139941F076190SITISVKNVFMAASRVSGRVWSLVRITIPNDLKFVKIRVEKPQNNSLYPHFQREPL*
Ga0105527_119220Ga0105527_1192201F076064LKKTTPGRALENLFYLQYDLPPEASFHATTEEAKRPDELYMRKLLPELTRLKLQPRHVVANDEAYYAVMKGVMLFTPEAEKLMLTEDYFSVRRQIRLCAPDLKRRNETRR

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.