NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300008132

3300008132: Human stool microbial communities from NIH, USA - visit 1, subject 159207311 reassembly



Overview

Basic Information
IMG/M Taxon OID3300008132 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0063646 | Gp0052974 | Ga0111230
Sample NameHuman stool microbial communities from NIH, USA - visit 1, subject 159207311 reassembly
Sequencing StatusPermanent Draft
Sequencing CenterBaylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size189412978
Sequencing Scaffolds23
Novel Protein Genes25
Associated Families25

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → unclassified Eubacteriales → Clostridiales bacterium3
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales12
All Organisms → cellular organisms → Bacteria1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Lachnospiraceae1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Bacteroidaceae → Bacteroides1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes2
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae2

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Large Intestine → Fecal → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal surface

Location Information
LocationUSA: Maryland: Natonal Institute of Health
CoordinatesLat. (o)39.0042816Long. (o)-77.1012173Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F039198Metagenome164Y
F042910Metagenome157N
F042936Metagenome157N
F043413Metagenome156N
F043945Metagenome155N
F050793Metagenome145N
F051934Metagenome143N
F056623Metagenome137N
F057385Metagenome136N
F058154Metagenome135N
F059106Metagenome134N
F068856Metagenome124N
F070133Metagenome123N
F073573Metagenome120N
F073656Metagenome120N
F074898Metagenome119N
F075480Metagenome119N
F078004Metagenome117N
F087213Metagenome110N
F088914Metagenome109N
F090484Metagenome108N
F090513Metagenome108N
F090514Metagenome108N
F091068Metagenome108N
F099406Metagenome103N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0111230_1000019All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales139750Open in IMG/M
Ga0111230_1002011All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → unclassified Eubacteriales → Clostridiales bacterium12563Open in IMG/M
Ga0111230_1002542All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → unclassified Eubacteriales → Clostridiales bacterium10465Open in IMG/M
Ga0111230_1002841All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → unclassified Eubacteriales → Clostridiales bacterium9536Open in IMG/M
Ga0111230_1003701All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales7738Open in IMG/M
Ga0111230_1003759All Organisms → cellular organisms → Bacteria7654Open in IMG/M
Ga0111230_1003936All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales7362Open in IMG/M
Ga0111230_1004147All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales7009Open in IMG/M
Ga0111230_1004712All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Lachnospiraceae6283Open in IMG/M
Ga0111230_1005051All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales5909Open in IMG/M
Ga0111230_1005788All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales5269Open in IMG/M
Ga0111230_1006157All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales4997Open in IMG/M
Ga0111230_1006807All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Bacteroidaceae → Bacteroides4582Open in IMG/M
Ga0111230_1008106All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes3896Open in IMG/M
Ga0111230_1010510All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales3080Open in IMG/M
Ga0111230_1010969All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales2962Open in IMG/M
Ga0111230_1015694All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales2123Open in IMG/M
Ga0111230_1020349All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales1666Open in IMG/M
Ga0111230_1032870All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae1050Open in IMG/M
Ga0111230_1033951All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes1017Open in IMG/M
Ga0111230_1035991All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales960Open in IMG/M
Ga0111230_1043849All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales790Open in IMG/M
Ga0111230_1052492All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae653Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0111230_1000019Ga0111230_100001958F090484MKLQLGRNINISLRLLEQWSDDPLFMELYALYCMIKISRRDSRIRFKNQKDLLHKLGIGYSKFKNMTGHPMFDELFRMTDSTFVARRYRVNGVQLTLGCGKVNIPKNRILIKIKKNEITNHEKVLDRIREAMFVNLVRNNESVLNSGETNSQAEVVDGSHSYYGLIDSTISNKTIALYLNVGLTKAKEIVSVAIQDKLVKRFENIQFITYVDNPRAYIEANEHNYPIGKLIPVYRHGAVFWQIANTWTLYKKGATNRWYFGEKDIEKGEKEKVSKKDDFNFFLKDNTHILRFLNAEEVVSEDGEILGIDRKKTKEEEARSLASVMAKEAHKDFWDGYERSTQNQIIRKYYRAIIAEDKKRRMDMFLNCLKQSYDKVSGWSKEKVATVKTGMADAEACCAEVGTSVAGVCGRVSRRMKSYNNTAPDKKAGFNEVRDMYAEFAGEMAKAVGSVSEDIYTYVKAEQFKEKIENMDISIKSLPNINTTVDNDKELDGESVFKDIPFEELSFYNDTYLYPSSQYSSL*
Ga0111230_1000019Ga0111230_100001977F099406MAEIGYNSKFEGQEVDSRLENVVQAAPGTGSESGKGGLIPAPPAGSQDGSKTLLSNMTWGDHVTKQYIDDAVSAAGWKKQIVSKLPTVEEAKDNVMYLVKDDVASKETKNVYNEYILVTEEGGTKVLESLGMVSTGVDSSYLDLSIFPRTSGTLDEDSYAKVLNAYNNNITLGKLSFYYFSLDYFSDNDNSELKIIAVLFNNTNSKEDVSGSYIDIEMVTYVVSQDKTYRAIANTATLSNDMLSYLKFMAKTPNVVTTLASLPIDAHNIIANVASATDLSMAVSAEDAGREWQVRVNNTTGTDITQPLPTSGLFQSMSGDSVVVPKNSFIELSIWYINDKLVIRVGEQA*
Ga0111230_1002011Ga0111230_10020114F073573MAAIRNKNKKHLPQGGFTMPTIVSFYQRFPNEAPPLNLSAFDHTGYTYAENFRRNERCYEAEQCEVTSDDGSVVLSLDVRIRPRGGEIEALPRVMLSVYSEGMPFQVRRMELRTGDTTYAILPDNVQQYTKRGDTDGGFLETMAIPLGKVGMKMLLEASDAAEARWVIQGARTAQTLPLPAAQKKAIRRFCMDCEESAITYQPAFLWYKDYCTKA*
Ga0111230_1002542Ga0111230_10025426F043413MTERKLREKLQSAYGTMPDATRAAFEHSLTHHRVETRHPVRMSRRMRTIVTLALMLMMLTAVGVAAAKLASVTDYPPPSGLTPEYMSHLVTLNEAYDGDLLTLSVNDLVFDGTTVEVAMNVQPKAGKQVFMDMEVTAECEGRAYTLEIEGCGGGDFMSGMFLPDETDSWSDSPFGFDGVLWDDDMQSPPESVPIAWTMTFELLQPVWEVEYLPDAQYQALLAGGADTLEAYVQNNWRKHIITVTYGSLVEYQYAAEAAMLADGTITEPLSGSRTEQLLSCGGFRRADTIIVQFTTDFADDYAHPELVGKRIGMGDYDLVIDNVNLSFMRANITMHYEFSAAYTEDEICQMTNLPNAWRVYVNGETDSGYDAYANFQQVTNGAYGVDTPEQLTVGFDFYPAETDITRLTFVPIRNMGEKWDACHPDAEKGFTLELQE*
Ga0111230_1002841Ga0111230_10028417F051934MGLMRRLAAAALAVVLLLSTALGERVTFRLSADIDPVQYPAQERKLAQGLKSLFRLLTVEGDVVASDGSFDARIDLGLTNAPEKTATRIRFFGLDSHWGIQSTTLGGETLMVNQLAWLEFAIKAYNHLDLPLQRVFLWLSPYAHTSAWAGVRQSIADLAAQENDGRLENTALITCAEEIARLSEEDRALYYYIEAFGLESGTDADIFDALATLPEYVEANFPDGLSIEHTENGVSWQNGEETVFSYAEADGTQVVSLHLPDLVDFSATLRRDALLFTGALSLQSDVLNADVSFSLPASYPVTLPFYAQIDADGMMTGDDGIHLAFEGEAQGDTITIRRIQPDHSATMMTLTVKVIQVAEGTVKYAPEDVQGTNVLSVDGPALAELMGRIGKPMVRKVTQWLAALPAELVQGAMDAMEDSGLLGTLTDAILNGEAGEY*
Ga0111230_1003701Ga0111230_10037017F050793VLVPENVEMADIHAIQTRVKKHNLLTVVLLIPLFAGCALRESHETLGGIVFVLSMVLMVLGFVSGRGDEKRLPAYWAKYQQIHRMDSLLRANLVGVWLCAEARVAVYREGSGYRVQLGVLDEKTGAWENDEQDEVFPTLADVREYATQEGYEPMDVDFGQMTDEEFQQFLDAHRI*
Ga0111230_1003759Ga0111230_100375910F058154MKNRTFQRLFPFLRMGSILLAAVLFCAALIGVNAQLTVGTNEERNLATYLGLALTAVVTYLLQDVPKMLVGRHFGWKLVRYEAFGRVYQADGTGAFVRVPVPAQSRLRYQPYLLPPEDAPRHDVTLYTLCPLLYGGALAVVFGVLTLLTLGQPVSLVLCELAMGGVVLVMTCILPMDERFIASPMAIARMLRDPEQLAEWLYFARMKANGGVEVTDVSIENIPQPATVKTCQDAICVFQRAQIALMVQCDAQKAYGLMKQVLESEARLSYTAWKLLLSDGVVAELLAGEPGEFTALYRGKAGAAIRQVMFRMVDYALPAYAVAMLGDT*
Ga0111230_1003936Ga0111230_10039361F043945HAPDHAMDWPPASGSRYRTDVTSALPFEDELFFISGSQLLRVDDPEYEPEVVCNLEDIIWSEALKQSGYYGQVMLLNGGAELVLFSANEGRLYSFVPPTDDTMVRMQMVTELDYSTVPQMGWKKFPIAKDLYDVTVEQAVYDAQYGLYVIAKDKTDEYQIVRFDVDTGKGEMMEFALEDEEDSHLLTDMPGIFQWSGTSSKYERTLYDGKVLTTVDLSSGEATKAVHNWVNTLTLQPIPDYDVTLGDYVTHDAYYAYAPNKDNSAMYFVHDNTVWVMDYDEETSEFGEPHGIDKLPFFAEDTMCGFYWRGFYLIQTHDGLYLCHTGDETARAYLYGEVE*
Ga0111230_1004147Ga0111230_10041473F091068MNEKNQQVSDEQIDALIRSGLWQDEQPLTADEEKLADAAFARAMAKIDRKAKRQKHRTVLHMLDRVVRVAACLIVAVGIAFPIALANSEAFREQILQLVLSINPETGMAHVGMQPAKEEQTANVPRMDVPDGWEGLFFPTFLPDSLPLVRCETTRGDQVHSEAYYADETRSLRFEEQDNLDGWNVRAADARVLTIYLHSVPGYLIDRQTEDTHEVSIIWAEGKRMLRVTSIGLPADEAVLVAQSVKKIFAE*
Ga0111230_1004712Ga0111230_10047121F059106MHVIAWKIRVILLQTFAWIVDLKVRERLTEYLKKDIKYHRVIIGVLV*
Ga0111230_1005051Ga0111230_10050516F039198MQITNTEMQSKKYVQIYLTEEELQKKETKDLVQKYKKEKHSVAIFVTGKENYPEILKKIVTKQVELNKNVC*
Ga0111230_1005788Ga0111230_10057882F070133MKEEKRMKRLLGLLLAVMVMMGGMAGAQASTDNASMQLIRMNPLAFRKEPVELYSVTHTPNGSFVVIYFAEGEKTELQEMWMELFDSVGTSLLSAKLGEFDPNGEQIPHGQIILKKDRFICEYYPDITSMEVCTQTVYRYTGKRIQKPTTKKLKFGAAPYAQHVGDYMVEKQAHSEDETPFRTVKITHIASGKSKKLMIYDWSFCAFPDQDGNLLIAQQNEKGNLEIRSYNAAMQESIVELSGDFLQNENVRDAACIGQTAYMRIRLTNEKSEILLYDITQQKITDSQTLLAVDDNSYIAEIKAAGAVLLSVDGYWNRELQRQKYQINLLNEHFETSRLPLQHESCLYIFTDVEQADVTTIEMDEKSHSYFVCSYSISAGE*
Ga0111230_1006157Ga0111230_10061573F057385MKKLLAVLLSIMMLAMPLTSMAENSVWDNAARQETTITIHDLNADLVAALGGDDTAMAAINDLLAALSLTGYQQGDEAGFDLNLSGKSVLGMASLTTAAEENQLMYVSSALLGGVIAVNSKDVEAIKEKALRATMKMSGQSDEEIDKAIEESKEQLSGNAEYTALMEASANLGSMTEEQLMEELTQADTTAFMTMMNEILSGAEMAEVTEQPGDCDAAKNYVKVTVPPEKIAEMTKALLEMIHSVPSIGAYMDALFSAADTSWDDLLKELDEADLYADDIVYEYWMTEAGELVRMTASVKINNGGEEPLPMSFTMTRNTADGVATWLVTIKSAEDTAATLTFAGDLENFTANLTAYAGEDSVEINVSGKGVGTDSSVVDVEIKETVDGVEQGFGVVVTTATTMDGEQGVRKVDVLVRFMGLDVVTITAETRTCDAKDALDVSKAQDLGAMTDSEFQTWFVKVMNNLQNLPMTLLMSLPESMLTLLMGGSN*
Ga0111230_1006807Ga0111230_10068076F073656MTMEQEQDQEQTQAALYVAVDDGNKIVAMERSRRGDEGFRALLDEFTDYAANRGEIPSVLFFDIRTTDAALLPRIEAAEHSYLLDPSTATEKLDIRHAVTLKDLLYCYKYDLDPLAEGNCGNMLSTKADYRQFRNEGLPPVAREDLRRCRAVETERGAVLFTQEPDGREACERYMQHHADCFFDPDLGVETLRVYEVEVDPDGFWDKVNPQVLPTAGGMMWVPEHPFVDAEVLRRGYCLKEYDMRATADNFWAFVDPQHGENLYVSNGIRDLTGLQIIMQRGYGYLMQNAERYWNREFVFRSGFDNIERKYASDLSDEGRAAKREEQYNLAAYILDRKFPIRRRPSSEIPPMQAEGIRTFRNFDAINLLFRPDKLLEAYQRRRDEPVRGAEFHLKRH*
Ga0111230_1008106Ga0111230_10081061F075480NQFGEEHRFRLIFGSAPDKPHFCYPAERARAAHTELVWNAGHPIENEADDRLGLLTGKAGTSGDSLTIGGVTFTVQDGVYSPENGQLFASAVISADESVQLVGVESDMEWEVRAVTPVSEKLDPSGISWAEWAEQNGKTLVPVRMEIRGSEQFPNIPMFGDFLTQNPDGTVSVGFQVDLSEEDVSHLKSCEVQLECRVGAFGKDGKATQWQKEILTATITFK*
Ga0111230_1010510Ga0111230_10105102F068856VKDSGASAEGNALAAAHAVLGKERKGMKRLTSILLALLMLVGMALAEETPDAALGDWYALPSDETVLRLTLREDGTFFFGTEGISGIEGKWRKTTDGEYNLAYTNRSSSLLDVIMSMVDSQAPAPDMTMTARLTESGLDVFYGSTAEGAVVHMARDAEELRTERTPRTDTPLEAFAGTWTMETMFLGTMQLTYTPEMGERQVFCTIDGLTMFPGAGLESFPEGTSFPLTFEDGVLRTTIPLTVQMAASSALVKEIVVDYDLTFFQTADGSLYATLRLSDVPDNPTTMFLLVPMEKE*
Ga0111230_1010969Ga0111230_10109695F087213MKKFFSLFLAVLLIFSCWTAATAESIDAVSGAPLDNDIEIVYKEHFDDMVTHYADELTEAELLVSADVYDAIGMIRFDSEDSLAARQRIYGIASADDLHAILTGYFDPNDAAYTADTALDARLQAILAACGLNPEDYDISVIRNLSGMPEPITGTNWYCTLIRKGVEVAEDETNPYDMVIVLYGDEMTVGAFVLNPEV*
Ga0111230_1013285Ga0111230_10132851F078004FLFRDLLFRKSSTGGLSAVAGSAALDIHMIRHTLIITVIDALYRLTVDADRMAWMSQGAAERIPPLSLLRKAFTAGSVTVAGVLATHHDVSLAAQTVLIIGTIFHNAF*
Ga0111230_1015694Ga0111230_10156942F042910GTGIGGGLHLLRRQREQLSLRVRFLIEEYDIAARHQLLHLVAAWAEAGGVLTLHEDGKRVLRVVCTQYPTMSTLNWLETLSLVFTAFSCPYWEDAAETSFLMPNTSDAPSKLLAVPGDAPETPLNLLIRNIGDAAITTLTISAAGKISFQGLTIAPGAAVRIHHDAGVFAAEMVSDDSTVSILPYRTPDSADDLLLRPGVLNEIRVEASAAAFVSGRCKGRYC*
Ga0111230_1020349Ga0111230_10203491F074898MAMTQMAEKTSPDAPPNCAVVPLTKRTKCCILYVAKAFVLYRKECFFMRKILSLLLILALFLPCALAETPQGIDLALTSTYGDGLSLRMTAGLGETPFCSLTLPSGQIDLAFSPDKGLCVQSGGQWAQLLVEGHPVDAALLTTPLKLLDGRSPSEALGLLAEDVNKMLDAIPSLYNLPMTLIRDPAFARLFSDLYIAAQTGTISITSEELNRMAASVLGKIYSMEYLDGLNFSDIYHHLLASVVQSSEVARAYRSAMRGYLLSNFRLSGQIGIDKGELLFSQPYQEPYHLTYEQTTPNSWHYLLTTANSDTIDGVLYWRNEYDFALSGYSENQHTAFSVTCSYGSANNFVFIASVDNSFSLSIVQAGSNFSLRLNNENANVFALNANVFSLESTSDYIRVKIYYNYYTLTITPATDGLDIVAQARTFDIFTAHVRTALNGLICTGVYGDTTYRLALTETATGFSLLLSLPDGDFHLDFAALSDTSCSLTFTDAAGQQVSLTGSLCTPESPVIPEAIGTIELTQLLDGLF*
Ga0111230_1032870Ga0111230_10328702F042936MWPGRPNVSGVDGGSGKGRVGKTKEGIMKTSGIVDRGEDTIRNFEKVEQEGAPTPPYLGNVYFRSRLKGKG*
Ga0111230_1033951Ga0111230_10339511F090513MLKIFAIAKYVRKMEWKLLHIKHILYMRPFGALKIAPQSVGNKNGTKAVLQGWAAAFVPYMLSFTYGNTA*
Ga0111230_1035991Ga0111230_10359911F056623PRPAHPAPARRQLLHLLYVREAAALQSMTLVRAEIPAAQKRDWKLLILPLAAAVLNLGTLALWLTLPNCVTPENSGWMGRFALCIVAQSVSLCAAIGSIVSAMRRKPPKPVVVADEIEFRRRLVEAEGRIALDAQTIEALFAQESVTIISTGAETAAELYASLYEMAQDARLDGDANQEKALSWPLSNAKRLLNAVGCEAVDYAPETAMFYDVMDADITQQRRPAIVQKADGIVQQRGLYLRKG*
Ga0111230_1043849Ga0111230_10438492F088914CCVFFVFWSRKEGANYQISVKSFSTLDSYDIIQLYIMTELTDGRVSDSLFSVPYTPKENIKNPGKFIVFSAWYAAKALEMDVK*
Ga0111230_1052492Ga0111230_10524921F090514MNLRIHALKKASRQRPGVKTAAARFFSILYYPFCAKRSSFFQQNVQPRGNFFANRPIFVYFADILPRLTGAKWFVILLTIVPAGSRARAGSSLPLSSASIFF*

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.