NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 7000000300

7000000300: Human stool microbial communities from NIH, USA - visit 2, subject 159814214



Overview

Basic Information
IMG/M Taxon OID7000000300 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0063646 | Gp0052823 | Ga0030589
Sample NameHuman stool microbial communities from NIH, USA - visit 2, subject 159814214
Sequencing StatusPermanent Draft
Sequencing CenterBaylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size96220883
Sequencing Scaffolds20
Novel Protein Genes22
Associated Families22

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales12
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Faecalibacterium → Faecalibacterium prausnitzii1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Lachnospiraceae1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → unclassified Eubacteriales → Clostridiales bacterium2
Not Available1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Bacillales → Paenibacillaceae1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Large Intestine → Fecal → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal surface

Location Information
LocationUSA: Maryland: Natonal Institute of Health
CoordinatesLat. (o)39.0042816Long. (o)-77.1012173Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F029444Metagenome188Y
F039147Metagenome164N
F041208Metagenome160N
F042910Metagenome157N
F045105Metagenome153N
F047125Metagenome / Metatranscriptome150N
F051935Metagenome143N
F056623Metagenome137N
F057385Metagenome136N
F058154Metagenome135N
F059982Metagenome133N
F060985Metagenome / Metatranscriptome132N
F068856Metagenome124N
F073573Metagenome120N
F073574Metagenome120N
F075480Metagenome119N
F077320Metagenome117N
F082714Metagenome113N
F087213Metagenome110N
F089053Metagenome109Y
F091068Metagenome108N
F101192Metagenome102N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
C1832884All Organisms → cellular organisms → Bacteria1285Open in IMG/M
C1848454All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales2526Open in IMG/M
SRS043701_LANL_scaffold_10863All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Faecalibacterium → Faecalibacterium prausnitzii598Open in IMG/M
SRS043701_LANL_scaffold_12981All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales8244Open in IMG/M
SRS043701_LANL_scaffold_14322All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales14931Open in IMG/M
SRS043701_LANL_scaffold_15949All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales10527Open in IMG/M
SRS043701_LANL_scaffold_16781All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales2326Open in IMG/M
SRS043701_LANL_scaffold_1756All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes15273Open in IMG/M
SRS043701_LANL_scaffold_1873All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Lachnospiraceae8216Open in IMG/M
SRS043701_LANL_scaffold_21315All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → unclassified Eubacteriales → Clostridiales bacterium13104Open in IMG/M
SRS043701_LANL_scaffold_22457All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales754Open in IMG/M
SRS043701_LANL_scaffold_26256All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales12968Open in IMG/M
SRS043701_LANL_scaffold_29088All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales13458Open in IMG/M
SRS043701_LANL_scaffold_30097All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales6736Open in IMG/M
SRS043701_LANL_scaffold_3031Not Available609Open in IMG/M
SRS043701_LANL_scaffold_31019All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales10773Open in IMG/M
SRS043701_LANL_scaffold_31313All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Bacillales → Paenibacillaceae7290Open in IMG/M
SRS043701_LANL_scaffold_31395All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → unclassified Eubacteriales → Clostridiales bacterium11868Open in IMG/M
SRS043701_LANL_scaffold_4824All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales59057Open in IMG/M
SRS043701_LANL_scaffold_8747All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales28908Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
C1832884C1832884__gene_136562F060985MSNIDEKAKNNFTIEMRIFENYEKVKHEIIKVIDFLRHAETNLGMCRIFDNQNHEFWHSVIKPWFKPERFGITHLWFSSGFSHIGYGEYHTIKGNRWLKTPIDKIDRENRIFGYWFPPYKKYIPHRIKVLKLALKDLERIKEEYGKD
C1848454C1848454__gene_140857F059982MTMRHMTRLLCLMLLFSLITFSCPLAEETDAPTGTPAPTPMLTAVPESALAPFNVVLPEDAHVEMAEGRITLVRGDSRVVAMVISRVPDADPAAALPILMQDFDPKTSETMDFDAQPGFCILGGVVNDAFDDGEDKITLMVLADSGELLILSGYNLARDHHALYLFLTELLENVSMDGAAVYVAEDAAATASPEA
SRS043701_LANL_scaffold_10863SRS043701_LANL_scaffold_10863__gene_20961F089053CRAKRALFGFDVRSGCVVDECRPAKGCIRKWLYIFANGQLKLSTAEEVNVKNMEVSL
SRS043701_LANL_scaffold_12981SRS043701_LANL_scaffold_12981__gene_25391F056623MKSTEEMLQAAVQQLQTAEQARLTSETGPARRQLLHLLYAREAAALQSMTLVRAEIPAAQKRDWKPLIFPFAAAVLNLGALALWLTLPNCVTPENSGWMGRFALCIVAQSVSLCAAIGSIVSAMRRKPPKPVVVADETEFRRRLVEVEGRIALDAQTIEALFAQESVTIVSTGAETAAELYASLYEMAQDARLDGDANQEKALSWPLSNAKRLLNAVGCEAVDYTPETAMFYDVMDADITQQRRPAIVQKADGIVQQRGLYLRKG
SRS043701_LANL_scaffold_14322SRS043701_LANL_scaffold_14322__gene_28061F068856VKDSGASAEGNVLAAAHAVLGKERKGMKRLTSILLALLMLVGMALAEETPDAALGDWYALPSDETVLRLTLREDGTFFFGTEGISGIEGKWRKTTDGEYNLAYTNRSSSLLDVIMSMVDSQAPAPDMTMTARLTESGLDVFYGSTAEGAVVHMARDAEELRTERTPRTDTPLEAFAGTWTMETMFLGTMQLTYTPEMGERQVFCTIDGLTMFPGAGLESFPEGTSFPLTFEDGVLRTTIPLTVQMAASSALVKEIVVDYDLTFFQTADGSLYATLRLSDVPDNPTTMFLLVPMEKE
SRS043701_LANL_scaffold_14548SRS043701_LANL_scaffold_14548__gene_28543F045105MSDKLKRVLDTNLAGLHVTDAQVNAVLRRVSQDEQRPRVIRWQAIAAVLLVLVLGAGVLLQTGILSDDSPQLTADVTRQGDFLTAAQVKKLLASAADLQFSVDEEAISALMTQVEKDGGCSLSTLLEILCPAGSSLGQWDIRAQVALSQTLERIGYQGILPGMTREPLSTEITAQDALARAVAYIRTNDDPQADFTNLDFYRVGIRFLSGVYDGTCEGAYYCVNFDALDAFGTTYEVAVNAEDGSICRMRRERGAGSNHTADEVTRGFRRIFGYDMRTWTPMQLRVYILALSRADHSSMQTVHELFLSVGRDGFPDVPAEALTREEAIDAALAIQGGSSSDVLAAEYLAGASGDFWKIAIRQPNAPSGQSIVYLELNGLTGTLLTTTYSGNLYGLPQEFFPSQLLSTLDRTDFSHGVPAVDAETAQSAASAAIERQYQRNMAEEGYTCFIESDFEDTAGGFSYYTGGGVVIFTKDAQNTSQGDIYWAALNWYGEVLDVGWNRNPLDAARFTLLMQGYLPPIYQRETVQQLQALLTGSQTDEAGLRQMKTDGTLPLFNALLALEVAPDASTKSTADDVTAAALKSLNAHLCYENSSFLVYREDGELIWHFWLSTDMGYFLMDVRDSDLSVVDSVQIPSFCALRTSILLPVRVWNTLAENLRVTIFYRDANTQPGIVYGMYANHIVQRYVDLYGANILRWDQATLRSFQSAISISGSYFGDWSVACLCQTIYPDVPDYAISQEVAAEYAARALGDDDYSLRGGVLIDPGEGDPIWKVTLDYPDGRSFNAEVDCRTGAIRTLRQQDTRVMPFYTDYLDFPDDGEYWFRNFVLDEVIEQVRTQMTGRYGNNV
SRS043701_LANL_scaffold_15949SRS043701_LANL_scaffold_15949__gene_31538F075480MNKTKQEKWQRAYGDTPDSFRQRVASSLPKGQETRHVAFPRRAMVLAAALVLVLTTAYAAVVTHTELVWNAGHPIENEADDHLGLLTGKAGTSGDSLTIGGVTFTVQDGIYSPETGQLFASAVISADESVQLVGVESDMEWEVRAVTPVSEKLDPSGISWAEWAEQNGKTLLPVRMEIRGSEQFPNIPMFCDFLTQNPDGTVSVGFQVDLSEEDVSHLKSCEVQLECRVGAFGKDGKATQWQKEILTATITFK
SRS043701_LANL_scaffold_16781SRS043701_LANL_scaffold_16781__gene_33247F073574MLLPAAVRPYAADVDDTASRVRCAIALNGSQDLRIQLCGRLKRGLFGRNQLLADGDVLCVALHQPDGDVPLFDSRCDGYANVLDGKQPPAPIPLHPAICPKCRNAAFQVDLSFEYPDAEEVSAFANPGDAFTWIWISMRCTRCHAVFRGDLECD
SRS043701_LANL_scaffold_1756SRS043701_LANL_scaffold_1756__gene_2076F051935MKRILTFLLAAALLLTSLSCALADGTSPIVSDGLTELKSKRLIQIVEDEIGEGWTLYQPNGREYEENFSSLVNLREMHFLPLVAWKGNQLRLLILRRQGDLWKVSEQNDCALMRYGWTLHNFSAMGYGNSDWADIYFYFLDENQKEWELALHLSDTNVSYFDMIYVPEKYGTIIVFYYDCELKFLFDAPFYFQLNYDIKPAESYSFDVKDFDLATCPLSMQEFLVPAVVTCGEEGAGLYIMVQQDIQPILTLVDGTAIEAIPQKWERDWTIVYYQGNYLFMKTEYVTFSE
SRS043701_LANL_scaffold_1873SRS043701_LANL_scaffold_1873__gene_2312F058154MKNRTFQRLFPLLRMGSILLAAVLFCAALIGVNAQLTVGTNEERNLATYLGLALTAVVTYLLQDVPKMLVGRYFGWKLVRYEAFGRVYQADGTGAFVRVPVPAQSRLRYQPYLLPPEDAPRHDVTLYTLCPLLYGGALAVLFGVLTLLTLGQPVSLVLCELAMGGVVLVMTCILPMEERFIASPMATARMLRDPEQLAEWLYFARMKANGGVEVTDVSIENIPQPATVKTCQDAICVFQRAQIALMVQCDAQKAYGLMKQVLESEARLSYTAWKLLLSDGVVAELLAGEPGEFTALYRGKAGTAIRQVMFRMVDYALPAYAVATLVTHEAREAEVVRNTLAPVREKMPELDALMKAIEEKAARIES
SRS043701_LANL_scaffold_21315SRS043701_LANL_scaffold_21315__gene_42654F041208MRKILAMLLSVMLLLTAAVAETSAPYAPETVLPLVANNRPYRDFARGAISSSEDAIEQAKAWLYLPLFVTNFSAKSEAVDWSAMRAEEGWYVTAYMRNTFVWLLMDEQGRVQAYDFDLLGNASLTYDGALPDNLDEAISSYIRRFADLNGFVEVADYAREEVTTFGDYAVAVTVQVTLDSTPYRFTMRLDMMAFTSVENANLHPTAAQTQRDILLLMRDNLAEKGVDITQTFFAVQTDDADGETKLTGIASFPADAASDAIREQYGELERYTLHYSGCASEAGIERVELSDWQETTATAQFPLALYALVDGKYLVPDGELAAGTAYLALDSMGLRGRNVIPLESITMLTRIRYARADGTFAESWVSSDTLTENDATPAAPKREPIPTLESYQITLNGTAYTAFAINKVEKGYDAFADIAGTQMAVVDVLTSAAQGVIAEYGVDASDLLCRTVVEYGYRADKGCWQVDFAIPQRDMADDAYEVEVDDKDGKVTGMWGPQDGNG
SRS043701_LANL_scaffold_22457SRS043701_LANL_scaffold_22457__gene_45280F029444MDVLGQLTGFELEDLMTWTVSNLQRPFREDFSLEKSGIIAEKGSQIFGCRFVGFDG
SRS043701_LANL_scaffold_26256SRS043701_LANL_scaffold_26256__gene_54609F087213MKKFSSVLFAVLLIFSCWTAATAESTDAVSGAPLDNDIEIVYKEHFDDMVTRYADELTEAELLVSADVYDAIGMIRFDSEDSLAARQRIYGIASADDLHAILTGYFDPNDAAYTADAALDARLQAILAACGLNPEDYDISVIRNLSGMPEPITGTNWYCTLIRKGVEVAEDETNPYDMVIVLYGDEMTVGAFVLNPEV
SRS043701_LANL_scaffold_29088SRS043701_LANL_scaffold_29088__gene_62767F082714MVVGIRFAADAPVRTVLQAVLPIFSTADVDFLVREYWVCTFGNGLPERRFTAQEMRRALDALTPDEHAELFTIYVLPHNAPSTPPSSCEDFCARGFTMAFYAYDGDGYALLAQSGEQVEAVIETLQKAVEIRSVEAVECETLARWAF
SRS043701_LANL_scaffold_30097SRS043701_LANL_scaffold_30097__gene_66356F091068MNEKNQHVSDEQIDALIRSGLWQDEQPLTADEEKLADAAFARAMAKIDRKAKRQKRRTVLRVLDRVVRVAACLIVAVGIAFPIALANSEAFREQILQLVLSINPETGMAHVGMKPAKEEQTANVPRIDVPDGWEGLFFPTFLPDSLPLVRCETTRGDQVHSEAYYADETRSLRFEEQDNLDGWNVRAADARVLTIYLHSVPGYLIDRQTEDTHEVSIIWAEGKRMLRVTSIGLPADEAVLVAQSVKKIFAE
SRS043701_LANL_scaffold_3031SRS043701_LANL_scaffold_3031__gene_4950F047125MDVALLLMVLGVMLSGFWAADALDHMRKEIIRQEGKRRGWW
SRS043701_LANL_scaffold_31019SRS043701_LANL_scaffold_31019__gene_70876F042910VAAWAEAGGVLTLHEDGKRVLRVVCTQYPTMSTLNWLETLSLVFTAFSCPYWEDAAETSFLMPNTSDAPSKLLAVPGDAPETPLNLLIRNIGDAAITTLTISAAGKISFQGLTLAPGAAIRIHHNAGVFAAEMVSDDSTVSILPYRTPDSADDLLLRPGVLNEIRVEAGSAAFVSGRCKGRYC
SRS043701_LANL_scaffold_31313SRS043701_LANL_scaffold_31313__gene_73162F039147MRARRLLILLMMLLLLPQAQAEQLTLYTRPNNVDEATPFQLRPTELSICSVTRAMGGVVVLANDYNYDSLSLYFWQDGMTEMRKLGGGFYWVMSSDTMETAQESCEYSMSRVPNYRMPDLTHAISNLTSDGETLYALNRINGLIFKISETKDGLQTEDVCTMANLSCLNVSYRDLETDKVYTYPASLTRMYVCGSVLAISVMQENGIKVVLVDLTDGAIREIADESLEAMYEWADGELLLWRLEGSTNEISRTSGTYTLSRYSVATGEETLLSTGVPYKKRSECGAYDPYSGSYYDVRTRQIVRTTDFVQEEPVVTFPAANVDIAVTKDSIVGVNLSSVYVRSKENGDMTVLRIQSSNGASNTALQHFAEENPKVILAQETLAKSAMNAASLAARMSASADAPDILRLGLTPNTPEADGSWPLDVLMDKGWCMDLSVYPEVSDYVSRLNGVYRDAVTRDGKIYALPIYAWSYGYFISRNVMEKLGLQESDIPTNLIDLCAFITKWNDNLTGAYAAYTPLEETESYRERVFDLMVHDWIGYCQAENIPLRFDHPVFREMMAALEAMRTDKIEQANQQVNEEISDYRECLIWTEAQAVGNFANYADAFGSRIFLPMALTPDVTTHYGIGYMTVLVVNPRTTNADLVGKLLAQVIADQEATAKCVLLADYDEPIEDSYYLIMVSDYEKTLTELRRQQENAPAWKKQGIQERINEEEASLQRYTVRERWTIAPKTIELYQQTILPMSYLRRPGILADSDAFNALVSQVHQGEISLEEFVGEADKLIEGLEQ
SRS043701_LANL_scaffold_31395SRS043701_LANL_scaffold_31395__gene_74160F073573MPTIVSFYQRFPNEAPPLNLSAFDRTGYTYAENFRRNERCYEAEQCEVTSDDGSVVLSLDVRIRPRGGEIEALPRVMLSVYSEGMLFQVRRMELRMGDTTYAILPDNVQQYTMRGETDGGFLETMAIPLGKVGMKMLLEASDAAEARWVIQGARTAQTLPLTAAQKKAIRRFCMDCEESAITYQPAFLWYKDYCTKA
SRS043701_LANL_scaffold_31395SRS043701_LANL_scaffold_31395__gene_74161F101192MQTEPSDKEGRNMTYWGWLLVGIALLLTALSGTQTSLCQRMRSVPVYRGLTYWEVQQLAKAAPTSITPCGEDTIRRWVSERYQIALRFNRYDICLGVEEEIDG
SRS043701_LANL_scaffold_4824SRS043701_LANL_scaffold_4824__gene_8542F057385MKKLLAVLLSIMMLAMPLTSMAENSVWDNAARQETTITIHDLNADLVAALGGDDTTMAAINDLLAALSLTGYQQGDEAGFDLNLSGKSVLGMASLTTAAEENQLMYVSSALLGGVIAVNSKDVEAIREKALRATMKMSGQSDEEIDKAIEESKEQLSGNAEYTALMEASANLSSMTEEQLMEELTQADTTAFMTMMNEILSGAEMAEVTEQPGDCDAAKNYVKVTVPPEKLAEMTKALLEMIHSVPSIGAYMDALFSAADTSWDDLLKELDEADLYADDIVYEYWMTEAGELVRMTASVKINNGGEEPLPMSFTMTRNTADGVATWLVTIKSAEDTAATLTFAGDLENFTANLTAYAGEDSVEINVSGKGVGTDSSVVDVEIKETVDGVEQGFGVVVTTATTMDGEQGVRKVDVLVRFMGLDVVTITAETRTCDAKDALDVSKAQDLGAMTDSEFQTWFVKVMNNLQNLPMTLLMSLPESMLTLLMGGSN
SRS043701_LANL_scaffold_8747SRS043701_LANL_scaffold_8747__gene_16652F077320MKRKGMHRRCKWVLLAVLLIMVGIAVWRIWQTPRPVVLHRQDAWLSSAEPEMVDTLPDNAFTAELPEEIDGRLLYIRDYSASGHYQIVWEDVSAEAAEVYLNALLDKGFTRLMGTAEDIASGVLLARGDLTLSISLSGGTLNMLMTRAEEPTPTPLP

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.