NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300008725

3300008725: Human stool microbial communities from NIH, USA - visit 1, subject 763678604 reassembly



Overview

Basic Information
IMG/M Taxon OID3300008725 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0063646 | Gp0053330 | Ga0115670
Sample NameHuman stool microbial communities from NIH, USA - visit 1, subject 763678604 reassembly
Sequencing StatusPermanent Draft
Sequencing CenterBaylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size211787256
Sequencing Scaffolds25
Novel Protein Genes26
Associated Families26

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes2
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → unclassified Eubacteriales → Clostridiales bacterium2
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales11
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Bacteroidaceae → Bacteroides → Bacteroides salyersiae1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Coriobacteriia → Coriobacteriales → Coriobacteriaceae → Collinsella → unclassified Collinsella → Collinsella sp. 4_8_47FAA1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Lachnospiraceae2
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Faecalibacterium → Faecalibacterium prausnitzii3
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Rikenellaceae → Alistipes1
Not Available1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Large Intestine → Fecal → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal surface

Location Information
LocationNational Institutes of Health, USA
CoordinatesLat. (o)N/ALong. (o)N/AAlt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F026489Metagenome197N
F026592Metagenome / Metatranscriptome197Y
F029444Metagenome188Y
F032286Metagenome / Metatranscriptome180Y
F039147Metagenome164N
F043945Metagenome155N
F051935Metagenome143N
F055715Metagenome138N
F055775Metagenome138N
F057385Metagenome136N
F058154Metagenome135N
F066905Metagenome126Y
F068811Metagenome124N
F073574Metagenome120N
F074898Metagenome119N
F075480Metagenome119N
F076190Metagenome118Y
F077320Metagenome117N
F078004Metagenome117N
F081453Metagenome114N
F088914Metagenome109N
F092227Metagenome107N
F097493Metagenome104Y
F099269Metagenome103N
F100397Metagenome102N
F105374Metagenome100N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0115670_1001486All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes20958Open in IMG/M
Ga0115670_1003427All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → unclassified Eubacteriales → Clostridiales bacterium9368Open in IMG/M
Ga0115670_1003477All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales9261Open in IMG/M
Ga0115670_1003814All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Bacteroidaceae → Bacteroides → Bacteroides salyersiae8478Open in IMG/M
Ga0115670_1003966All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales8102Open in IMG/M
Ga0115670_1004028All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → unclassified Eubacteriales → Clostridiales bacterium7969Open in IMG/M
Ga0115670_1004406All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales7292Open in IMG/M
Ga0115670_1005092All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales6314Open in IMG/M
Ga0115670_1005106All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales6300Open in IMG/M
Ga0115670_1005575All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales5746Open in IMG/M
Ga0115670_1006340All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales4997Open in IMG/M
Ga0115670_1006883All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae4612Open in IMG/M
Ga0115670_1010217All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes3046Open in IMG/M
Ga0115670_1012715All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Coriobacteriia → Coriobacteriales → Coriobacteriaceae → Collinsella → unclassified Collinsella → Collinsella sp. 4_8_47FAA2419Open in IMG/M
Ga0115670_1013422All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Lachnospiraceae2289Open in IMG/M
Ga0115670_1013875All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales2218Open in IMG/M
Ga0115670_1015795All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales1938Open in IMG/M
Ga0115670_1016802All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales1821Open in IMG/M
Ga0115670_1022390All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Faecalibacterium → Faecalibacterium prausnitzii1353Open in IMG/M
Ga0115670_1023527All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Rikenellaceae → Alistipes1284Open in IMG/M
Ga0115670_1026520All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Faecalibacterium → Faecalibacterium prausnitzii1137Open in IMG/M
Ga0115670_1028226All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Lachnospiraceae1066Open in IMG/M
Ga0115670_1041668Not Available713Open in IMG/M
Ga0115670_1042436All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Faecalibacterium → Faecalibacterium prausnitzii699Open in IMG/M
Ga0115670_1054581All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales529Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0115670_1001486Ga0115670_10014866F097493MKNGTAQFYEHGVEIDGTVYGIHTDRDTLRIKRSVVNDKFAETDGDFDMDTEIAKIKHTDITFEQPTAEQLEQIQAKTYNSMTELKQHVQSVMNGDETMSQDEINAMLMLQIAELKAGVGGE*
Ga0115670_1001591Ga0115670_10015911F078004KIIRFQAIFKDTVEFTCALEFKISSQHNITFFQITVFYVPAVSRRIRSLALLSAVSEHLSSRNTDFLFRDLLFRKSSTGGLSAVAGSAALDVHMLRHTLIITIINALYRLTVDTDGMAWMRQRITERLSSLSLLRKALAAGAVTITGMLTSHHDVSLAAQTVLVIGTIFHNTF*
Ga0115670_1003427Ga0115670_10034274F075480MNKTKQEKWQRAYGDTPDSFRQRVASALPKGEESRHVAFPRRAMVLAAALVLVLTTAYAAVVTQTELVWNAGHPIENEADDRLGLLTGKAGTSGDSLTIGGVTFTVQDGIYSPETGQLFASAVISADESVQLVGVESDMEWEVRAVTPVSEKLDPSGISWAEWAEQNGKTLVPIGMEAAPTLQFLKVNGQTTDTPLIGAFLTQNPDGTVSAGFQVDLTEADTSHLKSCEVQLECRVGAFGKDGKATQWQKEILIATITFK*
Ga0115670_1003477Ga0115670_10034778F058154MKNRTFQRLFPLLRMGSILLAAVLFCAALIGVNAQLTVGTNEERNLATYLGLALTAVVTYLLQDVPKMLVGRHFGWKLVRYEAFGRVYQADGTGAFVRVPVPAQRRLRYQPYLLPPEDASRHDVTLYTLCPLLYGGALAVVFGVLTLLTLGQPVSLVLCELAMGGVVLVMTCILPMDERFIASPMAIARMLRDPEQLAEWLYFARMKANGGVEVTDVSIENIPQPATVKTCQDAICVFQRAQIALMVQCDAQKAYGLMKQVLESEARLSYTAWKLLLSDGVVAELLAGEPGEFTALYRGKAGTAIRQVMFRMVDYALPAYAVATLVTHEAREAEVVRNTLAPVREKMPELDALMKAIEEKAPPTPKPDP
Ga0115670_1003814Ga0115670_10038142F055775MAQIAQQDNLVIEVTTAAALDGATKKKLIECIEGGTITDVILVTKEVEKKISHARVVSWLVDTTGDSPKYTIHIINANSGAVAAIALN*
Ga0115670_1003966Ga0115670_10039667F029444MEVSEQLTGFELEDLMSWTAFNLQRPFREDFSLKKSGIIAEKESQKFGRRFVGFDRPKKAAPFFNF*
Ga0115670_1004028Ga0115670_10040281F074898VSRPESGKDKKKTKKQCGTWNLSFSLTKRTKCCILYVAKAFVLYRKECFFMRKILSLLLILALFLPCALAETPQGIDLALTSTYGDGLSLRMTAGLGETPFCSLTLPSGQIDLAFSPDKELCVQSGGQWAQLLVEGHPVDAALLTTPLKLLDGRSPSEVLGLLAEDVNKMLDAIPSLYNLPMTLIRDPAFARLFSDLYIAAQTGTISITSEELNRLAASVLGKIYSMEYLDGLNFSDIYHNLLASVVQTSEVARAYRSAMRGYLLSNFRLSGQIGIEKGELLFRQPYQEPYHLTYEQTTPNSWHYLLTTANSDTIDGVLYWRNEYDFALSGYSENQHTAFSVTCSYGSANNFVFIASVDNSFSLSIVQAGSNFSLRLNNENANVFALNANVFSLESTSDYIRVKIYYNYYTLTITPATDGLDIVAQAGTFDIFTAHVRTALNGLICTGVYGDTTYRLALTETATGFSLLLSLPDGDFHLDFAALSDTSCSLTFTDAAGQQVSLTGSLCTPESPVIPDTLGTIELTQLLDGLF*
Ga0115670_1004406Ga0115670_10044065F039147MERKRIAMRARRLLILLMMLLLLPRAQAERLTLYTRPGQVDEATPFQLRPTELSICSVTRAMGGVVVLANDDNYDSLSLYFWQDGMTEMRKLGGGFYWVMSSDTMETAQESCEYAMSRVPNYRMPDLTHAISELTSDGETLYALNRINGLIFKISETKDGLQTEDVCTMANLSCLNVSYRDLETDKVYTYPASLTRMHVCGSVLAISVMQENGIKVVLVDLADGAIREIADESLEAMYEWADGELLLWRLEGSPNEISRSSGTYTLSRYSVATGEETLLSTGVPYKKRSECGAYDPYSGSYYDVRTRQIVRTTDFVQEDPVVTFPAANVNIAVTKDSIVGVNLSSVYVRSKENGDMTVLRIQSSNGASNTALQHFAEENPEVILAQETLAKSAMNAASLAARMSASADAPDILRLGLTPDTPEADGSWPLDVLMDKGWCMDLSVYPEVSDYVSRLNGIYRDAVTRDGKIYAMPIYAWSYGYFISRNVMEKLGLQESDIPTNLIDLCAFITKWNDNLTGAYAAYTPLEETESYRERVFDLMVRDWIGYCQAENIPLRFDHPVFREMMAALDAMRTDKIEQANQQVNEEISDYRECLIWTDAQAVGNFANYADAFGSRIFLPMALTPDVTTHYGIGYMTVLVVNPRTTNADLVGKLLAQVIADQEATAKCVLLADYDEPIEDSYYLIMVNDYEKTLTELRRQQENAPVWKKQGIQERINEEEASLQRYTVRERWTIAPKTIELYQQTILPMSYLRRPGILADSDAFSALVSQVHQGEISLEEFVEEADKLIEGLEQ*
Ga0115670_1005092Ga0115670_10050927F051935MKEEKRMKRLLGLLMAVMVMMGGISCAVAENTNPIVSDWLTELKSKRLIQIVEDEIGEGWTLYQPNGREEEENFSSAANLKEMRFLPVVAQKDNQLRLLILRKQGDLWKVSEQNDRALMRDGWTLQNFSAMPYGNSDWTYIYFDFVDENQKRWNLMLNLGDGYVSSFGTISHYVEGYGTTYINMNYDRGLEFLIDAPAYSRLSYEVYPVEDYSFGVEDFDLATCPLSMQEFLVSAIVTCGEEGAGLYIMVQQDVQPIVTLADGDAIEAIPQKWELDWTIVYYQGNYLFMKTENCKMEE*
Ga0115670_1005106Ga0115670_10051064F032286MVKWVCQIVTPVRHRALVFDARLFAGNAADDTLVTGGTFRFCRLICLCVKRRNIMLNDKRRSLLNSALFRADNRTEQKTISPFSLALISTFDFAALSERRSCPEDRSRRFVLLGALDAALRQRTYPVRTVMQFSRFRCDCKTILAKNIALSRISKRSENAPKNADFHAYG*
Ga0115670_1005575Ga0115670_10055754F057385MKKLLAVLLSIMMLAMPLTSMAENSVWDNAARQETTITIHDLNADLVAALGGDDTTMAAINDLLAALSLTGYQQGDEAGFDLNLSGKSVLGMASLTTAAEENQLMYVSSALLGGVIAVNSKDVEAIKEKALRATMKMSGQSDEEIDKAIEESKEQLSGNAEYTALMEASANLSSMTEEQLMEELTQADTTAFMTMMNEILSGAEMAEVTEQPGDCDAAKNYVKVTVPPEKLAEMTKALLEMIHSVPSIGAYMDAFFSAADTSWDDLLKELDEADLYADDIVYEYWMTEAGELVRMTASVKINNGGEEPLPMSFTATRNTADGVATWLVTIKSAEDTAATLTFAGDLENFTANLTAYAGEDSVEINVSGKGIGTDSSVVDVEIKETVDGVEQGFGVVVTTATTMDGEQGVRKVDVLVRFMGLDVVTITAETRTCDAKDALDVSKAQDLGAMTDSEFQTWFVKVMNNLQNLPMTLLMSLPESMLTLLMGGSN*
Ga0115670_1006340Ga0115670_10063405F043945MAFLSLLLVLTLGVMPVFAESADHAMDWTHGIGSRPPTGADSARPLAGELFFISGSQLLRVDDPEYEPEVVCNLEDIIWSEALKQSGYYGQVMLLSGGAELVLFSANEGRLYSFVPPTDDTMVRMQMVTELNYSTVPQMGWKKFPIAKDLYDVTVEQAVYDAQYGLYVIAKDKTDEYQIVWFDLDTGKGEMLEFALEDDENSHLLTDMPGIFQWSGTSSKYERTLYDGKVLTTVDLSSGEATKAVHNWVNTLTLQPIPDYDVTLGDYVTHDAYYAYAPNKDNSAMYFVHDNTVWVMEYDEENSQFSEPRGIDKLPFFAEDTMCGF*
Ga0115670_1006883Ga0115670_10068834F088914VGRILPVCNGFFVFWSRKEGANYQISVKSFSNLDSYDIIQLYIMTELTDGRVSDRLFSVPYTPKENIKNPGKFIVFSAWYAAKALEMDVK*
Ga0115670_1010217Ga0115670_10102171F026592ASPQGIDAQASQGSVAPLMERSDETFSGMQLFSADRE*
Ga0115670_1012715Ga0115670_10127154F099269VGELLLLDDLGDRASGASVLAGATSDAGVLVSDGSDVLELQNASGAGVDANATSDALVGINYGMSHGSFLSVDRRYRRCAPV*
Ga0115670_1013422Ga0115670_10134223F081453MFPFFLLAGENIIEKHISANPVCGINIKAPKPTVKGAPGRKCIQ*
Ga0115670_1013875Ga0115670_10138752F073574VDDTASRVRCAIVLNGSKELHIQLCGRLKRGLFGRNQLLADGDVLCVALHQPDGDVPLFYSRCDGYANVLDDRQPPAPIPLHPAICPKCRNAAFQLRLTFEYPEAEELAAFANPDDMFTWVWVTMRCTRCHAVFRGDFAAD*
Ga0115670_1015795Ga0115670_10157952F077320MKRKGMRRRRKLVLLAVLLIMVSIAVWRIWQTPRPVVLHRQDAWLSSAEPEMVDTLPDNAFTAELPEEIDGLLLYIRDYSASGHYQIVWENVSAEAAESYLTALLDKGFTRLMGTAEDIASGVLLARGDLTLSISLSGGTLNMLMTRAEESTPTPLPEWLADW*
Ga0115670_1016802Ga0115670_10168023F076190GSITIFVKNVFTAASRVLGRVWSLVRITIPNGLKYVKIGVEKPQNNSLYPYFQREPL*
Ga0115670_1022390Ga0115670_10223903F026489VAKENQKTTSDFDALEPRKRGCSPLLTPKKWAAPKKTEDSRLFGV
Ga0115670_1023527Ga0115670_10235272F105374DGDTPELCRPAPLPCRLGGGVCFGMDGLDAAIPRGGGASSCRDPRVVFVAGDTLITFISVAGVADTALGDPVCRFYGRNVARVVSRTRRMTGGRMGAADNDFPDDPDFAELQGVVIENQRYPWESYAAGDSAYRLPVVRSLVGGKEDPLLGSDMRRRYVRLLTEVSVELKAGGTRPFVHVVYLLPDP*
Ga0115670_1026520Ga0115670_10265202F068811MDKMDQDGSEHNICSNREGLCPGKEQHGASGWKKIFRHGKEPLRNKDSASQYCNKKAAVLLILNENVSETLCIFSIDKTNCCRI*
Ga0115670_1028226Ga0115670_10282262F055715LTKANEFDKINELLIERTAKKFERASKNKLKKFLTNEKFCDKINELIRVGTAEILDN*
Ga0115670_1041668Ga0115670_10416682F100397MKKLIAILVLSLSLILCLTACGSKMPYDLSETASVELHAYNNDSTEPFAKIVVDGEDVATIVEMFNSLKLKEMKYTEPSIRGYEFWFRDENGSEIVKIELPYGPSPWLVVGGTEYPYQDVNGGVDVDYLAQLVDMTISAGPMQPEDGVDHNAP
Ga0115670_1042436Ga0115670_10424362F092227MNEQKRKRILRVGCLILAGVFLLSVLGSVILMLLV*
Ga0115670_1054581Ga0115670_10545812F066905QDEVRDCSCCNNCNQGGLFGNLFGGCNNDSTILFFIIIFLLLFTNFGCGCGR*

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.