NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300029718

3300029718: Human fecal microbial communities from twins in the TwinsUK registry in London, United Kingdom - YSZC12003_36244



Overview

Basic Information
IMG/M Taxon OID3300029718 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0133139 | Gp0283873 | Ga0245237
Sample NameHuman fecal microbial communities from twins in the TwinsUK registry in London, United Kingdom - YSZC12003_36244
Sequencing StatusPermanent Draft
Sequencing CenterBeijing Genomics Institute (BGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size254020258
Sequencing Scaffolds21
Novel Protein Genes21
Associated Families20

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Oscillibacter2
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales11
Not Available2
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → unclassified Oscillospiraceae → Ruminococcaceae bacterium LM1581
All Organisms → cellular organisms → Bacteria2
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Faecalibacterium → Faecalibacterium prausnitzii1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Fecal Microbial Communities From Twins In The Twinsuk Registry In London, United Kingdom
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Large Intestine → Fecal → Human Fecal → Human Fecal Microbial Communities From Twins In The Twinsuk Registry In London, United Kingdom

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal distal gut

Location Information
LocationUnited Kingdom: London
CoordinatesLat. (o)51.5Long. (o)-0.12Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F013656Metagenome269Y
F029444Metagenome188Y
F041208Metagenome160N
F042936Metagenome157N
F043945Metagenome155N
F051934Metagenome143N
F055715Metagenome138N
F057385Metagenome136N
F068855Metagenome124N
F074899Metagenome / Metatranscriptome119N
F077320Metagenome117N
F078003Metagenome117N
F078822Metagenome116N
F081354Metagenome114Y
F082887Metagenome / Metatranscriptome113Y
F087213Metagenome110N
F088914Metagenome109N
F088921Metagenome109N
F092227Metagenome107N
F101192Metagenome102N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0245237_1000841All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Oscillibacter33891Open in IMG/M
Ga0245237_1004352All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales7855Open in IMG/M
Ga0245237_1005748All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales6217Open in IMG/M
Ga0245237_1008306Not Available4570Open in IMG/M
Ga0245237_1008586All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales4440Open in IMG/M
Ga0245237_1010361All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Oscillibacter3781Open in IMG/M
Ga0245237_1011069All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales3567Open in IMG/M
Ga0245237_1011479All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales3469Open in IMG/M
Ga0245237_1011485All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → unclassified Oscillospiraceae → Ruminococcaceae bacterium LM1583467Open in IMG/M
Ga0245237_1012467All Organisms → cellular organisms → Bacteria3228Open in IMG/M
Ga0245237_1017284All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales2397Open in IMG/M
Ga0245237_1018987All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales2212Open in IMG/M
Ga0245237_1020996All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia2015Open in IMG/M
Ga0245237_1021515All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae1970Open in IMG/M
Ga0245237_1025554All Organisms → cellular organisms → Bacteria1688Open in IMG/M
Ga0245237_1026441All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales1636Open in IMG/M
Ga0245237_1035187Not Available1244Open in IMG/M
Ga0245237_1036990All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales1185Open in IMG/M
Ga0245237_1055532All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales777Open in IMG/M
Ga0245237_1061761All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales692Open in IMG/M
Ga0245237_1065464All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Faecalibacterium → Faecalibacterium prausnitzii653Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0245237_1000841Ga0245237_100084136F029444MGVSAQLTGFELEDFMSWTVSNLQRPFREDFSLEISGIIAEKESQIFGRRFVGFDGPKK
Ga0245237_1004352Ga0245237_10043524F101192MQTEPSDKEGRNMTYRGWLLVGIALLLTALSGTQTSLCQRMRSVPVYRGLTYWEVQQIAKAAPTSITPCGEDNIRRWVSERYQIALRFNRYDICLGVEEEIDG
Ga0245237_1005748Ga0245237_10057484F029444MEVSEQLTGFELEDPMSWTVSNLQRPFREDFSLKKSGIITEKESQKFGRRFVGFDGPKKAAPFFNF
Ga0245237_1008306Ga0245237_10083061F078822FLKAFHPHAVRFPAAFLLVHRFYLAFEELLSCATAYL
Ga0245237_1008586Ga0245237_10085862F078003VNEHLAFAQCLQRVLNETELTATEVARRLEMRSRNSIFRILKGKTSPQLNRRFLESFHKQMGEQLTEAQWAALNRALEMDAVGAVEYKSRQALMQLVGAFSEPISPAKVCYLDASGAEKEDSFLHYLQVLFCGALKVNALLFGCCDLGLFRQLQEAIHPVAQRVIVRIDHFIYAGEDEIVSNLVGIQPMVDQPCYHAYLVDAENCPQERLAHYRTGQMTFHVMHQDGSESTVALFLLGKNEFTATVMSTQDLWMSRKVLCDRERFSPIRLLWQLNDENSDFIVYTQQYCKMEHGAAIYYIRPDVPFQYIPLEVLYPVVREGFARMGMTREEYEPNVTALAEIHQARMTNMMRRRRPTYIVLNKAAMEEFVRTGRQSDHLHFLRNFTPKERKQILDVLVQQARENPFFHLYFAQEKLPYTMGEVALYDGRALITMSSGSGYDLRTDHRESCITHPFVLRAYKQFYMNTVVARLADTQTESLNQLEELVRRCEKMIREGQKGNAGNEQEKLSGQ
Ga0245237_1010361Ga0245237_10103612F081354LVRENQQSSAHNFVKDFSPIFLKSSRRSPLKKGAGCRNPLENGKAEVQILFEPLQATISSMELPKNFENKKLPNTLR
Ga0245237_1011069Ga0245237_10110691F057385MKKLLAVLLSIMMLAMPLTSMAENSVWDNAARQETTITIHDLNADLVAALGGDDTAMAAINDLLAALSLTGYQQGDEAGFDLNLSGKSVLGMASLTTAAEENQLMYVSSALLGGVIAVNSKDVEAIKEKALRATMKMSGQSDEEIDKAIEESKEQLSGNAEYTALMEAAANLSSMTEEQLMEELTQADTTAFMTMMNEILSGAEMAEVTEQPGDCDAAKSYVKVTVPPEKLAEMTKALLEMIHSVPSIGAYMDALFSAADTSWDDLLKELDEADLYADDIVYEYWMTEAGELVRMTASVKINNGGEEPLPMSFTATRNTADGVATWLVTIKSAEDTAATLTFAGDLENFTANLTAYAGEDSVEINVSGKGIGTDSSVVDVEIKETVDGVEQGFGVVVTTATTMD
Ga0245237_1011479Ga0245237_10114792F041208MRKILAMLLSVLLLLLTAAVGETPAPYAPETVLPLVANNRPYRDFARGAISSSEDAIEQAKAWLYLPLFVTNFSAKSEAVDWSAMRAEEGWYVTAYMRNTFVWLLMDEQGRVQAYDFDLLGNASLTYDGALPDNLDEAIASYIRRFAELNGFVEVADYAREEVTTFGDYAVAVTVQVTLDGTPYRFTMRLDMMAFTSVENVNLHPAAAQTQRDILLLMRDNLAEKGVDITQTFFAVQADDADGETKLTGIASFPADAASDAIREQYGELERYTLHYSGRASEAGIERVELSDWQETTATAQFPLALYALVDGKYLVPDGELAAGTAYLALDSMGLRGRNVIPLESITMLTRIRYARADGTFAESWVSSDTLTENDAAPAAPKREPIPTLESYQITLNGTAYTAFAINKVEKGYDAFADIAGTRTAVVDVLTSAAQGVIAEYGVDASDLLCRTVVEYGYRADKGCWQVDFTIPQRDMADDAYEVEVDDKDGKVTGMWGPQDGNG
Ga0245237_1011485Ga0245237_10114856F074899MSGRKQWNKQAATSSFLDKKTPQSLIQQGLEGSTVVGKDEVGSSNLPSSSTNSP
Ga0245237_1012467Ga0245237_10124674F042936VWKTKEGIMKTSGIVDRGEDTIRNFEKVEQEGAFTPPYLGKVYFRSRLRGKG
Ga0245237_1017284Ga0245237_10172843F087213WTAATAESIDAVSGAPLDNDIEIVYKEHFDDMVTRYADELTEAELLVSADVYDAIGMIRFDSEDSLAARQRIYGIASADDLHAILTGYFDPNDAAYTADAALDARLQAILAACGLNPEDYDISVIRNLSGMPEPITGTNWYCTLIRKGVEVAEDETNPYDMVIVLYGDEMTVGAFVLNPE
Ga0245237_1018987Ga0245237_10189871F043945MKHNLRMAFLSLLLVLTLGVMPVFAESADHAMDWTHGIGSRYRTDVTSALPFEDELFFISGSQLLRVDDPEYEPEVVCNLEDIIWSEALKQSGYYGQVMLLDGGAELVLFSANEGRLYSFVPPTDDTMVRMQMVAELDYSTVPQMGWKKFPIAKDLYDVTVEQAVYDAENGLYVIAKDKTDEYQIVRFDVDTGKGEMMGFALEDDENSHLLTDMPGIFQWSGTSSKYERTLYDGKVLTTVDLSSGEATKAVHNWVNTLTLQPIPDYDVTLGDYVTHDAYYAYAPNKDNTAMYFVHDNTVWVMEYDEENS
Ga0245237_1020996Ga0245237_10209963F077320MKRKGMHRRRKLVLLAVSLIMVGIAVWRIWQTPRPTVMHRQDAWLSSAEPEMVDTLPDNAFTAELPEEIDGRLSYIRDYSASGHYQIVWEDVSAEAAEAYLTALLDKGFTRLMGTAEDIASGVLLARGDLTLSISLSGGTLNMLMTCAEEPTATPLPEWLADW
Ga0245237_1021515Ga0245237_10215153F088921LGATIRLYKIGAGKNHFLCSEKSKSTVFDLDRETKKRKYAKETCRIYIEKDQNMQEEKKEKYINGEIYGEKNQIIVANKLAMIRNKMRSK
Ga0245237_1025554Ga0245237_10255542F055715LTKADEFDKINELLIERTAKKFERTSKNKLKKFLTNEKFCDKINELIRVGTAKILDN
Ga0245237_1026441Ga0245237_10264414F088914VGRILPVCSGFFVFWSRKEGANYQISVKSFSTLDSYDIIQLYIMTELTDGRVSDRLFSVPYTPKENIKNPGKFIVFSAWYAAKALEMDVK
Ga0245237_1035187Ga0245237_10351871F082887SLNKKNLEYNILGGFNMEKIKTIAKYTINVLAIISALVTGINAVEGITIPYAIQIVQVIAVVQGVISTYLLGQKAISNREKK
Ga0245237_1036990Ga0245237_10369901F051934SDGSFDARIDLGLTNAPEKTATRIRFFGLDSHWGIQATDLGGETLMVNQLAWLEFAIKAYNHLDLPLQRVFLWLSPYAHTSAWAGVRQAIADLAAQETSGRLENTALITCAEEIARLSEDDRALYYYIEAFGLESGADANIFDALATLPEYVEANFPDGLSIEHTENGVSWQNGEETVFSYAEADGTQVVSLHLPDLVDFSATLRRDALLFTGALSLQSDVLNAQVSFSLPVSYPVTLPFYAQIDADGMMTGDDGIHLAFEGEAQGDTVIIRRIQPDHSATMMTLTVKVIQVAEGTVKYAPEDVQGTNVLSVDGPALAELMGRIGKPMVRKVTQWLAALPAELVQGAMDAMEDSGLLGTLTDAILNGEAGEY
Ga0245237_1055532Ga0245237_10555322F068855VEAQGPQVRKILAPQWLQAEKESENANVQNAGWLHSFDTDYHYRGSDLHYHVNVSAALSEGGAAW
Ga0245237_1061761Ga0245237_10617611F013656MSEIMEKKIYKEVNKNETETTINVLYKEEKICIYTNKVDLQKQLNKLLGEPTKEYKIKRSIVGSSWEIDFKEKSKISQMILKANIYEL
Ga0245237_1065464Ga0245237_10654641F092227PSGLMKEEKRKRILRVGCLILAGIFALSMLGSVVMMLLV

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.