NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300029698

3300029698: Human fecal microbial communities from twins in the TwinsUK registry in London, United Kingdom - YSZC12003_37194



Overview

Basic Information
IMG/M Taxon OID3300029698 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0133139 | Gp0283820 | Ga0245184
Sample NameHuman fecal microbial communities from twins in the TwinsUK registry in London, United Kingdom - YSZC12003_37194
Sequencing StatusPermanent Draft
Sequencing CenterBeijing Genomics Institute (BGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size197377191
Sequencing Scaffolds14
Novel Protein Genes19
Associated Families19

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales10
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Bacteroidaceae → Phocaeicola → Phocaeicola dorei1
All Organisms → cellular organisms → Bacteria1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Subdoligranulum → unclassified Subdoligranulum → Subdoligranulum sp. APC924/741

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Fecal Microbial Communities From Twins In The Twinsuk Registry In London, United Kingdom
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Large Intestine → Fecal → Human Fecal → Human Fecal Microbial Communities From Twins In The Twinsuk Registry In London, United Kingdom

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal distal gut

Location Information
LocationUnited Kingdom: London
CoordinatesLat. (o)51.5Long. (o)-0.12Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F041208Metagenome160N
F043413Metagenome156N
F043945Metagenome155N
F045105Metagenome153N
F047126Metagenome150N
F051934Metagenome143N
F056623Metagenome137N
F058154Metagenome135N
F070133Metagenome123N
F072366Metagenome121N
F074898Metagenome119N
F077319Metagenome117N
F077320Metagenome117N
F082714Metagenome113N
F082715Metagenome / Metatranscriptome113N
F088920Metagenome109Y
F090515Metagenome108N
F091068Metagenome108N
F099451Metagenome103N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0245184_100067All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales206444Open in IMG/M
Ga0245184_100091All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales178405Open in IMG/M
Ga0245184_100106All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales160385Open in IMG/M
Ga0245184_100110All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Bacteroidaceae → Phocaeicola → Phocaeicola dorei157603Open in IMG/M
Ga0245184_100209All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales107677Open in IMG/M
Ga0245184_100237All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales97980Open in IMG/M
Ga0245184_100326All Organisms → cellular organisms → Bacteria79611Open in IMG/M
Ga0245184_100400All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales71281Open in IMG/M
Ga0245184_100786All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales40575Open in IMG/M
Ga0245184_100850All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales37769Open in IMG/M
Ga0245184_101581All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes19883Open in IMG/M
Ga0245184_102989All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales10369Open in IMG/M
Ga0245184_127647All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales856Open in IMG/M
Ga0245184_128126All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Subdoligranulum → unclassified Subdoligranulum → Subdoligranulum sp. APC924/74837Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0245184_100067Ga0245184_100067136F058154MKNRTFQRLFPLLRMGSILLAAVLFCAALVGVNAQLTVGTNEERNLATYLGLALTAVVAYLLQDVPKMLVGRHFGWKLVRYEAFGRVYQADGTGAFVRVPVPAQSRLRYQPYLLPPEDAPRHDVTLYTLCPLLYGGALAVVFGVLTLLTLGQPVSLVLCELAMGGVVLVMTCILPMDERFIASPMATARMLRDPEQLAEWLYFARMKANGGVEVTDVSIENIPQPATVKTCQDAICVFQRAQIALMVQCDAQKAYGLMKQVLESEARLSYTAWKLLLSDGVVAELLAGEPGEFTALYRGKAGAAIRQVMFRMVDYALPAYAVAMLVTHEAREAEVVRNTLAPVREKMPELDALMKAIEEKAARIES
Ga0245184_100067Ga0245184_10006765F077320MKRKGMRRRCKLVLLAVLLIMVGIAVWRIWQTPRPTVLHRQDAWLSSAEPEMVDTLPDNAFTAELPEEIDGRLLYIRDYSASGHYQIVWEDVSAEAAEAYLTALLDKGFTRLMGMAEDIASGVLLARGDLTLSISLSGGTLNMLMTRAEESTPTPLPEWLADW
Ga0245184_100091Ga0245184_100091110F082715MMKFMKRALSLLISAMLLLGCFALAEEAPLLPIAIVSYDLTDEATTAALSALIEPKENALTRWERVVLSDGREAWVICQFDQATMSNAWSHVIDAETQEVLQEDTTDTGFFATAQARWESAKGIYALWSIQDKMLFDRLYAMAPCYGEPVEGDLTQEEALAAALNVTGLTAADYDAVGYGYIMGSSEQNGTWQVFFVKGNEVVSTVNLDARDATILLVEPDEEGNG
Ga0245184_100091Ga0245184_100091123F077319MQENTVCAANCTVQEKRGGQKIMLIRNATLQMSATERTCMDVRVMNGCVWEMGAALVKGLYEAEIDLCGDVLMPGRMLETPISAADEKALRLLCRRLYREGVRYFVADCPADALLRVQNRPERRGALPVTVLPNPEPLRSGTGMPLTRWTAAGEFVGMMDEHSGD
Ga0245184_100091Ga0245184_100091163F051934MGLMRRLAAAALAAVLVLSTALGERVTFRLSADIDPVQYPAQERKLAQGLKSLFRLLTVEGDVVASDGSFDARIDLSLTNAPEKTATRIRFFGLDSHWGIQATDLGGETLMVNQLAWLEFAIKAYNHLDLPLQRVFLWLSPYAHTSAWAGLRQAIADLAAQENDGRLENTALIACAEEIARLSEDDRALYYYIEAFGLESGTDANIFDALATLPEYVEANFPDGLSIEHTENGVSWQDGEETVFSYADADGTQVVSLHLPDLVDFSATLRRDALLFTGALSLQSDVLNAQVSFSLPVSYPVTLPFYAQIDADGMMTGDDGIHLAFEGEAQGDTVIIRRIQPDHSATMMTLTVKVIQVAEGTVKYAPEDVQGTNVLSVDGPALAELMGRIGKPMVRKVTQWLAALPAKLVQGAMDAMEDSGLLGTLTDAILNGEAGEY
Ga0245184_100106Ga0245184_100106156F074898MRKILSLLLILALFLPCALAETPQGIDLALTSTYGDGLSLRMTAGLGETPFCSLTLPSGQINLAFSPDKGLCVQSGGQWAQLLVEGHPVDAALLTTPLKLLDGRSPSEVLGLLAEDVNKMLDAIPSLYNLPMTLIRDPAFARLFSDLYIAAQTGTISITSEELNRMAASVLGKIYSMEYLDGLNFSDIYHHLLASVVQSSEVARAYRSAMRGYLLSNFRLSGQIGIDKGELLFSQPYQEPYHLTYEQTTPNSWHYLLTTANSDTIDGVLYWRNEYDFALSGYSENQHTAFSVTCSYGSANNFVFIASVDNSFSLSIVQAGSNFSLRLNNENANVFALESTSDYIRVKIYYNYYTLTITPATDGLDIVAQARTFDIFTAHVRTALNGLICTGVYGDTTYRLALTETATGFSLLLSLPDEDFQLDFAALSDTSCSLTFTDAAGQQVSLTGSLCTPESPVIPDTLGTIELTQLLDDLF
Ga0245184_100110Ga0245184_100110142F099451MEIKNVGQLRKIIENLSDDYEIEMRIRRKLTDEELKNCRYPYPYDTEYLTLEFDDIGVSDKVLCLGVTSNE
Ga0245184_100209Ga0245184_10020920F091068MNEKNQQVSDEQIDALIRSGLWQDEQPLTADEEKLADAAFARAMAKIDRKAKQQKRRTVLHMLDRVVRVAACLIVAVGIAFPIALANSEAFREQILQLVLSINPETGMAHVGMEPAKEEQTANVPRIDVPDGWEGLFFPTFLPDDLPLVRCETTRGDQVHSEAYYADETRSLRFEEQDNLDGWNVRAADARVLTIYLHSVPGYLIDRQTEDTHEVSIIWAEGKRMLRVTSIGLPADEAVLVAQSVKKIFAE
Ga0245184_100209Ga0245184_10020922F043945MKHNLRMAFLSLLLVLTLGVMPVFAESADHAMDWTHGIGSRYRTDVTSALPFEDELFFISGSQLLRVDDPEYEPEVVCNLEDIIWSEALKQSGYYGQVMLLNGGAELVLFSANEGRLYSFVPPTDDTMVRMQMVAELDYSTVPQMGWKKFPIAKDLYDVTVEQAVYDAENGLYVIAKDKTDEYQIVRFDVDKGKGEMMEFALEDDENSHLMPDMPGFFRWSAESSKYERTLYDGKVLTMVNLPNGEATSVVHNLVNTLTLQPIQDYDVTLGDYVTHDAYYAYAPNKDNTAMYFVHDNTVWVMEYDEENSQFSEPRGIDKLPFFAEDTMCGFYWRGFYLIQTHDGLYLCHTGDETARAYLYGGVE
Ga0245184_100237Ga0245184_10023710F043413MTERELREKLQSAYGTMPDATRAAFEHSLTHHRVETRHPVRMSRRMRTIVTFALMLMMLTAVGVAAAKLASVTDYPPPSGLTPEYMSHLVALNEAYDGDLLTLSVNDLVFDGTTVDVAMNVQPKAGKQVFMDMEVTAECAGRAYTLEIEGCGGGDFMSGMFLPDGTGSWSDSPFGFDGVLWDDDMQSPPEGTPIAWTMTFELLQPVWEVEYLSDAQYQALLAGGADALETYVQNNWRQQIITVTYGSLVEYQYAAEAAMLADGTITEPLSGSRTEQLLSCGGFRSADTIIVQFTTDFADDYAHPELVGKRIGMGDYDLVIDNVNLSFMRANITMHYEFSAAYTEDEIRQMTNLPNAWRVYVNGETDSGYDAYANFQQVTNGAYGVDTPEQLTVGFDFYPAETDITRLTFVPIRNMGEKWDACHPDAEKGFTLELQE
Ga0245184_100260Ga0245184_10026057F045105MSDKLKRVLDTNLAGLHVTDAQVNAVLRRVSQDEQRPRVIRWQAIAAVLLVLVLGAGVLLQTGILSDDSPQLTADVTRQGDFLTAAQVKKLLASAADLQFSVDEEAISALMTQVEKDGGCSLSTLLEILCPVGSSLGQWDIRAQVALSQTLERIGYQGILPGMTREPLSTEITAQDALARAVAYIRTNDDPQADFTNLDFYRVGIRFLSGVYDGTCEGAYYCVNFDALDAFGTTYEVAVNAENGSICRMRRERGAGSDHTADEVTRGFRRIFGYDMRTWTPMQLRVYILALSRADRSSMQTVHELFLSVGRDGFPDVPAEALTREEAIDAALAIQGGSSSEVLAAEYLAGASGDFWKIAIRQPNAPSGQSIVYLELNGLTGTLLTTTYSGNLYGLPQEFFPSQLLSTLDRTDFSHGVPAVDAETAQSAASAAIERQYQRNMAEEGYTCFIESDFEDTAGGFSYYTGGGVVIFTKDAQNTSQGDIYWAALNWYGEVLDVGWNRNPLDAARFTLLMQGYLPPIYQRETVQQLQALLTGSQTDEAGLRQMETDGTLPLFNALLALEVVPDASTKSTADDVTAAALKSLNAHLCYENSSFLVYREDGELIWHFWLSTDMGYFLMDVRDSDLSVVGSVQVPSFSALRASILLPVRVWNTLAEKLRVTIFYRDANTQPGIVYGMYANHIVQRYVDLYGANILRWDQATLRSFQSAISISGSYVGDWSVACLCQTIYPDVPDYAISQAVAAEYAARALGDDDYALRGGVLIDPGEGDPIWKVTLDYPDGRSFNAEVDCRTGAIRTLRQQDTRVMPFYTDYLDFPDDGEYWFRNFVLDEVIEQVRTQMTGRYGNNV
Ga0245184_100326Ga0245184_10032653F082714MAVGIRFAADAPVRTVLQAVLPIFSTADVDFLVREYWVCTFGNGLPERRFTAQEMRRAVDALTPDEHAELFTIYALPHDAPGTPPSSCEDFCARGFTMAFYAYDGDGYALLAQSGEQVEAAFEALTTAGLCEKVEEVEKETLAHWGF
Ga0245184_100400Ga0245184_10040053F041208MRKILAMLLSVILLLTAAVAETSAPYAPETVLPLVANNRPYRDFARGAISSSEDAIEQAKAWLYLPLFVTNFSAKSEAVDWSAMRAEEGWYVTAYMRNTFVWLLMDEQGRVQAYDFDLLGNASLTYDGVLPDNLDEAISSYIWRFADLNGFVKVADYAREEVTTFGDYAVAVTVQVTLDGTPYRFTMRLDMMAFTSVENANLHPTAAQTQRDILLLMRDNLAEKGVDITQTFFAVQADDADGETKLTGIASFPADAASDAIREQYGELERYTLHYSGRASEAGIERVELSDWQETTATAQFPLALYALVDGKYLVPDGELAAGTAYLALDSMGLRGRNVIPLESITMLTRIRYARADGTFAESWVSADTLTENDAAPAAPKREPIPTLESYQITLSGTAYTAFAINKVEKGYDTFADIAGTQTAVVDVLTSAAQGVIAEYGVDASDLLCRTVVEYGYRADKGCWQVDFTIPQRDMADDAYEVEVDDKDGKVTGLWGPQDGNG
Ga0245184_100786Ga0245184_1007863F070133MKRLLGLLMAVMVMMGGMAGAEASTDNASMQLIRMNPLAFRKEPVKLYSVTHTPNGSFVVIYFAEGEKTELQEMWMELFDSVGTSLLSAKLGEFDPNGEKIPHGQIILKKDRFICEYYPDITSMEVCTQTVYRYTGKRIQKPTTKKLKFGAAPYAQHVGDYMVEKQAHSEDETPFRTVKITHIASGKSKKLRIYDWSFCAFPDQDGNLLIAQQNEKGNLEIRSYNAAMQESIVELSGDFLQNSYISDAACIGQTVYFDIYLTNQQSEILFYDMKQQNITDSQTLHTANDNSYIAEIKAAGAVLLSVDGYWNRELQRQKYQINLLNEHFETSRLPLQHESCLYIFTDVEQADVTTIEMDEKSHSYFVCSYSISAGE
Ga0245184_100850Ga0245184_1008507F072366MLDILAIKADVYQLERQGKRLPVYRYLREVWQKEPPSEGLTVLALQQMVDYVEYVDDLTVLGEPWEAENEYDLYQDFLLDVISWGLQKYRAKKRFLWQICYYVNAWATFYYIFGREITQENVEQWKKTLFKEAKERYPDSLLFEFILHAAQLDYVWFYRLTDEQRLQIRLEVGEWNLQKNDMDQAVQSYFDDAMTWYRDNGRKLLEAKNKTNN
Ga0245184_101581Ga0245184_1015818F056623MKSTEEMLQAAVQQLQTAEQARLTSETDPARRQLLHLLYAREAAALQSMTLVRAEIPAAQKRDWKPLIFPLAAAVLNLGALALWLTLPNCVTPENSGWMGRFALCIVAQSVSLCAAIGSIVSAMRRKPPKPVAVADEPELRRRLVEAEGRIALDVQTIEALFAQESVTIVSTGAETAAELYASLYEMAQDARLDGDANQEKALSWPLSNAKRLLNAVGCEAVDYTPETAMFYDVMDADITQQRRPAIVQKADGIVQQRGLYLRKG
Ga0245184_102989Ga0245184_1029892F090515VITCCKAGQKAALQGTAPGRVACLEFAGGEKRKTNIAADYAVKALAFTALLCRHIGSIRTFLKS
Ga0245184_127647Ga0245184_1276472F088920VSARPHYYLAFRAGLILHLILHFSQKVAIFAPKRAILLIFVPTLFFAGLSVFSPQTSPKISGQTSLH
Ga0245184_128126Ga0245184_1281263F047126LFPKVYLKTKKMTNISQTQAGFKSKSLGILCGFQGLLTQNPAALVETDDIFDVSDTPYGFSGLNTLRAAGVPNPPPPFAQRFIACFCSQTAAASQSKSAHILSPLESPCILCSLLRYFHILSKKLQKTC

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.