NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300029700

3300029700: Human fecal microbial communities from twins in the TwinsUK registry in London, United Kingdom - YSZC12003_37322



Overview

Basic Information
IMG/M Taxon OID3300029700 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0133139 | Gp0283843 | Ga0245207
Sample NameHuman fecal microbial communities from twins in the TwinsUK registry in London, United Kingdom - YSZC12003_37322
Sequencing StatusPermanent Draft
Sequencing CenterBeijing Genomics Institute (BGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size186064096
Sequencing Scaffolds14
Novel Protein Genes21
Associated Families21

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Ruminococcus → Ruminococcus bromii1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales11
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Faecalibacterium → Faecalibacterium prausnitzii1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Fecal Microbial Communities From Twins In The Twinsuk Registry In London, United Kingdom
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Large Intestine → Fecal → Human Fecal → Human Fecal Microbial Communities From Twins In The Twinsuk Registry In London, United Kingdom

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal distal gut

Location Information
LocationUnited Kingdom: London
CoordinatesLat. (o)51.5Long. (o)-0.12Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F039147Metagenome164N
F041208Metagenome160N
F042910Metagenome157N
F043413Metagenome156N
F043945Metagenome155N
F051934Metagenome143N
F055739Metagenome138N
F058154Metagenome135N
F059982Metagenome133N
F068811Metagenome124N
F073573Metagenome120N
F073574Metagenome120N
F074898Metagenome119N
F077320Metagenome117N
F078003Metagenome117N
F082715Metagenome / Metatranscriptome113N
F087213Metagenome110N
F091068Metagenome108N
F092228Metagenome107N
F095494Metagenome105N
F101193Metagenome102N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0245207_100004All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Ruminococcus → Ruminococcus bromii582928Open in IMG/M
Ga0245207_100028All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales328762Open in IMG/M
Ga0245207_100065All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales221853Open in IMG/M
Ga0245207_100145All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales141981Open in IMG/M
Ga0245207_100198All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales120562Open in IMG/M
Ga0245207_100206All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales117611Open in IMG/M
Ga0245207_100209All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales116044Open in IMG/M
Ga0245207_100220All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales112807Open in IMG/M
Ga0245207_100267All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales98661Open in IMG/M
Ga0245207_100469All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales61784Open in IMG/M
Ga0245207_100493All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia59583Open in IMG/M
Ga0245207_100517All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales56775Open in IMG/M
Ga0245207_100650All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales46319Open in IMG/M
Ga0245207_101288All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Faecalibacterium → Faecalibacterium prausnitzii21830Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0245207_100004Ga0245207_10000429F092228MKKKCTLVLISVLTVACIVSAYLLFFYNPSFNMVYDSDTDSYFNNSYLSYNDGTLAAADYRKTKVTAYDSKNNSTVNLPSNGCLINDNLFYINGSKLCCLDTTTNTRKIIDTDCRSFVCNNEVIAYTKNDSVILKDSDTLENIGDIKFDNQIYYINISDGNLYIAERIFEYKTDEYGYSFKVGKQYIFKKYDLKSCKLLKSKNANYVNGIRYVTVCKDTFYFFCDETQTVNNVCLDKDVNYPTIQHPDVKFITSNSDCVYYISEKTESAIIRKTVESPYNGIWKLEVGSNKPVKIADKCDCDELLATKNFLYCYTINYILPRGVADLWVKGYLIDQLAIS
Ga0245207_100028Ga0245207_100028184F073573MPTIVSFYQRFPNEAPSLNLSAFDHTGYTYAENFRRNERCYEAEQCEVTSDDGSVVLSLDVRIRPRGGEIEALPRVMLSVYSDGMPFQVRRMELRTGDTTYAILPDNVQQYTMRGDTDGGFLETMAIPLGKVGMKMLLEASDAAEARWVIQGARTAQTLPLTAAQKKAIHRFCMDCEESAITYQPAFLWYKDYCTKA
Ga0245207_100065Ga0245207_100065161F087213MKKFSSVLLAVLLIFSCWTVATAESIDAVSGAPLDNDIEIVYKEHFDDMVTHYADELTEAELLVSADVYDAIGMIRFDSEDSLAARQRIYGIASADDLHAILTGYFDPNDAAYTADTALDARLQAILAACGLNPADYDISVIRNLSGMPEPITGTNWYCTLIRKGIEVAEDETNPYDMVIVLYGDEMTVGAFVLNPEV
Ga0245207_100145Ga0245207_10014558F091068MNEKNQQVSDEQIDALIRSGLWQDEQPLTADEEKLADAAFARAMAKIDRKEKQQKRRTVLHMLDRVVRVAACLIVAVGIAFPIALANSEAFREQILQLVLSINPETGMAHVGMKPAKEEQTANVPRMDVPDGWEGLFFPTFLPDSLPLVRCETTRGDQVHSEAYYADETRSLRFEEQDNLDGWNVRAADARVLTIYLHSVPGYLIDRQTEDTHEVSIIWSEGKRMLRVTSIGLPADEAVLVAQSVKKIFAE
Ga0245207_100145Ga0245207_10014560F043945MKHNLRMAFLSLLLVLTLGVMPVFAESADHAMDWTHGIGSRYRTDVTSALPFEDELFFISGSQLLRVDDPEYEPEVVCNLEDIIWSEALKQSGYYGQVMLLDGGAELVLFSANEGRLYSFVPPTDDTMVRMQMVAELDYSTVPQMGWKKFPIAKDLYDVTVEQAVYDAENGLYVIAKDKTDEYQIVRFDVDKGKGEMMEFALEDDENSHLMPDMPGFFRWSAESSKYERTLYDGKVLTMVNLPNGEATSVVHNLVNTLTLQPIQDYDVTLGDYVTHDAYYAYAPNKDNTAMYFVHDNTVWVMEYDEENSQFSEPRGIDKLPFFAEDTMCGFYWRGFYLIQTHDGLYLCHTGDETARAYLYGGVE
Ga0245207_100198Ga0245207_1001982F051934MGLMRRLAAAALAAVLVLSTALGERVMFRLSADIDPVQYPAQERKLAQGLKSLFRLLTVEGDVLASDGSFDARIDLGLTNAPEKTATRIRFFGLDSHWGIQATDLGGETLMVNQLAWLEFAIKAYNHLDLPLQRVFLWLSPYAHTSAWAGVRQAIADLAAQETSGRLEKTALITCAEEIARLSEDDRALYYYIEAFGLESGTDADIFDALATLPEYVEANFPDGLSIEHTENGVSWQNGEKTVFSYADADGTQVVSLHLPDLVDFSATLRRDALLFTGALSLQSDVLNADVSFSLPASYPVTLPFYAQIDADGMMTGDDGIHLAFEGEAQGDTVIIRRIQPDHSATMMTLTVKVIQVAEGTVKYAPEDVQGTNVLSVDGPALAELMGRIGKPMVRKVTQWLAALPAELVQGAMDAMEDSGLLGTLTDAILNGEAGEY
Ga0245207_100198Ga0245207_10019854F082715MMKFMKRALSLLISAMLLLGCFALAEEAPLLPIAIVSYDLTDEATTAALSALIEPKDNALTRWERVVLSDGREAWVICQFDQATMSNAWSRVIDAETQEVLQEDTTDTGFFATAQARWESAKGIYALWSIQDKMLFDRLYAMAPCYGEPVEGDLTQEEALAAALNVTGLTAADYDAVGYGYIMGSSEQNGTWQVFFVKGNEVVSTVNLDARDATILLVEPDEEGNG
Ga0245207_100206Ga0245207_10020668F077320MKRKGMRRRRKLVLLAVLLIMVGIAVWRIWQTPRPTVLHRQDAWLSSAEPEMVDTLPDNAFTAELPEEIDGRLLYIRDYSASGHYQIVWEGVSAEAAEVYLNALLDKGFTRLMGTAEDIASGVLLARGDLTLSISLSGGTLNMLMTRAEEPTATPLP
Ga0245207_100209Ga0245207_10020912F043413MTERELREKLQFAYGTMPDATRAAFEHSLTHHRVETRHPARMSRRMRTIVTLALMLMMLTAVGVAAAKLASVTDYPPPSGLTPEYMSHLVALNEAYDGDLLTLSVNDLVFDGTTVEVAMNVQPKAGKQVFMDMEVTAECAGRAYTLEIEGCGGGDFMSGMFLPDETGSWSDSPFGFDGVLWDDDMQSPPEGVPIAWTMTFELLQPVWEVEYLPDAQYQALLAGGADTLEAYVQNNWRKHIITVTYGSLVEYQYAAEAAMLADGTITEPLSGSRTEQLLSCGGFRRADTIIVQFTTDFADDYAHPELVGKRIGMGDYDLVIDNVNLSFMRANITMHYEFPAAYTEDEICQMTNLPNAWRVYVNGETDSGYDAYANFQQVTNGAYGVDTPEQLTVGFDFYPAETDITRLTFVPIRNMGERWDACHPDAEKGFTLELQE
Ga0245207_100209Ga0245207_10020914F055739MTEQQLREKLQFAYGNMPDATRAAFEHSLTHHRAPETHRSIGLSRTMRIVITAVLMALMLTAVGVAAARFFSVTDVHPAQDGTEGDYQAHYLALEERYDSDLLSVSVNDAVYDGSVLAFTMEMTAKTDDVLAVEVRICGECDGQMYRFDPLDVYGGEFQSLLMLPDLGGTFDGEKYAAEGILLDETGQMPPEGKPIAWTIEIDVLKAVWQTETMPDDLYEALSEEDDVAQYIREQAEQHMITLTDAGVEDYLLEMCGAGWDEIEQMSKADLLLRCGGFERAETYTVAFATEGNTQYVQPELAGLRIPLDGYTAVVDYVRASFLGGCVVLHCEAPNGTALPNGTALPDVWRIYRNDEQNPAGEAGWARASGYAGVPGGVVDVNQPSICLYFAPCADLTSLRIVPEGEAGFTLNLSGEKGTE
Ga0245207_100220Ga0245207_10022015F042910LADRLYCALNGTTLRDLDARIHLLDVEELAPAVRTVTASRIGGGLHLLRRQREQLSLRVRFLIEEYDIAARHQLLHLVAAWAEAGGVLTLHEDGKRVLRVVCTQYPAMSTLNWLETLSLVFTAFSCPYWEDAAETSFLMPNTSDAPSKLLAVPGDAPETPLNLLIRNIGDAAITALTISAAGKISFQGLTLAPGAAIRIHHDAGVFAAEMVSDDSTVSILPYRTPDSADDLLLRPGVLNEIRVEASAAAFVSGRCKGRYC
Ga0245207_100220Ga0245207_10022085F059982MRRMTRLLCLMLLFSLITFSCPLAEETDALAGTPAPTPMLTAVPESALAPFNVVLPEDAHVEMTEGRITLVRGDSRVVAMVISRVPDEDPAAALPILMQDFDPKTSETMDFDAQPGFCILGGVVNDAFDDGEDKITLMVLADSGELLILSGYNLARDHHALYLFLTELLENASMDGAAVYVAEDAAATASPKV
Ga0245207_100267Ga0245207_10026755F041208MRKILAMLLSVMLLLTAAVAETSAPYAPETVLPLVANNRPYRDFARGAISSSEVAIEQAKAWLYLPLFVTNFSAKSEAVDWSAMRAEEGWYVTAYMRNTFVWLLMDEQGRVQAYDFDLLGNASLTYDGALPDNLDEAISSYIRRFADLNGFVEVADYAREEVTTFGDYAVAVTVQVTLDGTPYRFTMRLDMMAFTSVENANLHPAAAQTQRDILLLMRDNLAEKGVDITQTFFAVQADDADGETNLTGIASFPADAASDAIREQYGELERYTLHYSGRASEAGIERVELSDWQETTATAQFPLALYALVDGKYLVPDGELAAGTAYLALDSMGLRGRNVIPLESITMLTRIRYARTDGTFAESWVSSDTLTENDAAPAAPKREPIPTLESYQITLNGTAYTAFAINKVEKGYDTFADIAGTQTAVVDVLTSAAQGVIAEYGVDASDLLCRTVVEYGYRADKGCWQVDFTIPQRDMADDAYEVEVDDKDGKVTGLWGPQDGNG
Ga0245207_100267Ga0245207_10026761F101193MAQYNWTRCPHCGKLCKPSRFKAATVLVPIEFVIFVVCIFFRNSMNDAIGWCAAWLLFMLLLFLPQYIYVRFFMPYETLSEDETRKFRDLQEH
Ga0245207_100267Ga0245207_1002679F078003VNEHLAFAQCLQRVLNETELTATEVARRLEMRSRNSIFRILKGKTSPQLNRRFLESFHKHMGEQLTEAQWAALNRALEMDAVGAVEYKSRQALMQLVGAFSEPISPAKVCYLDALGTEKEDSFLHYLQVLFCGALKVNALLFGCCDLGLFRQLQEAIHPVAQRVIVRIDHFIYAGEDEIVSNLVGIQPMVDQPCYHAYLVDAENCPQERLAHYRTGQMTFHVMHQDGSESTVALFLLGKNEFTATVMSTQDLWMSRKVLCDRERFSPIRLLWQLNDENSDFIVYTQQYCKMEHGAAIYYIRPDVPFQYIPLEVLYPVVREGFARMGMTREEYEPNVTALAEIHQARMTNMMRRRRPTYIVLNKAAMEEFVRTGRQSDHLHFLRNFTPKERKQILDVLVQQARENPFFHLYFAQEKLPCTMGEVALYDGRALITMSSGSGYDLRTDHRESCITHPFVLRAYKQFYMNTVVDRLADTQTESLNQLEELVRRCEKMIREGQKGNAGNEQEKISGQ
Ga0245207_100469Ga0245207_10046934F058154MKNRTFQRLFPLLRMGSILLAAVLFCAALIGVNAQLTVGTNEERNLATYLGLALTAVVTYLLQDVPKMLVGRHFGWKLVRYEAFGRVYQADGTGAFVRVPVPAQSRLRYQPYLLPPEDAPRHDVTLYTLCPLLYGGALAVVFGVLTLLTLGQPVSLVLCELAMGGVVLVMTCILPMDERFIASPMAIARMLRDPEQLAEWLYFARMKANGGVEVTDVSIENIPQPATVKTCQDAICVFQRAQIALMVQCDARKAYGLMKQVLESEARLSYTAWKLLLSDGVVAELLAGEPGEFTALYRGKAGAAIRQVMFRMVDYALPAYAVAMLVTHEAREAEVVRNTLAPVREKMPELDALMKAIEEKAARIES
Ga0245207_100493Ga0245207_10049326F073574MLLPAAVRPYAADVDGTASRVRCAIALNGSKELRIQLCGRLKRGLFGRNQLLADGDVLCVALHQPDGDVPLFDSRCDGYANVLDDRQPPAPIPLHPAICPKCRNAAFQLRLCFEYPDAEEVSAFANPSDAFTWVWVTMRCTRCHAVFRGDLECD
Ga0245207_100493Ga0245207_10049330F074898MRKILSLLLILALFLPCALAETPQGIDLALTSTYGDGLSLRMTAGLGETPFCSLTLPSGQIDLAFSPDKGLCVQSGGQWAQLLVEGHPVDAALLTTPLKLLDGHSPSEALGLLAEDVNKMLDAIPSLYNLPMTLIRDPAFARLFSDLYIAAQTGTISITSEELNRMAASVLGKIYSMEYLDGLNFSDIYHHLLASVVQSSEVARAYRSAMRGYLLSNFRLSGQIGIDKGELLFRQPYQEPYHLTYEQTTPNSWHYLLTTANSDTIDGVLYWRNEYDFALSGYSENQHTAFSVTCSYGSANNFVFIASVDNSFSLSIVQAGSNFSLRLDNENTNVFALNANVFSLESTSDYIRVKIYYNYYTLTITPATDGLDIVAQARTFDIFTAHVRTALNGLICTGVYGDTTYRLALTETATGFSLLLSLPDGDFQLDFAALSDTSCSLTFTDAAGQQVSLTGSLCTPESPVIPDTLGTIELTQLLDDLF
Ga0245207_100517Ga0245207_1005179F095494VSDPKEKEVQAILQAIDYGHLPPRETQRRLLNIIQAEAARTDAPTDETKIHTCMDLLERLQGEQKPIAPARVDALRQHIAAAHQKNERKRQKRKKIMAAAACSAAAIAVAFAVSHPLLWYENWTTSDEQQHFVTSHEIAIEMLETAVADPTLPSGDTVEVQSIAALDALIGRKTGIPEMVNGQWELQHRYVNFTRSGISISLMYVNAADAQQTIVGVINLISNPQYMMLSFEQSYEGTIQQFDGLNFYITENINKPVALWQGDDKLLLFSGRTSQEEVTSLLRTIIREIGE
Ga0245207_100650Ga0245207_10065020F039147MRARRLLILLMMLLLLPQAQAERLTLYTRPGQVDEATPFQLRPTELSICSVTRAMGGVVVLANDDNYDSLSLYFWQDGMTEMRKLGGGFYWVMSSDTMETAQESCEYAMSRVPNYRMPDLTHAISNLTSDGETLYALNRINGLIFKISETKDGLQTEDVCTMANLSCLNVSYRDLETDKVYTYPASLTRMYVCGSVLAISVMQENAIKVVLVDLTDGAIREIADESLEAMYEWADGELLLWRLEGSPNEISRSSGTYTLSRYSVATGEETLLSTGVPYKKRSECGAYDPYSGSYYDVRTRQIVRTTDFVQEDPVVTFPAANVDIAVTKDSIVGVNLSSVYVRSKENGDMTVLRIQSSNGASNTALQHFAEENPEVILAQETLAKSAMNAASLAARMSASADAPDILRLGLTPDTPEADGSWPLDVLMDKGWCMDLSVYPEVSDYVSRLNGIYRDAVTRDGKIYALPIYAWSYGYFISRNVMEKLGLQESDIPTNLIDLCAFITKWNDNLTGAYAAYTPLEETESYRERVFDLMVHDWIGYCQAENIPLRFDHPVFREMMAALDAMRTDKIEQANQQVNEEISDYRECLIWTDAQAVGNFANYADAFGSRIFLPMALTPDVTTHYGIGYMTVLVVNPRTMNADLVGKMLAQVIADQEATAKCVLLADYDEPIEDSYYLIMVNDYEKTLTELRRQQENAPVWKKQGIQERINEEEASLQRYTVRERWTIAPKTIELYQQTILPMSYLRRPGILADSDAFSALVSQVHQGEISLEEFVEKADKLIEGLEQ
Ga0245207_101288Ga0245207_1012885F068811MDQDGSEHNICSNREGLCPGKEQHGASGWKKIFQHGKEPLRNKDSVSQYCNKKVAVSLILNENVSETLCIFTIDKTNCCRI

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.