NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300029707

3300029707: Human fecal microbial communities from twins in the TwinsUK registry in London, United Kingdom - YSZC12003_36698



Overview

Basic Information
IMG/M Taxon OID3300029707 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0133139 | Gp0283879 | Ga0245244
Sample NameHuman fecal microbial communities from twins in the TwinsUK registry in London, United Kingdom - YSZC12003_36698
Sequencing StatusPermanent Draft
Sequencing CenterBeijing Genomics Institute (BGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size208103661
Sequencing Scaffolds14
Novel Protein Genes18
Associated Families17

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales8
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae1
All Organisms → cellular organisms → Bacteria2
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Ruminococcus → Ruminococcus bromii1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Streptomycetales → Streptomycetaceae → Streptomyces → Streptomyces longisporoflavus1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Subdoligranulum → unclassified Subdoligranulum → Subdoligranulum sp. APC924/741

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Fecal Microbial Communities From Twins In The Twinsuk Registry In London, United Kingdom
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Large Intestine → Fecal → Human Fecal → Human Fecal Microbial Communities From Twins In The Twinsuk Registry In London, United Kingdom

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal distal gut

Location Information
LocationUnited Kingdom: London
CoordinatesLat. (o)51.5Long. (o)-0.12Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F032312Metagenome / Metatranscriptome180N
F039147Metagenome164N
F043413Metagenome156N
F044555Metagenome / Metatranscriptome154N
F045105Metagenome153N
F047126Metagenome150N
F050793Metagenome145N
F051935Metagenome143N
F058154Metagenome135N
F064817Metagenome128N
F073573Metagenome120N
F074898Metagenome119N
F075480Metagenome119N
F078003Metagenome117N
F084124Metagenome112Y
F094005Metagenome / Metatranscriptome106N
F101193Metagenome102N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0245244_100022All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales303392Open in IMG/M
Ga0245244_100089All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales161112Open in IMG/M
Ga0245244_100150All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales125972Open in IMG/M
Ga0245244_100151All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales125959Open in IMG/M
Ga0245244_100162All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales119048Open in IMG/M
Ga0245244_100239All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales95031Open in IMG/M
Ga0245244_100289All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales81091Open in IMG/M
Ga0245244_100379All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales66590Open in IMG/M
Ga0245244_101573All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae16491Open in IMG/M
Ga0245244_101809All Organisms → cellular organisms → Bacteria14608Open in IMG/M
Ga0245244_102667All Organisms → cellular organisms → Bacteria10450Open in IMG/M
Ga0245244_103863All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Ruminococcus → Ruminococcus bromii7515Open in IMG/M
Ga0245244_118272All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Streptomycetales → Streptomycetaceae → Streptomyces → Streptomyces longisporoflavus1746Open in IMG/M
Ga0245244_130664All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Subdoligranulum → unclassified Subdoligranulum → Subdoligranulum sp. APC924/741004Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0245244_100022Ga0245244_100022223F073573MPTIVSFYQRFPNEAPPLNLSAFDRTGYTYAENFRRNERCYEAEQCEVTSDDGSVVLSLDVRIRPRGGEIEALPRVMLSVYSEGMLFQVRRMELRTGDTTYAILPDNVQQYTKRGDTGGGFLETMAIPLGKVGMKMLLEASDAAEARWVIQGARTAQTLPLTAAQKKAIHRFCMDCGESAITYQPAFLWYKDYCTKA
Ga0245244_100089Ga0245244_10008951F039147MRARRLLILLMMLLLLPQAQAERLTLYTRPNNVDEATPFQLRPTELSICSVTRAMGGVVVLANDNNYDSLSLYFWQDGMTEMRKLGGGFYWVMSSDTMETAQQSCEYSMSRVPNYRMPDLTHAISNLTSDGETLYALNRINGLIFKISETKDGLQTEDVCTMANLSCLNVSYRDLETDKVYTYPASLTRMYVCGSVLAISVMQENGIKVVLVDLTDGAIREIADESLEAMYEWADGELLLWRLEGSPNEISRSSGTYALSRYSVATGEETLLSTGVPYKKRSECGAYDPYSGSYYDVRTRQIVRTTDFVQEDPVVTFPAANVNIAVTKDSIVGVNLSSVYVRSKENGDMTVLRIQSSNGASNTALQHFAEENPEVILAQETLAKSAMNAASLAARMSASADAPDILRLGLTPDTPEADGSWPLDVLMDKGWCMDLSVYPEVSDYVSRLNGIYRDAVTRNGKIYALPIYAWSYGYFISRNVMEKLGLQESDIPTNLIDLCAFITKWNDNLTGAYAAYTPLEETESYRERVFDLMVRDWIGYCQAENIPLRFDHPVFREMMAALDAMRTDKIEQANQQVNEEISDYRECLIWTDAQAVGNFANYADAFGSRIFLPMALTPDVTTHYGIGYMTVLVVNPRTMNADLVGKMLAQVIADQEATAKCVLLADYDEPIEDSYYLIMVNDYEKTLTELRRQQENAPVWKKQGIQERINEEEASLQRYTVRERWTIAPKTIELYQQTILPMSYLRRPGILADSDAFNALVSQVHQGEISLEEFVEEADKLIEGLEQ
Ga0245244_100089Ga0245244_10008964F051935MKRLLGLLLAMMVMMEGISCAVAENANPLVSDWLTELKSKRLIQIVEDEIGEGWTLYQPNGREEEENFSSAENLKEMRFLPVVAQKGNQLRLLILRRQGDPWKVSEQNNRALMRDGWTLQNFSAIGYGNSDSADVYFYFSDENQQSWELVMELGDANVSYFSLIYNAEGYGITEIIMSYDCGMKFQVDAPGYLQLSYEVDPVEEYSCRVEDFDLATCPLSMQELLVPAVVSCGEAGAELYIALQQNIQPIFVLADGEAIEAIPQRWQRDWVIVCYRGNYLFMKTENCKMEE
Ga0245244_100150Ga0245244_1001503F074898MRKILSLLLMLALFLPCALAETPQGIDLALTSTYGDGLSLWMTAGLGETPFCSLTLPSGQINLAFSPDKGLCVQSGGQWAQLLVEGHPVDAALLTTPLKLLDGRSPSEVLGLLAEDVNKMLDAIPSLYNLPMTLIRDPAFARLFSDLYIAAQTGTISITSEELNRMAASVLGKIYSMEYLDGLNFSDIYHHLLASVVQSSEVARAYRSAMRGYLLSNFRLSGQIGIDKGELLFSQPYQEPYHLTYEQTTPNSWHYLLTTANSDTIDGVLYWRNEYDFALSGYSENQHTAFSVTCSYGSANNFVFIASVDNSFSLSIVQAGSNFSLRLNNENANVFALNANVFSLESTSDYIRVKIYYNYYTLTITPATDGLDIVAQARTFDIFTAHVRTALNGLICTGVYGDTTYRLALTETATGFSLLLSLPDGDFHLDFAALSDTSCSLTFTDAAGQQVSLTGSLCTPESPVIPEAIGTIELTQLLDGLF
Ga0245244_100151Ga0245244_10015136F058154MKNRTFQRLFPLLRMGSILLAAVLFCAALIGVNAQLTVGTNEERNLATYLGLALTAVVTYLLQDVPKMLVGRHFGWKLVRYEAFGRVYQADGTGAFVRIQAPAQSRLRYQPYLLPPEDAPRHDVTLYTLCPLLYGGALAVVFGVLTLLTLGRPVSLVLCELAMGGVVLVMTCILPMDERFIASPMAIARMLRDPEQLAEWLYFARMKANGGVEVTDVSIENIPQPATVKTCQDAICVFQRAQIALMVQCDAQKAYGLMKQVLESEARLSYTAWKLLLSDGVVAELLAGEPGEFTALYRGKAGAAIRQVMFRMVDYALPAYAVATLVTHEAREAEVVRNTLAPVREKMPELDALMKAIEEKAARIEN
Ga0245244_100162Ga0245244_10016263F050793MADIQAIQTRIKKHNLLTTALLIPLFAGCALRESNETLGGIVFVLSIALMVLGFVSGRGDEKRLPAYWAKYQQIHRMDNLLRANLVGVWLCAEARVAVYREGSGYRVQLGVLDEKTGAWENDEQDEVFPTLADVREYASQEGDEPMAVDFETMTDEEFQQFLDRSRI
Ga0245244_100239Ga0245244_10023928F075480MNKTKQEKWQRAYGDTPDSFRQRVASALPKGEESRHVAFPRRAMVLAAALVLVLTTAYAAVVTHTELVWNAGHPIENEADDRLGLLTGKAGTSGDSLTIGGVTFTVQDGVYSPENGQLFASAVISADESVQLVGVESDMEWEVRAVTPVSEKLDPSGISWAEWAEQNGKTLVPVRMEIRGSEQFPNIPMFCDFLTQNPDGTVSVGFQVDLSEADVSHLKSCEVQLECRVGAFGKDGKATQWQKEILTATITFK
Ga0245244_100289Ga0245244_10028971F101193MARKCTEYHYFQHTYWDAMARYNWTRCPHCGKLCKPSRFKAATVLMPIELVIFVVCIFFRNSMNDAIGWFAAWLLFVLLLFLPHYIYVRFFMPYETLSEDETRKFRDLQEH
Ga0245244_100289Ga0245244_1002899F078003VNEHLAFAQCLQRVLNETELTATEVARRLEMRSRNSIFRILKGKTSPQLNRRFLENFHKQMGEQLTEAQWAALNRALEMDTVGAVEYKSRQALMQLVGAFSEPISPAKVCYLDALGAEKEDSFLHYLQVLFCGALKVNALLFGCCDLGLFRQLQEAIHPVAQRVIVRIDHFIYAGEDEIVSNLVGIQPMVDQPCYHAYLVDAENCPQERLAHYRTGQMTFHVMHQDGSESTVALFLLGKNEFTATVMSTQDLWMSRKVLCDRERFSPIRLLWQLNDENSDFIVYTQQYCKMEHGAAIYYIRPDVPFQYIPLEVLYPVVREGFARMGMTREAYEPNVTALAEIHQARMTNMMRRRPTYIVLNKAAMEEFVRTGRQSDHLHFLRNFTPKERKQILDVLVQQARENPFFHLYFAQEKLPCTMGEVALYDGRALITMSSGSGYDLRTDHRESCITHPFVLRAYKQFYMNTVVARLADTQTESLNQLEELVRRCEKMIREGQKGNAGNEQEKISGQ
Ga0245244_100379Ga0245244_10037946F043413MTERELREKLQSAYGTMPDATRAAFEHSLTHHRVETRHPVRMSRRMRTIVTLALMLMMLTAVGVAAAKLASVTDYPPPSGLTPEYMSHLVALNEAYDGDLLTLSVNDLVFDGTTVEVAMNVQPKAGKQVFMDMEVTAECEGRAYTLEIEGCGGGDFMSGMFLPDGTGSWSDSPFGFDGVLWDDDMQSPPEGAPIAWTMTFELLQPVWEVEYLPDAQYQALLAGGADTLEAYVQNNWRKHIITVTYGSLVEYQYAAEAAMLADGTITEPLSGSRTEQLLSCGGFRRADTIIVQFTTDFADKYAHPELVGKRIDMGDYDLVIDNVNLSFMRANITMHYEFSAAYTEDEIRQMTNLPNAWRVYVNGQTEAGYDAYANFQQVTNGAYGVDTPEQLTVGFDFYPAETDITRLTFVPIRNMGEKWDACHPDAEKGFTLELQE
Ga0245244_100419Ga0245244_10041937F045105MSDKLKRVLDTNLAGLHVTDAQVNAVLRRVSQDEQRPRVIRWQAIAAVLLVLVLGAGVLLQTGILSDDSPQLTADVTRQGDFLTAAQVKKLLASAADLQFSVDEEAISALMTQVEKDGGCSLSTLLEILCPVGSSLGQWDIRAQVALSQTLERIGYQGILPGMTREPLSTEITAQDALARAVAYIRTNDDPQADFTNLDFYRVGIRFLSGVYDGTCEGAYYCVNFDALDAFGTTYEVAVNAEDGSICRMRRERGAGSNHTADEVTRGFRRIFGYDMRTWTPMQLRVYILALSRADHSSMQTVHELFLSVGRDGFPDVPAEALTREEAIDAALAIQGGSSSDVLAAEYLAGASGDFWKIAIRQPNAPSGQSIVYLELNGLTGTLLTTTYSGNLYGLPQEFFPSQLLSTLDRTDFSHGVPAVDAETAQSAASAAIERQYQRNMAEEGYTCFIESDFEDTAGGFSYYTGGGVVIFTKDAQNTSQGDIYWAALNWYGEVLDVGWNRNPLDAARFTLLMQGYLPPIYQRETVQQLQALLTGSQTDEAGLRQMKTDGTLPLFNALLALEVAPDASTKSTADDVTAAALKSLNAHLCYENSSFLVYREDGELIWHFWLSTDMGYFLMDVRDSDLSVVDSVQIPSFCALRTSILLPVRVWNTLAENLRVTIFYRDANTQPGIVYGMYANHIVQRYVDLYGANILRWDQATLRSFQSAISISGSYFGDWSVACLCQTIYPDVPDYAISQEVAAEYAARALGDDDYSLRGGVLIDPGEGDPIWKVTLDYPDGRSFNAEVDCRTGAIRTLRQQDTRVMPFYTDYLDFPDDGEYWFRNFVLDEVIEQVRTQMTGRYGNNI
Ga0245244_101573Ga0245244_1015737F084124LLFGGLNTTALAFARATKGNKHHFHSQRNALTMFLAFLFIMAVACFNKLSLSTQTVWLFAPDKLLNWKILWALPKPAIFEKIE
Ga0245244_101809Ga0245244_1018091F094005MKKEVIKLKEGNSVIYQDKTLMEKANVVSIDKKNGTAILSNKVIITRTTNLDGQFTRLDGKGNAIILPCTTENEQKYNSFVAYHQSKKSLEAIKKWLDDNGKHKDDETLEKVITLDKKLKKLIEKLNE
Ga0245244_102667Ga0245244_10266713F064817MNIKNLFNRFRKREPELSYSLNLIYLEDTKVVFNQNIQCAKDLENYLSAYMRLFGMYSDKPYVLIYQEYKSRYWVYDKEPYLLYYKVPLIVNLSRKLSGKSDMVITKEKYQAAKDLVPAHEVSDRFKIPEYITGVFTDIWYKCQGYMDTDHVGLEEILELMQHNWLKEFELLVFKRNYDTDILFLTHSLTYILDQTEEEGRRICIQNIIERNINQENQDENETV
Ga0245244_102667Ga0245244_10266721F044555MKTTNPSSRITISQNGNQILTCKVYKEPNYILSMSNEEILELISGLDYIGNLPTVPDLEKPIEIQVSTTRQIPLEQNKEVQTKIKEIIYNNLYDTLVDELKDTISRFQAQYNIQEINPYLQDILQNPEDLVSLSQHHKR
Ga0245244_103863Ga0245244_1038632F084124VFGGLNKQAKAFARATKENKHHFHSQRNALTMFLAFLFIMAVACFNKLSFSTQTAWLFVPDKLLYRKILWALPKPAIFEKIE
Ga0245244_118272Ga0245244_1182722F032312MARIKDYDEDLSAPKLLRERARDSKGRFIKKDLPPYLGSEQVLKPKNYYHFDSHGNYKGSSMNFDALVCLGFTWFKLLGVALMILLWPIVFIYALNDGIEGYPFKKYAIPYIFILVVWFIIFLYGLVS
Ga0245244_130664Ga0245244_1306643F047126VYLKTKKMTNISQTQAGFKSKSLGILCGFQGFLTQNPAALVETDDIFDVSDTPHGFSGLNTLRAAGVPNPPPPFAQRFIACFCSQTAAASQSKSAYILSSLESPCILCSLLRYFHILSKKLQKTC

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.