NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300029839

3300029839: Human fecal microbial communities from twins in the TwinsUK registry in London, United Kingdom - YSZC12003_35390



Overview

Basic Information
IMG/M Taxon OID3300029839 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0133139 | Gp0283902 | Ga0245267
Sample NameHuman fecal microbial communities from twins in the TwinsUK registry in London, United Kingdom - YSZC12003_35390
Sequencing StatusPermanent Draft
Sequencing CenterBeijing Genomics Institute (BGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size107751436
Sequencing Scaffolds12
Novel Protein Genes14
Associated Families14

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales6
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Ruminococcus → Ruminococcus bromii1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Eubacteriales incertae sedis → Gemmiger → Gemmiger formicilis1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → unclassified Eubacteriales → Clostridiales bacterium1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae1
Not Available1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Fecal Microbial Communities From Twins In The Twinsuk Registry In London, United Kingdom
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Large Intestine → Fecal → Human Fecal → Human Fecal Microbial Communities From Twins In The Twinsuk Registry In London, United Kingdom

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Unclassified

Location Information
LocationUnited Kingdom: London
CoordinatesLat. (o)51.5Long. (o)-0.12Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F039147Metagenome164N
F039198Metagenome164Y
F041208Metagenome160N
F043413Metagenome156N
F045105Metagenome153N
F050793Metagenome145N
F055739Metagenome138N
F078822Metagenome116N
F081354Metagenome114Y
F084124Metagenome112Y
F087336Metagenome110N
F090513Metagenome108N
F096287Metagenome105N
F101192Metagenome102N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0245267_100059All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales134062Open in IMG/M
Ga0245267_100162All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales74969Open in IMG/M
Ga0245267_100212All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales63143Open in IMG/M
Ga0245267_100221All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes62125Open in IMG/M
Ga0245267_100225All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Ruminococcus → Ruminococcus bromii61256Open in IMG/M
Ga0245267_100636All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales27201Open in IMG/M
Ga0245267_100817All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Eubacteriales incertae sedis → Gemmiger → Gemmiger formicilis21508Open in IMG/M
Ga0245267_101109All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales15524Open in IMG/M
Ga0245267_101759All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales8956Open in IMG/M
Ga0245267_102060All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → unclassified Eubacteriales → Clostridiales bacterium7418Open in IMG/M
Ga0245267_102360All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae6435Open in IMG/M
Ga0245267_105920Not Available2174Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0245267_100059Ga0245267_100059123F039147MRARRLLILLMMLLLLPQAQAERLTLYTRPGQVDEATPFQLRPTELSICSVTRAMGGVVVLANDDNYDSLSLYFWQDGMTEMRKLGGGFYWVMSSDTMETAQESCEYSMSRVPNYRMPDLTHAISNLTSDGETLYALNRINGLIFKISETKDGLQTEDVCTMENLSCLNISYRDLETDKVYTYPASLTRMHVCGSVLAISVMQESGIKVVLVDLTDGAIREIADESLEAMYEWADGELLLWRLEGSPNEISRSSGTYTLSRYSVATGEETLLSTGVPYKKRSECGAYDPYSGSYYDVRTRQIVRTTDFVQEDPVVTFPAANVNIAVTKDSIVGVNLSSVYVRSKENGDMTVLRIQSSNGASNTALQHFAEENPKVILAQETLTKSAVNAASLAARMSASADAPDILRLGLTPDTPEADGSWPLDVLMDKGWCMDLSVYPEVADYVSRLNGIYRDAVTRDSKIYALPIYAWSYGYFISRNVMEKLGLQESDIPTNLIDLCAFITKWNDNLTGAYAAYTPLEETESYRERVFDLMVHDWIGYCQAENIPLRFDHPVFREMMAALEAMRTDKIEQANQQVNEEIGDYRECLIWTDAQAVGNFANYADAFGSRIFLPMALTPDVTTHYGIGYMTVLVVNPRTMNADLVGKMLAQVIADQEATAKCVLLADYDEPIEDSYYLIMVNDYEKTLTELRRQQENAPVWKKQGIQERINEEEASLQRYTVRERWTIAPKTIELYQQTILPMSYLRRPGILADSDAFSALVSQVHQGEISLEEFVEKADKLIEGLEQ
Ga0245267_100162Ga0245267_10016210F096287MQRIQKRRAGQRQTQNALIGLLSAVSMTRIGLTQLLPLCGSAAWWLSAACMLPGLCVHGALRLLLRYTHTRTLTDCARKLLGNFGSILIMLTLTLPLLLDGIASLTALITFFTEGIGARGSQFTLTLLTASVMLIALNRDGLPRGVYLLRHVLLVAAAIIAINALLDAHPDGIVPLLGEGVPPLLSGIRSAWGMSWMLLLLLEFPAEEGARRTPAMLAGLLPCPVILLLLSLTIPPELTVPGRSLASRLALPTLFLQPVVRTLAQCLLMMTLFLSIAGSAQLAARFLTSSCQKPKKWVPYALIGLLTLTQLFDISRLWRVLTALTAWSLVPGVLLLLVLTIARLCRREKA
Ga0245267_100212Ga0245267_10021210F043413MTERELREKLQSAYGTMPDATRAAFEHSLTHHRVETRHPARMSRRMRTIVTLALMLIMLTAVGVAAAKLASVTDYPPPSGLTPEYMSHLVALNEAYDGDLLTLSVNDLVFDGTTVDVAMNVQPKAGKQVFMDMEVTAECEGRAYTLEIEGCGGGDFMSGMFLPDETGSWSDSPFGFDGVLWDDDMQSPPEGAPIAWTMTFELLQPVWGVEYLPDAQYQALLSGGADALEAYVQNNWRKHIITVTYGSLVEYQYAAEAAMLADGTITEPLSGSRTEQLLSCGGFRRADTIIVQFTTDFADDYAHPELAGKRIGMGDYDLVIDNVNLSFMRANITMHYEFSAAYTEDEIRQMTNLPNAWRVYVNGQTEAGYDAYANFQQVTNGAYGVDTPEQLTVGFDFYPAETDITRLTFVPIRNMGEKWDACHPDAEKGFTLELQE
Ga0245267_100212Ga0245267_1002128F055739MTEQQLREKLQFAYGTMPDATRAAFEHSLTHHRAPETHRSIGLSRMMRIVITAVLMALMLTAVGVAAARFFSVTDVHPAQDGTEGDYQAHYLALEERYDSDLLSVSVNDAVYDGSVLAFTMEMAAKTDDVLAVEVRICGECDGKMYRFDPLDVYGGEFQSLLMLPDLGGTFDGEKYAAEGILLDENGQMPPEGKPIAWTIEIDVLKAVWQTETMPDDLYEALSEEDDVAQYIREQAEQRVITLTDAGVEDYLLEMCGVGWNEIEQMSKADLLLRCGGFERAETYTVAFATEGNTQYVHPELAGLRIPLDGYTAVVDYVRASFLGGCVVLHCEAPNGTALPNGTALPDVWRIYRNDEQNPAGEAGWARASGYAGVPGGVVDVNQPSICLYFAPCADLTSLRIVPEGEAGFTLNLSGEKGTE
Ga0245267_100221Ga0245267_1002212F039198MKLTIAEIQGTKYVQIYLTEDELQKKETKDLAQKYKQEKYSVAIFVTGKENYPEILEKIITKQVELNKNVC
Ga0245267_100225Ga0245267_10022553F084124LNATAKAFARATKENTLHFHSQRNALTMFLAFLFIMAVACFNKLSFSTQTAWLFAPDKLLYLKIFMGAAQTRNF
Ga0245267_100636Ga0245267_10063628F050793MRKTGTIAPNDAEGEKTMRKYEVDVPENVEMADIQAIQTRIKKHNLLTVALLIPLFAGYALRESSETLGGIVFVLSMVLMVLGFISGRGDEKRLPAYWAKYQQIHRMDNLLRANLVGVWLCAEARVAVYREGSGYRVQLDVLDETTGAWENDEQDEVFPTLADVREYATQEGYEPMDVDFEQMTDEEFQQFLDRSRI
Ga0245267_100817Ga0245267_1008175F087336MHCLLRGMVAELEQVSKAFRAAGNQRRAAAKERIKDDAIGHGRVSDRILAEIEDNHMRERDTKIGLAEQRQVAFLGIAFQILPLKSKQKRAP
Ga0245267_100829Ga0245267_10082915F045105MSDKLKRVLDTNLAGLHVTDAQVNAVLRRVSQDEQRPRVIRWQAIAAVLLVLVLGAGVLLQTGILSDDSPQLTADVTRQGDFLTAAQVKKLLASAADLQFSVDEEAISALMTQVEKDGGCSLSTLLEILCPVGSSLGQWDIRAQVALSQTLERIGYQGILPGMTREPLSTEITAQDALARAVAYIRTNDDPQADFTNLDFYRVGIRFLSGVYDGTCEGAYYCVNFDALDAFGTTYEVAVNAEDGSICRMRRERGAGSNHTADEVTRGFRRIFGYDMRAWTPLQLRVYILALSRADRSSMQTVHELFLSVGRDGFPDVPAEALTREEAISAALAIQGGSSSDVLAAEYLAGASGDFWKIAIRQPNAPSGQSIVYLELNGLTGALLTTTYSGNLYGLPQEFFPSQLLSTLDRTDFSHGVPAVDAETAQSAASDAIQRQYQRNMAEEGYTCFIENDFEDTAGGFSYYTGGGVVIFTKDAQNTSQGDIYWAALNWYGEVLDVGWNRNPLDAARFTLLMQGYLPPIYQRETVQQLQALLTSGLTDEAGLRQMETDGTLPLFNALLALEVVPDASTKSTADDVTAAALKSLNAHLCYENSSFLVYREDGELIWHFWLSTDMGYFLLDVRDSDLSVVGSVQIPSFSALRASILLPVRVWNTLAENLRVTIFYRDANTQPGIVYGMYANHIVQRYVDLYGANILRWDQATLRSFQSAISISGSYVGDWSVACLCQTIYPDIPDYAISQEVAAEYAARALGDDDYSLRGGVLIDPGEGDPIWKVTLDYPDGRSFNAEVDCRTGAIRTLRQQDTRAMPFYTDYLDFPDDGEYWFRNFVLDEVIEQVRTQMTGRYGNNV
Ga0245267_101109Ga0245267_1011092F101192MTYWGWLLVGIALLLTALSGTQTSLCQRMRSVPVYRGLTYWEVQQIAKAAPTSITPCGEDTICRWVSGRYQIALRFNRYDVCLGVEEEIDG
Ga0245267_101759Ga0245267_1017591F081354LVRENQQSSVHNFVKDFSPIFLKTSRRSPLKKGAVTEIPLEYGKTEVQIPFEPLQAAISSMELPKNFENKKLPNPFCLPLQGHKLSPPARERNNPLPHF
Ga0245267_102060Ga0245267_1020605F041208MRKILAMLLSVMLLLTAAVAETSAPYAPETVLPLVANNRPYRDFARSAISSSEDAIEQAKAWLYLPLFVTNFSAKSEAVDWSAMRAEEGWYVTAYMRNTFVWLLMDEQGRVQAYDFDLLGNASLTYDGTLPDNLDEAISSYIRRFADLNGFVEVADYAREEVTTFGDYAVAVTVQVMLDGTPYRFTMRLDMMAFTSVENVNLHPAAAQTQRDILLLMRDNLAEKGVDVAQTFFAVQADDADGETKLTGIASFPADAASDAIREQYGELERYTLHYSGRASEAGIERVELSDWQETTATAQFPLALYALVDGKYLVPDGELAAGTAYLALDSMGLRGRNVIPLESITMLTRIRYARADGTFAESWVSSDTLTENDAAPAAPKREPIPTLESYQITLSGTAYTAFAINKVEKGYDAFADIAGTRMTVVDVLTSAAQGVIAEYGVDASDLLCRTVVEYGYRADKGCWQVDFTIPQRDMADDAYEVEVDDKDGKVTGMWGPQDGNG
Ga0245267_102360Ga0245267_1023607F078822QRSYFLSSTFFDIFLSLPNAFLKAFHPHAIRFPAAFLLVHRFYLAFEELLSCATAYL
Ga0245267_105920Ga0245267_1059201F090513MAKYVRKMEWKLLHIKHILYMQPFGALKIAPQSVGNKNGTKAVLQGWAAAFVPYMLSFTYGNTA

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.