NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300029732

3300029732: Human fecal microbial communities from twins in the TwinsUK registry in London, United Kingdom - YSZC12003_37878



Overview

Basic Information
IMG/M Taxon OID3300029732 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0133139 | Gp0283901 | Ga0245266
Sample NameHuman fecal microbial communities from twins in the TwinsUK registry in London, United Kingdom - YSZC12003_37878
Sequencing StatusPermanent Draft
Sequencing CenterBeijing Genomics Institute (BGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size169064199
Sequencing Scaffolds17
Novel Protein Genes27
Associated Families26

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales6
Not Available3
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes2
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Lachnospiraceae → Dorea → Dorea longicatena1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae2
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Subdoligranulum → Subdoligranulum variabile1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Eubacteriales incertae sedis → Gemmiger → Gemmiger formicilis1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Faecalibacterium → Faecalibacterium prausnitzii1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Fecal Microbial Communities From Twins In The Twinsuk Registry In London, United Kingdom
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Large Intestine → Fecal → Human Fecal → Human Fecal Microbial Communities From Twins In The Twinsuk Registry In London, United Kingdom

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal distal gut

Location Information
LocationUnited Kingdom: London
CoordinatesLat. (o)51.5Long. (o)-0.12Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F032312Metagenome / Metatranscriptome180N
F039147Metagenome164N
F041208Metagenome160N
F042910Metagenome157N
F042936Metagenome157N
F044555Metagenome / Metatranscriptome154N
F045105Metagenome153N
F050793Metagenome145N
F051934Metagenome143N
F056309Metagenome137N
F060985Metagenome / Metatranscriptome132N
F064817Metagenome128N
F070133Metagenome123N
F071325Metagenome122N
F074964Metagenome119N
F078003Metagenome117N
F078822Metagenome116N
F088920Metagenome109Y
F090484Metagenome108N
F090514Metagenome108N
F094005Metagenome / Metatranscriptome106N
F096287Metagenome105N
F099406Metagenome103N
F101192Metagenome102N
F101355Metagenome102N
F101357Metagenome / Metatranscriptome102N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0245266_100015All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales371718Open in IMG/M
Ga0245266_100022All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales318533Open in IMG/M
Ga0245266_100072All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales173910Open in IMG/M
Ga0245266_100098Not Available147127Open in IMG/M
Ga0245266_100102All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes145306Open in IMG/M
Ga0245266_100129All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales132418Open in IMG/M
Ga0245266_100155All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales120094Open in IMG/M
Ga0245266_100246Not Available85105Open in IMG/M
Ga0245266_100614All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Lachnospiraceae → Dorea → Dorea longicatena39593Open in IMG/M
Ga0245266_101035All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae24797Open in IMG/M
Ga0245266_101521All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Subdoligranulum → Subdoligranulum variabile16894Open in IMG/M
Ga0245266_101584All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes16210Open in IMG/M
Ga0245266_102789All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales9120Open in IMG/M
Ga0245266_104007All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Eubacteriales incertae sedis → Gemmiger → Gemmiger formicilis6264Open in IMG/M
Ga0245266_105393All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae4541Open in IMG/M
Ga0245266_111811Not Available1868Open in IMG/M
Ga0245266_123624All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Faecalibacterium → Faecalibacterium prausnitzii870Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0245266_100015Ga0245266_100015334F041208MRRILAMLLSVMLLLTAAVAETSAPYAPETVLPLVANNRPYRDFARGAISSSEDAIEQAKAWLYLPLFVTNFSAKSEAVDWSAMHAEEGWYVTAYMRNTFVWLLMDEQGRVQAYDFDLLGNASLTYDGALPDNLDEAISSYIRRFADLNGFVEVADYAREEVTTFGDYAVAVTVQVTLDGTPYRFTMRLDMMAFTSVENANLHPTAAQTQRDILLLMRDNLAEKGVDVAQTFFAVQTDDADGETNMTGIASFPADAASDAIREQYGELERYTLHYSGRASEAGIERVELSDWQETTATAQFPLALYALVDGKYLVPDGELAAGTAYLALDSMGLRGRNVIPLESITMLTRIRYARADGTFAESWVSSDTLTENDAAPAAPKREPIPTLESYQITLSGTAYTAFAINKVEKGYDAFADIAGTQTAVVDVLTSAAQGVIAEYGVDASDLLCRTVVEYGYRADKGCWQVDFTIPQRDMADDAYEVEVDDKDGKVTGMWGPQDGNG
Ga0245266_100015Ga0245266_10001553F101192MTYWGWLLVGIALLLTALSGTQTSLCQRMRSVPVYRGLTYWEVQQIAKAAPTSITPCGEDTIRRWVSGRYQIALRFNRYDICLGVEEEIDG
Ga0245266_100022Ga0245266_10002217F050793MRKYEVDVPENVEMADIQAIQTRIKKHNLLTVALLIPLFAGCALRESSETLGGIVFVLSIALMVLGFISGRGDEKRLPAYWAKYQQIHRMDNLLRANLVGVWLCAEARVAVYREGSGYRVQLGVLDEKTGAWENDEQDEVFPTLIDVREYASQEGYEPMDVDFEQMTDEEFQQFLDRSRI
Ga0245266_100072Ga0245266_10007294F096287MQRIQKRRAGQRQTQNALIGLLSAVSMTRIGLTQLLPLCGSAAWWLSAACMLPGLCVHGALRLLLRYTHTRTLTDCARKLLGNFGSILIMLTLTLPLLLDGIASLTALITFFTEGIGARGSQFTLTLLTASVMLIALNRDGLPRGVYLLRHVLLVAAAIIAINALLDAHPDGIVPLLGEGVPPLLSGIRSAWGMSWMLLLLLEFPAEEGARRTPAMLAGLLPCPVILLLLSLTIPPELTVPGRSLASRLALPTLFLQPAVRTLAQCLLMMTLFLSIAGSAQLAARFLTSSCQKPKKWVPYALIGLLTLTQLFDISRLWRVLTAFTAWSFVPGVLLLLVLTIARLCRREKA
Ga0245266_100098Ga0245266_100098105F090484MKLQLGRNINISLRLLERWSDDSLFMELYALYCMIKISRRDSRIRFKNKKDLLHKLGIGYSKFKNMTGHPMFDELFRMTNSTFVARRYRVNGVQLTLGCGKVNIPKNRILIKIKKNEITNHEKVLDRIREAMFVNLVRNNESVLNSGETNSQAEVVDGSHSYYGLIDSTISNKTIALYLNVGLTKAKEIVGMAIQDKLVKRFENIQFITYVDNPRAYIEANEHNYPIGKLIPVYRHGAVFWQIANTWTLYKKGATNRWYFGEKDIEKGEKEKVSKKDDFNFFLKDNTHILRFLNAEEVVSEDGEILGIDRKKTKEEEARSLASVMAKEAHKDFWDGYERSTQNQIIRKYYRAIIAEDKKRRMDMFLNRLKQSYDKVSGWSKEKVATVKAGMADAEACCAEVGTSVAGVCGRVSRRMKSYNNTDTDKKAGFNEVRDMYAEFAGEMAKAVGSVSEDIYTYVKAEQFKEKIGNMDISTQSLPNISITVDNDKELDGESIFKDIPLEELSFYNDTYLYPSSQYSSL
Ga0245266_100098Ga0245266_100098126F099406MAEIGYNSKFEGQEVDSRLENVVQAAPGTGSESGKGGLIPAPPAGSQDGSKTLLSNMTWGDHVTKQYIDDAVSAAGWKKQIVSKLPTVEEAKDNVMYLVKDDVASTETKNVYNKYILVTEEGGTKVLESLGMVSTGVDSSYLDLSIFPSTSGTLDEDSYAKVLNAYNNNITLGKLSSYYFSLDYFLDNDNSELKIIAVLFNNTNSKEDVSGSYIDIEMVTYVVSQDKTYRAIANTATLSNDMLSYLKFMAKTPNVVTTLASLPIDAHNIIANVASATNLFMAVSAEDVGREWQVRVNNTTGTDITQPLPTYGLFQSMSGDSVVVPKNSFIELSIWYINDKLVIRVGEQA
Ga0245266_100098Ga0245266_10009880F074964MITKKNVNKLQNAVIKENASNLVGAVKLYNALFANGADLKAICKTLEIPAEYAVKVAALAKDKKRLVTVCSQMLPKVGDTFIKFTLYSKVYKDSKVDKEKGIEAKTADWCAENVIYGGEYKSFGFSTAETLETKKSAKWLVKETDEYKATYVAVKIKSYSIRTVAKCVSEYLAHESNQQ
Ga0245266_100102Ga0245266_10010214F042910LADRLYCALNGTTLRDLDARIHLLDVEELAPAVRTVTASRIGGGLHLLRRQREQLSLRVRFLIEEYDIAARHQLLHLVAAWAEAGGVLTLHEDGKRVLRVVCTQYPTMSTLNWLETLSLVFTAFSCPYWEDAAETSFLMPNTSDAPSKLLAVPGDAPETPLNLLIRNIGDTAITTLTISAAGKISFQGLTLAPGAAIRIHHNAGVFAAEMVSDDSTVSILPYRTPDSTDDLLLRPGVLNEIRVEASAAAFVSGRCKGRYC
Ga0245266_100129Ga0245266_10012929F051934MGLMRRLAAAALAAVLVLSTALGERVTFRLSADIDPVQYPAQERKLAQGLKSLFRLLTVEGDVVASDGSFDARIDLGLTNAPEKTATRIRFFGLDSHWGIQSTTLGDETLMVNQLAWLEFAIKAYNHLDLPLQRVFLWLSPYAHTSAWAGLRQAIADLTAQENDGRLENTALIACAEEIARLSEDDRALYYYIEAFGLESGADANIFDALATLPEYVEANFPDGLSIEHTENGVSWQNGEETVFSYAEADGTQVVSLHLPDLVDFSATLRRDALLFTGALSLQSDVLNAQVSFSLPVSYPVTLPFYAQIDADGMMTGDDGIHLAFEGEAQGDTITIRRIQPDHSATMMTLTVKVIQVAEGTVKYAPEDVQGTNVLSVDGPALAELMGRIGKPMVRKVTQWLAALPAELVQGAMDAMEDSGLLGTLTDAILNGEAGEY
Ga0245266_100155Ga0245266_100155101F039147MRARRLLILLMMLLLLPQAQAERLTLYTRPGQVDEATPFQLRPTELSICSVTRAMGGVVVLANDDNYDSLSLYFWQDGMTEMRKLGGGFYWVMSSDTMETAQESCEYAMSRVPNYRMPDLTHAISNLTSDGETLYALNRINGLIFKISETKDGLQTEDVCTMANLSCLNISYRDLETDKVYTYPASLTRMHVCGSVLAISVMQENGIKVVLVDLTDGAIREIAGESLEAMYEWADGELLLWRLEGSPNEISRTSGTYTLSRYSVATGEETLLSTGVPYKKRSECGAYDPYSGSYYDVRTRQIVRTTDFVQEEPVVTFPAANVNIAVTKDSIVGVNLSSVYVRSKENGDMTVLRIQSSNGASNTALQHFAEENPEVILAQETLAKSAMNAASLAARMSASADAPDILRLGLTPDTPEADGSWPLDVLMDKGWCMDLSVYPEVSDYVSRLNGIYRDAVTRNGKIYALPIYAWSYGYFISRNVMEKLGLQESDIPTNLIDLCAFITKWNDNLTGAYAAYTPLEETESYRERVFDLMVRDWIGYCQAENIPLRFDHPVFREMMAALDAMRTDKIEQANQQVNEEISDYRECLIWTDAQAVGNFANYADAFGSRIFLPMALTPDVTTHYGIGYMTVLVVNPRTTNADLVGKMLAQVIADQEATAKCVLLADYDEPIEDSYYLIMVNDYEKTLTELRRQQENAPVWKKQGIQERINEEEASLQRYTVRERWTIAPKTIELYQQTILPMSYLRRPGILADSDAFSALVSQVHQGEISLEEFVEEADKLIEGLEQ
Ga0245266_100155Ga0245266_10015589F070133MKRLLGLLMAVMVMMGGMAGAEASTDNASMQLIRMNPLAFRKEPVELYSVTHTPNGSFVVIYFTEGEKSELQEMWMELFDSVGTSLLSAKLGEFDPNGEQIPHGQIILKKDRFICEYYPDITSMEVCTQTVYRYTGKRIQKPTIKKLKFGAAPYAQHVGDYMVEKQAHSEDESPFRTVKITHIASGKSKKLMIYDWSFCACSDQDGNLLIAQQNEKGNLEIRSYNAAMQESIVELSGDFLQNENVRDAACIGQTAYMRIRLTNEKSEILLYDITQQKITDSQTLLAVDDNSYIAEIKAAGAVLLSVDGYWNRELQRQKYQINLLNEHFETSRLPLQHESCLYIFTDVEQADVTTIEMDEKSHSYFVCSYSISAGE
Ga0245266_100238Ga0245266_10023856F045105MSDKLKRVLDTNLAGLHVTDAQVNAVLRRVSQDEQRPRVIRWQAIAAVLLVLVLGAGVLLQTGILSDDSPQLTADVTRQGDFLTAAQVKKLLASAADLQFSVDEEAISALMAQVEKDGGCSLSTLLEILCPVGSSLGQWDIRAQVALSQTLERIGYQGILPGMTREPLSTEITAQDALARAVAYIRTNDDPQADFTNLDFYRVGIRFLSGVYDGTCEGAYYCVNFDALDAFGTTYEVAVNAEDGSICRMRRERGAGSNHTADEVTRGFRRIFGYDMRTWTPMQLRVYILALSRADRSSMQTVHELFLSVGRDGFPDVPAEALTREEAIDAALAIQGGSSSDVLAAEYLAGASGDFWKIAIRQPNAPSGQSIVYLELNGLTGTLLTTTYSGNLYGLPQEFFPSQLLSTLDRTDFSHGVPAVDAETAQSAASDAIQRQYQRNMAEEGYTCFIESDFEETAGGFSYYTGGGVVIFTKDAQNTSQGDIYWAALNWYGEVLDVGWNRNPLDAARFTLLMQGYLPPIYQRETVQQLHALLIGSQTDEAGLRQMETDGTLPLFNALLALEVVPDASTKSTADDVTAAALKSLNAHLCYENSSFLVYREDGELIWHFWLSTDMGYFLMDVRDSDLSVVGSVQIPSFSALRTSILLPVRVWNTLAEDLRVTIFYRDANTQPGIVYGMYANHIVQRYVDLYGANILRWDQATFRSFQSAMSISGSYFGDWSVACLCQTIYPDVPDYAISQEVAAEYAARALGDDDYSLRGGVLIDPGEGDPIWKVTLDYPDGRSFNAEVDCRTGAIRTLRQQDTRAMPFYTDYLDFPDDGEYWFRNFVLDEVIEQVRTQMTGRYGNNV
Ga0245266_100246Ga0245266_100246101F064817MNIKNLFNRFRKREPELSYSLNLIYLEDTKIVFNQNIQCAKDLENYLSAYMRLFGMYSDKPYVLIYQEYKSRYWVYDKEPYLLYYKVPLIVNLSRKLSGKSDMVITKEKCRAAKDLVPAHEVSDGFKIPEYITGVFTDIWYKCQGYMDTDHVGLEEILELMQHNWLKEFELLVFKRNYDTDMLFLTHSLTYILDQTEEEGRRICIQNIIERNINQENQDENETI
Ga0245266_100246Ga0245266_100246106F101357MNLNNITTVLKTGITIYQYEQWQNTGSVNLMQKESHMLSKVWLKTNIYNPDSLDKPFIQLSATFTSESDIQEYNEWLNANQYKLYPLLLDILKISLKDDFYNYSNASNIHYEGGKSPSMLTIQLFNLEF
Ga0245266_100246Ga0245266_100246112F044555MKTTNPSSRITISQNGNQILTCKVYKEPNYILSMSNEEILELISGLDYMGNLPTVPDPEKPIQIQVSTTRQIPLEQNKEVQTKIKEIIYNNLYDTLIDELKDTISRFQAQYNIQKINPYLQDILQNPEDLVPQKIK
Ga0245266_100246Ga0245266_1002465F060985MSNIDEKAKNNFTIEMRIFENYEKVKYEIIKAIDFLRHAETNLGMCRIFYNQNHEFWHSVIKPWFKPERFGITHLWFTSGFSFIGYGEYHTIRGNRWLKTPIDKIDRENRIFGYWFPPYKKYIPHRIKVLKLALKDLERIKEEYGKD
Ga0245266_100246Ga0245266_1002466F032312MARIKDYDEDLSAPKLLKERARDSKGRFIKKDLPPYLGAEQVLKPKNYYHFDSHGNYKGSSMNFDALVCLGFTWFKLLGVVLMMLLWPIVFIYALNDGIEGYPFKKYAIPYIFILVAWFIILLYGLVS
Ga0245266_100246Ga0245266_1002467F094005MKKEIVKLKEGNSVIYQDKTLMEKANVVSIDKKNGTAILSNKVIITRTTNLDGQFTRLDGKGNAIILPCTTENEQKYNSFVAYHQSKKSLEAIKKWLDDNGKHKDDETTEKVITLDKKLKKLIEKLNE
Ga0245266_100614Ga0245266_1006141F056309RTPSLSVVPLLLGDSDSLSASYGMEPEQEALSIVSFKD
Ga0245266_101035Ga0245266_1010351F078822ISKVVLFVKHFFDIFLNLPNAFLKAFHPHAVRFPVAFLLVHRFYLAFEELLSCATAYL
Ga0245266_101521Ga0245266_1015215F042936MGRGGGKGRVWKTKEGIMKTSGIVDRGEDTIRNFEKVEQEGALTPPYRGKVYFRSRLRGK
Ga0245266_101584Ga0245266_10158410F078003VNEHLAFAQCLQRVLNETELTATEVARRLEMRSRNSIFRILKGKTSPQLNRRFLESFHKHMGEQLTEAQWAALNCALEMDAVGAVEYKSRQALMQLVGAFSEPISPAKVCYLDASGAEKEDSFLHYLQVLFCGALKVNALLFGCCDLGLFRQLQEAIHPVAQRVIVRIDHFIYAGEDEIVSNLVGIQPMVDQPCYHAYLVDAENCPQERLAHYRTGQMTFHVMHQDGSESTVALFLLGKNEFTATVMSTQDLWMSRKVLCDRERFSPIRLLWQLNDENSDFIVYTQQYCKMEHGAAIYYIRPDVPFQYIPLEVLYPVVREGFAWMGMTREEYEPNVTALAEIHQARMTNMMRRRRPTYIVLNKAAMEEFVRTGRQSDHLHFLRNFTPKERKQILDVLVQQARENPFFHLYFAQEKLPYTMGEVALYDGRALITMSSGSGYDLRTDHRESCITHPFVLRAYKQFYMNTVVARLADTQTESLNQLEELVRRCEKMIREGQKGNAGNEQEKLSGQ
Ga0245266_102789Ga0245266_1027891F071325ALLSDSSIIISKVVLFVKHFFDIFLSFPNAFLKAFLPHAVRFPAAFLLVHRFYLAFEELLSCATAYL
Ga0245266_104007Ga0245266_1040071F078822LSKRLLKAFHLHAVRFPAAFLLVHRFYLAFEELLSCATAYL
Ga0245266_105393Ga0245266_1053938F101355MIEPPFQHGIADMAFWFLQWYLPSAQPPQPKGSGAVFPYVLPRCSYFFKSFVTEMSIFICMHKCLAQMGRLRGSSCHIVVAAKRACACTLLWISDHFYKKLLPYVLFFFFKIYLKKIDFFQNIA
Ga0245266_111811Ga0245266_1118113F088920VSARPHYYLAFGAGLILHLILHFSQKAAIFAPKRAILLIFVPTLFFACLAAFSLHTSPKISGQTSLYQPWLSPYCLFKD
Ga0245266_123624Ga0245266_1236242F090514MNLRIHALKKASRQRPGVKIAAARFFSILYYPFCAKRSSFFQQNVQPRGNFFANRPIFVYFADVPPRLTGAKWFVILLTIVPAGSRAWAGSSLSLSPASIFF

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.