NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300029729

3300029729: Human fecal microbial communities from twins in the TwinsUK registry in London, United Kingdom - YSZC12003_37229



Overview

Basic Information
IMG/M Taxon OID3300029729 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0133139 | Gp0283827 | Ga0245191
Sample NameHuman fecal microbial communities from twins in the TwinsUK registry in London, United Kingdom - YSZC12003_37229
Sequencing StatusPermanent Draft
Sequencing CenterBeijing Genomics Institute (BGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size160969182
Sequencing Scaffolds9
Novel Protein Genes24
Associated Families24

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → Viruses → Duplodnaviria → Heunggongvirae1
Not Available3
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Rikenellaceae → Alistipes → Alistipes timonensis1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Ruminococcus → Ruminococcus bromii1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Lachnospiraceae → Roseburia → unclassified Roseburia → Roseburia sp.1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Clostridiaceae → Clostridium → environmental samples → uncultured Clostridium sp.1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Fecal Microbial Communities From Twins In The Twinsuk Registry In London, United Kingdom
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Large Intestine → Fecal → Human Fecal → Human Fecal Microbial Communities From Twins In The Twinsuk Registry In London, United Kingdom

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal distal gut

Location Information
LocationUnited Kingdom: London
CoordinatesLat. (o)51.5Long. (o)-0.12Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F032312Metagenome / Metatranscriptome180N
F042095Metagenome159N
F044555Metagenome / Metatranscriptome154N
F050794Metagenome145N
F057001Metagenome137Y
F058555Metagenome135N
F060985Metagenome / Metatranscriptome132N
F064725Metagenome128N
F064817Metagenome128N
F067720Metagenome125Y
F074964Metagenome119N
F076653Metagenome118N
F078006Metagenome117N
F083451Metagenome113N
F089005Metagenome109N
F089054Metagenome109N
F089591Metagenome109N
F090484Metagenome108N
F094005Metagenome / Metatranscriptome106N
F097172Metagenome / Metatranscriptome104Y
F097493Metagenome104Y
F099406Metagenome103N
F105374Metagenome100N
F106193Metagenome100N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0245191_100077All Organisms → Viruses → Duplodnaviria → Heunggongvirae152053Open in IMG/M
Ga0245191_100084Not Available146390Open in IMG/M
Ga0245191_100222Not Available86481Open in IMG/M
Ga0245191_100240Not Available81924Open in IMG/M
Ga0245191_100379All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes57758Open in IMG/M
Ga0245191_101680All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Rikenellaceae → Alistipes → Alistipes timonensis15835Open in IMG/M
Ga0245191_101874All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Ruminococcus → Ruminococcus bromii14045Open in IMG/M
Ga0245191_116480All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Lachnospiraceae → Roseburia → unclassified Roseburia → Roseburia sp.1222Open in IMG/M
Ga0245191_118975All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Clostridiaceae → Clostridium → environmental samples → uncultured Clostridium sp.1069Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0245191_100077Ga0245191_100077107F078006MDNVLKRAAAELKEAGCRVFAWQDDTYNRSWSKGDYIMLYYAFPDSPNIGYLSRGEYGMSVAYSRAYIPSRGSGSGCRVKEEATFDLETALDVLNGPLPMWCRSYGVYPKQYDNIDKWYNSDNHNKKLFKEI
Ga0245191_100077Ga0245191_10007712F050794MQDLRIQRVKVLMMLYTSNYFVNVRQKQLLDHSYALSRDQAFDYMTEFNKRLSDKVGIKCTMDVLLPTDDDNANIIIEHNGIIKKLMKEAEKLELDTDAIKTMMRDLLNELKDDIDLNILIFDVSQLLIKYNLFRLEAITEQEFKNSFVRMDSRNMEIKKLTLFDIKEVITMIEDRYSYALYMTEECD
Ga0245191_100077Ga0245191_100077154F083451MIMDKNEREKQVLDLLMSRKDIRKLVEKSNECYSKMDFVGAMKYRQEIKDIVDRESKIMLTKSESLIGLMNNADNEYKFNMLVWLHSMMCMADVFNGILEDFKDGVRKANGNSKFVKFDNLDRLMAECKKEIDYLMKGTSKSFQISFAVRSDELREMIENMVGDNIREGYDIFKEEAEMVNETDRSKIEEFNKKLDHDQM
Ga0245191_100077Ga0245191_100077164F076653MTIRDKYFGWKDIFFDRFVHCCNEKSDQPQGSNIPLAKINFDNKTGYVEDGTINIAELLQYLWINNKVYECEYAPIDISSVLQTLIRLTENAKFIFDDQPGIHDMIPYRGFFLRDDFLPGKDYSLDLDKIVSGMGGWYGEDEDPCYSMFVSQDQIWNLNPILKVLADEGSILAKELGYDMNSYVSDNGYTIYNPYLSWINHYYHYCPTFNEDKLKPWDRVEDRKNKFKMTDKVKRGANNWYYSGGTISCVDNFLGKEYRKNLRTFIYRGIVFFLDRIWHTPLFEKMGVKMKYNAYYCYAATSGIWYSKGFKERLAKRFNRSLSGGGEPFGANLACMVCDRKDIDWEALRFWLEKYDDPTDKGMVNSPIQFMYLYLYYTFNK
Ga0245191_100077Ga0245191_10007723F064725VIQLAREGRLDFFCITDSNTYLCAKDLNMTIKGLLAEIKADLHKYDDSGAIDTSSVYRWAEIALKRFGGVIAVMSEAIVKTSNKQAVLPSDFFDMLDAYRCEPLVCEIPGGDKAKADLQHEIGWVERTERGFRWNSCTECCKEEFEKTITEKLYIGSHEVRFHYHHPVRLSIGRGLRRDCAADKYRDKYDWDNYDITISGNAMYTGFDGFIYIIYRATPKDDDGLPYIPETALGYLEDYVETYIKMKIFENAAVNGLIQGAGDAYKLYAQQEPGKFARAMKELKMSMITLNDYRELAEDNRRRMLSYERMWPNAFDKYIKLV
Ga0245191_100077Ga0245191_10007739F106193VGCNTCKEKALRAERERIERSMMNHSSSTAVSDMEYASRSTAGCMVMQDPLQTMERDVVSIYRQVRTKGDGVGVSYLNMQKKIREWIKNLPYECPPDEEVQEMRKEILDGRSKHIKP
Ga0245191_100077Ga0245191_10007747F042095MAKTLYKYEASSNKFVWFTTWDRALRNYYTDDYNYVPDPVVGDPYNTFVEFRSRKPGMANVDWGDGIKEQFPMTKVQGEDNYRIIFRSLAIQHKKNPNTTWWFRKEDGSQYVPVDNHAYADGRRDVQRAVSIDFTCDIYYANIQVCKMTSFPIVDIPGLEFLVVSHTLYVNDGIPVDKLSRSKKLIYIDLQNIGQRMTVIPEAITSKTEVYYLNMFNMLDLRDIESSGIRNIKNMKNLQTLELFSCYLDRYIKEFNDLPKLTSLRIHPGPSDMWNYFDINTLPFFEVDKINPNITNFDFLNDWVSGERRTGWNDDNMSGRGLDHLTGFFVYHSNSIRVDKLPDYIYEMRSITWFVMDYSTHSQKRSDDFVNSFYDLVVGWNQITMASVAKDGERNQFYGLAVSMYGSQYPDENQRPSGTEQAPEGFVKGSSNGFPATPMEKIYVLKNNYAQRWTIKPE
Ga0245191_100077Ga0245191_10007753F089591MTIQEAYLRSLQKNEQNLANGGIKLDPGRFVLLFNEAQDRLIRYYLNRKDDETIRSIQTLLVYWKSLNKINHIDDPESTSFGLPDDYLWFSNIKGAFSYNGCEVGDFVMWEAKNENVHELLGDDNNRPSFDYRETFYTIGDGKVVVYEDGFLTDEVRMTYYRNPVRVDLAGYINAAGGRSTDIDPELPDPLVEEILDMVAKQFNLNENELSRYRMDKDNVASFK
Ga0245191_100077Ga0245191_10007766F058555METKIALLQKMKSNFDKILTEAYIPKDIQAKKDELGCLRLPAGSLVCPVDYKPVTNKDGKKVTAVKYSNKKDNIRGSGMVIEKKCKQVTAYLSIINVQKHVFLRNRMRDGYRDRIEINTDDFIDILSDGIAYFCYRHVIENCHEDIDYQLKTLKAYAEGEIRIALSDIMIYSYKAKKNEDTKEIFVGKKRSVYKCLDKNLSSDERRNMANKSRKLDRVRILSKIIFRARTRNVHHIYKVTKRKTIKFNVAYLLNELNKKLAGIGMHEISQSTIYRYISMFLGMCKKSISDLYDEVKKNNGIANAKDRKNVTIGHLRLSYRGKIMHIIIAEDFIKDVFLGVKGPEMSKAG
Ga0245191_100077Ga0245191_10007793F057001MISKEINKVQNEVKKSNEKTLTGAVKAWCNLFKSGKEINDILKENDIKVSKEVVPALVALAKDKEVVIQLCKEILPRVNNTFCAYKEVEREYYDKNEQDKNKKLKMSEIEDIAILGSSHKRFGYNEPIEYDFGIYYETFNGTDKRIVKCAVPIKRYTFSLIAKCVTYYLTHPKNDR
Ga0245191_100084Ga0245191_100084147F099406MAEVGYNSKFEGQEVDSRLENVVQAAPGTGSESGKGGLIPAPPAGSQDGSKTLLSNMTWGDHVTKQYIDDAVSAAGWKKQIVSKLPTVEEAKDNVMYLVKDDVASTETKNVYNEYILVTEEGGGKVLESLGMVSTGVDSGYLDLSIFSGDSGSLDESSFAKVLDAYNNNITLGKLDGDYYYLNYFLEGNDFENNFKLKIVFASFANTDSAVGASEYDIEIQVGTFVVIQDKTYEAMNNLVTLSNTILSYLNFMAMPPRVVTTLANLPKGAHNIIANVASATNLSMTVSSEYVGREWQVRVNNTTGTDITQPLPTSGQFQSMSGDSIVIPKNSFIELSIWYINDKLVIRVGEQA
Ga0245191_100084Ga0245191_100084168F090484MDIAKNPSDFFLLQIMKLQLGRNINISLRLLEQWSDDSLFMELYALYCMIKISRRDSRIRFKNQKDLLHKLGIGYSKFKNMTGHPMFNELFRMTDSTFVARRYRVNGVQLTLGCGKVNLPKNRILIKIKKNEITNHEKVLDRIREAMFVNLVRNNESVLNSGETNSQADVVDGSHSYYGLIDSTISNKTIALYLNVGLTKAKEIVGMAIKDKLVKRFENVQFITYVDNPRAYIEANEHNYPIGKLIPVYRHGAVFWQIANTWTLYKKGATNRWYFGEKDIEKGEKEKVSKKDDFNFFLKDNTHILRFLNAEEVVSEDGELLGIDRKKTKEEEARSLASVMAKEAHKDFWDGYERSTQNQIIRKYYRAIIAEDKKRRMDMFLNCLKQSYDKVSGWSKEKVATVKAGMAYAEACCAEVGTSVAGVCGRVSRRMKSYNNTAPDKKAGFNEVRDMYAEFAGEMAKAVGSVSEDIYTYVKAEQFKEKIENMDISIQSLPNISITVGNDKELELDGESVFKDIPLEELSFSSDTYLYPSSQYSSL
Ga0245191_100084Ga0245191_100084191F074964MITKKNVNKLQNAVIKENAANLVSAVKLYNALFANGADLKAICKALEIPAEYAVKVASLAKDKKRLVAVCSQMLPKVGDTFVKFTLYSKIYKDSKVNKEKGIKNKEVKNIAYGEEYKPFGFASPEPLEGKISAKWLTRETDEYRVTYVATRIASYSIRTIAKCVSEYLAHESNQQ
Ga0245191_100222Ga0245191_10022290F089054MLTSGKFLVSFEVPGQLPGTTEGFCEEMNVVYRTEELNTYLRYPKQEINPWHKHSTYIRLKLREILKVNLTDITLIDIISLP
Ga0245191_100240Ga0245191_10024041F094005MKKEVIKLKEGNSVIYQDKTLMEKANVVSIDKKNGTAILSNKVIITRTTNLDGQFTRLDGKGNAIILPCTTENEQKYNAFVAYHQSKKSLEAIKKWLDDNGKHKDEETFEKVITLDKKLKKLIEKLNE
Ga0245191_100240Ga0245191_10024042F032312MARIKDYDEDLSAPKLLKERARDNKGRFIKKDLPPYLGAEQVLKPKNYYHFDSHGNYKGSSMNFDAMVCLGFTWFKLLGVALMMLLWPIVFIYALNDGIKRYPFKKYAIPYIFILVVWFIVFLYGLVS
Ga0245191_100240Ga0245191_10024043F060985MSNIDEEAKNNFTIEMRIFENYEKVKYEIIKVIDFLRHAETNLGMCRIFYNQNHEFWHSVIKPWFQPERFGITHLWFLSGFSHIGYGEYHIIRSNRWLKTPIDKIDRENRIFGYWFPPYKKYIPHRIKILNARDVFILWLLCVLCLGMPLTCV
Ga0245191_100240Ga0245191_10024047F044555MSTTNPSSRITISQNGNQILSCKVYKEPNYILSMSNEEILELISGLDYIGNIPTVPDLEKPIGIQVSTIRQIPLEQNKEVQTKIKEIIYNNLYDTLIDELKGTISRFQAQYNIQEINPYLQDILQNPEDLVSLSQHHKR
Ga0245191_100240Ga0245191_10024051F064817MNIKNLFNRFRKSKESELSYSLNLIYLEDTRVVFNQNIQCAKDLENYLSAYMRLFGMYSDKPYVLIYQEYKSRYWVYDKEPYLLYYKVPLIVNLSRKLSGKSDMVITKEKYQAAKDLVPAHEVSDRFKIPEYITGVFTDIWYKCQGYMDTDHVGLEEILELMQHNWLKEFELLVFKRNYDTDMLFLNHSLTYILDQTEEEGRRICIQNIIERNINQENQDENETI
Ga0245191_100379Ga0245191_1003793F067720MASTTYRHLGDVTGMFAAQEQFRDITKMVCARFRGLTKTYHLGNVNKLVTFCHRFAVIGNMARNAGQLPQPFWLGATRGGGSHSLSASVARA
Ga0245191_101680Ga0245191_1016804F105374MKKRVVLLVALCIWKVVAAQTPYGEMPERFRPDTLPCRLGGGVCFGMDGLDAAIPRGGGASSCRDPRVVFVAGDTLITFISVAGVADTALGDPVCRFYGRNVARVVSRTRRMTGGRMGAADNDFPDDPDFAELQGVVIENQRYPWESYAAGDSAYRLPVARSLVGGKEDPLLGSDMRRRYVRLLTEVSVELKAGGTRPFVHVVYLLPDP
Ga0245191_101874Ga0245191_10187416F089005LTKSKQRSIIALALLRLATSSEESKQALKVRRTLKIEQRENSKETRNDFEESSKNYSEM
Ga0245191_116480Ga0245191_1164802F097172MKKTMLKKLMCAVLTAVCVATAVVPAMADDVITAEAATKKVTSAYKYHLEGYDKNGYPVSGFSKTSFYKDLNSLPAVKTGKTTINVPAVTSSVKSVSKEKGNPEYESYVKFKAQKTGKYVFTIDNLQGTDDKSLKCLNIYICRIAKNGKKYFLEDDLYPDTVGNYGDLYENNYLARLRTILDNYKEEHPEYADVIEETYEYQKDFVNKYPXXXXXXXXXXXXXXRPMYSSLTIGVCKKLSRLTLLHTAAMNRVVCGVETT
Ga0245191_118975Ga0245191_1189751F097493MYKFYMKNGQAQFYERGVEIDGTVYGIHTDRDILRIKRSVVNNKFAETDDNFDMDTEIAKIQHTSITFKQPTSEQLSQIQAKAFGSMSDMKQYVQSIMNGELTQDEINAML

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.