NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300029576

3300029576: Human fecal microbial communities from twins in the TwinsUK registry in London, United Kingdom - YSZC12003_35512



Overview

Basic Information
IMG/M Taxon OID3300029576 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0133139 | Gp0283736 | Ga0245100
Sample NameHuman fecal microbial communities from twins in the TwinsUK registry in London, United Kingdom - YSZC12003_35512
Sequencing StatusPermanent Draft
Sequencing CenterBeijing Genomics Institute (BGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size186663653
Sequencing Scaffolds14
Novel Protein Genes33
Associated Families33

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → Viruses → Duplodnaviria → Heunggongvirae1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Tannerellaceae → Parabacteroides1
Not Available1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae2
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Faecalibacterium → Faecalibacterium prausnitzii1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes2
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Eubacteriales incertae sedis → Gemmiger → Gemmiger formicilis2
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales1
All Organisms → cellular organisms → Bacteria1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → unclassified Clostridia → Clostridia bacterium1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Fecal Microbial Communities From Twins In The Twinsuk Registry In London, United Kingdom
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Large Intestine → Fecal → Human Fecal → Human Fecal Microbial Communities From Twins In The Twinsuk Registry In London, United Kingdom

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal distal gut

Location Information
LocationUnited Kingdom: London
CoordinatesLat. (o)51.5Long. (o)-0.12Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F032312Metagenome / Metatranscriptome180N
F042095Metagenome159N
F044554Metagenome154N
F050794Metagenome145N
F055715Metagenome138N
F057001Metagenome137Y
F058555Metagenome135N
F064725Metagenome128N
F064817Metagenome128N
F074899Metagenome / Metatranscriptome119N
F075481Metagenome119N
F076653Metagenome118N
F077313Metagenome117N
F078005Metagenome117N
F078006Metagenome117N
F080673Metagenome115N
F081354Metagenome114Y
F081453Metagenome114N
F083451Metagenome113N
F083452Metagenome113N
F085718Metagenome111N
F087336Metagenome110N
F088920Metagenome109Y
F089590Metagenome109N
F089591Metagenome109N
F090513Metagenome108N
F090514Metagenome108N
F094005Metagenome / Metatranscriptome106N
F099451Metagenome103N
F101355Metagenome102N
F102167Metagenome102N
F105375Metagenome100N
F106193Metagenome100N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0245100_100149All Organisms → Viruses → Duplodnaviria → Heunggongvirae142308Open in IMG/M
Ga0245100_100177All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Tannerellaceae → Parabacteroides123136Open in IMG/M
Ga0245100_100378Not Available65594Open in IMG/M
Ga0245100_100605All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae38019Open in IMG/M
Ga0245100_100636All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Faecalibacterium → Faecalibacterium prausnitzii35791Open in IMG/M
Ga0245100_100850All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae24407Open in IMG/M
Ga0245100_101523All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes12404Open in IMG/M
Ga0245100_101891All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Eubacteriales incertae sedis → Gemmiger → Gemmiger formicilis9689Open in IMG/M
Ga0245100_105158All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales3805Open in IMG/M
Ga0245100_105975All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Eubacteriales incertae sedis → Gemmiger → Gemmiger formicilis3402Open in IMG/M
Ga0245100_112660All Organisms → cellular organisms → Bacteria1911Open in IMG/M
Ga0245100_112919All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales1883Open in IMG/M
Ga0245100_132921All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes873Open in IMG/M
Ga0245100_135937All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → unclassified Clostridia → Clostridia bacterium806Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0245100_100149Ga0245100_10014912F102167VVDLIDHGIIDLDIMNGKMMGEYVAMLMIRSREKITGSYTMTTFSFHEKDMDKLRMLIGNAIMAVGYRNNPLNGDGNTAIK
Ga0245100_100149Ga0245100_100149130F076653MTIRDKYFGWKDIFFDRFVHCCNEKSDQPQGSNIPLAKINFDNKTGYVEDGTINIAELLQYLWINNKVYGCEYAPIDISSALQTLIRLTENAKHMFEDQPGVYDMIPYRGFFLRDDFLSGKDYSLDLDKIVSGMGGWYGEDEDPCYSMFVSQDQIWNLNPILKVLADEGSILAKELGYDMNSYVSDNGYTLYNPYLSWINHYYHYLPTFNEDKLKPWDRVEDRKNKFKMTDKVKRGANNWYYSGGTISCVDSFLGKKYRKNLRTFIYRGIVFFLDRIWHTSLFEKMGVKMKYNAYYCYAATSGIWYNKGFKKRLAKRFNESLRGGGDLFGANLACMVCDRRDIDWEALRLWLDKYDEPTDKGMVNSPIQFMYLYLYYTFNNNLK
Ga0245100_100149Ga0245100_100149140F083451MDKKEKQILDLLMSRKDIRKLVEKSNECYSRMDFVGAMRYRQEIKDIVDRESKIMLTKSESLVGLMNNADNEYKFNMLVWLHSMMCMADVFNGILEDFKDGVRKANGNSKFVKFDNLDRLMAECKKEIDYLMKGTSKSFQISFAVRSDELREMIENMVGDNIREGYDMFKEEAEMVNETDRDKIEEFNKKLGHD
Ga0245100_100149Ga0245100_100149159F078005MKKSEFVKELEKIIDMIKTEDNGFEYGGKVIFYKEDDSNYEVSVMNIEMNLEVEADVMAGMDDIDFTCLMSEVYKQKAIKAIMMEKDDDEDN
Ga0245100_100149Ga0245100_10014928F058555MFGTKITLLQKMKSNFDKILTEKYIPRNIQTKKDELGCVKLPAGSLICPVDFKPVTNKEGKKVTAIKYSLKYEEYHGSGIRISDECKMAMIYLIIINVLKHVFLRKRMQDGNRDQIEINTNDFIDILSDGCAYFCYRHVLRDSHEDINYQLISLKAWAEGEIRIALSDIVKYKHKASKVPRIKDMFVKKGESIYTCIDKNLDSASRRRMANKSRKLDRVRILSKIIFRARTRNVHHIYKVTKRKTVKFNVAYLLNELNNKLIGIGMREISQSTIYRYISMFLDMCKKSISDLYDEVKKNNGVANTKDRKNVTIGCLRLLYKGKYMHILISTEYIRDVFLGEKSSEMSKAG
Ga0245100_100149Ga0245100_1001493F057001MPIQVPRARANLMTSKDLNKVQSEVKKASEKTLTGAVKAWCQLFKSGKEVNEILKDNDIKVDKAVVPALVALAKDKEVVIQLCKEILPRVNDTFCAYKEVEREYYDKQDQANNSKLPLDKVNDIAVLGDTHKRFGYCEPIAYNDTGSVPYYEVFNGSDKRIVKVAIPIKRYTYNLIAKCITYYLTHPKNGK
Ga0245100_100149Ga0245100_10014941F089591MTIQEAYLRSLQKNEQNLANGGIKLDPGRFVLLFNEAQDRLVKYYLNRKDDETIRSIQSLLVYWMSLDNAGRMDDPESTSFNLPDDYLWFSNIKGVFSYKGCEATDFVMWEAKNENIHELLGDENNRPSYDYRETFYSIGNGKVVVYESGFRTEEVKMTYYRRPVRVDLSGYINAAGIQSTDIDPELPDYLVEEILDMVAKQFNLNENELYRYRMDKDNVASFK
Ga0245100_100149Ga0245100_10014948F042095MAKTLYKYEASSNKFVWFTTWDRALRNYYTDDYNYVPDPVVDNPYNTFVEFRSRKPGMANVDWGDGIKEQFPMTKVQGQDNYRIIFRSLAIQHRKNPNTTWWFRKEDGSQYVPVDNHAYADGRRDVQRAVSIDFTCDIYYANIQTCKMTAFPIVDIPGLEFLVVSHTMYVNDGIPVDKLSRSNKLIYIELSNVGQRMTKMPEAITSKTEVYYLSMFNMLDLRDIESSGIRNIKNMKNLQTLELSSCYLDRYIKEFNDLPKLTSLRMHPGPSDMWNYFDINTLPSFEVDKINPNINDFYFLDDWVSGERRTGWNDDNMSGRGLEHLTSFIAAHSNSLRMDKLPDYIYEMRSITWFIVNCSTHSQKRSDDFVDSFYKLVTEWDQITMTSVASDNKRNQFYGLQVSMYLAMYPNENQRPSGTEQAPEGFVKGSSNGSPATPMEKIYVLKNNYAQRWTIKPA
Ga0245100_100149Ga0245100_10014950F089590MTLVMDNKGMLDKIGALWNIAIAYGTSCWAYFQPVHHLLEVLLVVLLANFIARLIQSARRWKVRRSRKRRFSLYRWFREVRLVGILKEFFLSCFIVMTLCVIYKTLSIEEDDASAILVVTKYGVYAALVAYVMLFLNTIGEAFPDTYIVKVFKSIFNRVNILKLFGSAKSLPDDAFDDIKKIADEEVKDKS
Ga0245100_100149Ga0245100_10014951F075481MKRLRISLKAVFCLGLSLSLSSCGSRRQVSETSIDSRLISRIETMIDEVMDRKIVEIKTSDLNADIVITEREFDTDKDVDPATGERPVSSQTDTHIVIGRRDSTVTADSLGIDKTITGIEDIDKKTDIKHKDIDDKEESRWPMAIIFMSILGILVVLFVLLKRFGLIK
Ga0245100_100149Ga0245100_10014957F106193VGCNTCKEKALKAERERIERSMMNRVSSTVISDREYASRSTAGCMVMLDPLKTMERDVVSIYKQTRTIGDVGIVYLNMQKKIREWIKNLPYGCPPDEEVQEMRKEILDGRSIYIKP
Ga0245100_100149Ga0245100_10014975F064725VIQLAREGRLDFFCITDSNTYLCAKDLNMTIKGLLAEIKADLHKYDDSGAIDTSSVYRWAEIALKRFGGVIAIMSEAVIKTSNKQAVLPSDFFDMLDAYRCEPLVCEIPGGDKAKADLQHEIGWVERTERGFRWNSCTECCKEEFEKTITEKIYIGSHEVRFHYHHPVRLSIGRGLRRDCAADKYRDKYAWDNYDITISGNTMYTGFDGFIYIVYRATPKDEDGLPYIPETDLGYLEDYVETYIKMKIFENAAVNGLIQGAGEAYKLYAQQEPGKFARAMKELKMSMITLNDYRELAEDNRRRMLSHERMWPNAFDKYIKLI
Ga0245100_100149Ga0245100_10014984F077313MGKYVIKRKIPKYQDAGEVDPVMPGNIVGLQGLGVEPLVSSTRIGFDIQQPDINTIDTSDLNAIVDSNKKVDESGSTDVFDFTTIPYYGADDIGSRFTQMGRGIGRMRSEGYGDLSTGVKTANIVGTVMSGIGGVLGLARNVFSGMASEQGTRTNIRLAQEREARQRRQSQMRYKDGGGVYLGPNNRFDSGSLTGEYLYPLPKSMEDQANVEVEKGEYVTQPGEAPMEAMGQKHADGGTPVSLEEGTKVITDDTTIESDFAKYIRDTYGIKATPKDTYATLMDRYKAKIGLKSAYDDQKKALDKLKKNDKIDDENTRRLNASVLSMAINDSNETVNGLEGRFTDFANVIYKEQEDRKMKKDEDTYFAKGGEIDNIISRSMKEYGLTEEDIAEAKKELLKKVAGIRQKMEKGGSSLFDYLLTFRPVENKYNNKDNTFGYQRQGQDGSYGGINTDERLEYYKTFMPLAYDAYMSAPKATAAKALQDAIYNTTGGWMGLATAENPIIANAEALRDYTTLVSFGGEDSQGNYPEDKKAAYHDRMRDNKFGQYSSSRPMIGLDVVTEDQHKALNDAGITHFSQLFSDKNKDIVNKILGEDMLKMQALRSMKGMEGLDFILDPHKVAPGSMDIGDVEDPDVKLDMPELIDPNTLPKTNTSAGKSNSGNGGRNIVGGGLDFPEVFRMTPGAVTTEGLERHYAPTVDPVLRSADQYMVEANRAFQSQLDQMGNAPDSQRGALSSNLQAIMSSNIGKYINDVEQGNVAQRTWADNINARTWADTYDKNIAQRQAYQQRILQGLAINDENWARYFDSVNDEIQQKWNTATTMNTLRSIFGDVKIGPNGQLIADPQGDILSYRRLYPAQEVTKGKKG
Ga0245100_100149Ga0245100_10014987F050794MAPVGTLYIMQLDAFLHRKIMQDLRLQRVKVLMMLYTSHYFVNNRQRQLLDHTYALSRSQAFDYMTEFNKRLSDKIGIECTMDILLPTDDDNANIIIEYNGIIKKLMREAEKLELDTDAIKDMMRDLLNELKDDVDLNILIFDVTQLLIKYNLFRLDAITEQEFKDSFVRMDSRNMEIKKLTLSDIKKVVMMMEDRYSYISSI
Ga0245100_100149Ga0245100_1001499F083452MSGRIKIKSKNKDKKPKIDVFKVIEDRFKNMNELRDLIDMDPKKGLVRIRDGAGFREVERGGCLHRNYLNLLEEELGAKLSIDLIDKYIKRK
Ga0245100_100177Ga0245100_100177104F080673MDEIMKLQDEALLYLRDNITREEAYYILTKENEMTEVLIAKKKDGSKRIKILDMEYTIERDDMLLLFDTDGIIDECLLTSSHIGINMYFRRQDIRDILSKKLEVMEYRYIKIQVDNIPVVEKRRVILDLTGHRVDRNDRDKIDFMFVYYMARLCV
Ga0245100_100177Ga0245100_10017797F085718MVIEFDFEIYKNGDYDKVYLRNGREARVLCDNGKGDRPIVVMIENNNVDDYIILRYNETGRRSISSQSGLDLMLSIKEREPELWVVVISYMDNKDKRQKMVLPNFFSRNIRGNIYLQGSSKSSVSYYVDKLEEDGCFDELCEKIRVKRDRIYNMEIISLSDDETAV
Ga0245100_100177Ga0245100_10017799F078006MEDNILKRAAAELKEAGCRVFAWKDDTYNRGWSKGDYIMLYYAFPDSPNIGYLSHGEYGMSVAYSRAYIPSCGSGSGCCVKEEATFDLETALDVLNEPLPRWCKSYGVYPEQYKDIDRWYNSDNYNKKIFKEI
Ga0245100_100378Ga0245100_10037811F094005MKKEVIKLKEGNSIIYQDKTLMEKANVVSIDKKNGTAILSNKVIITRTTNLEGQFTRLDGKGNAIILPCTTENEQKYNSFVAYHQSKKSLEAIKKWLDDNGKHKDDETLEKVITLDKKLKKLVEKLNE
Ga0245100_100378Ga0245100_10037812F032312MARIKDYDEDLSAPKLLRERARDSKGRFIKKDLPPYLGSEQVLKPKNYYHFDSHGNYKGSSMNFDAMVLLGFTWFKLLGVALMMLLWPIVFIYALNDGIERYPFKKYAIPYIFILVVWFIVFLYGLVS
Ga0245100_100378Ga0245100_10037823F105375MKNNETFQTTQHLDKLVTNLGLQIQELFSLDLEEILDYSNNLMNLLVNAYVENQCLALSAMISKQDGFAIYSFLFQTPDTFNGAADALVNFAMNFTDGEANIKSINGISSNIMQITFTV
Ga0245100_100378Ga0245100_10037825F064817MNIKNLFNRFRKREPELSYSLNLIYLEDTKVVFNQNIQCAKDLENYLSAYMRLFGMYSDKPYVLIYQEYKNRYWVYDKEPYLLYYKVPLIVNTRRKLSGKSDMVITKEKYQAAKDLVPAHEVSDRFKIPEYITGVFTDIWYKCQGYMDTDHVGLEEILELMQHNWLKEFELLVFKRNYDTDMLFLNYSLTYILDQTEEEGRRICIQNIIERNINQENQDENETI
Ga0245100_100605Ga0245100_10060530F087336MVAELEQVPKAFRAVGNQRRAAAKERIKDDAIGHGRVSDRILAEIEDNHMRERNTKIGLAEQRQVAFLEIAFQILPLKSKQKRAP
Ga0245100_100636Ga0245100_10063623F090514MNLRIHALKKASRQRPGVKIAAARFFSILYYPFCAKRSSFFQQNVQPRGNFFANRPIFVYFADIPPCLTGAKWFVILLTIVPAGSRTRAGSSLPLSPASNIF
Ga0245100_100850Ga0245100_10085013F044554VLAGRFIPVLCASIARLFPCRTEIARCLTLDFAISRYLFLSFFFSFRTNFAQALFSSLLFVSDTRAKSILFLLFENEIAHLQGQYRFNSHRYCFSAFLVL
Ga0245100_101523Ga0245100_1015234F074899MSGRKQWNKQAATSSFLDKKPPQSLVQQGLEGSTAVGKDEVGSSNLPSSSNINRLNRLI
Ga0245100_101891Ga0245100_1018916F090513MAKYVRKMEWKLLHIKHILYMRPFGALKIALQSVGNKNGTKAVLQGWAAAFVPYMLSFT
Ga0245100_105158Ga0245100_1051588F099451MEIKNVGQLRKIIENLSDDYEIEMRIRRKLTDEELKNCRYPYPYDTEYLTLEFDDIGVFDKVLCLGVTSNE
Ga0245100_105975Ga0245100_1059754F101355MIEPPFQHGIADMAFWFIQWYLPSAQSPRPKGAGAVFPYVLPRCSYFFKSFVTEMSIFICMHKCLAQTGRLRGSSCHIVVAAKRACACTLLWIPDHFYKKLLPYVLFPFFKIYLKKIDFFQNIA
Ga0245100_112660Ga0245100_1126602F055715LTKANEFDKIIELLIERTAKKFERASKNKLKKFLTNEKFCDKINELIRVGTAKILDN
Ga0245100_112919Ga0245100_1129193F088920VSANLHHYPAFEAGLILHLILHLILHFSPKAAIFAPKRAFSFLFVPTLFFVGLSVLSPQTSLKISGQASLY
Ga0245100_132921Ga0245100_1329212F081453MFPFFLFAGENIIEKHISANPVCGTNIRAPELAVKGAPGRKCVQ
Ga0245100_135937Ga0245100_1359372F081354VKDFSSIYPESRRRSPLKKGAVTEIPLEYGKTEVQIPFEPLQAAISSMELPKNFENKKLPNTLRRLAFLPFHL

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.