NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300029843

3300029843: Human fecal microbial communities from twins in the TwinsUK registry in London, United Kingdom - Stool sample from british twins



Overview

Basic Information
IMG/M Taxon OID3300029843 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0133139 | Gp0283909 | Ga0245274
Sample NameHuman fecal microbial communities from twins in the TwinsUK registry in London, United Kingdom - Stool sample from british twins
Sequencing StatusPermanent Draft
Sequencing CenterBeijing Genomics Institute (BGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size176466355
Sequencing Scaffolds17
Novel Protein Genes19
Associated Families19

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia1
Not Available1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales12
All Organisms → cellular organisms → Bacteria1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → unclassified Eubacteriales → Clostridiales bacterium1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Fecal Microbial Communities From Twins In The Twinsuk Registry In London, United Kingdom
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Large Intestine → Fecal → Human Fecal → Human Fecal Microbial Communities From Twins In The Twinsuk Registry In London, United Kingdom

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Unclassified

Location Information
LocationUnited Kingdom: London
CoordinatesLat. (o)51.5Long. (o)-0.12Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F039198Metagenome164Y
F041208Metagenome160N
F042910Metagenome157N
F043413Metagenome156N
F043945Metagenome155N
F050793Metagenome145N
F051934Metagenome143N
F055775Metagenome138N
F056623Metagenome137N
F056682Metagenome137Y
F058154Metagenome135N
F072366Metagenome121N
F073573Metagenome120N
F075480Metagenome119N
F077320Metagenome117N
F078003Metagenome117N
F082714Metagenome113N
F091068Metagenome108N
F101192Metagenome102N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0245274_100135All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia104600Open in IMG/M
Ga0245274_100402Not Available44346Open in IMG/M
Ga0245274_100503All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales37158Open in IMG/M
Ga0245274_100656All Organisms → cellular organisms → Bacteria30645Open in IMG/M
Ga0245274_100681All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales29260Open in IMG/M
Ga0245274_101337All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales17246Open in IMG/M
Ga0245274_101408All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales16574Open in IMG/M
Ga0245274_101476All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales15880Open in IMG/M
Ga0245274_101748All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales13635Open in IMG/M
Ga0245274_101994All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales11946Open in IMG/M
Ga0245274_102043All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales11668Open in IMG/M
Ga0245274_102047All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales11614Open in IMG/M
Ga0245274_103047All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales7713Open in IMG/M
Ga0245274_103050All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales7711Open in IMG/M
Ga0245274_104079All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae5769Open in IMG/M
Ga0245274_105105All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → unclassified Eubacteriales → Clostridiales bacterium4669Open in IMG/M
Ga0245274_118617All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales1358Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0245274_100135Ga0245274_100135102F039198EIQSTKYVQIYLTEEELQKEDTKDLIKKYKQEKYSMAIFVTGKENYPEVLKKIVMKQVELNNNVC
Ga0245274_100402Ga0245274_10040253F055775MAQIAQQDNLVIEVTTTTAALDGDTKKKLIECIEGGTITDVILVTKEVEKKISHARVVSWLVDTTGDSPKYTIDIINANSGAVEAIALN
Ga0245274_100503Ga0245274_10050332F058154MKNRTFQRLFPLLRMGSILLAAVLFCAALIGVNAQLTVGTNEERNLATYLGLALTAVVTYLLQDVPKMLVGRHFGWKLVRYEAFGRVYQADGTGAFVRVPVSAQSRLRYQPYLLPPEDAPRHDVTLYTLCPLLYGGALAVVFGVLTLLTLGQPVSLVLCELAMGGVVLVMTCILPMDERFIASPMAIARMLRDPEQLAEWLYFARMKANGGVEVTDVSIENIPQPATVKTCQDAICVFQRAQIALMVQCDAQKAYGLMKQVLESEARLSYTAWKLLLSDGVVAELLAGEPGEFTALYRGKAGAAIRQVMFRMVDYALPAYAVAMLVTHEAREAEVVRNTLAPVREKMPELDALMKAIEEKAARIEN
Ga0245274_100656Ga0245274_10065621F072366MLDILAIKADVYHLERQGKRLPVYRYLREVWQKEPPSEGLTVLALQQMVDYVEYVDDLTVLGEPWEAENEYDLYQDFLLDVISWGLQKYRTKKRFLWQICYYVNAWATFYYIFGREITQENVEQWKKTLFKEAKERYPDSLLFEFILHAAQLDYVWFYRLTDEQRLQIRLEVGEWNLQKNDMDQAVQSDFDDALTWYRDNGRKLLEAKKQDE
Ga0245274_100681Ga0245274_1006813F082714MVVGIRFAADAPVRTVLQAFLPIFSTVDVDFLVREYWVCTFGNGLPEQRFTSQEMQAAVNALTPDEHAELFTIYVLPHDAPGTPPSSCEDFCARGFTMAFYAYDGDGYALLAQSEEQVEAAFEALTTAGLCEKVEEVEKETLAHWGF
Ga0245274_101337Ga0245274_10133712F073573MPSIVSFYQRFPNEAPSLNLSAFDRTGYTYAENFRCNERCYEAEQCEVTSDDGSVVLSLDVRIRPRGGEIEALPRVMLSVYSEGMLFQVRRMELRTGDTTYAILPDNVQQYTMRGDTDGGFLETMAIPLGKVGMKMLLEASDAAEARWVIQGARTAQTLPLTAAQKKAIHRFCMDCEESAITYQPAFLWYKDYCTKA
Ga0245274_101337Ga0245274_10133715F101192MQTEPSDKEGRNMTYRGWLLVGIALLLTALSGTQTSLCQRMRSVPVYRGLTYWEVQQIAKAAPTSFTPCGEDTIRRWVSGRYQIALRFNRYDICLGVEEEIDG
Ga0245274_101408Ga0245274_10140819F077320MKRKGMRRRCKLVLLAVLLIMVGIAVWRIWQTPRPTVLHRQDAWLSSAEPEMVDTLPDNAFTAELPEEIDGRLLYIRDYSASGHYQIVWEGVSAEAAESYLTALLDKGFTRLMGTAEDIASGVLLARGDLTLSISLSGGTLNMLMTRAEEPTATPLPEWFADW
Ga0245274_101476Ga0245274_10147612F051934MGLMRRLAAAALAAVLVLSTALGERVTFRLSADIDPVQYPAQERKLAQGLKSLFRLLTVEGDVVASDGSFDARIDLGLTNAPEKTATRIRFFGLDSHWGIQSTTLGGETLMVNQLAWLEFAIKAYNHLDLPLQRVFLWLSPYAHTSAWAGVRQAIADLAAQETSGRLENTALITCAEEIARLSEDDRALYYYIEAFGLESGADANIFDALATLPEYVEANFPDGLSIEHTENGVSWQNGEETVFSHAEADGTQVVSLHLPDLVDFSATLRRDALLFTGALSLQSDVLNAQVSFSLPVSYPVTLPFYAQIDADGMMTDDDGIHLAFEGEAQGDTITIRRIQPDHSATMMTLTVKVIQVAEGTVKYAPEDVQGTNVLSVDGPALAELMGRIGKPMVRKVTQWLAALPAELVQGAMDAMEDSGLLGTLTDAILNGEAGEY
Ga0245274_101748Ga0245274_10174812F041208MRKILAMLLSVMLLLTAAVAETSAPYAPETVLPLVANNRPYRDFARGAISSSEDAIEQAKAWLYLPLFVTNFSAKSEAVDWSAMRAEEGWYVTAYMRNTFVWLLMDEQGRVQAYDFDLLGNASLTYDGALPDNLDEAISSYIRRFADLNGFVKVADYAREEVTTFGDYAVAVTVQVTLDGTPYRFTMRLDMMAFTSVENANLHPAAAQTQRDILLLMRDNLAEKGVDITQTFFAVQADDADGETKLTGIASFPADAASDAIREQYGELERYTLHYSGRASEAGIERVELSDWQETTATAQFPLALYAMVDGKYLVPDGELAVGTAYLALDSMGLRGRNVIPLESITMLTRIRYARADGTFAESWVSSDMLTENDAAPAAPKREPIPTLESYQITLNGTAYTAFAINQVEKGYDAFADIAGTQTAVVDVLTSAAQGVIAEYGVDASDLLCRTVVEYGYRADKGCWQVDFTIPQRDMADDAYEVEVDDKDGKVTGMWGPQDGNG
Ga0245274_101994Ga0245274_1019948F078003VNEHLAFAQCLQRVLSETELTATEVARRLEMRSRNSIFRILKGKTSPQLNRRFLESFHKHMGEQLTEAQWAALNRALEMDAVGAVEYKSRQALMQLVGAFSEPISPAKVCYLDALGAEKEDSFLHYLQVLFCGALKVNALLFGCCDLGLFRQLQEAIHPVAQRVIVRIDHFIYAGEDEIVSNLVGIQPMVDQPCYHAYLVDAENCPQERLAHYRTGQMTFHVMHQDGSESTVALFLLGKNEFTATVMSTQDLWMSRKVLCDRERFSPIRLLWQLNDENSDFIVYTQQYCKMEHGAAIYYIRPDVPFQYIPLEVLYPVVREGFARMGMTREAYEPNVTALAEIHQARMTNMMRRRRPTYIVLNKAAMEEFVRTGRQSDHLHFLRNFTPKERKQILDVLVQQARENPFFHLYFAQEKLPCTMGEVALYDGRALITMSSGSGYDLRTDHRESCITHPFVLRAYKQFYMNTVVARLADTQTESLNQLEELVRRCEKMIREGQKGNAGNEQEKLSGQ
Ga0245274_102043Ga0245274_1020431F043945MKHNLRMAFLSLLLVLTLGVMPVFAESADHAMDWTHGIGSRYRTDVTSALPFEDELFFISGSQLLRVDDPEYEPEVVCNLEDVIWSEALKQSGYYGQVMLLSGGAELVLFSANEGRLYSFVPPTDDTMVRMQMVTELDYSTVPQMGWKKFPIAKDLYDVTVEQAVYDAENGLYVIAKDKTDEYQIVWFDLDTGKGEMLEFALEDDENSHLLTDMPGFFRWSAESSKYERTLYDGKVLTTVDLSSGEATKAVHNWVNTLTLQPIPDYDVTLGDYVTHDAYYAYAPNKDNSAMYFVHDNTVWVMEYDEENSQFSEPRGIDKLPFFAEDTMCGFYWCGFYLIQTHDGLYLCHTGDETARAYLYGGVE
Ga0245274_102043Ga0245274_1020433F091068MNEKNQQVSDEQIDALIRSGLWQDEQPLTADEEKLADAAFAQAMAKIDRKAKQQKRRTVLHMLDRVVRVAACLIVAVGIAFPIALANSEAFREQILQLVLSINPETGMAHVGMKPAKEERTANVPRMDVPDGWEGLFFPTFLPDSLPLVRCETTRGDQVHSEAYYADETRSLRFEEQDNLDGWNVRAADARVLTIYLHSVPGYLIDRQTEDTHEVSIIWSEGRRMLRVTSIGLPADEAVLVAQSVKKIFAE
Ga0245274_102047Ga0245274_1020477F043413MTERELREKLQSAYGTMPDATRAAFEHSLTHHRVETRHPVRMSRRMRTIVTFALMLMMLTAVGVAAAKLASVTDYPPPSGLTPEYMSHLVALNEAYDGDLLTLSVNDLVFDGTTVDVAMNVQPKAGKQVFMDMEVTAECAGRAYTLEIEGCGGGDFMSGMFLPDGTGSWSDSPFGFDGVLWDDDMQSPPEGTPIAWTMTFELLQPVWEVEYLSDAQYQALLAGGADALETYVQNNWRQQIITVTYGSLVEYQYAAEAAMLADGTITEPLSGSRTEQLLSCGGFRRADTIIVQFTTDFADDYAHPELVGKRIGMGDYDLVIDNVNLSFMRANITMHYEFSAAYTEDEICQMTNLPNAWRVYVNGETDSGYDAYANFQQVTNGAYGVDTPEQLTVGFDFYPAETDITRLTFVPIRNMGEKWDACHPDAEKGFTLELQE
Ga0245274_103047Ga0245274_1030479F042910LADRLYCALNGTTLRDLDARIHLLDVEELAPTVRTVTASRIGGGLHLLRRQREQLSLRVRFLIEEYDIATRHQLLHLVAAWAEAGGTLTLHEDGKRVLRVVCTAFPAMSTLNWLETLSLVFTAFSCPYWEDAAETSFLMPNTSDAPSKLLAVPGDAPETPLNLLIRNIGDTAITTLTVSAAGKISFQGLTLAPGAAIRIHHDAGVFAAEMVSDDSTVSILPYRTPESADDLLLRPGVLNEIRVEADSAAFVSGRCKGRYC
Ga0245274_103050Ga0245274_1030502F056623MKSTEEMLQAAVQQLQTAEQARLTSETDPARRQLLHLLYAREAAALQSMTLVRAEIPATQKRDWKPLILPLAAAVLNLGALALWLTLPSCVTPENSGWMGRFALCIVAQSVSLCATIGSIVSAMRRKPPKPVAVADEPELRRRLVEAEGRIALDAQTIEALFAQESVTIISTGAETAAELYASLYEMAQDARLDGDANQEKALSWPLSNAKRLLNAVGCEAVDYAPETAMFYDVMDADITQQRRPAIVQKADGIVQQRGLYLRKG
Ga0245274_104079Ga0245274_1040797F056682AGKAANQPIRQGEIYPASFGAFPSKNRSTFPIQKLGKIYENQEVL
Ga0245274_105105Ga0245274_1051052F075480MNKTKQEKWQRAYGDTPDSFRQRVAASLPKGEESRHVAFPRRAMVLAAALVLVLTTAYAAVVTHTELVWNAGHPIENEADDRLGLLTGKAGTSGDSLTIGGVTFTVQDGIYSPETGQLFASAVISADESVQLVGVESDMEWEVRAVTPVSEKLDPSGISWAEWAEQNGKTLVPIGMEATPTLQFLKVNGQTTDTPLIGAFLTQNPDGTVSVGFQVDLSEEDVSHLKSCEVQLECRVGAFGKDGKATQWQKEILTATITFK
Ga0245274_118617Ga0245274_1186171F050793MRKQVLVPENVEMADIHAIQTRVKKHNLLTVVLLIPLFAGCALRESHETLGGIVFVLSMVLMVLGFVSGRGDEKRLPAYWAKYQQIHRMDSLLRANLVGVWLCAEARVAVYREGSGYRVQLGVLDEKTGAWENDEQDEVFPTLADVREYAAQEGYEPMDVDFEQMTDEEFQQFLDRSRI

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.