NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300007184

3300007184: Human stool microbial communities from NIH, USA - visit 2, subject 763536994



Overview

Basic Information
IMG/M Taxon OID3300007184 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0063646 | Gp0052641 | Ga0103258
Sample NameHuman stool microbial communities from NIH, USA - visit 2, subject 763536994
Sequencing StatusPermanent Draft
Sequencing CenterBaylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size156800711
Sequencing Scaffolds14
Novel Protein Genes31
Associated Families27

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → Viruses → Duplodnaviria → Heunggongvirae1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Ruminococcus → Ruminococcus bromii1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Faecalibacterium → Faecalibacterium prausnitzii2
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Bacteroidaceae → Bacteroides1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Lachnospiraceae1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Lachnospiraceae → Roseburia → Roseburia faecis1
All Organisms → cellular organisms → Bacteria3
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Clostridiaceae → Clostridium → unclassified Clostridium → Clostridium sp.1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Eubacteriaceae → Eubacterium → Eubacterium ramulus1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → unclassified Oscillospiraceae → Oscillospiraceae bacterium1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Large Intestine → Fecal → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal surface

Location Information
LocationUSA: Maryland: Natonal Institute of Health
CoordinatesLat. (o)39.0042816Long. (o)-77.1012173Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F026592Metagenome / Metatranscriptome197Y
F032286Metagenome / Metatranscriptome180Y
F042095Metagenome159N
F050794Metagenome145N
F058555Metagenome135N
F064725Metagenome128N
F067844Metagenome125N
F075481Metagenome119N
F076653Metagenome118N
F077313Metagenome117N
F078005Metagenome117N
F078006Metagenome117N
F080673Metagenome115N
F081453Metagenome114N
F083451Metagenome113N
F083452Metagenome113N
F085718Metagenome111N
F087335Metagenome110N
F088921Metagenome109N
F089590Metagenome109N
F089591Metagenome109N
F089592Metagenome109N
F090514Metagenome108N
F092228Metagenome107N
F093883Metagenome106N
F099451Metagenome103N
F102167Metagenome102N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0103258_100021All Organisms → Viruses → Duplodnaviria → Heunggongvirae153747Open in IMG/M
Ga0103258_100059All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Ruminococcus → Ruminococcus bromii98941Open in IMG/M
Ga0103258_100211All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Faecalibacterium → Faecalibacterium prausnitzii61667Open in IMG/M
Ga0103258_101615All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Bacteroidaceae → Bacteroides17614Open in IMG/M
Ga0103258_102891All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Faecalibacterium → Faecalibacterium prausnitzii10443Open in IMG/M
Ga0103258_106770All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Lachnospiraceae3919Open in IMG/M
Ga0103258_107021All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Lachnospiraceae → Roseburia → Roseburia faecis3747Open in IMG/M
Ga0103258_110690All Organisms → cellular organisms → Bacteria2253Open in IMG/M
Ga0103258_111603All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales2038Open in IMG/M
Ga0103258_113417All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Clostridiaceae → Clostridium → unclassified Clostridium → Clostridium sp.1709Open in IMG/M
Ga0103258_119383All Organisms → cellular organisms → Bacteria1099Open in IMG/M
Ga0103258_121882All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Eubacteriaceae → Eubacterium → Eubacterium ramulus957Open in IMG/M
Ga0103258_134607All Organisms → cellular organisms → Bacteria583Open in IMG/M
Ga0103258_137145All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → unclassified Oscillospiraceae → Oscillospiraceae bacterium538Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0103258_100021Ga0103258_100021114F085718MVIEFDFEIYKNGDYDKVYLRNGKEARVLCDNGKGNSPMVVMIEDDKADDYIILRYNETGRRNINGQSGLDLMLSVKEREPELWVVVISYMDNKDKRQKMVLPNFFSRNIRGNIYLQGSSKSSVSYYVDKLEEDGCFDELCEKIRVKRDRIYNMEIISLSDDETAV*
Ga0103258_100021Ga0103258_100021116F078006MIRGRFPAPWHNLNVSSMEDNILKRAAAELKEAGCRVFAWQDDTYNRGWSKGDYIMLYYAFPDSPNIGYLSHGEYGMSVAYSRAYIPSRGSGSGCGIKEEATFDLATALDVMNGPLPRWCRSYGVYPKQYDNIDKWYNSDNHNKKLFKEI*
Ga0103258_100021Ga0103258_10002112F077313MGKYVIKRKIPKYQDAGEVDPVMPGNIVGLQGLGVEPLVSSTRIGFDIQQPDINTIDTSDLNAIVDSNKKVDESGSTDVFDFTTIPYYGADDIGSRFTQMGRGIGRMRSEGYGDLSTGVKTANVVGTVMSGIGGVLGLARNVFSGMASEQGTRTNIRLAQEREARQRRQSQMRYKDGGGVYLGPNNRFDSGSLTGEYLYPLPKSMEDQANVEVEKGEYVTQPGEAPMEAMGQKHADGGTPVSLEEGTKVITDDTTIESDFAKYIRDTYGIKATPKDTYATLMDRYKAKIGLKSAYDDQKKALDKLKKNDKIDDENTRRLNASVLSMAINDSNETVNGLEGRFTDFANVIYKEQEDRKMKKDEDTYFAKGGEIDNIISRSMKEYGLTEENIAEAKKELLKKVAGIRQKMEKGGSSLFDYLLTFRPVENKYNNKDNTFGYQRQGQDGSYGGINTDERLEYYKTFMPLAYDAYMSAPKATAAKALQDAIYNTTGGWMGLATAENPIIANAEALRDYTTLVSFGGEDSQGNYPEDKKAAYHDRMRDNKFGQYSSSRPMIGLDVVTEDQHKALNDAGITHFSQLFSDKNKDIVNKILGEDMLKMQALRSMKGMEGLDFILDPHKVAPGPMDIGDVEDPDVKLDMPELIDPNTLPKTNTNAGKSNGGNGGRNIVGGGLDFPEVFRMTPGAVTTEGLERHYAPTVDPVLRSADQYMVEANRAFQSQLDQMGNVPDSQRGALSSNLQAIMSSNIGKYINEVEQGNVAQRTWADNVNSQSWANTYDKNIAQRQAYQQRILQGLAINDENWARYFDSVNDEIQQKWNTATTMNTLRSIFGDVKIGPNGQLIADPQGDILSYRRLYPAQEVTKGKKG*
Ga0103258_100021Ga0103258_100021121F080673MDEIIKLQDEILSYLRNNITKDEAYYILTTDKDMIEVLISDKKDGSKRIKILDMEYTIEKDDMLLLFDTDGVIDECLLVASYIGVNMYFRRQDVNAILYNINREKVMKYPYIAIQLDNIQTIEKRRVVFEITGHRMDDNKERIDFMFIYFMARLCV*
Ga0103258_100021Ga0103258_100021135F083452MSGRIKIKPKNKDKKPKIDVFKVIEDRFKNMNELRDMIDMDPKKGLVRIRDGAGFREVERGGCLHRNYLNLLEEELGAKLSIDLIERYIKR*
Ga0103258_100021Ga0103258_100021138F102167MDINQIKKYLPAGWDVVDLIDHGIIDLDIMNEKMMGEYVAVLMIKSYDKITESHNLTTFSFHDKDISGLRRLVSNAIMAVGLRNNPLTGDGNTAIK*
Ga0103258_100021Ga0103258_10002115F050794MAPVGALYIMQLDSFLHWKIMQDLRIQRVKVLMMLYTSHYFVNNRQRQLLDHTYALSRSQAFDYMTEFNERLSDKIGIECTMDILLPTDDDNANIIIEYNGIIKKLMREAEKLELDTDAIKDMMRDLLNELKDDVDLNILIFDVTQLLIKYNLFRLDAITEQEFKDSFVRMDSRNMEIKKLTLSDIKKVVMMMEDRYSYISSI*
Ga0103258_100021Ga0103258_100021155F058555MGTNIVLLQKMKSNFDKILTEAYIPKDIQAKKDELGCLRLPAGSLVCPVDYKPVTNKDGKKVTAVKYSNKKDNIRGSGMVIEKKCKQVVAYLTIINVQKHVFLRNRMRDGYRDRIEINTDDFIDILSDGIAYFCYRHVIENCHEDIDYQLKTLKAYAEGEIRIALSDIMIYSYKAKKNEDTKEIFVGKKRSVYKCLDKNLSSDERRNMANKSRKLDRVRILSKIIFRARTRNVHHIYKVTKRKTIKFNVAYLLNELNKNLIGIGMQEISQSTIYRYISMFLDMCKKSISDLYEEVVKNNGVVNTKDNNNVTIGHIRASYKGSVLYILISTDYIINVFLGKKSAEMSKAG*
Ga0103258_100021Ga0103258_100021161F093883MEENKDIKKEIRDYLKEEADTHIRHWIAIKRESKRLYSDIEDRTKKIALKSSSLIKEEDFVVLHEMTHKIQMLNIEAVKVNSRLMFIIQLATSFGMDLDLDTTYASTAKSIIEDRTSGFVFYDDKERLRYADKELEDMFHDMSVKEVSKIGVVQSYELLMKQYNEFKDMKANATGKTKADE*
Ga0103258_100021Ga0103258_100021167F089592MKEILKSRKVLAEVNGFNIMSDTLYEVVGKHDGSAPQAFQDANIAKAPFPENATHVCCPWDDFSKAYNTGFYPRSRCYNGLDKNEIDRLVKQRVDNIMKPFEEMSQMDLSQTNLEFWDDAKDKIFMGKVYNTANTVDLFYLYLAVFSGMLTPQEMDGDPVFMNSMFCFVEKDNMKDFVQQREINKMNISYKFISALKKGGDDRQAVIDLLLYIGIVTRPDFTEDEYYTGSLSNWMNEKKTNVDYLLDIWDRSLEGDFKEVLEFYRIVNVLQRNGRINMTPSGLQYNGQIIGHDVRTSAEFLATKKDFINIKANVLDEYEEIISMSNIDDKSKTKKVKDIKKKDDVEEGDKVKEE*
Ga0103258_100021Ga0103258_100021168F089591MTIQEAYLRSLQKNEQNLANGGIKLDPGRFVLLFNEAQDRLIRYYLNRKDDETIRSIQTLLVYWESLNKGNHIDDPESTSFGLPDDYLWFSNIKGSFSYNGCEVGDFVMWEAKNENVHELLGDDNNRPSFDYRETFYTIGDGKVVVYEDGFRTDEVRMTYYRNPVRVDLTGYINAAGMQSTDIDPELPDPLVEEILDMVAKQFSLNENELNRYSMDKDNVASFK*
Ga0103258_100021Ga0103258_100021175F042095MAKTLYKYEASSNKFVWFTTWDRALRNFYTDDYNYVPDPVVNNPFNTFVEFRSRKPGIANVDWGDGIKEQFPMTKVQGEDNYRIIFRSLAIQHRKNPNTTWWFRKEDGSQYVPVDNHAYADGRRDVQRAVSIDFTCDIYYANIEICKMTAFPIVDIPGLEFLVVSHTMYVNDGIPVDKLSRSNKLIYIELSNVGQRMTEMPEAITSKTEVYYLSMFNMLDLRDIESSGIRNIKNMKNLQTLELSSCYLDRYIKEFNDLPKLTSLNITQGPSDMWNYFDINTLPFFEVDKINPNITDFIFLDDWKNGERRTGWNDDNMSGRGLEHLTSFIATNSNSLRMDKLPDYIYEMRAITRFNVNTSTHSQKRSDDFVNSFYDLVVGWDQITMTSVAKDGKRNQFYSLSVSMYNAISPTENQRPSGTEQAPEGFVKGQSNGSPATPMEKIYVLKNNYAQRWTIKPA*
Ga0103258_100021Ga0103258_100021177F089590MKDKDMIERVGALWNIALAYGASCWAYFQPVHHLLTVLLIVLIANFLARLAQSVRGWKLRRSRRRRFSFKRWLREVRLTDILKEFALSCFIVMTLCVIYKTLYPIEEEASMILTVTKYGVYIALVGYVMLFLNTIGDTFADAYLVKVFKAVFKRINVFKMFSFSKNIPDETFDNIKKIADDEVKDKS*
Ga0103258_100021Ga0103258_100021178F075481MMRLRISLRAIFCLGLSLSLSSCGSRRQVSETSIDSRLISRIETMIDEVMDRKIVEIKTSDLNADIVITERKFDTDKDVDPATGERPVSSQTDTHIVIGRRDSTVTADSVGVNKTRNDIKDLDNKIDIKSKDVDDKKESRWPIVWIVAGILMILLVLVYILKRIKIL*
Ga0103258_100021Ga0103258_1000213F064725VIQLAREGRLDFFCITDSNTYLCAKDLNMTIKGLLAEIKADLHKYDDSGAIDTSSVYRWAEIALKRFGGVIAIMSEAVIKTSNKQAVLPSDFFDMLDAYRCEPLVCEIPGGDKAKADLQHEIGWVERTERGFRWNSCTECCKEEFEKTITEKIYIGSHEVRFHYHHPVRLSIGRGLRRDCAADKYRDKYAWDNYDITISGNTMYTGFDGFIYIVYRATPKDEDGLPYIPETDLGYLEDYVETYIKMKIFENAAVNGLIQGAGEAYKLYAQQEPGKFARAMKELKMSMITLNDYRELAEDNRRRMLSHERMWPNAFDKYIKLI*
Ga0103258_100021Ga0103258_10002164F076653MTIEDKYLGWKDLFFDRFVHCYDDIDQPPGSNIPLAKINFDNNTGYVEDGTINIAELLQYLRIHNKVYGHDYNPLEIFFVLQTLERLVEGAKEIFKDQPGVQDMPTYKGFFIRDDFSRGKDYALDLDKIVSGMGGWYGEDEDPCYSMFVSQDQIWNLNPILKVLADEGSPLAKKLGYEINSYVSDNGYTIYNPYLSWINHYYHYCPTFNEDKLKPWDRVEDRKNKFKMTDKVKRGANNWYYSGGTISCVDNFMGKRYRKNLRTFIYRGIVFFLDRIWHTPLFEKMGVKMKYNAYYCYAATSGIWYNKGFKKRLAKRFNESLRGGGDLFGANLACMVCDRRDIDWEALRLWLDKYDEPTDKGMVNSPIQFMYLYLYYAFNK*
Ga0103258_100021Ga0103258_10002174F083451MDKSEREKQILDLLMSRKDIKKLVEKSNECYSRMDFVGAMRYRQEIKSILDRESKIMLTKSESLVDLMNDADNEYKFNMLVWLHSMMCMADVFNGILEDFKDGVRKANGNSKFIKFDNLDRLMMECKKEIDYLMKGTSKSFQISFAVRSDEMREMIENMVGDNIREGYDMFKEEAEMVNETDRSKIEEFNKKLDHDQM*
Ga0103258_100021Ga0103258_10002195F078005MKKSEFVKKLEKIIDMVKTEDDGFEYGGKVIFYKEDDSNYEVSVMNIEMNLEVEANVMAGMDDMDFTCLMSEVYKQKAAKAIMMEKDDDEDN*
Ga0103258_100059Ga0103258_10005964F092228MKKKCTLVLISVLTVACIVSAYLLFFYNPSFNMVYDSDTDSYFNNSYLSYNDGTLAAADYRKTKVTAYDSKNNSTVNLPSNGCLINDNLFYINGNKLCCLDTTTNTRKIIDTDCRSFVCNNEVIAYTKNDSVILKNSDTLENIGDIKFDNQIYYINISDGNLYIAERIFEDKTDEYGYSFKVGKQYIFKKYDLKSCKLLKSKNANYVNGIRYVTVCQDTFYFFCDETQTVNNVCLDKDVNYPTIQHPDVKFITSNNDCVYYISEKTESAIILKTVESPYNGIWKLEVGSNKPVKIADKCDCDELLATKNFLYCYTINYILPRGVANSWVKGYLIDQLAIS*
Ga0103258_100211Ga0103258_10021152F087335MIELYFNDANLLPENREEEQYAEAARNNILAGKTADTEMCILPCFYDAPLHKGLLYHLNDGTTLRIRPVLNEAGEPELLIRCVTEETVEEMYQRVSNYDKEKEL*
Ga0103258_101615Ga0103258_1016155F099451MEIKNVGQLRKIIENLSDDYEVEMRVRRKLSEEELKQCRYPYPYDTEYLTLEFDDIGVSDKVLCLGVTSKIN*
Ga0103258_102891Ga0103258_1028918F090514MNLRIHALKKASRQRPGVKIAAARFFSILYYPFCAKRSSFFQQNVQPRGNFFANRPIFVYFADVLPRLTGANWFVILLTIVPAGSRTRAGSSLPLIPASNIFLKQTPRRGLPLAWNAGAGSFLFCG*
Ga0103258_106770Ga0103258_1067701F026592VFGFSNLLATNSISCCLRTASPQGIDAQASQGSVAPLTERSDETFSVKQFFSADRE*
Ga0103258_107021Ga0103258_1070211F081453MFPFFLLAGENIIEKHISANPVCGTNIKAPEPAVKGAPGRKFM*
Ga0103258_110690Ga0103258_1106904F081453FFLLAGDNILEKHISANPVCRTNIRAPELAVKGALGRKCVQ*
Ga0103258_111603Ga0103258_1116034F026592NRLRTASPQGIAALAAQGGVATLTERSDATFSVMQFSSADGE*
Ga0103258_113417Ga0103258_1134172F032286MVKWVCQIVTPIRHRALVFNTRLFAGTAADDALVTGSTLSFCRLMCLCVKRRNIMLNDKRRSLLNSALFRADYRTEQKTISPFSLALILTFDFAALSERRSCPEDRSRRFVLLGALDAALRQRTYPVRTVMQFSRFRCDCKTILAKNIALSRISKRSENALKNADFHAHGQDRERAEI*
Ga0103258_119383Ga0103258_1193832F026592ASTQGIAAQAAQGGVATLTEGRDETCSVGQFFSADRE*
Ga0103258_121882Ga0103258_1218823F099451MGSNKIKTIGHLRKIIEDLSDDYVIDIRIRRKLSDNEIKELKVYPYPYETQHSELEFDDIGVSDKVLCLGVELEKE*
Ga0103258_134607Ga0103258_1346072F067844MNTLAAETASAWYLILNRKLQTLPIYITQLSSHLPRPLTMGTQQVSAGAAVITPQRRS
Ga0103258_137145Ga0103258_1371451F088921LYKIWAGENHFLCSEKSKSTVFDLDRETKKRKYAKGTCRIYIEKDQNMQEEKKEKYINGEIYGEKNQIIVANKLAMIRDKMRSK*

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.