NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 7000000624

7000000624: Human stool microbial communities from NIH, USA - visit 2, subject 159571453



Overview

Basic Information
IMG/M Taxon OID7000000624 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0063646 | Gp0053199 | Ga0030563
Sample NameHuman stool microbial communities from NIH, USA - visit 2, subject 159571453
Sequencing StatusPermanent Draft
Sequencing CenterBaylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size201626213
Sequencing Scaffolds13
Novel Protein Genes24
Associated Families23

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales6
All Organisms → cellular organisms → Bacteria4
Not Available1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Erysipelotrichia → Erysipelotrichales → Erysipelotrichaceae → Holdemania → Holdemania massiliensis1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → unclassified Flavobacteriales → Flavobacteriales bacterium1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Large Intestine → Fecal → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal surface

Location Information
LocationNational Institutes of Health, USA
CoordinatesLat. (o)N/ALong. (o)N/AAlt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F042095Metagenome159N
F042910Metagenome157N
F047755Metagenome149Y
F058555Metagenome135N
F059982Metagenome133N
F064725Metagenome128N
F068856Metagenome124N
F073573Metagenome120N
F075480Metagenome119N
F075481Metagenome119N
F076653Metagenome118N
F077313Metagenome117N
F078004Metagenome117N
F078005Metagenome117N
F078006Metagenome117N
F080673Metagenome115N
F083452Metagenome113N
F085718Metagenome111N
F087213Metagenome110N
F087336Metagenome110N
F093883Metagenome106N
F101191Metagenome102N
F102166Metagenome102N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
C3086825All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales672Open in IMG/M
C3110754All Organisms → cellular organisms → Bacteria768Open in IMG/M
C3148798All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales1011Open in IMG/M
C3194670All Organisms → cellular organisms → Bacteria1691Open in IMG/M
C3206497All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales2084Open in IMG/M
C3210574All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales2265Open in IMG/M
C3215078All Organisms → cellular organisms → Bacteria2523Open in IMG/M
C3223416All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales3237Open in IMG/M
SRS024435_LANL_scaffold_20298All Organisms → cellular organisms → Bacteria60235Open in IMG/M
SRS024435_LANL_scaffold_23226Not Available35169Open in IMG/M
SRS024435_LANL_scaffold_28092All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Erysipelotrichia → Erysipelotrichales → Erysipelotrichaceae → Holdemania → Holdemania massiliensis1913Open in IMG/M
SRS024435_LANL_scaffold_31519All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales1552Open in IMG/M
SRS024435_LANL_scaffold_8137All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales → unclassified Flavobacteriales → Flavobacteriales bacterium58624Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
C3086825C3086825__gene_213343F087213TAESTDAVSGAPLDNDIEIVYKEHFDDMVTRYADELTEAELLVSADVYDAIGMIRFDSEDSLAARQRIYGIASADDLHAILTGYFDPNDAAYTADTALDAHLQAILAACGLNPEDYDISVIRNLSGMPEPITGTNWYCTLIRKGIEVAEDETNPYDMVIVLYGDEMTVGAFVLNPEV
C3110754C3110754__gene_225586F087336RGMVAELEQVPKAFRAAGNQRRAAAKERIKDDAIGHGRVSDRILAEIEDNHMRERDTKIGLAEQRQVAFLGIAFQILPPTSKQKRAPQEVNSQSALSESNQLINP
C3146335C3146335__gene_244639F078004LPPNPITCFIIRILQGFVKPDLMVLFPVCFLFFCKSSTGRLSAVAGSATLDIHMIRHALVIAVINTFYRLTVDADRMAWMRQGIAERFSSLSLLRKAFTAGSVTVAGVLATHHDVSLATQTVLIIGTIFHNAF
C3148798C3148798__gene_245958F101191MLLCGFDLDKALSSRDPLDCMAEAVADDTKRIGTTCGGGIFRAVPREFDPESGSHRLPFAGSIRLIDWRVTLSGTMLDVTPENLARLLPSDTEITERVTTLTPKQTRKPLSRLCWIGTTSRGLLVIELRNPLCISGASLTSVPDGAGRLPFTFLAQNDRPGDVNLPARLYWWKEETHDAA
C3194670C3194670__gene_273477F047755DIKDKLRLAICMSQSKWSGLIYNTKENYKKFDSMLKNIDEEYMTTLIDFSKYKLIMFAMAKLMEMETTEQNKVALYLFNYIEK
C3206497C3206497__gene_281558F042910LRRQREQLSLRVRFLIEEYDIAARHQLLHLVAAWAEAGGVLTLHEDGKRVLRVVCTQYPTMSTLNWLETLSLVFTAFSCPYWEDAAETSFLMPNTSDAPSKLLAVPGDAPETPLNLLIRNIGDAAITALTISAAGKISFQGLTIAPGAAIRIHHDAGVFAAEMVSDDSTVSILPYRTPESADDLLLRPGVLNEIRVEAGSAAFVSGRCKGRYC
C3210574C3210574__gene_284599F073573MLRGQRWSHLTKNTPKEVFTMPTIVSFYQRFPNEAPPLNLSAFDHTGYTYAENFRRNERYYEAEQCEVTSDDGSVVLSLDVRIRPRGGEIEALPRVMLSVYSEGMPFQVRRMELRTGDTTYAILPDNVQQYTMRGDTDGGFLETMAIPLGKVGMKMLLEASDAAEAR
C3215078C3215078__gene_287984F059982MTMRRMTRLLCLMLLFSLITFSCPLAEETDAPTGTPAPTPMLTAVPESALAPFNVVLPEDAHVEMAEGRITLVRGDSRVVAMVISRVPDEDPAAALPILMQDFDPKTSETMDFDAQPGFCILGGVVNDAFGDGEDKITLMVLADSGELLILSGYNLARDHHALYLFLTELLENASMDGAAVYVAEDAAATASPEV
C3223416C3223416__gene_294916F102166MEETWRISPEMRAFLMKKLFSLLLVLALALVPTLSLADDDAACQNLYNMLLDELKSVDLEMTADEESYRIYLGYALDKNSLGDADVIFDAYSDAVTINVSYSNPLDEALVPQVISFFNRVNSTLYVGKLMVIKSDNMWYAAYEIFLSVDPENITDWDRNNVLAYTALALDTMEEMVDYITEIANGESADNVFAMWQADIGAV
SRS024435_LANL_scaffold_20298SRS024435_LANL_scaffold_20298__gene_48269F075481MKRLRISLKAVFCLGLSLSLSSCGSRRQVSETSIDSRLISRIETMIDEVMDRKIVEIKTSDLNADIVITKRKFDTDKDVDPATGERPVSSQTDTHIVIGRRDSTVTADSVGVNKKRNDIKDLDNKIDIKSKDVDDKKESKWPTVWIVCGILMILLVLVYILKRIRIL
SRS024435_LANL_scaffold_20298SRS024435_LANL_scaffold_20298__gene_48273F042095MAKTLYKYEALSNKFMWFTTWDRALRNYYTDDYNYVPDPVVGNPFNTFVEFRSRKPGMANVDWGDGIKEQFPMTKVQGHDSYHIIFRSLAIQHKKNPNTTWWFRKEDGSQYVPVDNHAYADGRRDVQRAVSIDFTCDIYYVNINTCKMTAFPIVDIPDLESLIVSHTRYVNDGIPVDRLSRSKKLIYINLSDIGQRMTEMPEAITSKTEVDHLNMFNILDLRDIESSGIRNIKNMKNLQALDLSSCYLDRYIKEFNDLPKLTSLNITSGPLDMCNYFDINTLPSFEVDKINPNIAYFSFLEDWKNGERRTGWNDDNMSGRGLDHIANFNVSHSNGIRVDKLPDYIYEMRSITRFGMNCSTHSQKRSDDFVNSFYDLVVGWDQITMASVAKDGERNQFYGLAVSMYSSQYPDENQRPSGTEQASEGFVKGQSNGSPATPMEKIYVLKNNYAQIWTIKPE
SRS024435_LANL_scaffold_20298SRS024435_LANL_scaffold_20298__gene_48287F093883MEEDKDIKKEIRDYLKEEADTHIRHWIAIKRESKRLYSDIEDRTKKIALKSPSLIKEEDFVVLHEMTHKIQMLNIEAVKVNSRLMFIIQLATSFGMDLDLDTTYASTAKSIIEDRTSGFVFYDDKERLRYADKELEDMFHDMSVTEVSKIGVVQSYELLMKQYNEFKDMKANATGKTKAD
SRS024435_LANL_scaffold_20298SRS024435_LANL_scaffold_20298__gene_48294F058555MKLVYSQKIKSTMGTKIGILHIMKSNFDKIITERYTPRNIQAKKDELGCVKLPAGSLICPVDFKPVTNKEGKKVTAIKYSLKHEEYHGSGIQISDECKMAMIYLIIINVFKHVFLRNRMHGGNRDQIEINTNDFIDILSDGCAYFCYRHVLRDSHEDMNYQLISLKAWAEGEIMIALSDIIKYKHKASKTPRIKDMFVKKGESVYTCLDKSLDSNTRRRMANKSRKLNRVKMLSKIIFSARNRNINKIYKVTKKRTVKFNVSYLMDRLNIKLSKEGMMLISQRTVYRMIKDVLSMCCKTISDLYDEVKKNNGIINTKDRKNVNIGHLRLSYRGTIMHIIIAEDYIRDVFLGVKGAEMSKA
SRS024435_LANL_scaffold_20298SRS024435_LANL_scaffold_20298__gene_48310F083452MSGRVKIKSKDKDKKPKIDVFKIIENRFKNMNELRDMIDMDPKKGLVRIRDGAGFREVERGGCLHRNYLNLLEEELGAKLSIDLIERYIKR
SRS024435_LANL_scaffold_23226SRS024435_LANL_scaffold_23226__gene_55085F077313MGKYVIKRKIPKYQEAGEVTPIMPGNVVGLQGIGVEPLVSSTRIGFDIQQPDINTIDTSDLNAIVDSNKKVDESGSTDVFDFTTIPYYGADDIGSRFTQMGRGIGRMRSEGYGDLSTGAKTANVVGTVMSGIGGVLGLARNVFSGMASEQGTRTNIRLAQEREARQRRQSQMRYKDGGGVYLGPNNRFDSGSLTGEYLYPLPKSMEDQANVEVEKGEYVTQPGEAPMEAMGQKHADGGTPVSLEEGTKVITDDTTIESDFAKYIRDTYGIKATPKDTYATLMDRYKAKIGLKSAYDDQKKALEKLKKNDKIDDENTKRLNASVLSKAINDSNDTVNGLEGRFTDFANVIYKEQEDRKMKKDEDTYFAKGGEIDNIISRSMKEYGLTEEDIAEAKKELLKKVAGIRQKMEIGGTSLFGRKLTFRPIENRFNNDPNYFGYQRQGTDGSYGGINTDERLNYYKTFNPVAYDAYMGASEGTRARALQDAIYGQTSSWMGLATAENPIIANAEALRDYTTLVSFGGEDSQGNYPEDKKAAYHDRMRDNKLGLFTTSRPMIGLDVVTEEQHKALNDAGITHFSQLFSDKNKDVVNKILGEDMLKMQALRSMKGMEGLDFILDPHKVAPGSMDIGDVENPDVKLDMPELIDSNTLPKTNTNAGKSNGGNGGRNIVGGGLDFPEVFRMTPGAVTTEGLERHYAPTVDPVLRSADQYMVEANRAFQSQLDQMGNVPDSQRGALSSNLQAIMSSNIGKYINEVEQGNVAQRTWADNVNARTWTDTYDKNIAQRQGYQSRILQALANTDENWARYFDSVNDEIQQKWNTATTMNTLRSIFGDVKIGPNGQLIADPQGDILSYRRLYPAQEVTKGKKG
SRS024435_LANL_scaffold_23226SRS024435_LANL_scaffold_23226__gene_55094F064725MGLQAVEVFFVIQLAREGRLDFFCITDSNTYLCAKDLNMTIKGLLAEIKADLHKYDDSGAIDTSSVYRWAEIALKRFGGVIAVMSEAVVKTSNKQAVLPSDFFDMLDAYRCEPLVCEIPGGDKAKADLQHEIGWVERTERGFRWNSCTECCKEEFEKTITEKIYIGSHEVRFHYHHPVRLSIGRGLRRDCAADKYRDKYAWDNYDITISGNTMYTGFDGFIYIIYRATPKDDDGLPYIPETALGYLEDYVETYIKMKIFENAAVNGLIQGAGDAYKLYAQQEPGKFARAMKELKMSMITLNDYRELAEDNRRRMLSHERMWPNAFNKYIKLI
SRS024435_LANL_scaffold_28092SRS024435_LANL_scaffold_28092__gene_67829F075480GDTPDSFRQRVAASLPKGEESRHVAFPRRAMVLAAALVLVLTTAYAAVVTHTELVWNAGHPIENEADDRLGLLTGKAGTSGDSLTIGGVTFTVQDGIYSPETGQLFASAVISADESVQLVAVEADMEHEVRLTTPVDAKLDPSGISWAEWAEQNGKTLVPIGMEAAPTLQFLKVNGQTTDTPLIGAFLTQNPDGTVSAGFQVDLTEADTSHLKSCEVQLECRVGAFGKDGKATQWQKEILTATITFK
SRS024435_LANL_scaffold_31519SRS024435_LANL_scaffold_31519__gene_78032F068856LVGMALAEETPDTALGDWYALNTENEAICLTLREDGTFCYDSREGTWCKTTDGEYWLTYNIHDLPEVMERMVNSQAAEQDLTALLTETGLDVYYGSTAKGVVAHMVRDAEELQNVRTPKTDTPLEAFAGTWTMETVFAGAMEMTYTLDKGERLAFCTIDGLTMLPGAALGNFTEGTSYPMTLEDGKLHTTILMQMTEEETLDFDLTFFQTADGSLYATLRLNDVPDNPTTMFLLVPMEKE
SRS024435_LANL_scaffold_7741SRS024435_LANL_scaffold_7741__gene_19379F078004SRRIRSLALLSAVSEHLSSRNTDFLFRDLLFRKSSTGGLSAVAGSAALDVHMLRHTLIITIINALYRLTVDTDGMAWMRQRITERLSSLSLLRKALAAGAVTITGMLTSHHDVSLAAQTVLVIGTIFHNTF
SRS024435_LANL_scaffold_8137SRS024435_LANL_scaffold_8137__gene_20442F080673MDEIIKLQDEILSYLRNNITKDEAYYILTTDKDMIEVLISDKKDGSKRIKILDMEYTIEKDDMLLLFDTDGVIDECLLVASYIGVNMYFRRQDVNAILYNINREKVMKYPYIAIQLDNIQTIEKRRVIFDITGHRMDDNKERIDFMFIYFMARLCV
SRS024435_LANL_scaffold_8137SRS024435_LANL_scaffold_8137__gene_20447F078006MEDNILKRAAAELKEAGCRVFAWQDDFYNRSWDKGDYTMLYYAFPDSPNIGYLSHGEYGMSVAYSRAYIPSCGSGSGCCVKEEATFDLATALDVLNGPLPRWCRSYGVYPKQYDNIDKWYNSDNYNKKLFKEI
SRS024435_LANL_scaffold_8137SRS024435_LANL_scaffold_8137__gene_20448F085718MVIEFDFEIYKNGDYDKVYLRNGKEARVLCDNGKGNSPIVVMIEDDKADDYIILRYNETGRRNINGQSGLDLMLSVKEREPELWVVVISYMDNKDKRQKMVLPNFFSRNIRGNIYLQGSSKSSVSYYVDKLEEDGCFDELCEKIRVKRDRIYNMEIISLSDDETAV
SRS024435_LANL_scaffold_8137SRS024435_LANL_scaffold_8137__gene_20463F078005MEKSEFVKNLEKIIDMVKTEDDGFEYGGKVIFYKEDDSNYEVSVMNIEMNLGVEANVMAGMDDMDFTCLMCEVYKQKAVKAIMMEKDDDEDN
SRS024435_LANL_scaffold_8137SRS024435_LANL_scaffold_8137__gene_20491F076653MTIRDKYFGWKDIFFDRFVHCCNEKSGQPQGSNIPLAKINFDNKTGYVEDGTINIAELLQYLWINNKVYGCEYAPIDISSVLQTLIRLTENAKFIFDDQPGIHDMIPYRGFFLRDDFLPGKDYSLDLDKIVSGMGGWYGEDEDPCYSMFVSQDQIWNLNPILKVLADEGSPLAKKLGYEINSYVSDNGYTIYNPYLSWINHYYHYCPTFNEDKLKPWDRVEDRKNKFKMTDKVKRGANNWYYSGGTISCVDNFMGKRYRKNLRTFIYRGIVFFLDRIWHTPLFEKMGVKMKYNAYYCYAATSGIWYNKGFKKRLAKRFNESLRGGGDLFGANLACMVCDRRDIDWEALRLWLDKYDEPTDKGMVNSPIQFMYLYLYYAFNK

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.