NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 7000000736

7000000736: Human stool microbial communities from NIH, USA - visit 1, subject 763678604



Overview

Basic Information
IMG/M Taxon OID7000000736 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0063646 | Gp0053330 | Ga0030476
Sample NameHuman stool microbial communities from NIH, USA - visit 1, subject 763678604
Sequencing StatusPermanent Draft
Sequencing CenterBaylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size195916482
Sequencing Scaffolds18
Novel Protein Genes19
Associated Families19

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales10
Not Available1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae1
All Organisms → Viruses → Predicted Viral2
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Bacillales → Paenibacillaceae1
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ct3pM21
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Large Intestine → Fecal → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal surface

Location Information
LocationNational Institutes of Health, USA
CoordinatesLat. (o)N/ALong. (o)N/AAlt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F039147Metagenome164N
F043945Metagenome155N
F044554Metagenome154N
F051934Metagenome143N
F051935Metagenome143N
F055739Metagenome138N
F055775Metagenome138N
F056682Metagenome137Y
F070133Metagenome123N
F073573Metagenome120N
F073574Metagenome120N
F075480Metagenome119N
F077319Metagenome117N
F077320Metagenome117N
F078003Metagenome117N
F078004Metagenome117N
F089054Metagenome109N
F095494Metagenome105N
F096287Metagenome105N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
C3394277All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales598Open in IMG/M
C3395925Not Available603Open in IMG/M
C3498179All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales1312Open in IMG/M
C3512731All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales1619Open in IMG/M
C3555201All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales6784Open in IMG/M
SRS014235_WUGC_scaffold_40175All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales6168Open in IMG/M
SRS014235_WUGC_scaffold_42668All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales2343Open in IMG/M
SRS014235_WUGC_scaffold_47231All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae8582Open in IMG/M
SRS014235_WUGC_scaffold_52680All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales4016Open in IMG/M
SRS014235_WUGC_scaffold_61618All Organisms → Viruses → Predicted Viral3799Open in IMG/M
SRS014235_WUGC_scaffold_66688All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales3992Open in IMG/M
SRS014235_WUGC_scaffold_67642All Organisms → Viruses → Predicted Viral2881Open in IMG/M
SRS014235_WUGC_scaffold_67678All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes7615Open in IMG/M
SRS014235_WUGC_scaffold_69784All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Bacillales → Paenibacillaceae7118Open in IMG/M
SRS014235_WUGC_scaffold_70483All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales4405Open in IMG/M
SRS014235_WUGC_scaffold_71218All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Myoviridae sp. ct3pM22353Open in IMG/M
SRS014235_WUGC_scaffold_71737All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales2253Open in IMG/M
SRS014235_WUGC_scaffold_73314All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia8463Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
C3394277C3394277__gene_243814F073574MFHAASRRRAPLCRDVDDTASRVRCAIVLNGSKELHIQLCGRLKRGLFGRNQLLADGDVLCVALHQPDGDVPLFYSRCDGYANVLDDRQPPAPIPLHPAICPKCRNAAFQLRLTFEYPEAEELAAFANPDDMFTWVWVTMRCTRCHAVFRGDFAAD
C3395925C3395925__gene_244489F044554FHGGLPPPCIFFHTQAYVLAGRFIPVLCASIARLFPCRTEIARCLTLDFAISRYLFLSFPFSFKTNFAQALFSSLLFVSDTRAKSILFLLFENEIAHLQGQYRFDSHRYCFSAFLVL
C3498179C3498179__gene_284403F051934AAALAAVLVLSTALGERVTFRLSADIDPVQYPAQERKLAQGLKSLFRLLTVEGDVVASDGSFDARIDLGLTNAPEKTATRIRFFGLDSHWGIQSTTLGGETLMVNQLAWLEFAIKAYNHLDLPLQRVFLWLSPYAHTSAWAGLRQAIADLTAQENDGRLENTALIACAEEIARLSEDDRALYYYIEAFGLESGTDANIFDALATLPEYVEANFPDGLSIEHTENGVSWQNGEETVFSYAEADGTQVVSLHLPDLVDFSATLRRDALLFTGALSLQSDVLNAQVSFSLPVSYPVTLPFYAQIDADGMMTGDDGIHLAFEGEAQGDTITIRRIQPDHSATMMTLTVKVIQVAEGTVKYAPEDVQGTNVLSVDGPALAELMGRIGKPMVRKVTQWLAALPAELVQGAMDAMEDSGLLGTLTDAILNGEAGEY
C3512731C3512731__gene_289981F073573MPTIVSFYQRFPNEAPPLNLSAFDHTGYTYAENFRRNERCYEAEQCEVTSDDGSVVLSLDVRIRPRGGEIEALPRVMLSVYSEGMLFQVRRMELRTGDTTYAILPDNVQQYTKRGDTDGGFLETMAIPLGKVGMKMLLEASDAAEARWVIQGARTAQTLPLP
C3555201C3555201__gene_308428F077319MLIRNATLQMNATECACMDVRVMNGCVWEMGAALVKGLYESETDLCGDVLMPGRMLETPISAADEKALRLLCRRLYREGVRYFVADCPADALLRVQNRPERRGALPVTALPNPEPLRSGTGMPLTRWTAAGEFVGMMDEHSGD
SRS014235_WUGC_scaffold_40175SRS014235_WUGC_scaffold_40175__gene_66921F070133MQLIRMNPLAFRKEPVELYSVTHTPNGSFVVIYFTEGEKSELQEMWMELFDSVGTSLLSAKLGEFDPNGEKIPHGQIILKKDRFICEYYPDITSMEVCTQTVYRYTGKRIQKPTIKKLKFGAAPYAQHVGDYMVEKQAHSEDETPFRTVKITHIASGKSKKLMIYDWSFCAFPDQDGNLLIAQQNEKGNLEIRNCNAAMQESIVELSGDFLQNENIRDAACIGQTAYMRIRLTNEKSEILLYDITQQKITDSQTLLAVDDNSYIAEIKAAGAVLLSVDGYWNRELQRQKYQINLLNEHFETSRLPLQHESCLYIFTDVEQADVTTIEMDEKSHSYFVCSYSISAGE
SRS014235_WUGC_scaffold_42668SRS014235_WUGC_scaffold_42668__gene_71346F056682VVEKPHKQNNVKMQSRAEKVANQPIGQDKMYPASFGTFPLKNRSTFPIQELGKNREKQEV
SRS014235_WUGC_scaffold_47231SRS014235_WUGC_scaffold_47231__gene_80502F077320MKRKGMRRRRKLVLLAVLLIMVSIAVWRIWQTPRPVVLHRQDAWLSSAEPEMVDTLPDNAFTAELPEEIDGLLLYIRDYSASGHYQIVWENVSAEAAESYLTALLDKGFTRLMGTAEDIASGVLLARGDLTLSISLSGGTLNMLMTRAEESTPTPLPEWLADW
SRS014235_WUGC_scaffold_52680SRS014235_WUGC_scaffold_52680__gene_91926F078003VNEHLAFAQCLQRVLSETELTATEVARRLEMRSRNSIFRILKGKTSPQLNRRFLESFHKHMGEQLTEAQWAALNRALEMDAVGAVEYKSRQALMQLVGAFSEPISPAKVCYLDALGTEKEDSFLHYLQVLFCGALKVNALLFGCCDLGLFRQLQEAIHPVAQRVIVRIDHFIYAGEDEIVSNLVGIQPMVDQPCYHAYLVDAENCPQERLAHYRTGQMTFHVMHQDGSESTVALFLLGKNEFTATVMSTQDLWMSRKVLCDRERFSPIRLLWQLNDENSDFIVYTQQYCKMEHGAAIYYIRPDVPFQYIPLEVLYPVVREGFARMGMTREEYEPNVAALAEIHQARMTNMMRRRRPTYIVLNKAAMEEFVRTGRQSDHLHFLRNFTPKERKQILDVLVQQARENPFFHLYFAQEKLPYTMGEVALYDGRALITMSSGSGYDLRTDHRESCITHPFVLRAYKQFYMNTVVARLADTQTESLNQLEELVRRCEKMIREGQKGNAGNEQEKISGQ
SRS014235_WUGC_scaffold_61618SRS014235_WUGC_scaffold_61618__gene_111859F055775MAQIAQQDNLVLEVTTAAAAALDGATKKKLIECIEGGTITDVILVTKEVEKKISHARVVSWLVDTTGDSPKYTIHIINANSGAVAAIALN
SRS014235_WUGC_scaffold_66688SRS014235_WUGC_scaffold_66688__gene_125069F096287RTLTDCARKLLGNFGIILIMLTLTLPLLLDGIASLTALITFFTEGIGARGSQFTLTLLTASVMLIALNRDGLPRGVYLLRHVLLVAAAIIAINALLDAHPDGIVPLLGEGVPPLLSGIRSAWGMSWVLLLLLEFPAEEGTRRTPAMFVALLPCPVILLLLSLTIPSELTVPGRSLASRLALPTLFLQPAVRTLAQCLLMMTLFLSIAGSAQLAARFLTSSCQKPKKWVPYALIGLLTLTQLFDISRLWRVLTDFTAWSLVPGVLLLLVLTIARLCRREKA
SRS014235_WUGC_scaffold_67642SRS014235_WUGC_scaffold_67642__gene_127666F055739MTEQQLREKLQFAYGNMPDATRAAFEHSLTHHRAPETHRNIGLSRMMRIVITAVLMALMLTAVGVAAARFFSVTDVHPAQDGTEGEYQAHYLALEERYDSDLLSVSVNDAVYDGSVLAFTMEMAAKTDDVLAVEVRVRGECDGKMYRFDPLDVYGGEFQSLLMLPDLGGTFDGEKYAAEGILLDENGQMPPEGKTIAWTIEIDVLKAVWQTETMPDDLYEALSEEDDVAQYIREQAEQRVITLTDAGVEDYLLEMCGAGWDEIERISKADLLLRCGGFERAETYTVAFATEGNTQYVHPELSGLRIPLDGYTAVVDYVRASFLGGCVVLHCEAPNGTALPDVWRIYRNDEQNPAGEAGWARASGYAGVPGGVVDVNQPSICLYFAPCADLTSLRIVPEGEAGFTLNLSGEKGTE
SRS014235_WUGC_scaffold_67678SRS014235_WUGC_scaffold_67678__gene_127806F051935VSDWLTELKSKRLIQIVEDEIGEGWTLYQPNGREEEENFSSAANLKEMRFLPVVAQKDNQLRLLILRKQGDLWKVSEQNDRALMRDGWTLQNFSAMPYGNSDWTYIYFDFVDENQKRWNLMLNLGDGYVSSFGTISHYVEGYGTTYINMNYDRGLEFLIDAPAYSRLSYEVYPVEDYSFGVEDFDLATCPLSMQEFLVSAIVTCGEEGAGLYIMVQQDVQPIVTLADGDAIEAIPQKWELDWTIVYYQGNYLFMKTENCKMEE
SRS014235_WUGC_scaffold_69784SRS014235_WUGC_scaffold_69784__gene_134468F039147MERKRIAMRARRLLILLMMLLLLPRAQAERLTLYTRPGQVDEATPFQLRPTELSICSVTRAMGGVVVLANDDNYDSLSLYFWQDGMTEMRKLGGGFYWVMSSDTMETAQESCEYAMSRVPNYRMPDLTHAISELTSDGETLYALNRINGLIFKISETKDGLQTEDVCTMANLSCLNVSYRDLETDKVYTYPASLTRMHVCGSVLAISVMQENGIKVVLVDLADGAIREIADESLEAMYEWADGELLLWRLEGSPNEISRSSGTYTLSRYSVATGEETLLSTGVPYKKRSECGAYDPYSGSYYDVRTRQIVRTTDFVQEDPVVTFPAANVNIAVTKDSIVGVNLSSVYVRSKENGDMTVLRIQSSNGASNTALQHFAEENPEVILAQETLAKSAMNAASLAARMSASADAPDILRLGLTPDTPEADGSWPLDVLMDKGWCMDLSVYPEVSDYVSRLNGIYRDAVTRDGKIYAMPIYAWSYGYFISRNVMEKLGLQESDIPTNLIDLCAFITKWNDNLTGAYAAYTPLEETESYRERVFDLMVRDWIGYCQAENIPLRFDHPVFREMMAALDAMRTDKIEQANQQVNEEISDYRECLIWTDAQAVGNFANYADAFGSRIFLPMALTPDVTTHYGIGYMTVLVVNPRTTNADLVGKLLAQVIADQEATAKCVLLADYDEPIEDSYYLIMVNDYEKTLTELRRQQENAPVWKKQGIQERINEEEASLQRYTVRERWTIAPKTIELYQQTILPMSYLRRPGILADSDAFSALVSQVHQGEISLEEFVEEADKLIEGLEQ
SRS014235_WUGC_scaffold_70118SRS014235_WUGC_scaffold_70118__gene_135731F078004VPAVSRRIRSLALLSAVSEHLSSRNTDFLFRDLLFRKSSTGGLSAVAGSAALDVHMLRHTLIITIINALYRLTVDTDGMAWMRQRITERLSSLSLLRKALAAGAVTITGMLTSHHDVSLAAQTVLVIGTIFHNTF
SRS014235_WUGC_scaffold_70483SRS014235_WUGC_scaffold_70483__gene_137022F075480MNKTKQEKWQRAYGDTPDSFRQRVASALPKGEESRHVAFPRRAMVLAAALVLVLTTAYAAVVTQTELVWNAGHPIENEADDRLGLLTGKAGTSGDSLTIGGVTFTVQDGIYSPETGQLFASAVISADESVQLVGVESDMEWEVRAVTPVSEKLDPSGISWAEWAEQNGKTLVPIGMEAAPTLQFLKVNGQTTDTPLIGAFLTQNPDGTVSAGFQVDLTEADTSHLKSCEVQLECRVGAFGKDGKATQWQKEILIATITFK
SRS014235_WUGC_scaffold_71218SRS014235_WUGC_scaffold_71218__gene_140094F089054MLTKGKFLVSFEVPGHTKDYTEGFTEEMVIPYRTEELNPYLRYPHQEINKNHLHSKFIRQRLREILQSDITIIDIIPLP
SRS014235_WUGC_scaffold_71737SRS014235_WUGC_scaffold_71737__gene_142319F043945VTSALPFEDELFFISGSQLLRVDDPEYEPEVVCNLEDIIWSEALKQSGYYGQVMLLSGGAELVLFSANEGRLYSFVPPTDDTMVRMQMVTELNYSTVPQMGWKKFPIAKDLYDVTVEQAVYDAQYGLYVIAKDKTDEYQIVWFDLDTGKGEMLEFALEDDENSHLLTDMPGIFQWSGTSSKYERTLYDGKVLTTVDLSSGEATKAVHNWVNTLTLQPIPDYDVTLGDYVTHDAYYAYAPNKDNSAMYFVHDNTVWVMEYDEENSQFSEPRGIDKLPFFAEDTMCGF
SRS014235_WUGC_scaffold_73314SRS014235_WUGC_scaffold_73314__gene_151503F095494MQYHIVKGGFFVSDPKEKEVQAILQAIDYGHLPPRETQRRLLNIIQAEAARTDAPTDETKIHTCMDLLERLQGEQKPIAPARVDALRQHIAAAHQKNERKRQKRKKIMAAAACSAAAIAVAFAVSHPLLWYANWTTSDEQQHFVTSHEIAIEMLETAVADPMLPSGDTVEVQSIAALDALIGRKTGIPEMVNGQWELQHRYVNFTRSGISISLMYVNAADAQQTIVGVINLISNPQYMMLSFEQSYEGTIQQFDGLNFYITENINKPVALWQGDDKLLLFSGRTSQEEVTSLLRTIIREIGE

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.