NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300008520

3300008520: Human stool microbial communities from NIH, USA - visit 1, subject 764487809 reassembly



Overview

Basic Information
IMG/M Taxon OID3300008520 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0063646 | Gp0052925 | Ga0111044
Sample NameHuman stool microbial communities from NIH, USA - visit 1, subject 764487809 reassembly
Sequencing StatusPermanent Draft
Sequencing CenterBaylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size203612297
Sequencing Scaffolds17
Novel Protein Genes19
Associated Families18

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
Not Available4
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales4
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Lachnospiraceae → Roseburia → Roseburia faecis1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Lachnospiraceae1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Rikenellaceae → Alistipes2
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Faecalibacterium → Faecalibacterium prausnitzii3
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Lachnospiraceae → Lacrimispora → Lacrimispora saccharolytica1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Clostridiaceae → Clostridium → unclassified Clostridium → Clostridium sp.1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Large Intestine → Fecal → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal surface

Location Information
LocationUSA: Maryland: Natonal Institute of Health
CoordinatesLat. (o)39.0042816Long. (o)-77.1012173Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F013656Metagenome269Y
F026489Metagenome197N
F026592Metagenome / Metatranscriptome197Y
F029444Metagenome188Y
F051210Metagenome / Metatranscriptome144Y
F051936Metagenome143N
F052660Metagenome142N
F055775Metagenome138N
F067844Metagenome125N
F067845Metagenome125N
F068811Metagenome124N
F071326Metagenome / Metatranscriptome122Y
F073656Metagenome120N
F076190Metagenome118Y
F078004Metagenome117N
F089054Metagenome109N
F090515Metagenome108N
F105374Metagenome100N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0111044_100008Not Available399527Open in IMG/M
Ga0111044_100140All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales123747Open in IMG/M
Ga0111044_100289Not Available86290Open in IMG/M
Ga0111044_100618All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Lachnospiraceae → Roseburia → Roseburia faecis53193Open in IMG/M
Ga0111044_100790All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales44440Open in IMG/M
Ga0111044_100810Not Available42598Open in IMG/M
Ga0111044_101224All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Lachnospiraceae30269Open in IMG/M
Ga0111044_102632All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Rikenellaceae → Alistipes13717Open in IMG/M
Ga0111044_103462All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Faecalibacterium → Faecalibacterium prausnitzii9979Open in IMG/M
Ga0111044_103535All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales9704Open in IMG/M
Ga0111044_103553All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Faecalibacterium → Faecalibacterium prausnitzii9656Open in IMG/M
Ga0111044_108648All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Lachnospiraceae → Lacrimispora → Lacrimispora saccharolytica3345Open in IMG/M
Ga0111044_109994All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales2789Open in IMG/M
Ga0111044_111455All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Clostridiaceae → Clostridium → unclassified Clostridium → Clostridium sp.2347Open in IMG/M
Ga0111044_113630Not Available1881Open in IMG/M
Ga0111044_116745All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Rikenellaceae → Alistipes1429Open in IMG/M
Ga0111044_124131All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Faecalibacterium → Faecalibacterium prausnitzii864Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0111044_100008Ga0111044_10000847F051210MNTQYLQMMQTPSMMEALINSSVSAEDANLRSREYAKMFSRNDEMKDLFGLGNAGNLLQKTFSGYAETPLLSTQYFNASVASYVSSFAGYMSIERDFDQPNGLFYWFDVLGVTDMRSVIPNLGPDNYQDIQAMGNFTLNITPTTNADYSSLIGRKIIPGTVRVKIATATEKFELIDNGQGAFMAVAGKISNGTINYLNGRVEFTLATALAGDAATETITIVGKEDVTGTPCNTIGASNAHANDKRFIAKMQQLGLATVPDMLVAEYNIAALGAMKKATGSDMATFLFTKLRELYTKVINYKLVSTLEEGYNGNVMADLDLTQGAMTGQFMDYRSRVDLFDAYLINVESALATKAVKGVDVTTYVAGNMASNQFQKGGMIGKWERNTKMTYINDLLGWYNGIPVLRSTDIAEAPGEGTFYAIHKTKDGQMAPLARGIYMPLTDTPTIGNYNNPTQMASGIYYQEGTKYMAPELVQKVTFKFGI*
Ga0111044_100008Ga0111044_10000865F071326MADMISKNLDKANRLYSIGMKNIKLQLKLLGTEFVVLRPKSNSKWKNVFGGTYSSSSTLENDYDQFTTILILNQNELRDVWNRNRDNLEVYTDDGSLEVGDELQYTRGKYTFRFKISLKMGYSEVAEVFYVYTLNSIIETLDM*
Ga0111044_100140Ga0111044_100140113F013656MGKIYKEPNKSEMETTINVLYSENILSIYTNKVNLQKQLNKLLGAPTKEDKIKRSIAGSRWNISLDEKTKIQKIILKANIYEL*
Ga0111044_100289Ga0111044_1002892F089054MLTKGKFLVSFEVPGHTKEYTEGFTEEMVIPYRTEELRSYLRYPNQEINNNHLHSQYIRLQIREILQIPLRDITIIDIIPLP*
Ga0111044_100365Ga0111044_10036558F078004MLRHTLIITIINALYRLTVDTDGMAWMRQRITERLSSLSLLRKALAAGAVTITGMLTSHHDVSLAAQTVLVIGTIFHNTF*
Ga0111044_100618Ga0111044_1006183F067844MNTLEAETASAWYLILNRKLQTLPIYITQLSSHLPRPLTMGTQQVSAGAAVITPQHRSYRVPQGSPQVFGGP*
Ga0111044_100790Ga0111044_1007902F013656MKKIYKEPNKSETETTINVLYSENILSICTNKVDLQKKLNKLLGEPEKEYKIKRSIAGSTWNISLDDKTKIQKVILKANIYDM*
Ga0111044_100810Ga0111044_10081054F055775MAQIAQQDNLVIEVTTTAAALDGATKKKLIECIEGGTITDVILVKKEVEKKISHARVVSWLVDTTGDSPKYTIHIINANSGAVAAIALN*
Ga0111044_101224Ga0111044_10122413F052660MCILRVLPEKTSERIGQERAGTEWTVVKSKIRLCIRNRSYGRFLHGGILMGIALPIPSHRAKSHDFAYWWPAAAGHSRSADALPGKSNS*
Ga0111044_102632Ga0111044_10263212F105374LRPGFGAAENIRYLVLSKGVFAMKKRLVLLVALCIWMVVAAQTPYGEMPERFRPDTLPCRLGGGVCFGMDGLDAAIPRGGGASSCRDPRVVFVAGDTLITFISVAGVADTALGDPVCRFYGRNVARVVSRTRRMTGGRMGAADNDFPDDPDFAELQGVVIENQRYPWESYAAGDSAYRLPVARSLVGGKEDPLLGSDMRRRYVRLLTEVSVELKAGGTRPFVHVVYLLPDP*
Ga0111044_103462Ga0111044_1034629F068811MDQDESEHNICSNREGLCPGKKQHGASGWKKIFQHGKEPLRNKDSVSQYCNKKAAVSLILNENVSETLCIFSIDKTNCCRI*
Ga0111044_103535Ga0111044_1035354F029444MEVSEQLTGFELEDLMSWTVSNLQRPFREDFSLEKSGIIAEKESQIFGRRFVGFDRPKKAAPFFNF*
Ga0111044_103553Ga0111044_1035531F026489GGCSLLTPKENQKSASDFDALEPRKRGCSPLLTPKQRATPEKTEDSRLFGVKIF*
Ga0111044_108648Ga0111044_1086481F026592RLRTASPQGIAALAVQGGVATLTERSDATFAGKQFSSADRE*
Ga0111044_109994Ga0111044_1099945F090515GRVSCLEFAGGEKRKTNIAADYAVKVPAFTALLCRHIGSIRTFLKS*
Ga0111044_111455Ga0111044_1114553F076190SVKNVFTAASRVSGRVWSLVRITVPNDLKFVRIRVEKPQNNSLYPYFQREPL*
Ga0111044_113630Ga0111044_1136304F051936KQEGTGQVLFFLLALRGKAFGFSGLSETAVMTPVIINFSLLLIIR*
Ga0111044_116745Ga0111044_1167451F073656MTMEQDQEQMQGALYVAVDDGNKIIAMERSRRSDEGFRALLDEFTDYAANCGAIPSVLFFDIRTTDAALLPRIEAAEHSYLLDPSTATEKLDIRHAVTLKDLLYCYKYDLDPLAEGNCGNMLSTKADYRQFRNEGLPPVAREDLRRCRAVETERGTVLFTQEPDGREACERYMQHHADCFFDPDLGVETLRVYEVEADPDGFWDKVNPQVLPTAGGMMWVPEHPFVDAEVLRRGYCLKEYDMRATADNFWAFADPQHGENLYVSNGIRDLTGLQIIMQRGYGYLMQNAERYWNREFVFRSGFDNIERKYASDLSDEGRAAQRGGENKTRDAG*
Ga0111044_124131Ga0111044_1241312F067845MKHWKNLLVCLLAGVLALGVLTACSGLGSVNTGTDAEKAAELAQQLGVAHTQELDNTAKAVAEWFVQEPDSLRVSGLDLVYTVALDADSNMSHTDDLNDFLYWSGCYGVPDDVTVALLLDDSAAMTARLYAPQADSAAAELLDDAAGHSELGAAFIDYNGTAYVVAVFR*

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.