NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300007802

3300007802: Human stool microbial communities from NIH, USA - visit 2, subject 159268001 reassembly



Overview

Basic Information
IMG/M Taxon OID3300007802 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0063646 | Gp0052825 | Ga0105667
Sample NameHuman stool microbial communities from NIH, USA - visit 2, subject 159268001 reassembly
Sequencing StatusPermanent Draft
Sequencing CenterBaylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size161361095
Sequencing Scaffolds5
Novel Protein Genes7
Associated Families6

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Coriobacteriia → Coriobacteriales → Coriobacteriaceae → Collinsella1
All Organisms → cellular organisms → Bacteria1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Large Intestine → Fecal → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal surface

Location Information
LocationUSA: Maryland: Natonal Institute of Health
CoordinatesLat. (o)39.0042816Long. (o)-77.1012173Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F026592Metagenome / Metatranscriptome197Y
F029444Metagenome188Y
F074899Metagenome / Metatranscriptome119N
F078004Metagenome117N
F099269Metagenome103N
F101356Metagenome102N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0105667_100639All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae35705Open in IMG/M
Ga0105667_102897All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales10166Open in IMG/M
Ga0105667_103502All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia8165Open in IMG/M
Ga0105667_111909All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Coriobacteriia → Coriobacteriales → Coriobacteriaceae → Collinsella1926Open in IMG/M
Ga0105667_125644All Organisms → cellular organisms → Bacteria805Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0105667_100639Ga0105667_10063916F074899MSGWKQWSKQAVTSSFLDKKPPKSLVQQGLEGSTPVGKDEVGSSNLPSSSNKAL*
Ga0105667_102897Ga0105667_1028977F029444MEVSEQLTGFELEYLMSWTVSNLQRPFREVFSLEKSGIIAEKESQIFGRRFVGFDGPKKAAPFFNF*
Ga0105667_103405Ga0105667_10340510F078004LSSRNTDFLFRDLLFRKSSTGGLSAVAGSAALDVHMLRRTLIITIINALYRLTVDTDGMAWMRQRITERLSSLSLLRKALAAGAVTITGMLTSHHDVSLAAQTVLVIGTIFHNTF*
Ga0105667_103502Ga0105667_1035021F074899MSGRKQWNKQTATSSFLDKKPPQSLIQQGLEGSTVVGKDEVGSSNLPSSSKKHRKLRFLV
Ga0105667_111909Ga0105667_1119093F099269VGELLLLDDLGDRASGASVLASAAGDAGVLVSDGSDVLELQNASGAGVDANATSDALVGINYGMSHGSFLSVDRRYRRCAPV*
Ga0105667_112764Ga0105667_1127642F101356VGELMPVVTPEKDGLSNSKFATTKIKSEGKRSVLLYRSSSSQWAPFAIRVSCISTGEPLSDFCVYIAGNTMELQDSTKVYVKYLYGQPNSDTYLKMKYETDHRISIYLTSDNSLGDRTIVRELIVRDSMYDMATQDDEITGLADCTIVQ*
Ga0105667_125644Ga0105667_1256441F026592QTVCQNRLRTASPQGIAAQASQGGVATLTERSDATFSVMQYPPADGE*

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.