| Basic Information | |
|---|---|
| IMG/M Taxon OID | 3300008547 Open in IMG/M |
| GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0063646 | Gp0052942 | Ga0111055 |
| Sample Name | Human stool microbial communities from NIH, USA - visit 1, subject 159733294 reassembly |
| Sequencing Status | Permanent Draft |
| Sequencing Center | Baylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis |
| Published? | N |
| Use Policy | Open |
| Dataset Contents | |
|---|---|
| Total Genome Size | 157217036 |
| Sequencing Scaffolds | 12 |
| Novel Protein Genes | 17 |
| Associated Families | 16 |
| Dataset Phylogeny | |
|---|---|
| Taxonomy Groups | Number of Scaffolds |
| Not Available | 3 |
| All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Faecalibacterium → Faecalibacterium prausnitzii | 1 |
| All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales | 2 |
| All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Eubacteriales incertae sedis → Gemmiger → Gemmiger formicilis | 2 |
| All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes | 2 |
| All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia | 1 |
| All Organisms → cellular organisms → Bacteria | 1 |
| Ecosystem Assignment (GOLD) | |
|---|---|
| Name | Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
| Type | Host-Associated |
| Taxonomy | Host-Associated → Human → Digestive System → Large Intestine → Fecal → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
| Alternative Ecosystem Assignments | |
|---|---|
| Environment Ontology (ENVO) | Unclassified |
| Earth Microbiome Project Ontology (EMPO) | Host-associated → Animal → Animal surface |
| Location Information | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location | USA: Maryland: Natonal Institute of Health | |||||||
| Coordinates | Lat. (o) | 39.0042816 | Long. (o) | -77.1012173 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F026592 | Metagenome / Metatranscriptome | 197 | Y |
| F032312 | Metagenome / Metatranscriptome | 180 | N |
| F044554 | Metagenome | 154 | N |
| F059106 | Metagenome | 134 | N |
| F060985 | Metagenome / Metatranscriptome | 132 | N |
| F064817 | Metagenome | 128 | N |
| F067844 | Metagenome | 125 | N |
| F071325 | Metagenome | 122 | N |
| F080163 | Metagenome | 115 | N |
| F087334 | Metagenome | 110 | N |
| F087336 | Metagenome | 110 | N |
| F089054 | Metagenome | 109 | N |
| F090514 | Metagenome | 108 | N |
| F101355 | Metagenome | 102 | N |
| F101357 | Metagenome / Metatranscriptome | 102 | N |
| F105375 | Metagenome | 100 | N |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| Ga0111055_100149 | Not Available | 85034 | Open in IMG/M |
| Ga0111055_100244 | Not Available | 66120 | Open in IMG/M |
| Ga0111055_101010 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Faecalibacterium → Faecalibacterium prausnitzii | 21994 | Open in IMG/M |
| Ga0111055_101293 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales | 17706 | Open in IMG/M |
| Ga0111055_103265 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Eubacteriales incertae sedis → Gemmiger → Gemmiger formicilis | 7266 | Open in IMG/M |
| Ga0111055_103637 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales | 6521 | Open in IMG/M |
| Ga0111055_106309 | Not Available | 3679 | Open in IMG/M |
| Ga0111055_106540 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Eubacteriales incertae sedis → Gemmiger → Gemmiger formicilis | 3536 | Open in IMG/M |
| Ga0111055_127108 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes | 810 | Open in IMG/M |
| Ga0111055_128348 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia | 775 | Open in IMG/M |
| Ga0111055_133524 | All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes | 658 | Open in IMG/M |
| Ga0111055_135988 | All Organisms → cellular organisms → Bacteria | 613 | Open in IMG/M |
| Scaffold ID | Protein ID | Family | Sequence |
|---|---|---|---|
| Ga0111055_100149 | Ga0111055_10014978 | F089054 | MLTKGKFLVSFEVPGHTKDYTEGFTEEMVIPYRTEELNPYLRYPNQEINKNHLHSEFIRLRLREILQIDLSDITIIDIIPLP* |
| Ga0111055_100244 | Ga0111055_1002443 | F105375 | MKNNETFQTTQHLDKLVTNLGLQIQELFSLDLEGILDYSNNLMNLLVNAYVENQCLAVSAMISKQDGFAIYSFLFQTPDTSNGAADALVNFAMNFTDGEANIKSINRISSNIMQITFTV* |
| Ga0111055_100244 | Ga0111055_1002445 | F064817 | MNIKNLFNRFRKSKESELSYSLNLIYLEDTRVVFNQNIQCAKDLENYLSAYMRLFGMYSDKPYVLIYQEYKSRYWVYDKEPYLLYYKVPLIVNTSRKLSGKSDRVITKEKYQAVKALVPAHEVSDRFKIPEYITGVFTDIWYKCQGYMDTDHVGLEEILELMQYNWLKEFELLVFKENYDTDMLFLSHSLTYILDQTEEEGRRICIQNIIERNINQENQDENETI* |
| Ga0111055_100244 | Ga0111055_10024481 | F032312 | MPKIKDYDEDLSAPKLLRERARDSKGRFIKKDLPSYLGSEQVLKPKNYYHFDSHGNYKGSSMNFDALVCLGFTWFKLLGVALMMLLWPIVFIYALNDGIEGYPFKKYAIPYIFILVAWFIIFLYGLVS* |
| Ga0111055_100244 | Ga0111055_10024482 | F060985 | MSNIDEKAKNNFTIEMRIFENYEKVKHEIIKAIDFLEHSGTAMGMCNIFDNQNHEFWHSVIKPWFQPERFGITHLWFPSGFSFIGYGEYHTIRGNRWLKTPIDKIDKEGRISGYWFPYYKKYIPHRIKVLKLALKDLVRIKILNARDVFILWLLCVLCLGMPLTCV* |
| Ga0111055_100244 | Ga0111055_10024483 | F080163 | MKTIKLLQESFETKERFQQEISIKYSYNRDTVESIDFRINQRNIRYFYEAMQNFENSLVNEFKEKKNNFCDAKQFLESIDDFDKIIFVIITYMKTYFDFCKDYSKISLHVHLVQFDFTTNVLIQGFFYNTHRDLSFSTKLESEILNSEIELLQEKLDLIREEICELMGIDPNLQKKGHEENYIFNLNIDSDNQIGFLLQATEL* |
| Ga0111055_100244 | Ga0111055_10024490 | F101357 | MNLNNITTALKTGITIYQYEQWQNTGSVNLMQKESHMLSKVWLKTNIHNPDSLDKPFIQLSATFTSEFDIQEYNEWLNANQYKLYPLLLDILKISLKDDFYNYSNASNIHYEGGKFPSMLTVQLFNLEF* |
| Ga0111055_101010 | Ga0111055_10101023 | F090514 | MNLRIHALKKASRQRPGVKIAAARFFSILYYPFCAKRSSFFQQNVQPRGNFFANRPIFVYFADVLPRLTGAKWFVILLTIVPAGSRTRAGSSLPLIPASNIFLKQTPRRGLPLAWNAGAGSFLFCG* |
| Ga0111055_101293 | Ga0111055_10129314 | F087336 | MVAELEQVPKAFRAAGNQRRAAAKERIKDDAIGHGRVSDRILAEIEDNHMRERDTKIGLAEQRQVAFLGIAFQILPLKSKQKRAP* |
| Ga0111055_103265 | Ga0111055_1032651 | F044554 | RHPGDPPECGGSAQGKCCAAGGGSVSWRSASPCIFFHTQAYVLAGRFIPVLCASIARLFPCRTEIARCLTLDFAISRYLFLSFSFSFRTNFDQALFSSLLFVSDTRAKSILFLLFENEIAHLQGQYRFDSHRYCFSAFLVL* |
| Ga0111055_103637 | Ga0111055_1036372 | F087334 | MVSRYPFVGSAAHLFPKNAIKMLSFSTSGKGSILLYPFRSSPLLAITFLYHPKDFFLYRAAALFEYREKHQ* |
| Ga0111055_106309 | Ga0111055_1063091 | F071325 | ALLSDSSIIISKVVLFVKHFFDIFLSFPNAFLKAFHPHAVRFPAAFLLVYRFYLAFEELLSCATAYL* |
| Ga0111055_106540 | Ga0111055_1065404 | F101355 | MAFWFIQWYLPSAQPPQPKGAGAVFPYVLPRCSYFFKSFVTEMSIFICMHKCLAQTGRLRGSSCHIVVAAKRACACTLLWISDHFYKKLLPYVLFFFFKIYLKKIDFFQNIA* |
| Ga0111055_127108 | Ga0111055_1271082 | F067844 | NTLAAETASAWYLILNRKLQTLPIYITQLSSHLPRPLTMGTQQVSAGAAVITPQHRSYRVPQGSPQVFGGP* |
| Ga0111055_128348 | Ga0111055_1283481 | F059106 | RVILLQKFVWIVDLKVREHLTEYLKKNIKYHRVTTGVHA* |
| Ga0111055_133524 | Ga0111055_1335242 | F026592 | PQGIAALAAQGGVATLTERSDATFSVKQFSSADGE* |
| Ga0111055_135988 | Ga0111055_1359882 | F026592 | LAANSILCCLRTASPQGIAALAAQGGVATLTERSDETFSVMQFLSADRE* |
| ⦗Top⦘ |