NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300007650

3300007650: Human stool microbial communities from NIH, USA - visit 1, subject 604812005 reassembly



Overview

Basic Information
IMG/M Taxon OID3300007650 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0063646 | Gp0052761 | Ga0105532
Sample NameHuman stool microbial communities from NIH, USA - visit 1, subject 604812005 reassembly
Sequencing StatusPermanent Draft
Sequencing CenterBaylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size129959930
Sequencing Scaffolds19
Novel Protein Genes23
Associated Families23

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
Not Available1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Rikenellaceae → Alistipes2
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Rikenellaceae → Alistipes → Alistipes onderdonkii1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Faecalibacterium → Faecalibacterium prausnitzii1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales11
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Faecalibacterium → environmental samples → Faecalibacterium sp. CAG:741
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Ruminococcus1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Large Intestine → Fecal → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal surface

Location Information
LocationUSA: Maryland: Natonal Institute of Health
CoordinatesLat. (o)39.0042816Long. (o)-77.1012173Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F026592Metagenome / Metatranscriptome197Y
F032312Metagenome / Metatranscriptome180N
F041208Metagenome160N
F042910Metagenome157N
F044555Metagenome / Metatranscriptome154N
F051934Metagenome143N
F051935Metagenome143N
F068856Metagenome124N
F072366Metagenome121N
F073656Metagenome120N
F074898Metagenome119N
F074899Metagenome / Metatranscriptome119N
F076064Metagenome118N
F077319Metagenome117N
F089005Metagenome109N
F094005Metagenome / Metatranscriptome106N
F095494Metagenome105N
F100397Metagenome102N
F101192Metagenome102N
F101193Metagenome102N
F101357Metagenome / Metatranscriptome102N
F105374Metagenome100N
F105375Metagenome100N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0105532_100096Not Available84161Open in IMG/M
Ga0105532_100460All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Rikenellaceae → Alistipes37292Open in IMG/M
Ga0105532_101816All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Rikenellaceae → Alistipes → Alistipes onderdonkii10436Open in IMG/M
Ga0105532_103227All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales → Rikenellaceae → Alistipes5465Open in IMG/M
Ga0105532_104007All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Faecalibacterium → Faecalibacterium prausnitzii4501Open in IMG/M
Ga0105532_109398All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales2140Open in IMG/M
Ga0105532_109907All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales2047Open in IMG/M
Ga0105532_110554All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales1936Open in IMG/M
Ga0105532_111103All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Faecalibacterium → environmental samples → Faecalibacterium sp. CAG:741855Open in IMG/M
Ga0105532_111564All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales1787Open in IMG/M
Ga0105532_112877All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales1631Open in IMG/M
Ga0105532_113043All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae1612Open in IMG/M
Ga0105532_115065All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales1421Open in IMG/M
Ga0105532_115956All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales1347Open in IMG/M
Ga0105532_119980All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales1095Open in IMG/M
Ga0105532_123342All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales939Open in IMG/M
Ga0105532_128005All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales774Open in IMG/M
Ga0105532_132126All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae → Ruminococcus668Open in IMG/M
Ga0105532_134519All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales618Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0105532_100096Ga0105532_100096108F094005MKKEIVKLKEGNSVIYQDKTLMEKANVVSIDKKNGTAILSNKVIITRTTNLDGQFTRLDGKGKGNAIILPCTTENEQKYNAFVAYHQSKKSLEAIKKWLDDNGKHKDEETFEKVITLDKKLKKLIEKLNE*
Ga0105532_100096Ga0105532_100096109F032312MARIKDYDEDLSAPKLLKERARDSKGRFIKKDLPPYLGAEQVLKPKNYYHFDSHGNYKGSSMNFDAMVCLGFTWFKLLGVALMMLLWPIVFIYATLNDGIEGYPFKKYAIPYIFILVVWFIIFLYELVS*
Ga0105532_100096Ga0105532_10009611F101357MNLNNITTVLKTGITIYQYEQWQNTGSVNLMQKESHMLSKVWLKTNIYNPDSLDKPFIQLSATFTSESDIQEYNEWLNANQYKLYPLLLDILKISLKDDFYNYSNASNIHYEGGKFPSMLTIQLFNLEF*
Ga0105532_100096Ga0105532_10009614F105375MKNIETFQTTQHLDNLVTNLGLQIQELFSLDLEEILDYSNNLMNLLVNAYVENQCLALSAMISKQDGFAIYSFLFQTPDTSNGAADALVSFAMNFTDGEANIKSINRISSNIMQITFTV*
Ga0105532_100096Ga0105532_1000965F044555MKTTNPSSRITLSQNGNQILTCKVYKEPNYILSMSNEEILEFISGLDYMGNLPTVPDLEKPIEIQVSTTRQIPLEQNKEVQTKIKEIIYNNLYDTLIDELKNTISRFQAQYNIQEINPYLQDILQNPEDLVSLSQHDK*
Ga0105532_100460Ga0105532_10046013F073656MTMEQDQEQMQGALYVAVDDGNKIVAMERSRRGDEGFRALLDEFTDYAANCGAIPSVLFFDIRTTDAALLPRIEAAEHSYLLDPSTATEKLDIRHAVTLKDLLYCYKYDLDPLAEGNCGNMLSTKADYRQFRNEGLPPVAREDLRRCRAVETERGTVLFTQEPDGREACERYMQHHADCFFDPDLGVETLRVYEVEADPDGFWDKVNPQVLPTAGGMMWVPEHPFVDAEVLRRGYCLKEYDMRATADNFWTFVDPQHGENLYVSNGIRDLTGLQIIMQRGYGYLMQNAERYWNREFVFRSGFDNIERKYASDLSDEGRAAKREEQYNLAAYILDRKFPIRRRPSSEIPPMQAEGIRTFRNFDAINLLFRPDKLLEAYQRRRDEPVRGTEFHLKRH*
Ga0105532_101816Ga0105532_1018163F076064LKKTIPGRALENLFYLQYDLPPEASFHATTEEAKRPDELYMRKLLPELTRLKLQPRHVVANDEVYYAAMKGVMLFTPEAEKLMLTEDYFSARRQIRLCAPDLKRRNETRRYPMPVLKLY*
Ga0105532_103227Ga0105532_1032274F105374LRPGFGAAENIRYLVLSKGVFAMKKRVTLLVALCIWKVVAAQTPYGEMPERFRPDTLPCRLGGGVCFGMDGLDAAIPRGGGASSCRDPRVVFVAGDTLITFISVAGVADTALGDPVCRFYGRNVARVVSRMRRMTGGRMGAADNDFPDDPDFAELQGVVIENQRYPWESYAAGDSAYRLPVVRSLVGGKEDPLLGSDMRRRYVRLLTEVSVELKAGGTRPFVHVVYLLPDP*
Ga0105532_104007Ga0105532_1040075F026592RTASPQGIAALAAQGGVATLTERSDATFSVMQFSSADGE*
Ga0105532_109398Ga0105532_1093982F101193MARKCTEYHYFKHTYWDAMARYNWTRCPHCGKLCKPSRFKAATVLVPIELVIFVVCIFFRNSMNDAIGWFAAWLLFVLLLFLPQYIYVRFFMPYETLSEDETRKFHELQKN*
Ga0105532_109907Ga0105532_1099072F074898MRKILSLLLMLALFLPCALAETPQGIDLALTSTYGDGLSLWMTAGLGETPFCSLTLPSGQINLAFSPDKGLCVQSGGQWAQLLVEGHPVDAALLTTPLKLLDGRSPSEVLGLLAEDVNKMLDAIPSLYNLPMTLIRDPAFARLFSDLYIAAQTGTISITSEELNRMAASVLGKIYSMEYLDGLNFSDIYHNLLASVVQSSEVARAYRSAMRGYLLSNFRLSGQIGIDKGELLFRQPYQEPYHLTYEQTTPNSWHYLLTTANSDTIDGVLYWRNEYDFALSGYSENQHTAFSVTCSYGSANNFVFIASVDNSFSLSIVQAGSNFSLRLNNENANVFALNANVFSLESTSDYIRVKIYYNYYTLTITPATDGLDIVAQARTFDIFTAHVRTALNGLICTGVYGDTTYRLALTETATGFSLLLSLPDGDFHLDFAALSDTSCSLTFTDAAGQQVSLTGSLCTPESPVIPDAIGTIELTQLLDGLF*
Ga0105532_110554Ga0105532_1105542F100397MKKLIAILVLSLSLILCLTACGSKMPYDLSETASVELHAYNNDSTEPFAKIVVDGEDVATIVEMFNSLKLKEMKYTEPSIRGYEFWFRDENGSEIVKIELPYGPSPWLVVGGTEYPYQDVNGGVDVDYLAQLVDMTISAGPMQPEDGVDHNAPVEPLFYKSAPGLSIQHDGKSVRALPGTSSWQYMNPDGTSTGIEADSMHPLDAKEYMPVLPAVEGEAWLVFDTAPDEITVKAWPVSKWGDLSAVDEAIVVPVSGDKITLLEGGHIFE
Ga0105532_111103Ga0105532_1111033F101192MQTEPSDKEGRNMTYRGWLLVDIALLLTALSGTQTSLCQRMRSVPVSRGLTYWEVQQLAKAAPTSITPCGEDTIRRWVSGRYQIALRFNRYDVCLGVEEEIDG*
Ga0105532_111564Ga0105532_1115641F072366MLDILAIKADVYHLERQGKRLPVYRYLREVWQKEPPSEGLTVLALQQMVDYVEYVDDLTVLGEPWEAENEYDLYQDFLLDVISWGLQKYRAKKRFLWQICYYVNAWATFYYIFGREITQENVEQWKKTLFEEAKERYPDSMLFEFIPHAAQLDYVWFYRLTDEQRLQIRLEVGEWNLQKNDMDQAVQSYFDDAMTWYRDNGRKLLEAKNKTNN*
Ga0105532_112877Ga0105532_1128771F041208MRKILAMLLSVILLLTAAVAETSAPYAPETVLPLVANNRPYRDFARGAISSSEDAIEQAKAWLYLPLFVTNFSAKSEAVDWSAMRAEEGWYVTAYMRNTFVWLLMDEQGRVQAYDFDLLGNASLTYDGALPDNLDEAIASYIRRFADLNGFVEVADYAREEVTTFGDYAVAVTVQVTLDGTPYRFTMRLDMMAFTSVENANLHPTAAQTQRDILLLMRDNLAEKGVDVAQTFFAVQADDADGETNLTGIASFPADAASDAIREQYGELERYTLHYSGRASEAGIERVELSDWQETTATAQFPLALYALVDGKYLVPDGELAAGTAYLALDSMGLRGRNVIPLESITMLTRIRYARADGTFAESWVSSDMLTENDAAPAAPKREPIPTLESYQITLNGTAYTAFAINKVEKGYDTFADIAGTQTAVVDVLTSAAQGVIAEYGVDASDLLCRTVVEYGYRADKGCWQVDFTIPQRDMADDAYEVEVDDKDGKVTGLWGPQDG
Ga0105532_113043Ga0105532_1130431F074899WNKQAVTSSFLDKKPPESLILQGLEGSTTLGKDEVGSSNLPSSSK*
Ga0105532_115065Ga0105532_1150652F068856VKDSGASAEGNVLAAAHAVLGKGRKGMKRLTSILLALLMLVGMALAEETPDAALGDWYALPSDETVLRLTLREDGTFFFGTEGISGIEGKWRKTTDGEYNLAYTNRSSSLLDVIMSMVDSQAPAPDMTMTARLTESGLDVFYGSTAEGAVVHMARDAEELRTERTPRTDTPLEAFAGTWTMETMFLGTMQLTYTPEMGERQVFCTIDGLTMFPGAGLESFPEGTSFPLTFEDGVLRTTIPLTVQMAASSALVKEIVVDYDLTFFQ
Ga0105532_115956Ga0105532_1159563F051935MKRLLGLLLAMMVMMGGISCAVAENTNPIVSDWLTELKSKRLIQIVEDEIGEGWTLYQPNGREEEENFSSAENLKEMRFLPVVAQKGNQLRLLILRKQGDLWKVSEQNDRALMRDGWMLQNFSAMPYVNSDSTYIYFDFVDENQTEWELVLNLSTVYVSYFRMIYTAEEYGITEIVFNYERGVDFQFDAPFYFRLSYDVDPAKSLSFGVADFELATCPLSMREFLVPAVVSCGEEGAGLYIMAKQGIQPILVLADGEMIEAIPQRWQRDWVIVCYRGNYLFMKTENCKMEE*
Ga0105532_119980Ga0105532_1199801F051934LKSLFRLLTVEGDVVASDGSFDARIDLSLTNAPEKTATRIRFFGLDSHWGIQTTDLGGETLMFNQLAWLEFAIKAYNHLDLPLQRVFLWLSPYAHTSAWAGVRQAIADLAAQENDGRLENTALIACAEEIARLSEDDRALYYYIEAFGLESGTDADIFDALATLPEYVEANFPDGLSIEHTENGVSWQNGEETVFSYAEADGTQVVSLHLPDLVDFSATLRRDALLFTGALSLQSDVLNAQVSFSLPVSYPVTLPFYAQIDADGMMTGDDGIHLAFEGEAQGDTVIIRRIQPDHSATMMTLTVKVIQVAEGTVKYAPEDVQGTNVLSVDGPALAELMGRIGKPMVRKVTQWLSGPSGGTAPGGK
Ga0105532_123342Ga0105532_1233421F042910GARCAGGCRRPGAGAGLHLLRRQREQLSLRVRFLIEEYDIAARHQLLHLVAAWAEAGGVLTLHEDGKRVLRVVCTQYPTMSTLNWLETLSLVFTAFSCPYWEDAAETSFLMPNTSDAPSKLLAVPGDAPETPLNLLIRNIGDAAITTLTISAEGKISFQGLTLAPGAAVRIHHDAGVFAAEMVSDDSTVSILPYRTPDSADDLLLRPGVLNEIRVEASAAAFVSGRCKGRYC*
Ga0105532_128005Ga0105532_1280051F095494ETQRRLLNIIQAEAARTDAPTDETKIHTCMDLLERLQGEQKPIAPARVDALRQHIAAAHQKNERKRQKRKKIIAAAACSAAAIVVAFAVSHPLLWYENWTTSDEQQHFVTSHEIAIEMLETAVADPTLPSGDTVEVQSIAALDALIGRKTGIPEMVNGQWELQHRYVNFTRSGISISLMYVNAADAQQTIVGVINLISNPQYMMLSFEQSYEGTIQQFDGLNFYITENINKPVALWQGHDQTLLFFRRTSPPPAAAL
Ga0105532_132126Ga0105532_1321261F089005KKCLTKSKQCSIIALALLRLATSNEESKQALKVRRTLKIEQRDNSKETRNDFEESSKNYSEMYTKKHQE*
Ga0105532_134519Ga0105532_1345191F077319MQENTVCAANCTVQEKRGGQKIMLIRNATLQMSATERTCMDVRVMNGCVWEMGAALVKGLYESETDLCGDVLMPGRMLETPIPAADEKALRLLCRRLYREGVRYFVADCPADALLRVQNHPERRGALPVTALPTPEPLRSGTGMPLTRWTAAGGVVCGWVLYCRRGRSARGEALRRS

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.