NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 7000000289

7000000289: Human stool microbial communities from NIH, USA - visit 2, subject 159551223



Overview

Basic Information
IMG/M Taxon OID7000000289 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0063646 | Gp0052812 | Ga0030561
Sample NameHuman stool microbial communities from NIH, USA - visit 2, subject 159551223
Sequencing StatusPermanent Draft
Sequencing CenterBaylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size151857691
Sequencing Scaffolds21
Novel Protein Genes21
Associated Families21

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales8
All Organisms → Viruses → Predicted Viral1
Not Available2
All Organisms → cellular organisms → Bacteria1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → unclassified Eubacteriales → Clostridiales bacterium6
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHuman Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase
TypeHost-Associated
TaxonomyHost-Associated → Human → Digestive System → Large Intestine → Fecal → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Host-associated → Animal → Animal surface

Location Information
LocationUSA: Maryland: Natonal Institute of Health
CoordinatesLat. (o)39.0042816Long. (o)-77.1012173Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F039147Metagenome164N
F041208Metagenome160N
F042936Metagenome157N
F044554Metagenome154N
F051934Metagenome143N
F055739Metagenome138N
F057385Metagenome136N
F059982Metagenome133N
F067720Metagenome125Y
F068856Metagenome124N
F074964Metagenome119N
F077319Metagenome117N
F077320Metagenome117N
F078003Metagenome117N
F082714Metagenome113N
F088914Metagenome109N
F090484Metagenome108N
F096287Metagenome105N
F099406Metagenome103N
F101191Metagenome102N
F101192Metagenome102N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
C2301003All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales777Open in IMG/M
C2309319All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales833Open in IMG/M
C2334452All Organisms → Viruses → Predicted Viral1064Open in IMG/M
C2352914All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales1343Open in IMG/M
C2369260Not Available1744Open in IMG/M
C2394235All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales3821Open in IMG/M
C2395227All Organisms → cellular organisms → Bacteria4076Open in IMG/M
C2401528All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes13064Open in IMG/M
SRS024331_LANL_scaffold_10851All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → unclassified Eubacteriales → Clostridiales bacterium12089Open in IMG/M
SRS024331_LANL_scaffold_1574All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales6708Open in IMG/M
SRS024331_LANL_scaffold_16768All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → unclassified Eubacteriales → Clostridiales bacterium8004Open in IMG/M
SRS024331_LANL_scaffold_3350All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → unclassified Eubacteriales → Clostridiales bacterium16265Open in IMG/M
SRS024331_LANL_scaffold_40060All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales8674Open in IMG/M
SRS024331_LANL_scaffold_43362All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales5437Open in IMG/M
SRS024331_LANL_scaffold_46646All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → unclassified Eubacteriales → Clostridiales bacterium9633Open in IMG/M
SRS024331_LANL_scaffold_472All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales21691Open in IMG/M
SRS024331_LANL_scaffold_48572All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Oscillospiraceae4326Open in IMG/M
SRS024331_LANL_scaffold_48785All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales5654Open in IMG/M
SRS024331_LANL_scaffold_49891All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → unclassified Eubacteriales → Clostridiales bacterium8966Open in IMG/M
SRS024331_LANL_scaffold_50289All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → unclassified Eubacteriales → Clostridiales bacterium7435Open in IMG/M
SRS024331_LANL_scaffold_50596Not Available22605Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
C2301003C2301003__gene_183120F068856VGMALAEETPDAALGDWYALNVDGTTECLTLREDGTFCYDSREGTWRKTMDGEYWLTYNSHDLPEVMERMVNSQAAEQDLTALLTETGLDVYYGSTAKGVVAHMVRDAEELQNVRTPKTDTPLEAFAGTWTMETVFAGAMEMTYTLDKGERLAFCTIDGLTMFPGAGLESFPEGTSFPLTFEDGVLRTTIPLTVQMAASSALVKEIVVDYDLTFFQTADGSLYATLRLSDVPDNPTTMFLLVPMEKE
C2309319C2309319__gene_186148F088914CSGFFVFWSRKEGANYQISVKSFSNLDSYDIIQLYIMTELTDGRVSDSPFSVPYTPKENIKNPGKFIVFSAWYAAKALEMDVK
C2334452C2334452__gene_195364F067720MASTTYRHLGDVTGMFAAQEQFRDITKMVCARFRGLTKTYHLGNVNKLVTFCHRFAVIGNMVRNAGQLPQPFWLGA
C2352914C2352914__gene_202460F078003ALEMDAVGAVEYKSRQALMQLVGAFSEPISPAKVCYLDALGAEKEDSFLHYLQVLFCGALKVNALLFGCCDLGLFRQLQEAIHPVAQRVIVRIDHFIYAGEDEIVSNLVGIQPMVDQPCYHAYLVDAENCPQERLAHYRTGQMTFHVMHQDGSESTVALFLLGKNEFTATVMSTQDLWMSRKVLCDRERFSPIRLLWQLNDENSDFIVYTQQYCKMEHGAAIYYIRPDVPFQYIPLEVLYPVVREGFARMGMTREEYEPNVTALAEIHQARMTNMMRRRRPTYIVLNKAAMEEFVRTGRQSDHLHFLRNFTPKERKQILDVLVQQARENPFFHLYFAQEKLPYTMGEVALYDGRALITMSSGSGYDLRTDHRESCITHPFVLRAYKQFYMNTVVARLADTQTGSLNQLEELVRRCEKMIREGQKGNAGNEQEKLSGQ
C2369260C2369260__gene_209158F044554FIPVLCASIARLFPCRTEIARCLTLDFAIGRYLFLSFSFSFRTNFAQALFSSLLFVSDTLAKSILFLLFENEIAHLQGQYRFNSHRYCFSAFLVL
C2394235C2394235__gene_220803F101192MQTEPSDKEGRNMTYWGWLLVGIALLLTALSGTQTSLCQRMRSVPVYRGLTYWEVQQLAKAAPTSITPCGEDTIRRWVSGRYQIALRFNRYDVCLGVEEEIDG
C2395227C2395227__gene_221382F082714MVVGIRFGADAPVRTVLQAVLPIFSTVDVDFLVREYWVCTFGNGLPERRFTAQEMRRAVDALTPDEHAELFTIYALPHDAPDTPPSSCEDFCARGFTMAFYAYDGDGYALLAQSEEQVEAAFEALTTAGLCEKVEEVEKETLAHWGF
C2401528C2401528__gene_224940F090484FSDGSMSKIMKLQLGRNINISLRLLEQWSDDLLFMELYALYCMIKISRRDSRIRFKNQKDLLHKLGIGYSKFKNMTGHPMFDELFRMTDSTFVARRYRVNGVQLTLGCGKVNIPKNRILIKIKKNEITNHEKVLDRIREAMFVNLVRNNESVLNSGETNSQADVVDGSHSYYGLIDSTISNKTIALYLNVGLTKAKEIVGMAIQDKLVKRFENIQFITYVDNPRAYIEANEHNYPIGKLIPVYRHGAVFWQIANTWTLYKKGATNRWYFGEKDIEKGEKEKVSKKDDFNFFLKDNTHILRFLNAEEVVSEDGEILGIDRKKTKEEEARSLASVMAKEAHKDFWEGYERSTQNQIIRKYYRAIIAEDKKRRMDMFLNRLKQSYDKVSGWSKEKVATVKAGMADAEACCAEVGTSVAGVCGRVSRRMKSYNNTAPDKKAGFNEVRDMYAEFAGEMAKAVGSVSEDIYTYVKAEQFKEKIENMDISIQSLPNINTTVDNDKELDGESVFKDIPFEELSFYNDTYLYPISQYSSL
SRS024331_LANL_scaffold_10851SRS024331_LANL_scaffold_10851__gene_16744F039147MRARRLLILLMMLLLLPQAQAERLTLYTRPNNVDEAMPFQLRPTELSICSVTRAMGGVVVLANDNNYDSLSLYFWQDGMTEMRKLGGGFYWVMSSDTMETAQESCEYAMSRVPNYRMPDLTHAISNLTSDGETLYALNRINGLIFKISETKDGLQTEDVCTMANLSCLNVSYRDLETDKVYTYPASLTRMYVCGSVLAISVMQENGIKVVLVDLTDGAIREIADESLEAMYEWADGELLLWRLEGSPNEISRSSGTYALSRYSVATGEETLLSTGVPYKKRSECGAYDPYSGSYYDVRTRQIVRTTDFVQEDPVVTFPAANVNIAVTKDSIVGVNLSSVYVRSKENGDMTVLRIQSSNGASNTALQHFAEENPKVILAQETLAKSAMNAASLAARMSASADAPDILRLGLTPDTPEADGSWPLDVLMDKGWCMDLSVYPEVSDYVSRLNGIYRDAVTRDGKIYALPIYAWSYGYFISRNVMEKLGLQESDIPTNLIDLCAFITKWNDNLTGAYAAYTPLEETESYRERVFDLMVRDWIGYCQAENIPLRFDHPVFREMMAALDAMRTDKIEQVNQQVNEEISDYRECLIWTDAQAVGNFANYADAFGSRIFLPMALTPDVTTHYGIGYMTVLVVNPRTTNADLVGKLLAQVIADQEATAKCVLLADYDEPIEDSYYLTMVSGYEETLTELRRQQENAPVWKKQGIQERINEEEASLQRYTVRERWTIAPKTIELYQQTILPMSYLRRPGILADSDAFNALVSQVHQGEISLEEFVEEADKLIEGLEQ
SRS024331_LANL_scaffold_1574SRS024331_LANL_scaffold_1574__gene_1579F059982MTMRRMTRLLCLMLLFSLITFSCPLAEGTDAPTGTPAPTPMLTAVPESALAPFNVVLPEDAHVEMAEGRITLVRGDSRVVAMVISRVPDEDPAAALPILMQDFDPKTSETMDFDAQPGFCILGGVVNDAFGDGEDKITLMVLADSGELLILSGYNLARDHHALYLFLTELLENASMDGAAVYVKEEATETASPEV
SRS024331_LANL_scaffold_16768SRS024331_LANL_scaffold_16768__gene_27324F041208MRKILAMLLSVMLLLTAAVAETSAPYAPETVLPLVANNRPYRDFARGAISSSEDAIEQAKAWLYLPLFVTNFSAKSEAVDWSAMHAEEGWYVTAYMRNTFVWLLMDEQGRVQAYDFDLLGNASLTYDGALPDNLDEAISSYIRRFADLNGFVKVADYAREEVTTFGDYAVAVTVQVTLDGTPYRFTMRLDMMAFTSVENANLHPTAAQTQRDILLLMRDNLAEKGVDVAQTFFAVQADDADGETKLTGIASFPADAASDAIREQYGELERYTLHYSGRASEAGIERVELSDWQETTATAQFPLALYALVDGKYLVPDGELAAGTAYLALDSMGLRGRNVIPLESITMLTRIRYARADGTFAESWVSSDTLTENDAAPAAPKREPIPTLESYQITLNGTAYTAFAINKVEKGYDAFADIAGTQTAVVDVLTSAAQGVIAEYGVDASDLLCRTVVEYGYRADKGCWQVDFTIPQRDMADDAYEVEVDDKDGKVTGLWGPQDGNG
SRS024331_LANL_scaffold_3350SRS024331_LANL_scaffold_3350__gene_4368F077319MQENAVCAANCTVQEKRGGQKIMLIRNATLQMSATERACMDVRVMNGCVWEMGAALVKGLYESETDLCGDVLMPGRILETPVPAADEKALRLLCRRLYREGVRYFVADCPADALLRVQNRPERRGALPVTALPNPEPLRSGTGMPLTRWTAAGEFVGMMDEHSGD
SRS024331_LANL_scaffold_40060SRS024331_LANL_scaffold_40060__gene_72978F057385MKKLLAVLLSIMMLAMPLTSMAENSVWDNAARQETTITIHDLNADLVAALGGDDTAMAAINDLLAALSLTGYQQGDEAGFDLNLSGKSVLGMASLTTAAEENQLMYVSSALLGGVIAVNSKDVEAIKEKALRATMKMSGQSDEEIDKAIEESKEQLSGNAEYTALMEAAANLSSMTEEQLMEELTQADTTAFMTMMNEILSGEEMAEVTEQPGDCDAAKSYVKVTVPPEKLAEMTKALLEMIHSVPSIGAYMDALFSAADTSWDDLLKELDEADLYADDIVYEYWMTEAGELVRMTASVKINNGGEEPLPMSFTATRNTADGVATWLVTIKSAEDTAATLTFAGDLENFTANLTAYAGEDTVEINVSGKGVGTDSSVVDVEIKETVDGVEQGFGVVVTTFATMDGEQGVRKVDVLVRFMGLDVVTITAETRTCDAKDALDVSKAQDLGAMTDSEFQTWFVKVMNNLQNLPMTLLMSLPESMLTLLMGGSN
SRS024331_LANL_scaffold_43362SRS024331_LANL_scaffold_43362__gene_80989F099406MAEVGYNSKLEGQEVDSRLENVVQAAPGTSSESGKGGLIPAPPAGSQDGSKTLLSNMTWGDYVNKKYIDDAVSAAGWKKQIVSKLPTVEEAKDNVMYLVKDDVASTETKNVYNEYILVTEEGGTKVLESLGMVSTGVDSSYLDLSIFPSTSGTLDEDSYAKVLNAYNNNITLGKLSFYYFSLDYFLDNDNSELKIIAVLFNNTNSKEDVSGSYIDIDMVTYVVSQDKTYRAIANTATLSNDMLSYLKFMAKTPNVVTTLASLPIDAHNIIANVASATNLSMAVSAEDVGREWQVRVNNTTGTDITQPLPTSGLFQSMSGNSVVVPKNSFIELSIWYINDKLVIRVGEQA
SRS024331_LANL_scaffold_46646SRS024331_LANL_scaffold_46646__gene_89645F096287MQRIQKRRAGQRQTQNALIGLLSAVSMTRVGLTQLLPLCGSAAWWLSAACMLPGLCVYGAFRLLLRRAHTRTLTDCARKLLGNFGGILIMLTLILPLLLDGAASLTALITFFTEGIGARGSQFTLTLLTASVMLIALNRDGLPRGVYLLRHVLLVAATIIAINALLDAHPDGIVPLLGEGVPPLLSGIRSAWGMSWMLLLLLEFPAEEGARRTPAMLAGLLPCPVILLLLSLSIPPELTVPGRSLASRLALPTLFLQPAVRTLAQCLLMMTLFLSIAGSAQLAARFLTSSCQKPKKWVPYALIGLLTLTQLFDISRLWRVLTALTAWSLVPGVLLLLVLTIARLCRREKA
SRS024331_LANL_scaffold_472SRS024331_LANL_scaffold_472__gene_741F077320MKRKGMHRRRKLVLLAVLLIMVGIAVWRIWQTPRPTVLHRQDAWLSSAEPEMVDTLPDNAFTAELPEEIDGRLLYIRDYSASGHYQIVWKNVSAEAAESYLTALLDKGFTRLMGTAEDIASGVLLARGDLTLSISLSGGTLNMLMTRAEEPTPTPLP
SRS024331_LANL_scaffold_48572SRS024331_LANL_scaffold_48572__gene_95825F042936MGRGGGKGRVWKTKEGIMKTSGIVDRGEDTIRNFEKVEQEGALTPPYLGKVYFRLQLRGK
SRS024331_LANL_scaffold_48785SRS024331_LANL_scaffold_48785__gene_96560F101191LLISHHPASAECLQLGEGMLLCGFDLDKALSSRDPLDCMAEAVADDTKRIGTTCGGGIFRVVPREFDPESGSHRLPFAGSIRLIDWRVTLSGTMLDVTPENLARLLPSDTEMTERVTTLTPKQARKPLSRLCWIGTTSRGLLLIELRNPLCISGASLTSVPDGAGRLPFTFLAQNDRPGDVNLPARLYWWKEETHDAA
SRS024331_LANL_scaffold_49891SRS024331_LANL_scaffold_49891__gene_101059F051934MGLMRRLAAAALAAVLLLSTALGERVTFRLSADIDPVQYPAQERKLAQGLKSLFRLLTVEGDVVVSDGSFDARIDLGLTNAPEKTATRIRFFGLDSHWGIQATDLGGETLMVNQLAWLEFAIKAYNHLDLPLQRVFLWLSPYAHTSAWAGLRQEIADLAAQETSGRLENTALITCAEEIARLSEDDRALYYYIEAFGLESGTDANIFDALATLPEYVEANFPDGLSIEHTENGVSWQNGEKTVFSYADADGTQVVSLHLPDLVDFSATLRRDALLFTGALSLQSDVLNADVSFSLPASYPVTLPFYAQIDADGMMTGDDGIHLAFEGEAQGDTITIRRIQPDHSATMMTLTVKVIQVAEGTVKYAPEDVQGTNVLSVDGPALAELMGRIGKPMVRKVTQWLAALPAELVQGAMDAMEDSGLLGTLTDAILNGEAGEY
SRS024331_LANL_scaffold_50289SRS024331_LANL_scaffold_50289__gene_102833F055739MTEQQLREKLQFAYGNMPDATRAAFEHSLTHHRAPETHRNIGLSRMMRIVITAVLMALMLTAVGVAAARFFSVTDVHPAQDGTEGDYQAHYLALEERYDSDLLSVSVNDAVYDGSVLAFTMEMAAKTDDVLAVEVRICGECDGKMYRFDPLDVYGGEFQSLLMLPDLGGTFDGEKYAAEGILFDENGQMPLEGKPIAWTIEIDVLKAVWQTETMPDDLYEALSEEDDVAQYIREQAARHIITLTDAGVEDYLLEMCGAGWDEIEQISKADLLLRCGGFERAETYTVAFATEGNTQYVHPELAGLRIPLDGYTAVVDYVRASFLGGCVVLHCEAPNGTALPNGTALPDVWRIYRNDEQNPAGEAGWVRASGYAGVPGGVVDVNQPSICLYFAPCADLTSLRIVPEGEAGFTLNLSGEKGTE
SRS024331_LANL_scaffold_50596SRS024331_LANL_scaffold_50596__gene_104688F074964MITKKNVNKLQNAVIKENASNLVGAVKLYNALFANGSDLRTICKALEIPAEYAVKVATLAKDKKKLVTVCSQMLPKVGDTFVKFALYSKVYKDNKVDKEKGVEARTAEWCADNVVYGCEYKPFGFSTAEALESDNSAKWLIKESDEYKATCVAVRVKSYSIRTVAKCVSEYLSHESNEQ

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.