NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Scaffold Ga0104787_100042

Scaffold Ga0104787_100042


Overview

Basic Information
Taxon OID3300007361 Open in IMG/M
Scaffold IDGa0104787_100042 Open in IMG/M
Source Dataset NameHuman stool microbial communities from NIH, USA - visit 2, subject 158337416 reassembly
Source Dataset CategoryMetagenome
Source Dataset Use PolicyOpen
Sequencing CenterBaylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis
Sequencing StatusPermanent Draft

Scaffold Components
Scaffold Length (bps)153624
Total Scaffold Genes207 (view)
Total Scaffold Genes with Ribosome Binding Sites (RBS)77 (37.20%)
Novel Protein Genes19 (view)
Novel Protein Genes with Ribosome Binding Sites (RBS)8 (42.11%)
Associated Families19

Taxonomy
All Organisms → Viruses → Duplodnaviria → Heunggongvirae(Source: IMG-VR)

Ecosystem & Geography

Source Dataset Ecosystem
Host-Associated → Human → Digestive System → Large Intestine → Fecal → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase

Source Dataset Sampling Location
Location NameUSA: Maryland: Natonal Institute of Health
CoordinatesLat. (o)39.0042816Long. (o)-77.1012173Alt. (m)Depth (m)
Location on Map
Zoom:    Powered by OpenStreetMap ©

Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F042095Metagenome159N
F050794Metagenome145N
F057001Metagenome137Y
F058555Metagenome135N
F064725Metagenome128N
F075481Metagenome119N
F076653Metagenome118N
F077313Metagenome117N
F078005Metagenome117N
F078006Metagenome117N
F083451Metagenome113N
F083452Metagenome113N
F085718Metagenome111N
F089590Metagenome109N
F089591Metagenome109N
F089592Metagenome109N
F093883Metagenome106N
F098313Metagenome104N
F106193Metagenome100N

Sequences

Protein IDFamilyRBSSequence
Ga0104787_100042112F078005N/AMKKNEFVKELERIIDMVKAEDDGFEYGGKVIFYKEYDDNYEISVKNIEMNLMVEANTMASMNDRTFACLMSEVYKQKFTKAVTISEDEDDEDN*
Ga0104787_100042133F083451N/AMIMDKNEREKQVLDLLMSRKDIRKLVEKSNECYSKMDFVGAMKYRQEIKDIVDRESKIMLTKSESLIGLMNNADNEYKFNMLVWLHSMMCMADVFNGILEDFKDGVRKANGNSKFVKFDNLDRLMAECKKEIDYLMKGTSKSFQISFAVRSDEMREMIENMVGDNIREGYDVFSKEAEMVNETDRDKIEEFDKSLAHE*
Ga0104787_100042142F076653AGGMTIRDKYFGWKDIFFGRFVHCCNEKSDQPQGSNIPLAKINFDNRTGYVEDGTINIAELLQYLWINNKVYRCEYAPIDISSVLQTLIRLTENAKFIFDDQPGIHDMIPYRGFFLRDDFLPGKDYSLDLDKIVSGMGGWYGEDEDPCYSMFVSQDQIWNLNPILKVLADEGSILAKELGYDMNSYVSDNGYTIYNPYLSWINHYYHYCPTFNEDKLKPWDRVEDRKNKFKMTDKVKRGANNWYYSGGTISCVDSFLGKKYRKNLRTFIYRGIVFFLDRIWHTPLFEKMGVKMKYNAYYCYAATSGIWYDKGFKERLAKRFNRSLSGGGEPFGANLACMVCDRKDIDWEALRFWLEKYDDPTDKGMVNSPIQFMYLYLYYTFNK*
Ga0104787_10004216F106193N/AMGCNTCKEKALKAERERIERSMMNRPSSTVVSDMEYASRSTAGCMVMQDPLQTMERDVVSIYKQVRTKGDGVGVSYLNMQKKIREWIKNLPYGCPPDEEVQEMRKEILDGRAEHIKP*
Ga0104787_100042174F098313AGGMREKKFDFVIYPLDLIITVGLDYKTLCDRFENMEPEHEGKWGDEDDMDKEASFVNLVRDRDDDDKFAILWNFSSDDDLIMRNICHESFHIAMSVCQFCNMSLGFKVGEDEHAAYIAGFAGDCVSEFINSKNTD*
Ga0104787_100042188F050794AGGMMLYTSNYFVKVRQKQLLDHTYSLSRDQAFDYMTEFNKRLSDKVGIKCTMDILLPTDDDNANIIIEHNGIIKKLMKEAEKLELDTDAIEAMMRDLLDELKDDIDLNILIFDVSQLLIKYNLFRLDAITEQEFKNSFVRMDSRNMEIKKLTLSDIKKVVEMIEDRYSYALYMTEEYG*
Ga0104787_100042191F077313N/AMSKYVIKRKIPKYQEAGEVGSYMLGNMDGIQGLGIEPLVNTNQGLPAPVNPLGIYSLDTPDQLRTKYANAFDQDNVFPASFKGSLQRIAENYQDNGITLNNITVNDVDKSKTGSGETDVFDFTTIPYYGADDIGSRFTQMGRGIGRMRSEGYGDLSTGAKTANTITTIASGISGIMGLARNVVSGIASEKGTRTNIRLAQEREARQRRQSQMQYKDGGGVYLGPNNRFDSGSLTGEYLYPLPKSMEDQANVEVEKGEYVEQPGEAPMEAMGQKHADGGTPVSLEQGTEVITDDTIIEPDFAKYIRDTYGIKATPKDTYATLMDRYKVKIGLKSAYDDQKKALEKLKKNDKIDDENTRRLNASVLSKAINDSNDTVNGLEGRFTDFANVIYKEQEDRKMKKDEDTYFAKGGEIDNIISRSMKEYGLTEEDIAEAKKELLKKVAGIRQKMEIGGTSLFGRKLTFRPIENRFNNDPNYFGYQRQGTDGSYGGINTDERLNYYKTFNPVAYDAYMGASEGARARALQDAIYGQTSSWMGLATAENPIIANAEALRDYTTLVSFGGEDSQGNYPEDKKAAYHDRMRDNKLGLFTTSRPMIGLDVVTEEQHKALNDAGITHFSQLFSDKNKDVVNKILGEDMLKMQALRSMKGMEGLDFILDPHKVAPGPMDIGDVEEPDVKLDMPELIDPNTLPKTNTNAGKSNSGNGGRNIVGGGLDFPEVFRMTPGAVTTEGLERHYAPTVDPVLRSADQYMVEANRAFQSQLDQMGNVPDSQRGALSSNLQAIMSSNIGRYINEVEQGNVAQRTWADNVNARTWTDTYDKNIAQRQGYQSRILQALANTDENWARYFDSVNDEIQQKWNTATTMNTLRSIFGDVKIGPNGQLIADPQGDILSYRRLYPAQEVTKGKKG*
Ga0104787_100042200F064725N/AMTIKGLLAEIKADLHKYDDSGAIDTSSVYRWAEIALKRFGGVIAVMSEAVVKTSNKQAVLPSDFFDMLDAYRCEPLVCEIPGGDKAKADLQHEIGWVERTERGFRWNSCTECCKEEFEKTITEKIYIGSHEVRFHYHHPVRLSIGRGLRRDCAADKYRDKHDWDNYDITISGNIMYTGFDGFIYIIYRATPKDDDGLPYIPETALGYLEDYVETYIKMKIFENAAVNGLVQGAGDAYKLYAQQEPGKFARAMKELKMSMITLNDYRELAEDNRRRMLSYERMWPNAFDKYIKLV*
Ga0104787_10004222F075481AGAAGMIRLRISLKAVFCLGLSLFLSSCGSRRQVSDTSIDNRLISRIETMIDEVMDRKIVEIRTSDLNADIVITERKFDTTKDVDPSTGERPVSSQTDAHIVIGRLDSTVTVDSLGIDKTITGVKDIDKKTDIEHKDVDDKKESRWPIVWIVAGILMILLVLVYILKKIKIL*
Ga0104787_10004223F089590N/AMKDKDMIERVGALWNIALAYGASCWAYFQPVHHLLTVLLIVLIANFLARLAQSVRGWKLRRSRRRRFSFKRWFREVRFTDILKEFALSCFIVMTLCVIYKTLYPIEEEASMILTVTKYGVYIALVGYVMLFLNTIGDAFTDAYLVKVFKAVFKRINVFKMFGFSKNIPDEMFDDIKKIADDKVKDKS*
Ga0104787_10004226F042095N/AMAKTLYKYEASSNKFVWFTTWDRALRNYYTDDYNYVPDPVVGNPYNTFVEFRSRKPGMANVDWGDGIKEQFPMTKVQGEDNYRIIFRSLAIQHKKNPNTTWWFRKEDGSQYVPVDNHAYADGRRDVQRAVSIDFTCDIYYANIQVCKMTSFPIVDIPGLEFLVVSHTLYVNDRIPVDKLSRSKKLIYIDLQNIGQRMTVIPEAITSKTEVYYLNMFNMLDLRDIESSGIRNIKNMKNLQTLELSSCYLDRYIKEFNDLPKLTSLRIHPGPSDMWNYFDINTLPFFEVDKINPNITNFDFLNDWVSGERRTGWNDDNMSGRGLDHLTGFFVYHSNSIRVDKLPDYIYEMRSITWFVMDYSTHSQKRSDDFVNSFYDLVVGWDQITMASVAKDGERNQFYGLAVSMYGSQYPDENQRPSGTEQAPEGFVKGSSNGSPATPMEKIYVLKNNYAQRWTIKPE*
Ga0104787_10004232F089591AGGMTMTIQEAYLRSLQKNEQNLANGGIKLDPGRFVLLFNEAQDRLIRYYLNRKDDETIRSIQTLLVYWKSLNKINHIDDPESTSFGLPDDYLWFSNIKGAFSYNGCEVGDFVMWEAKNENVHELLGDDNNRPSFDYRETFYTIGDGKVVVYEDGFRTDEVRMTYYRNPVRVDLAGYINAAGERSTDIDPELPDPLVEEILDMVAKQFNLNENELSRYRMDKDNVASFK*
Ga0104787_10004233F089592N/AMKEILKSKKVLVEVNGFNIMSDTLYEVVGKHDGSAPQAFQDANIAKAPFPENATHVCCPWDDFSEVYNTGFYPRSRCYNGMDKDEVDKLVDQRVNNIMKPFENISQKDLSQTNFEFWDDAKDKIYMGKVYNTANTVELFYLYLAVFSGMLTPQEMDGDPIFMNSMFCFIEKDNAKDFVQQREINKMNISYKFIDALKKGGKERQAVIDLLLYIGIVTRPDFTEDDYYTGSLSNWMNEKKTNIDYLLDIWDRSLEGDFKEVLEFYRIINVLQRNGRINMTPSGLQYNGQIIGPDTRTSAEFLATKKDLISVKANVLDEYEELMSISNIDDKTKKVKDVKKKEDVGEGDKVNTEE*
Ga0104787_10004240F093883N/AMEDKDIKTEIRDYLKEEADTHIRHWIAIKRESKRLYRDIEDRTKKIALKSSSLIKEEDFVVLHEMTHKIQMLNIEAVKVNSRLMFIIQLATSFGMDLDLDTTYASTAKSIIEDRTSGFVFYDDKERLRYADKELEDMFHDMSVTEVSKIGVVQSYELLMKQYNEFKDIIKNNINSI*
Ga0104787_10004246F058555N/AMGTKIRFLHIMKSNFDKILTERYIPRNIQTKKDELGCVKLPAGSLICPVDFKPITNKEGKKVTAIKYSLKHEEYHGSGIQISDECKMAMIYLIIINVSKHVFLRKRMQDGNRDQIEINTNDFIDILSDGCAYFCYRHVLRDSHEDMNYQLISLKAWAEGEIMIALSDIIKYKHKASKTPRIKDMFVKKGESVYTCLDKNLDSNTRRRMANKSRKLNRVKMLSKIIFSARNRNINKIYKVTKKRTVKFNVSYLMDRLNIKLSKEGMMLISQRTVYRMIKEVLSMCCKTISDLYDEVKKNNGIVNTKDRKNVTIGHLRLSYRGTIMHIIIAEYFIKDVFLGVKGVEMSKAG*
Ga0104787_10004263F083452GGAGGMSGRVKIKIKDKKPKIDVFKVIESRFKNMNELRDLIDMDPRKGLVRIRDGAGFREVERSGCLHRNYLNLLEEELGAKLSIDLIERYIKR*
Ga0104787_10004272F057001AGGMISKEINKVQNEIKKSNEKTLIGAVKAWCNLFKSGKEINDILKENDIKVSKEVVPALVALAKDKEVVIQLCKEILPRVNNTFCAYKEVEREYYDKNEQDKNKKLKMSEIEDIAILGSSHKRFGYNEPIEYDFGIYYETFNGADKRIIKCAVPIKRYTFNLIAKCVTYYLTHPKNDR*
Ga0104787_10004284F078006N/AMGDNILRKAADELKKAGCRVFAWQDDTYNRGWSKGDYTMLYYAFPDSPNIGYLSHGEYGMSVAYSRAYIPSCGSGSGCCVKEEATFDLEAALDVLNGPLPRWCRSYGVYPKQYDNIDKWYNSDNHNKKLFKEI*
Ga0104787_10004286F085718AGGAGMVIEFDFEIYKNGDYDKVYLRNGKEPRVLCDNGKGDRPIVVMVEDDNANDYIILRYNETGRRDINGKSSLDLMLSVKEREPELWVVVISYMDNKDKRQKMVLPNFFSRNIGGNIYLQGSSKSNVSYYVGRLEEDGCFDELCEKIRVKRDRIYNMEIISLSDDKTTV*

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.