NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300005264

3300005264: Hydrothermal sediment microbial communities from Guaymas Basin, California, USA 4572. Combined assembly of Gp0115316 and Gp0146562



Overview

Basic Information
IMG/M Taxon OID3300005264 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0114726 | Gp0115316 | Ga0073581
Sample NameHydrothermal sediment microbial communities from Guaymas Basin, California, USA 4572. Combined assembly of Gp0115316 and Gp0146562
Sequencing StatusPermanent Draft
Sequencing CenterUniversity of Texas, Austin
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size291954024
Sequencing Scaffolds32
Novel Protein Genes54
Associated Families51

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
Not Available11
All Organisms → cellular organisms → Archaea → Candidatus Thermoplasmatota1
All Organisms → cellular organisms → Archaea3
All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Desulfobacterales → Desulfobacteraceae → Desulfosarcina2
All Organisms → cellular organisms → Bacteria → FCB group1
All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria3
All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Desulfobacterales1
All Organisms → cellular organisms → Bacteria2
All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Desulfobacterales → Desulfobacteraceae2
All Organisms → cellular organisms → Bacteria → PVC group → Candidatus Omnitrophica → unclassified Candidatus Omnitrophica → Candidatus Omnitrophica bacterium1
All Organisms → cellular organisms → Bacteria → Proteobacteria1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi1
All Organisms → cellular organisms → Archaea → Candidatus Thermoplasmatota → Thermoplasmata → unclassified Thermoplasmata → Thermoplasmata archaeon1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Lactobacillales → Streptococcaceae → Streptococcus1
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHydrothermal Sediment Microbial Communities From Guaymas Basin, California, Usa
TypeEnvironmental
TaxonomyEnvironmental → Aquatic → Marine → Unclassified → Unclassified → Sediment → Hydrothermal Sediment Microbial Communities From Guaymas Basin, California, Usa

Alternative Ecosystem Assignments
Environment Ontology (ENVO)Unclassified
Earth Microbiome Project Ontology (EMPO)Unclassified

Location Information
LocationGuaymus Basin
CoordinatesLat. (o)27.013056Long. (o)-111.519722Alt. (m)N/ADepth (m)0 to .12
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F005243Metagenome / Metatranscriptome407Y
F007460Metagenome350N
F008501Metagenome332Y
F013049Metagenome / Metatranscriptome275Y
F014691Metagenome260N
F015290Metagenome255N
F015411Metagenome255Y
F018717Metagenome / Metatranscriptome233Y
F019588Metagenome228N
F019935Metagenome226Y
F023017Metagenome211Y
F024453Metagenome205Y
F025034Metagenome203Y
F028664Metagenome191Y
F029623Metagenome187Y
F029954Metagenome186N
F035145Metagenome172Y
F036294Metagenome / Metatranscriptome170Y
F039146Metagenome / Metatranscriptome164N
F040421Metagenome161Y
F040567Metagenome / Metatranscriptome161Y
F040581Metagenome / Metatranscriptome161N
F042112Metagenome158N
F042625Metagenome / Metatranscriptome158Y
F044793Metagenome154N
F048391Metagenome / Metatranscriptome148Y
F049343Metagenome146Y
F050065Metagenome145Y
F050310Metagenome145Y
F056226Metagenome137N
F056961Metagenome137N
F060609Metagenome / Metatranscriptome132Y
F060924Metagenome / Metatranscriptome132Y
F062246Metagenome131Y
F063346Metagenome129N
F066330Metagenome126N
F068117Metagenome / Metatranscriptome125N
F068714Metagenome124N
F070560Metagenome / Metatranscriptome123N
F072848Metagenome / Metatranscriptome121Y
F072934Metagenome120N
F076639Metagenome118Y
F078539Metagenome116Y
F085719Metagenome / Metatranscriptome111N
F092080Metagenome107Y
F092871Metagenome107N
F094909Metagenome105Y
F097367Metagenome104Y
F098044Metagenome / Metatranscriptome104Y
F099975Metagenome / Metatranscriptome103Y
F102489Metagenome101Y

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0073581_100056Not Available45550Open in IMG/M
Ga0073581_100619All Organisms → cellular organisms → Archaea → Candidatus Thermoplasmatota21185Open in IMG/M
Ga0073581_101090All Organisms → cellular organisms → Archaea18844Open in IMG/M
Ga0073581_101251Not Available16236Open in IMG/M
Ga0073581_101395Not Available19503Open in IMG/M
Ga0073581_101831All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Desulfobacterales → Desulfobacteraceae → Desulfosarcina14029Open in IMG/M
Ga0073581_101844All Organisms → cellular organisms → Bacteria → FCB group14726Open in IMG/M
Ga0073581_101930Not Available14750Open in IMG/M
Ga0073581_102055All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria71834Open in IMG/M
Ga0073581_102336All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Desulfobacterales14890Open in IMG/M
Ga0073581_102347All Organisms → cellular organisms → Archaea23785Open in IMG/M
Ga0073581_102540All Organisms → cellular organisms → Bacteria32572Open in IMG/M
Ga0073581_103334All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Desulfobacterales → Desulfobacteraceae27755Open in IMG/M
Ga0073581_103733All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Desulfobacterales → Desulfobacteraceae17570Open in IMG/M
Ga0073581_105188Not Available19798Open in IMG/M
Ga0073581_105890All Organisms → cellular organisms → Bacteria → PVC group → Candidatus Omnitrophica → unclassified Candidatus Omnitrophica → Candidatus Omnitrophica bacterium8549Open in IMG/M
Ga0073581_106224Not Available9187Open in IMG/M
Ga0073581_109247All Organisms → cellular organisms → Bacteria → Proteobacteria7297Open in IMG/M
Ga0073581_110368Not Available6510Open in IMG/M
Ga0073581_111381All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Desulfobacterales → Desulfobacteraceae → Desulfosarcina10648Open in IMG/M
Ga0073581_112568All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi5917Open in IMG/M
Ga0073581_112901All Organisms → cellular organisms → Archaea → Candidatus Thermoplasmatota → Thermoplasmata → unclassified Thermoplasmata → Thermoplasmata archaeon5844Open in IMG/M
Ga0073581_113972All Organisms → cellular organisms → Bacteria5606Open in IMG/M
Ga0073581_114535All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Lactobacillales → Streptococcaceae → Streptococcus5693Open in IMG/M
Ga0073581_115009Not Available5399Open in IMG/M
Ga0073581_115482All Organisms → cellular organisms → Archaea5311Open in IMG/M
Ga0073581_115551Not Available9654Open in IMG/M
Ga0073581_116653Not Available5105Open in IMG/M
Ga0073581_118754All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes13346Open in IMG/M
Ga0073581_120318All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria11068Open in IMG/M
Ga0073581_120816Not Available6420Open in IMG/M
Ga0073581_123682All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria9383Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0073581_100056Ga0073581_1000568F008501MIFCVQDRGGPAHCHLVLPHVTKMEKYNRMEFKLKNSSVDLTDLAIGIVVLGIVVSIGATILLNVRDTNTSGDTAYNLADSAAAGLAEYGNWFDIIVIVGVAAVILSLIFMAFGRRGGGSSVTY*
Ga0073581_100056Ga0073581_1000569F008501MEIKLKNSSVDLTDLAIGIVVLGIVVTVGATILLNVRDTNYVVSATCNDTSNIHTGCDSAWTLADSAASGIAEYGNWFDIIVIVGVAAVILSLIFMAFGQRGSGSGMQSY*
Ga0073581_100619Ga0073581_10061916F062246MVLHMERKLRSISGSLVVTIPKQVCDLYGFNDGDMLNIEPIGVGELRLRKK*
Ga0073581_101090Ga0073581_10109012F056226MRAYDILLFLVCLEASIGFIASIDLFSTTYVDPSAVQLTDWNVQEIQNQSSSPSLLDTAIDGLVKAIPQFLNMLLAIAIVYIPLTQTLGVPTEVALLFQGAVYLIYVWAIIQFLSGRSVKYME*
Ga0073581_101090Ga0073581_10109014F007460MDDEEGIGIGTAIFVVMLNIVFAIVFFFLVGSVMCGILPPLTTIVGIDETTPFGQALTVVPNLVNVFFYLPLIFIFTMFVWLFKYIVKRHKYTYYQYQEEEEM*
Ga0073581_101090Ga0073581_10109015F015290MLNNKGQAGEIAVFVILVFMVAFAYIFISPVVQNIKDVTPTIADATSWTNAQTQAMNWLFRAWYAFPFFAFIALLVWLIKRAIEKRSGEVV*
Ga0073581_101090Ga0073581_10109016F024453VKMEIPERDGIVMYFAILQNLAQRPDLTDLERQALDFASSLLYQSVYGEEISEVGEEVKSVREEGREEDMDKVRSVHELAQVRRANAKQ*
Ga0073581_101090Ga0073581_1010902F019935MPKVKETANRTVVHVGDLWKRYHRASPKTKKRWKFRIKDVGRLRYSELILCKPPNGDWQTYAWSFSKGQVKKGKRKLLIYDAKAFEILQKLKENDDLRGWKLAFKG*
Ga0073581_101090Ga0073581_10109020F042112MTFDEYSSAFPQFEEKYVSKLAYPHLLFIQIQKIMDRIDAGGDGKEELESLKALLKPSWRGEIDVKTERCRREMEREINRIARVKERVGITTYKEMKRAAIVRYVREYVQHVIEKLDEVGLLLIEERGVLRGGGLML*
Ga0073581_101090Ga0073581_10109024F023017MRLKKEKREEMKQKIIEFLKNADGHIASIPMLARAIQASSPTAKSIVFELEVERKVSVVMLGGECVVKLEGG*
Ga0073581_101090Ga0073581_10109026F029623MNTYAKYVRARFERGNLIVEISKDAFKDAETRKEATKELQQIIKTYFSIRFFIRLSIVYAFTALFCACVALLMLLK*
Ga0073581_101090Ga0073581_1010905F072934MVGIDIERLLADEEIREKMNEWNREKAKRRLESMKNLSEPHRRYLEKIASGEWKVW*
Ga0073581_101090Ga0073581_1010907F019588MGEEKRIEEEVRGVLERLKVPYSVEIVIVDDKSGKVVFRWRRGRGNLVKGMRKTIEVLDRKIGIPVKDLLK*
Ga0073581_101090Ga0073581_1010908F040421MRGERVRAITMLTLAFTIGFKSIILFLQHSFGFAFALLLLAWTCICFAFPDQLIGASIGQDLKLGEVIEKWVRRRELKRK*
Ga0073581_101090Ga0073581_1010909F014691MSRGIGFVSAILILIFVVVGVFTLSIAWSEIPVEPQANTSLANTSSIIPALTSFLPYLVLVVILAMAIGVFLSLVRVR*
Ga0073581_101251Ga0073581_10125113F085719MVNETWNGIKYDNYHCNDKDLLDGLDTVSFTKLSEQDMKNEIDHFLDKKQDLLHIQDLVHEARIDWYSNTYPGEYTGD*
Ga0073581_101251Ga0073581_10125135F060924MAQGISIGIGIPRGLTGNAGFPTAFDEIVSESGDFIIAESGAYPTYMIVE*
Ga0073581_101395Ga0073581_10139530F097367MERQSYYNVRYAGKLIQVIPAHTKWEAIDRIYNKNIGTHPWIKREKFTAVKSKN*
Ga0073581_101831Ga0073581_10183118F099975MSNFSQGKEIKELRGGVQVDAAQVIPQIDAEIAEKGHFWMHTN*
Ga0073581_101844Ga0073581_10184411F092871MGRKSVYLKQIEHLLSVGYEAKDIVKITNIPKGTIYRIVDQLREEAKTNFDQLMTKDYLYKYQMNLDNYSKTIIQCNIEIDTINVKYDELEKVVMDSLEVCPIDKYIARSAYMTNLINIRNNRTIEIQKLIAQRDKSSEMKAKIFNSGPVVYRINQVVESKVLQPEMLNEKPRVEFVQKDVKHEISEDDLQVLKEMEEND*
Ga0073581_101844Ga0073581_10184413F072848MTNLIDNEKIDIGFNELDMAMEKIINEHKLNFYEVLSVLAMMDTKVKQNNISQYLLETVTRFQELLNKEDRDGR*
Ga0073581_101930Ga0073581_10193023F040581MEFLENGNFNGIKFLGKRRVYNGYLNKEQSWKEYQQWAEKNIY*
Ga0073581_101930Ga0073581_10193027F025034MDSKKAYEIMKEEYVDREALELCNQIEKQLERIISYDDGFLNDRTFDEIKKSAIKLLKEWHL*
Ga0073581_102055Ga0073581_10205573F063346MNVEHRIMYSTIYNKDKATDRRLSEPTKGRRALSTARDGGQERLQHSTFDFDVKIIASEITTKSSYHVEIRYTGQEV*
Ga0073581_102336Ga0073581_1023363F063346MNVEHRIMYSTIYNKEKATRPPRLSEPTKGRRALSTARDGGQERLPHSTFDVERSMFDVQIVASKIATKPSCCVEMTYTCQEF*
Ga0073581_102347Ga0073581_10234738F015411MPRSSYGRKDGSGRGWKEGGRGRNRTNICRHPNIRKRRR*
Ga0073581_102540Ga0073581_10254035F040567MKKYIVVERNRKFDSFDLINEDTNGLNEIWESHIPYKLNEDEKPNVRPLIFDNRSEAVKYKNKAQSDRDRDWRENDYIYKSFGDLKPKWKIEEYTGNLFK*
Ga0073581_103334Ga0073581_10333415F048391MNDGKSVEPINTLDYLKELLDDGYVVKGPRKDPSRDLISFKAFLKKGKEFVPELWLSNMGYEFVEPSTFTKGHKIAYKIIDEFPDERFNSNYTLVKGEKIFPLYLKVEVPKVE*
Ga0073581_103733Ga0073581_1037331F056961MYSTIYNEDKAKRLPHSTFDVERSMFTRLRRRQRRPGFDVQIVASEITTKPSYHVETMYTDQ*
Ga0073581_105188Ga0073581_10518830F050310MASMQAFIVGIVLVGVTLVIGIFISAEIADQMDTGSAEANAANDLVTALSGGSAWITILVVVGFATIVLGMLTQGLGRSADVAGPVY*
Ga0073581_105890Ga0073581_10589013F102489MSSRKGYRSQAEAAEEYKKRGWGVFVPQKSKYGAQDIFGMFDLVAISPDGSEIHFVQVKSNSTRGFLKKLKEWREKHVVKKVEWRLMVRLDARKHKKKWKVYH*
Ga0073581_106224Ga0073581_10622413F018717MKITIEHYDEKVSIETKHDDITFADFMQLVRKVAHTVGYGTKTINEWYNG*
Ga0073581_109247Ga0073581_1092479F036294MDRGFSDEAIVGVKPAADEDAVTYQRVKLSENDKDVGAEGRNM*
Ga0073581_110368Ga0073581_11036810F039146MKNKKNCGCFSKYMKPSVKGTKGSKGRNGWDAKPVFRITNPGRR*
Ga0073581_111381Ga0073581_1113813F056961MNVEHRTSNVEHRIMYSTIYNKDKAKRLPHSTFDVKRSMFDVQIVASEITTKPPYHVEITYTGQEF*
Ga0073581_112568Ga0073581_1125685F094909MLNKKVLDWIEGYVLSRMLIFGDFVGTVKRSDLIFDLDTTSEYFRKHEGQRIQSLSLPVKSRNLSTLLGEIEDCFQQGWILMDNGEWTEAEFDRVYELSRRVITEIALLRQELNQENPEPYFPLSDPMHLK*
Ga0073581_112901Ga0073581_1129015F068714MKILETNKSHIIYKEDSGIVSIVPNTPYFNMKVLENKNKRIKFDLPIEGIEILETFNGMIFNISLNGKTITTRPNMADAVVESFTDTDKIYKIFDDYNNNKYNLKLLSGILEKYGERIKTRPDGFVIDDIFLVDRTGVCWLWDDKNNCRNKNHRTNLGSGAICIVVDKTQRLKLNTKNGLVEIDEMGYIILSKIEFLMQPNLNDTVFMHQVPKKIQTILQRNKKLYK*
Ga0073581_112901Ga0073581_1129018F070560MLMTNQTQQFKQLVSSAKPETQIPKTESMVTVSSLMSTADQVQFGKGLSSIFVDVPHHTSGESQIFSVRPQQLVPRSAVIQRDESSQLVGISAVSGG*
Ga0073581_113972Ga0073581_1139723F076639MKNSEIKKGQDRAPILWMHEAITYLGLDRLGLTRPDKAMYRLIKKGALHPKKIAGHFAFDKSELDIVVANGDQKRGRGRPKKIHSA*
Ga0073581_114535Ga0073581_1145352F078539MVRPFTDRENFIVASVIMVVSDKMKSVSRETRTNILQYIRETKYPGVTDQDWKDIANGIDAHKKDVFSVMVKAFHESSSNPSVSSNKAFATLDTDMKAEIEDIDFDELKSIVDESDDPKLREYYLTMKQLKRDFDDDRKK*
Ga0073581_114535Ga0073581_1145354F098044MNTTSITNKQAVIGVNSTYETVLVIEGIILNNPMMSDLQDIAKLNKTNLCKMALMEFMRNPVSSRKLKRLFIAANNDDKFRFVMQGGQL*
Ga0073581_114535Ga0073581_1145356F042625MTDKAEKDKLWKDLKIQWSYLKRDSGADEVKKGEAKKRINEIQEALKLDKTDWNQPRSGPPGSHLTNAGASPMPSNNALVEKILGTVLDMKRTVNEDLIALTQKINNLEEVVKKGACNCAPPSDTPLD*
Ga0073581_115009Ga0073581_11500912F068117MSENVKTINPANAPTVEKSKSLGIAKLASGKKEFYITGCQHNRGTPSQYTRADQIGEDGKTDYYTIKVETPIELEYKEEGKIPIDNFFVTPTIYQQIERIPNALEGINTGARLGPVKAVKRESQKTTNSYWCLAFESDPDF*
Ga0073581_115482Ga0073581_11548210F029954MELKVREYKFNDGERKVEYVIPPTNTPFFSVVFSKRGGVRMIAIHQRVLEYHEAVVLISLLCGVIKNERGNGYRDGSSTLYH*
Ga0073581_115482Ga0073581_11548215F066330MAEGEKRKVVVSVALSVDEYLRLEEAVVRKHGRKWGVFSEFVREAIKKAVEEVLQARSEK
Ga0073581_115482Ga0073581_1154822F049343MEWSRLRDKLRKGEWHAFSAACFIAGVVVGKEITTLGRAEPILALTAVGAVLLAYWLALLTATIIAEIKEAPKTI*
Ga0073581_115482Ga0073581_1154823F035145LEKLSEFMRRDYNLKTIITEECCDLVRVEVICLTARTSFPLLSLTYNRHTDNFILTVERKFITQQSTLKALIEALESVKEVMLNGMEQIER*
Ga0073581_115482Ga0073581_1154824F050065MNMLKATASMEQQILLAIAYLKGLLRDARRRGDKEVVRYIGEIIGVYEARLQPQNNNN*
Ga0073581_115551Ga0073581_1155516F013049MDQDVETQFHTIDLFIRSVLDDMDKVSKSTRKIDIMAYIDSWKTELKTVQHIINL*
Ga0073581_116653Ga0073581_1166532F005243MSDSKTNSNLQDKDPKMSKEEMSARRDEITQFYNDNIPHLEVQADYETLLATIEKARAERMQAQMFMAQQYAAQKGEGAPDLNTEEGKAFQEAMVKAMQDETA*
Ga0073581_118754Ga0073581_11875425F028664MKNQILIGGQALRELGSDRYTNDIDYLINDESSTEAFKTSKEVDYLNANGNKFFNEIYKIEKNNKIASPQSLLELKAYAFVQHCQNYNWEKVDSSEYDMKFLVRKFKLNSVSIVKNFVSNGELFEIEKIIKSVKF*
Ga0073581_120318Ga0073581_12031812F092080MQDKRVTIRIPFEIWKALRELQTLGKISSIQQAAVSGMDRLIESLKTGDEENRRDAAKERILNILSAAKPLGNWEDIHRERSEADADRS*
Ga0073581_120816Ga0073581_1208165F044793MIINTSSYTNAIPVALNDNINIPGPTVRASGTTTSLTNNKLVDTNANFVQVIDAKGNITNQGVQRGQIVYNMAAMNTTAWLGPEAAEILEVENNNTLVLSANIFPVTGAPSTTQEYKIYDANKTNPKGAIIMVGDNIAGNNTKSDVFVKTLDGEDVLVQGVAPGETLDLVVQRVMVGSAATSGAPSTLTTAEKITAFI*
Ga0073581_123682Ga0073581_1236823F060609VIGDAQVDSIDRRDFCKKAIKRSSVAATVGVAGYIAYKKPAIRSFFGAKDAYAASTTAAGKFSLKGDSN*

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.