NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300007301

3300007301: Hydrothermal vent microbial communities from Teddy Bear hydrothermal vent, East Pacific Rise - large volume pump, sample 5



Overview

Basic Information
IMG/M Taxon OID3300007301 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0116043 | Gp0118654 | Ga0079920
Sample NameHydrothermal vent microbial communities from Teddy Bear hydrothermal vent, East Pacific Rise - large volume pump, sample 5
Sequencing StatusPermanent Draft
Sequencing CenterBigelow Laboratory Single Cell Genomics Center (SCGC)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size249394076
Sequencing Scaffolds42
Novel Protein Genes44
Associated Families42

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosopelagicus → unclassified Candidatus Nitrosopelagicus → Candidatus Nitrosopelagicus sp.1
Not Available25
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1
All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria1
All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Marine Group I → Marine Group I thaumarchaeote5
All Organisms → cellular organisms → Archaea2
All Organisms → cellular organisms → Bacteria2
All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Nitrososphaeria → Nitrososphaerales → unclassified Nitrososphaerales → Nitrososphaerales archaeon1
All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium TMED2642
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Acidimicrobiia1
All Organisms → cellular organisms → Archaea → Euryarchaeota → unclassified Euryarchaeota → Euryarchaeota archaeon1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameHydrothermal Vent Microbial Communities From Teddy Bear Hydrothermal Vent, East Pacific Rise
TypeEnvironmental
TaxonomyEnvironmental → Aquatic → Marine → Hydrothermal Vents → Unclassified → Hydrothermal Fluid → Hydrothermal Vent Microbial Communities From Teddy Bear Hydrothermal Vent, East Pacific Rise

Alternative Ecosystem Assignments
Environment Ontology (ENVO)marine hydrothermal vent biomemarine hydrothermal venthydrothermal fluid
Earth Microbiome Project Ontology (EMPO)Free-living → Non-saline → Subsurface (non-saline)

Location Information
LocationTeddy Bear Hydrothermal Vent, East Pacific Rise
CoordinatesLat. (o)9.847222Long. (o)-104.2975Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F000057Metagenome / Metatranscriptome3033Y
F000615Metagenome / Metatranscriptome984Y
F002030Metagenome601Y
F002348Metagenome / Metatranscriptome568Y
F002715Metagenome / Metatranscriptome535Y
F002745Metagenome533Y
F003285Metagenome / Metatranscriptome496Y
F004643Metagenome / Metatranscriptome429Y
F005610Metagenome / Metatranscriptome395Y
F006198Metagenome / Metatranscriptome379Y
F008155Metagenome / Metatranscriptome338Y
F008692Metagenome / Metatranscriptome329Y
F009692Metagenome / Metatranscriptome314Y
F010197Metagenome307Y
F013570Metagenome / Metatranscriptome270N
F016878Metagenome244Y
F017326Metagenome / Metatranscriptome241N
F020372Metagenome / Metatranscriptome224Y
F020602Metagenome / Metatranscriptome223N
F022430Metagenome / Metatranscriptome214Y
F022526Metagenome214Y
F023622Metagenome / Metatranscriptome209Y
F023950Metagenome208Y
F024888Metagenome204Y
F033215Metagenome / Metatranscriptome178Y
F043455Metagenome156Y
F045148Metagenome153Y
F045157Metagenome / Metatranscriptome153Y
F049035Metagenome147Y
F050430Metagenome / Metatranscriptome145N
F052659Metagenome / Metatranscriptome142N
F055187Metagenome / Metatranscriptome139N
F057444Metagenome136N
F061282Metagenome / Metatranscriptome132N
F065115Metagenome / Metatranscriptome128Y
F066848Metagenome / Metatranscriptome126N
F068126Metagenome125Y
F077397Metagenome / Metatranscriptome117Y
F079636Metagenome115Y
F087323Metagenome / Metatranscriptome110Y
F101351Metagenome / Metatranscriptome102N
F101844Metagenome / Metatranscriptome102Y

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0079920_1005854All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Candidatus Nitrosopelagicus → unclassified Candidatus Nitrosopelagicus → Candidatus Nitrosopelagicus sp.1280Open in IMG/M
Ga0079920_1009239Not Available1113Open in IMG/M
Ga0079920_1012066All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1025Open in IMG/M
Ga0079920_1013622Not Available986Open in IMG/M
Ga0079920_1015406All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria947Open in IMG/M
Ga0079920_1016025All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Marine Group I → Marine Group I thaumarchaeote937Open in IMG/M
Ga0079920_1021396Not Available856Open in IMG/M
Ga0079920_1024945All Organisms → cellular organisms → Archaea816Open in IMG/M
Ga0079920_1027230Not Available793Open in IMG/M
Ga0079920_1030324All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Marine Group I → Marine Group I thaumarchaeote767Open in IMG/M
Ga0079920_1033628All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Marine Group I → Marine Group I thaumarchaeote742Open in IMG/M
Ga0079920_1033707Not Available742Open in IMG/M
Ga0079920_1033953All Organisms → cellular organisms → Bacteria740Open in IMG/M
Ga0079920_1034320Not Available737Open in IMG/M
Ga0079920_1036369All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Nitrososphaeria → Nitrososphaerales → unclassified Nitrososphaerales → Nitrososphaerales archaeon724Open in IMG/M
Ga0079920_1036918Not Available720Open in IMG/M
Ga0079920_1037250Not Available718Open in IMG/M
Ga0079920_1040097Not Available702Open in IMG/M
Ga0079920_1040470Not Available700Open in IMG/M
Ga0079920_1043388All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Marine Group I → Marine Group I thaumarchaeote683Open in IMG/M
Ga0079920_1044692All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Thaumarchaeota incertae sedis → Marine Group I → Marine Group I thaumarchaeote677Open in IMG/M
Ga0079920_1045329Not Available674Open in IMG/M
Ga0079920_1051617All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium TMED264646Open in IMG/M
Ga0079920_1051622All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Acidimicrobiia646Open in IMG/M
Ga0079920_1053788Not Available637Open in IMG/M
Ga0079920_1057799All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium TMED264622Open in IMG/M
Ga0079920_1069650Not Available585Open in IMG/M
Ga0079920_1071222Not Available581Open in IMG/M
Ga0079920_1071311Not Available580Open in IMG/M
Ga0079920_1078642All Organisms → cellular organisms → Archaea → Euryarchaeota → unclassified Euryarchaeota → Euryarchaeota archaeon562Open in IMG/M
Ga0079920_1080610Not Available558Open in IMG/M
Ga0079920_1081935Not Available555Open in IMG/M
Ga0079920_1083647All Organisms → cellular organisms → Bacteria551Open in IMG/M
Ga0079920_1089193Not Available540Open in IMG/M
Ga0079920_1090472Not Available537Open in IMG/M
Ga0079920_1090482Not Available537Open in IMG/M
Ga0079920_1102931Not Available515Open in IMG/M
Ga0079920_1105040Not Available511Open in IMG/M
Ga0079920_1109488All Organisms → cellular organisms → Archaea504Open in IMG/M
Ga0079920_1109702Not Available504Open in IMG/M
Ga0079920_1109732Not Available504Open in IMG/M
Ga0079920_1110504Not Available503Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0079920_1005854Ga0079920_10058541F020602MGIKYISGFRGLDHAKQHLRRSVHLGSALTSGYTVLNPAEAFDQQACFVYLNGALLKEGTSGAGGDYVLSGSNTVTFNVAVATTDAIEVISYAFQNPTLPATMTEVDHTITSANASYHSTSF
Ga0079920_1009239Ga0079920_10092393F002348MKPNEKWMKYGGWLLSLLSILIGASFLWPHVHVALLGIAFIYLGIRIFNFSTFDEYKEKRMKLLYKLLK*
Ga0079920_1012066Ga0079920_10120663F010197YGLNLFLLPNTTHELKAFTIKHDFAQHMQKVFSRQDGILFSFEEV*
Ga0079920_1013622Ga0079920_10136221F068126MAVALFLSGCAKNVADKNNDLGSGDKSNLPISLTSLIEHAEYCKAIYD
Ga0079920_1015406Ga0079920_10154062F052659MLLSGNPGTETRSAVFIDGRLISSLPELNYQTIDKLQQGGVPDKVSWMKPSLRGRLRLVKLQAQERNLTIEDLNQSIRSKWVFLWAEV*
Ga0079920_1016025Ga0079920_10160252F017326MNDYEKVKIEINESVTSDNFKEHLRKNYHNSYDEYESDTRHIDLDVYYEKNSIPINELVPKGMTIKEYFGKEQRLLREENISYVKFD*
Ga0079920_1016025Ga0079920_10160253F043455MNMSMRKETVEEYLRRGKTITKIPQVLDTIGSIWNQQGYEVNKDRYGHKKFGIRLQDWKSMQPDIRFDTEDDDRKYWNKLNKKCD
Ga0079920_1021396Ga0079920_10213962F050430MDKKLKENLIVGIKIVGMLFFLIVLITTLVWPGGVDTLVN
Ga0079920_1024945Ga0079920_10249452F065115MLSINRIFIGFFLIGVLYAVGDINPLIGGLAVGLLFGLTDYTKESR*
Ga0079920_1027230Ga0079920_10272302F002715MPEEELQSVKLEVGLLKNEVEVRGRQIDTLLSKLDHTADKLQELTVEIRTLNTRQEDYLRTNTSMSNEFKILHTRIGDLHDKLSSNNRRIEERLDHLDQYKSKLMGMIIVVGGVVGTIVATAISIFLKE*
Ga0079920_1030324Ga0079920_10303241F049035MANWDSRELQTPPAIKKIGQNAQNVINNLDILLKIVKGGAEVAKMFLLLSNPAGAIIKLAANEIIKAANDFKEIGVFYLFINPNDEGYGNQTIRELGLAIKREGLTGLYQFKPTVFSVGGEEGGVKTHTVGTAYQRSLDIADLDSNYRDSNNKSKSDPKFIPPIPIFDDPPEWELGGYDPSTWTGHAPVTSIPLANGVFPPEMRPSKVLQIMSESFDDEGDVSTFEVQAGWKNVSRKAKI
Ga0079920_1033628Ga0079920_10336281F087323KIEVFTLTMQSTGGNLLQDLGDHADFNGKRERGISATFLRARSEDGTQSLFSISSKGLSIHAHVSQKKMGWIRLSL*
Ga0079920_1033707Ga0079920_10337072F022430MSDYIYENRQLALWAEYDDLKGRVSFEAFMERHVNVNGYIVNLLKKFGTVSAEIDRSESAKRFAEMDLVQLGD*
Ga0079920_1033953Ga0079920_10339531F003285MPEQESIHQLQTEIQTLKIKDEFRTKELDALMEKLSDTSSKLNALSENIGRLLAGQDLHKTSDNEVRDELKILHTRIGDLHDKCTEMIDKTETRVSSDISLLYKKVDSLEKWRWITIGIATAIAWLLTNIIPKFLSN*
Ga0079920_1034320Ga0079920_10343202F101351MDKPAFSETVPNELKELGYVPPIIKSYDKNGINVWLDKRKKWYDVPFWQFTMDGVQWMVLDERHGSASQFYSHYKLARGHVICTGLGFGTREQWLASKPEVTKITVLEKFKEVIDYHKDIGTKWPDKIEIINCDANDYKGSCDFLSIDHYEYDDVLRILDSIKKVCNNITCECAWFWMLEPWIRLGYI
Ga0079920_1036369Ga0079920_10363691F024888PDEFAQERGFDNWMEYAAWSRHTGGDYNMMEMMLKSKWKEQDPEEFARQKKIESDQRTREHSYISLPPEAWPTSFIGNKKKKWQPVHKSNFTAEELEVIYDERGVGGEPVEYRW*
Ga0079920_1036918Ga0079920_10369181F002745STSMFKRIWPDTIRATVFHTTDLAGLERLKKLEGGKKSISAFFSMMSRYMETGVATGDGVHVVVEMDADVLVSARDDIMSEVDKQGRRWVMMSWFEYQTRERSKFGKIEKDLNTLIANLVKKHIPKDKEIQQTKHFGKDSGAVFDIWGNMKRHLKGDGQKLRLVIKDYFDGVERILKKNSEVMGNIIYGYAKGKRMTDNSWDEQIVNNIEVKKVHVISEKDDQEGAEWRRDEIKAIGNW
Ga0079920_1037250Ga0079920_10372501F009692HWGSFATQIENEFHRNAITIADDTHNLLLQSSDLTGNTTAVFHLYVSVRHSTATNDAYQVANFIVRAEKAVPADCYLHTIADGGNIGTEFDAVDENAYSTYANVTDGNVGVGVTADANGWYVYLANRSSHSVVAGFKAIAITN*
Ga0079920_1040097Ga0079920_10400971F033215TVLERDALSVNETLVLVMVQVSATAKLPPRLRVVFEVLNKAPLANVAPIGRLRVPPVEERVPAVALIPPLRDKLALPVTIFSPAASRVTGCATLRLDILSSSVKAKLARSSVPTDREALSSIVEFAAKRRVVELSGNTPPKFCVPDTSRMPPESVSALAFVVNVVFFVLRVAPVSDKAFCSTNVVSGSEILVPDGALAPETRVIVPPEEFSDALSVNANAPERVRLPAPVSRA
Ga0079920_1040470Ga0079920_10404701F009692GRYRVLTVPSSSTVTLQTVDNASLAFDDNGTGITFIRVFSKSVPNLTVTNHVMLFLNGMHLVKDTDYYIDQQSVTIDSTVNLLENAVVAVRHFGSFVTQITGEVQRNGITITDDTHNLLLSSTDLTGNTTAVFQLWVSVRHVIATNDAYRTTHLFVRAEKGQASDCYLHRAGDAGNIGTDLQIVETGSYSTYANVSDGNIGVGVTADANGWYVYLSNRSSHSIVAGFKAMAI
Ga0079920_1043388Ga0079920_10433881F101844FYGDTASSVLLWDESDDQLEFGNAAMTISDARTSVTQSLTGLTIDMYNKLDTGSSNDRIGAAISTYGDTNAYTIRDSVGIRAMSRQSTGAEVTRLNTPLHAVLDLANTAIANNSGTTLSGAYGLAIDHDDTIATRTGQPTAFISFGEMYVGGTESIETAYLFDIFPNGKTGDATYGTAADVAFYNDGAQYATTRDTMLLNSTDGGPASDAGDHILMEPDPFDAILLE
Ga0079920_1044692Ga0079920_10446922F023622MKTFKEFQGKIPTQIVEQVYFKIRIPDMSTMFMKATSESAVKLDMRKKLKPDVVKEVTIERVTKAEMRKIYRAMGQGKEDEKEKESAEEK*
Ga0079920_1045329Ga0079920_10453292F045148MDDTTVTDWVDVTAECGQCATEDTVSVPTGQYREWKNGANISLVFGNLSPNQRDILIGADTSRPMPFYLCEVCWNITFDAN*
Ga0079920_1051617Ga0079920_10516172F013570MELEKIKKKREELMTNYNSLVDKRIELEKQLEITNTDILTMRGAILLCNEFVEEEEKPEPKPLFPEKEVVVNDLDKEKDGRQKNK*
Ga0079920_1051622Ga0079920_10516221F066848MRFSTRSIHSLRHPQDDVYRAVSTEVGPGFAMGSSDSVLRTVLKGARQIAADQRLPVDPVAKGAGSEPELNLVA*
Ga0079920_1053788Ga0079920_10537881F016878MKSFKGYLKERFQDWKSGSAPAWTESLSTMLFDLPRAGLKDLKIPLSSSIMNRVWPKSVRSKAFHVTDLDGVYKLTKLGKTKSISAFYNMDDFIISSGIKTNGGYVVELEGDVLAASPDDISSQPDKSGRRWITFSSLMNPSTAADPGLGGKTQLKGIETSLQNLLVKILQDNGKDID
Ga0079920_1057799Ga0079920_10577992F061282VLYEDSPTTAMGGGNIAVRGIPLLKKPPKGLVMKRFGGIDVFAIDPTYFQKSRLGKKKYTRYSGYVGEDEAGEYIRAFARKYPKKPIIVMDSQTGCMQYLRHGS*
Ga0079920_1069650Ga0079920_10696501F022526MNMKKVNTKTVRAKEYRDFSKGGTPQERHMENVERGYVTTDYMTAVAAAAVRYEVIDGKRFKVVTL*
Ga0079920_1071222Ga0079920_10712222F077397NTRTVANSGAFTGLEWNVLVTKIMWATIGCQVGIEWDGSTAEKYIGEFGGNGSWSLPGNEWPGIPINATGDSSEVLGDIQFSTADQSGTDSYTIIMELKKQAPGYDVPAYEENASLGYRVDYVLGNFT*
Ga0079920_1071311Ga0079920_10713112F005610VRSFKEYLTEADREEIRDAKKVFVALQAMYSKLPKFPLVFKNLQTSKNLDKRGGGYLQTSKLKGGKFIFVDKMVIDDSGLGSFEPDYAVVHEFAHAILAVTKGDLGHNKKHADLTYKLAQKFGLA*
Ga0079920_1078642Ga0079920_10786421F008692MSKKKRKQTKEEMYMPDSEEEFQADLDAQDMRYLQYCREQEEMYNDPVMNWSGLR*
Ga0079920_1080610Ga0079920_10806101F000615IVSGLERPENIMCRLRLFYECSDGSMGFAEHVMRYEDDIVGFIKHWKTGGRMVITEHIDLV*
Ga0079920_1081935Ga0079920_10819352F006198MEVELDVECDNCPATYTMVYDSDDIQSQDQEEHAFHCAFCGILMEPYYNEEEI*
Ga0079920_1083647Ga0079920_10836471F045157VIKAGIAKSGTSSGREGIRNKISHLKKTGHLREAQSALMDMINLKSQQKR*
Ga0079920_1089193Ga0079920_10891932F008155MNGRFPSCSATLVYIHAMKKQIEEEVTSKYLKRIEHLVAANNELMYELEQRELELITVESEMITVEKECYGRGV*
Ga0079920_1090472Ga0079920_10904722F023950MTKKTRGATVKYKMYGKFNYEKKFETVKAAKGFFWGYVVKTPNITGELIIH*
Ga0079920_1090482Ga0079920_10904821F055187MRVVNGNVIKSPADLVGLKPFKDAKLLRHEAIERMKRHMERKLKQSKEEARLAQPYVRDADAPSF
Ga0079920_1102931Ga0079920_11029311F079636VATPGETQVLGFDLTANDSLVMPVRKEYEAYDDPVLHREQKQGFYGHANLGFACLDSRMLGMGVIDRS*
Ga0079920_1105040Ga0079920_11050402F004643LNELAKYYDNEFWEENEWVSCFECDKIFEDLEKLYEHQDLHIEEENQKK*
Ga0079920_1105207Ga0079920_11052071F000057MAQKLNTEFNYRYQVIGDTPWERIKTLKGFLEGRIRALALEEVSKLKHQAKLSKLNYLKNGGEGLEHEILELKAEIIEAESHQGSLKEAFELTKDEIKILKKLIKELYAIAEPTRIKGYTDEQM
Ga0079920_1109488Ga0079920_11094882F057444MSTTKQADKKQPEWENMVGGKEPPEQLKEYIEKMIEFNNTAKEILKDTRDWLVTNGNSSR
Ga0079920_1109702Ga0079920_11097022F002030MIGFLIFGLVVLSTVCLWLLIEERKSPKFLIWFIPVLLVIVTSTYVTYTSPLGFPKFGTPEKGMYLRHYIDEPNWIYLWVLSRKNVPMSYQLVYTRKKHDALEGVKGKAEEGKFMVLGEDTSQGAGDELDGDIGGERGGGYT
Ga0079920_1109732Ga0079920_11097322F020372MTDKKNLATIKIYIDGKEMLYSHQNIIGAINSFLPYLTNDDLDELMLRFHTLKNHRRQKEYLAMLEAKKHSWPYPDTINK*
Ga0079920_1110504Ga0079920_11105041F006198MEVELDVECNNCGVNYKMIYDSSDMRYDEPAFHCAFCGILMEPYYDEFFEEDKDE*

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.