NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300006945

3300006945: Hot spring sediment bacterial and archeal communities from British Columbia, Canada, to study Microbial Dark Matter (Phase II) - Dewar Creek DC16 2012 metaG



Overview

Basic Information
IMG/M Taxon OID3300006945 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0111485 | Gp0115603 | Ga0073933
Sample NameHot spring sediment bacterial and archeal communities from British Columbia, Canada, to study Microbial Dark Matter (Phase II) - Dewar Creek DC16 2012 metaG
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size338563328
Sequencing Scaffolds34
Novel Protein Genes38
Associated Families32

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Archaea3
All Organisms → cellular organisms → Bacteria19
All Organisms → cellular organisms → Bacteria → Proteobacteria1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1
All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1
All Organisms → cellular organisms → Archaea → Euryarchaeota → Archaeoglobi → Archaeoglobales → unclassified Archaeoglobales → Archaeoglobales archaeon5
All Organisms → cellular organisms → Archaea → TACK group → Candidatus Korarchaeota → Candidatus Korarchaeota archaeon1
All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Bryobacterales → unclassified Bryobacterales → Bryobacterales bacterium1
All Organisms → cellular organisms → Bacteria → Terrabacteria group1
Not Available1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameBacterial And Archaeal Communities From Various Locations To Study Microbial Dark Matter (Phase Ii)
TypeEnvironmental
TaxonomyEnvironmental → Aquatic → Thermal Springs → Sediment → Unclassified → Hot Spring Sediment → Bacterial And Archaeal Communities From Various Locations To Study Microbial Dark Matter (Phase Ii)

Alternative Ecosystem Assignments
Environment Ontology (ENVO)aquatic biomehot springsediment
Earth Microbiome Project Ontology (EMPO)Free-living → Non-saline → Water (non-saline)

Location Information
LocationCanada: British Columbia
CoordinatesLat. (o)49.9543Long. (o)-116.5155Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F001341Metagenome / Metatranscriptome719Y
F001564Metagenome / Metatranscriptome670Y
F002294Metagenome / Metatranscriptome574Y
F002493Metagenome / Metatranscriptome554Y
F002916Metagenome / Metatranscriptome521Y
F003217Metagenome / Metatranscriptome500Y
F004454Metagenome / Metatranscriptome437Y
F004874Metagenome / Metatranscriptome420Y
F007214Metagenome / Metatranscriptome355Y
F007999Metagenome341Y
F011072Metagenome / Metatranscriptome295Y
F013170Metagenome273Y
F016227Metagenome / Metatranscriptome249Y
F018387Metagenome / Metatranscriptome235Y
F020112Metagenome226Y
F024517Metagenome / Metatranscriptome205Y
F029929Metagenome187Y
F031015Metagenome / Metatranscriptome183Y
F035241Metagenome172Y
F038735Metagenome / Metatranscriptome165Y
F039196Metagenome / Metatranscriptome164Y
F046219Metagenome151Y
F050940Metagenome144Y
F056992Metagenome / Metatranscriptome137Y
F059099Metagenome / Metatranscriptome134Y
F062987Metagenome130Y
F063724Metagenome / Metatranscriptome129N
F091496Metagenome / Metatranscriptome107Y
F093913Metagenome / Metatranscriptome106Y
F098955Metagenome / Metatranscriptome103Y
F099502Metagenome / Metatranscriptome103Y
F103293Metagenome101Y

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0073933_1000002All Organisms → cellular organisms → Archaea804628Open in IMG/M
Ga0073933_1000016All Organisms → cellular organisms → Bacteria396028Open in IMG/M
Ga0073933_1000017All Organisms → cellular organisms → Bacteria391608Open in IMG/M
Ga0073933_1000113All Organisms → cellular organisms → Bacteria150325Open in IMG/M
Ga0073933_1000118All Organisms → cellular organisms → Bacteria145637Open in IMG/M
Ga0073933_1000218All Organisms → cellular organisms → Bacteria → Proteobacteria106960Open in IMG/M
Ga0073933_1000332All Organisms → cellular organisms → Bacteria80291Open in IMG/M
Ga0073933_1000788All Organisms → cellular organisms → Bacteria40049Open in IMG/M
Ga0073933_1002446All Organisms → cellular organisms → Archaea15350Open in IMG/M
Ga0073933_1002718All Organisms → cellular organisms → Bacteria14016Open in IMG/M
Ga0073933_1003059All Organisms → cellular organisms → Bacteria12544Open in IMG/M
Ga0073933_1003189All Organisms → cellular organisms → Bacteria12085Open in IMG/M
Ga0073933_1003308All Organisms → cellular organisms → Archaea11675Open in IMG/M
Ga0073933_1006822All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria6069Open in IMG/M
Ga0073933_1009575All Organisms → cellular organisms → Bacteria4504Open in IMG/M
Ga0073933_1010026All Organisms → cellular organisms → Bacteria4325Open in IMG/M
Ga0073933_1012479All Organisms → cellular organisms → Bacteria3558Open in IMG/M
Ga0073933_1013150All Organisms → cellular organisms → Bacteria3404Open in IMG/M
Ga0073933_1016606All Organisms → cellular organisms → Bacteria2738Open in IMG/M
Ga0073933_1026148All Organisms → cellular organisms → Bacteria1821Open in IMG/M
Ga0073933_1030623All Organisms → cellular organisms → Bacteria1588Open in IMG/M
Ga0073933_1031638All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1543Open in IMG/M
Ga0073933_1031681All Organisms → cellular organisms → Archaea → Euryarchaeota → Archaeoglobi → Archaeoglobales → unclassified Archaeoglobales → Archaeoglobales archaeon1541Open in IMG/M
Ga0073933_1037435All Organisms → cellular organisms → Archaea → Euryarchaeota → Archaeoglobi → Archaeoglobales → unclassified Archaeoglobales → Archaeoglobales archaeon1336Open in IMG/M
Ga0073933_1043926All Organisms → cellular organisms → Bacteria1176Open in IMG/M
Ga0073933_1047244All Organisms → cellular organisms → Bacteria1110Open in IMG/M
Ga0073933_1066948All Organisms → cellular organisms → Archaea → TACK group → Candidatus Korarchaeota → Candidatus Korarchaeota archaeon841Open in IMG/M
Ga0073933_1067681All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Bryobacterales → unclassified Bryobacterales → Bryobacterales bacterium834Open in IMG/M
Ga0073933_1075717All Organisms → cellular organisms → Archaea → Euryarchaeota → Archaeoglobi → Archaeoglobales → unclassified Archaeoglobales → Archaeoglobales archaeon764Open in IMG/M
Ga0073933_1076089All Organisms → cellular organisms → Bacteria761Open in IMG/M
Ga0073933_1086711All Organisms → cellular organisms → Archaea → Euryarchaeota → Archaeoglobi → Archaeoglobales → unclassified Archaeoglobales → Archaeoglobales archaeon689Open in IMG/M
Ga0073933_1094025All Organisms → cellular organisms → Bacteria → Terrabacteria group647Open in IMG/M
Ga0073933_1094833All Organisms → cellular organisms → Archaea → Euryarchaeota → Archaeoglobi → Archaeoglobales → unclassified Archaeoglobales → Archaeoglobales archaeon643Open in IMG/M
Ga0073933_1117200Not Available547Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0073933_1000002Ga0073933_1000002351F004454MDRSRRTVVEVRADLHQRIKQLALLNDLRIYELANAVIEEALADKEKVKALIKRLKLDTWRQHT*
Ga0073933_1000016Ga0073933_1000016133F056992MELRSPEELRQFVDLDRAEVVDERAKGGEVILIPLVNPFAPIPALSAVADNLSWFMEQVTGRGYQKAEEVYDVGFIVREPGHQAFGLKVNAESGMVVISRVSILEDETVFRRYVNYLRTGVFL*
Ga0073933_1000017Ga0073933_1000017106F011072MYYQLQWKTLPGLRGLSCSEFRAEPTAQPDPERGVAIELASEAERDALVRALEQQFAAQRFSNTAAAFEAVKSCVLDWVARRGAEGQQHRNS*
Ga0073933_1000017Ga0073933_1000017205F062987MKNPHAVALGKLGGIARVFRTTPEQRRRWARRGGLARARRYSREQLSQWAKLGGRPRKKEGGA*
Ga0073933_1000017Ga0073933_10000177F024517MYGLHTIQDVHRLVKAFAGVPDCFVPLKTRREISYPLSFRVPLSQVEEIDAAAAREKRKRRSDLLEAIWEVAWAEYKRAGSLEAYASGRRGRGYTRRVSEELQDQIFTAVELILERAPSAVIEELARKLTQWAGKYGSRR*
Ga0073933_1000017Ga0073933_100001792F002916MQYLPGWTVQCVNPDCAARGHWLRADEAFPEFCSNCGAPLRNVPPPLAPRLRLRPRPLVGYRPPGRPRR*
Ga0073933_1000113Ga0073933_100011370F002294VRPFEETVSAELAWLLRAGVPPRALRLTVRELVVTRLERGALGGREVSDAVAAAVRAACRLVRELDAPGDVVETVCRAALEAVRGHGGESARWMPEATSAVYAVLDELAREGAAEPAWRLVARRLERW*
Ga0073933_1000118Ga0073933_10001185F016227LNGVSLKGVCTTLVTTLLVISVAAGDEAPIVGTVKAVDGAARTLTLETSAKGKTREVVVHLAPGARIVKFVRPSEPGKTGFVEQPLALAELTVGSIVSIEARHEGHREVADLVKVVLER*
Ga0073933_1000218Ga0073933_100021895F002493MAEITQQTTSAETQSQDLAARVRAIRARLPGQMLSERVEMARLHYGPLYTLAQLQERIGRTLPFRFGFIRTATLEPIESYRPRIPDEALLKWDDAVQKGIFDKFWVAVPAYFRERQSDPWIVGEISGAGLFAVIARWDE*
Ga0073933_1000332Ga0073933_10003324F001341MTTTPEGRDPTVRTREIVFEADVQAVTPFLKLATVSRDGTGHMTFVSDEGPNLGGLGSAPTPLMYFSAALAF*
Ga0073933_1000788Ga0073933_100078846F007214MHGLSLARLPLSARVFCTLALVGLGLGNLAAVTQAATTVGLAPAAVQASLAPEMPMTHLGHGPTSAEQEIDLAQLGQQARVWIRTPLLIQTSHTHLFGQTLIAGLLGVIFLLAAVPERLKALIVALPFVGTILDIGGMWLTRFVSPALAWVVIAGGGLFALGYLLIAAISLHQLWLRRERVA*
Ga0073933_1002446Ga0073933_10024464F029929VPIFQQSVDSELYEWLLKEQKRRKARSIQDVIRQVLREAMAEAERHYG*
Ga0073933_1002718Ga0073933_100271810F035241MARVLQGPEQAQRAYRELLGLEGSSRERAVDFMAQMARGYGLDLTPWFEAILRFHPAPTARAAAAAYLAAEGREDTVLEVLESLRPPLLTMTLLTGLLDALAPRPATPERVARLTAFAQRFNDDEHYTTLYLGDFTGRDVKRLVRRAVAESLLRQHAPEAVSWPRE*
Ga0073933_1003059Ga0073933_10030599F003217MSEGGFVVQKVEWFQFRTESGEPFLVMVASLPNGFFTAVPCRVPITHAAHGNMALGSSVDDALRQLQNTLAGKTVEEIFRSE*
Ga0073933_1003189Ga0073933_10031895F039196MSREQTFLLALGLLIAAAALVSAGGFVLLARDRRAGRAAHHPEAAEER*
Ga0073933_1003308Ga0073933_10033082F013170MSVHLHTTITRKTNEALEELAKTYGTKSRVLEKAVETLLRVDKVGSCDDCVIKARMSEQTNLREALDLTSIGRKTLDSLLDVAVGDKTIESLLKEQKAEAKNIIEILKGSVAWKTPSNFKEFISVLEEIKNLTRMFDIPSYSEIDSVAILRPKAFKRLPEIVAFQIATILEGIGVPFDLRMMGEDIVVKMIRSEVYPLRRKEFSEFLDQQIEKRLSNVTPGLFKNNLMLVGPAFMSWAEKHLEEPVTDLGSFIEDVRIALGADELPKEPKDFVKGLLSACVKMNWFRQAKVLAEKEENTLELVFQVTAIPVTRISVAAFSVMLATRGWKLVNYSIEHTTVNMTVKYVGAEDQSLLDQLAELSLFQTIGKQFLDVVPVPRELFNSFASKVYETDRQKFDEIYRATGVRIANAIRMLARNDPEKIRRLSQNFILKSINATQPDAEVRFVDDEHFTMIFKRVEPLIMNSQRMLIESMFRELGYDISTTVFQNLLSFKLKLLEKPVLEPVPRKRIMQTLVDEMTSCKTVEEAFALEKEQLDEMFPEDYPWTIREVGDRLIDMYRELGIEVEIEYFEGGFTLKYKTCPYYKLVKNQQKTWLCTVRKKTLDYVISRVSHGKKGKIKIIKSLLQNEHPCEYAIFLTGFLEREEKTH*
Ga0073933_1006822Ga0073933_10068228F018387VLGAEPPLLEQGSVQFILFGLTVLAIFVGLWITISK*
Ga0073933_1009575Ga0073933_10095753F063724MSSGWLQRWRDAYYTENPIVWLLAKTESELCLRQISAPETAETGSAPSRLQLLKSDPVYSTRYFLWADEKEFERTLLAPWHLPSNRWMPVSDQIYLAFASLVTMTALVIHYALAKSPVWVGAIGVGAVVLAFFLPMARPMLALHSLAQTLTADTPLHIGVSRLTPAHLVYGALLYGVGRIARASAVYGILPALLVATIAQGTVFRGLPTAILLWGYSVCVGALWILCTLATVYGQATRLYGASRDWVRWQNILALAAGLLSASLLAVSVPAPLPAPALTAWVHTTLWWWGFLPPLGIVSGLAVGFHPLWTLSHIVGTLLYIGLGATACTRLLRGVWRSQEQPLNAETEGWE*
Ga0073933_1010026Ga0073933_10100265F059099MAQNPEDPVVLDPQEVLRLIDEARSRLRTAIWALHGMHQDMNALTADDLADVEHMLTELLDGALTPAYESLSQIVQTNPDGTVTTH*
Ga0073933_1012479Ga0073933_10124795F093913MRAYVKTAGTNIVFVDIDTGERVSVSATAIQKLFGVAVQEGDVWDFRASKLDDIAENFRKWQIVQQA*
Ga0073933_1013150Ga0073933_10131503F004874MGFNAMRPGKVKRGDLLYLAAALIVMAALVIWAVR*
Ga0073933_1016606Ga0073933_10166063F031015ANTWGRIEIPASGDGYYDQYRIEVQRTGGSDYALTSKIIGVR*
Ga0073933_1026148Ga0073933_10261482F099502MTEKANQESALGTIIGWGSLGIFALWFAYQVGAPLLIGEQWAEKQNDAIELVKNSKPLGNETLYDMIRAYSLKAKENDFFVGEFSWSAIQKDGPEYEVTLLWTEGDQKKVALWRVNLENKEVRPQGDAASLPQRLAAGPPKKGTGS*
Ga0073933_1030623Ga0073933_10306233F001564VRRDELCDPGLCPDAPEGGRCDHCPLGRLDAAQSSEPGQLLRRALNLRAALKLGVKLSLDEIAADEFQAMLIIEEEQVRWEEECTKRHG*
Ga0073933_1031397Ga0073933_10313972F046219LDAAQSSEAGLLLRRALDLRAALRLGIRASLDEIRADEFRALIVLEEERDALDREQMNAHGR*
Ga0073933_1031638Ga0073933_10316383F020112MVSKAFRKCVDLIEEMLREGYRLQIPSTHVERLIKIHVGADKRTIQKYMKMLTEDLGFLENTAKNSLGIIIYRINIQTIEQHVSEHLKEKLRQLTLLDMRLKQEEEVKTEKI*
Ga0073933_1031681Ga0073933_10316812F103293VALNLEAVPWLEPPYEIFEFRPCEWAAFHVTAFKIGKMNIAPRWPGAPTAKLILAIRLFVDPKTKPAYPHYYDITPSRLVNQLSAVLTAGIPPGMYLRIHRDVAGPKAHFSVEWAPTV*
Ga0073933_1037435Ga0073933_10374352F103293MALDLEAVPWLEPPYEIFEFHPCEWAAFHVTTFKIGKMQIAPRWPGAPTAKLILAIRLFVKPETKPAYPHYYDITPSRLVNQLAALLTRGIPAGMYLRIHRDVAGPKAHFAVEWAPIV*
Ga0073933_1043926Ga0073933_10439262F091496VAREIDWQLFEKACDLTASALRGSMGGEGSQPPRFAAEVFREVWAALKEASADLPAKPKAGF*
Ga0073933_1047244Ga0073933_10472442F098955MDSSAIPVFLAGPFPVLHSAWVQEPDGEVELDVALLIGGVPTMIAATRFPLDETWDRIRRALESGDARLGVAGVPHEEESPIGTREVYPAAYVGLECANGERLVLAHIRAPRPGMEPEAFARHVLSSILKGHTPLELGEPIEE*
Ga0073933_1066948Ga0073933_10669481F103293MALNLEAVPWLEPPFEIYEFKPCEWAAFHVTAYKIGKMQIAPRWPGAPSAKAILAIRLFVDPKTKPAYPHYWDITPSRLVSQLAAML
Ga0073933_1067681Ga0073933_10676813F001564LIHWALRREELCDPGLCPDAPDDGGRCDHCPLDKLDAAQSSEAGLLLRRALDLRAALKLGIRIGLDEVPADEFRALVALEEERDALDREQMNAHGR*
Ga0073933_1075717Ga0073933_10757172F103293VALDFEAVPWLEPPYEIFEFRPCEWAAFHVTAYKIGKMNIAPRWPGAPSTKTVLAIRLFVKPETKPAYPYYWDITPSRLVAQLSAMLTRGIPPNMYLRIHRDVAGPKAHFTVEWAPTV*
Ga0073933_1076089Ga0073933_10760893F038735MEEESKAMRWFIDSMLGYWILLLAISMGTAAAVYLLWLKQAAFSG*
Ga0073933_1086711Ga0073933_10867111F103293VVSVALNLEAVPWLEPPYEIFEFRPCEWAAFHVTAFKIGKMQIAPRWPGAPAAKLVLAIRLFVDPKTKPAYPYYYDITPSRLVNQLAAMLTRGIPPNMYLRIHRDVAGPKAHFAVEWAPTV*
Ga0073933_1094025Ga0073933_10940252F050940MGVMPGNRYWKRFDHGLCNNGYQYYVGLNCLRPDETFAADERVLCSYPGFHFGSRSWCAANYPERPLEALIRIPEDAQINEPWATDGKASADRIEILQVFDVVTGEDVTDQYRKGTA*
Ga0073933_1094833Ga0073933_10948331F103293VALDFEAVPWLEPPYEIFEFHPCEWAAFHVTAYKIGKMNIAPRWPGAPSTKTVLAIRLFVKPETKPAYPYYWDITPSRLVAQLSAMLTAGIPPGMYLRIHRDVAGPKAHFSIEWAPIV*
Ga0073933_1117200Ga0073933_11172001F007999VSDAWTTRRLARAAQARANLVAALRECCELADAVETFEGEELLEVLVHLDGLRFVMAESGQILQGVIRGIEELER*

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.