NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300029646

3300029646: Metatranscriptome of soil microbial communities from Anza Borrego desert, Southern California, United States - S1+v_10-13C (Metagenome Metatranscriptome)



Overview

Basic Information
IMG/M Taxon OID3300029646 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0128792 | Gp0224286 | Ga0206092
Sample NameMetatranscriptome of soil microbial communities from Anza Borrego desert, Southern California, United States - S1+v_10-13C (Metagenome Metatranscriptome)
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size25817522
Sequencing Scaffolds12
Novel Protein Genes13
Associated Families13

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
Not Available8
All Organisms → cellular organisms → Eukaryota → Haptista → Haptophyta → Prymnesiophyceae → Coccolithales → Coccolithaceae → Coccolithus → Coccolithus braarudii1
All Organisms → cellular organisms → Eukaryota → Amoebozoa → Discosea → Flabellinia → Vannellidae → Vannella → Vannella robusta1
All Organisms → cellular organisms → Bacteria → Acidobacteria1
All Organisms → cellular organisms → Eukaryota → Amoebozoa → Discosea1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameSystems Level Insights Into Methane Cycling In Arid And Semi-Arid Ecosystems
TypeEnvironmental
TaxonomyEnvironmental → Terrestrial → Soil → Unclassified → Desert → Soil → Systems Level Insights Into Methane Cycling In Arid And Semi-Arid Ecosystems

Alternative Ecosystem Assignments
Environment Ontology (ENVO)desert biomedesertsoil
Earth Microbiome Project Ontology (EMPO)Free-living → Non-saline → Soil (non-saline)

Location Information
LocationUSA: California
CoordinatesLat. (o)33.3049Long. (o)-116.2547Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F000344Metagenome / Metatranscriptome1257Y
F004947Metagenome / Metatranscriptome417Y
F007408Metagenome / Metatranscriptome351Y
F011252Metagenome / Metatranscriptome293Y
F015326Metagenome / Metatranscriptome255Y
F018507Metagenome / Metatranscriptome234Y
F021791Metagenome / Metatranscriptome217Y
F052341Metagenome / Metatranscriptome142Y
F063283Metagenome / Metatranscriptome129Y
F098519Metagenome / Metatranscriptome103Y
F100141Metagenome / Metatranscriptome102Y
F102215Metagenome / Metatranscriptome101N
F102608Metagenome / Metatranscriptome101Y

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0206092_104815Not Available920Open in IMG/M
Ga0206092_106893Not Available770Open in IMG/M
Ga0206092_109437Not Available652Open in IMG/M
Ga0206092_109581All Organisms → cellular organisms → Eukaryota → Haptista → Haptophyta → Prymnesiophyceae → Coccolithales → Coccolithaceae → Coccolithus → Coccolithus braarudii647Open in IMG/M
Ga0206092_110643Not Available614Open in IMG/M
Ga0206092_110657Not Available613Open in IMG/M
Ga0206092_111586All Organisms → cellular organisms → Eukaryota → Amoebozoa → Discosea → Flabellinia → Vannellidae → Vannella → Vannella robusta588Open in IMG/M
Ga0206092_111791Not Available582Open in IMG/M
Ga0206092_111882Not Available580Open in IMG/M
Ga0206092_113052Not Available552Open in IMG/M
Ga0206092_113525All Organisms → cellular organisms → Bacteria → Acidobacteria542Open in IMG/M
Ga0206092_114974All Organisms → cellular organisms → Eukaryota → Amoebozoa → Discosea516Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0206092_104815Ga0206092_1048151F063283TNTQWSGTGPLTNTNNSPSFYFYMTPPFVTADALIRIILSWGNLISNPANSLPDLDLVVAGPVDASSISTFGTGIVNFQNKDLHSTFKVLPYAKLVTDSAQGYGPEVMDFYGQPGALALGFSSSYAPGAAANAYEVWVDRPNSSPNQDSSFTFLFDTNSFIVVYQNDGTTGGNKQVLFDARTNVPYAYGFYNTDPWNKVPAEATLWHIIDLSQSAGGVVFNGFPGENSTDTTATPGAKYGYDGSFFETTKSIPCGHTASRGASPAYCPATLTYPQSKKK
Ga0206092_106893Ga0206092_1068932F000344MRPKHPHAAESGVGEHTARESEAPNVCVGKERVAN
Ga0206092_108683Ga0206092_1086831F021791ADYRIKPHVPPLNQIPANSVKFQYCNCTAQVEYFRVNLNYTV
Ga0206092_109437Ga0206092_1094371F052341MQIMFGGKNARVMVVMMISILSFFLIQESAAIDCPKDNSQDRIYPGCNCFFEWNDANRVGNWTFEENWMQLNEPGWVAFVSIAGDNTIHLDEARRINEIYIGPNRWDTTRVVLDEDLTVEYDDVPVINSVRGYRQPTGQIRLVIQGKGFGFVSEDIVVTATQVIDNFPDNNVENQITQVYNCEYATLVYRDAQIECNIFTASLYPYSLSVTVAANGH
Ga0206092_109581Ga0206092_1095812F098519MRCEEVSCAKCTVHCERPVCSIRCPKDFCEKNSCPPCETVCKPAQCHTTCSAPEPKCSPVCEELDCKNKCVKPTNCQKPKCELQCEKPQCEEPQCCSCSSSNVQIAVTRATCSTSPCFKEESQQPSLLEVFHTMQHKEQSGAEQCCPCK
Ga0206092_110643Ga0206092_1106431F004947IIVIIFAIAVIALAQPAPPKEPSDFSAYGFVIEWHNDFHRRFKGDIFEDFTNHRQRIDAERHASYGGVITFFRFFDKEREYEYFGQTDMCWQAHFNNTQHPVFEWLEHAHYAGNCHGRHSQGNVKGHLFREHGSESLRREICVADNSTYTPLWIEHHHGRSGRILEFLQFKSGAPDPAVFTLPGSCTKKQEEILA
Ga0206092_110657Ga0206092_1106571F102608FVVFFVSLFVSSQAQNGCQYYDSCGVCNGDDSTCCQDYLGFEPAEVDVELLKWTNQQLIGEIDQLQEHLNATRDALQDSYINEIDGGRLDLSEYIDAIHAFCAGRECSAEDCDSGRTGCGEEECKLDCECLSLSQFEFLQSAFLGAIRSVNSI
Ga0206092_111586Ga0206092_1115861F018507SAVLILFLGLVAFVYTACVSPPSDLQVCVLPSTARVPQEYANTNADRQLQSYFFFLQSIKAVPTDECSLAYIDFACSQAYPRCAGDAASGLGLPVNTCYFQCSNFVEKCRGQLIGVERPDCGTFSVSPDCTAVSVKLPQDTNGASVFSASLALFSVLALVLVL
Ga0206092_111791Ga0206092_1117911F102215RPMRVAIVILFCAIVAAAFAGVTERDDHISANFWDKYQEPAAHQIKTIVRRSVDQLVNARQTAAVNFTTWCFEYQQYIDQFKWGLAATDPETNTEYFHGIVEDSSTDPAVVVGYIYGYSLDNQRLVQMAVDYNDNGYMRLFSFQFNPRNERYSDAFTVFSNADTTLTPQFLGSPEYTSLTKCA
Ga0206092_111882Ga0206092_1118821F007408KKEDTEKFIEGASAALWKALADEAVGLWTKIYSLNTSVESVFSSQPEDVTTPLLDLLSHIFEVQVRGFNSIRVSFTKKLKEALGETKDHDGVKAVARTALRDAIFENINILAEEHWAKTHEALTKAAKAYVIHQFTVELWPSIKSGLDDLLSFLPEELKSLGVDIAGMVLKISLILINKGVAWAMGKIGIRLE
Ga0206092_113052Ga0206092_1130522F100141NFQNCSFTVGGKYYDLSSLQAQGQFNFDQAIYQQDVTDLYVDATNAATNFHWELQVCGNVKSNYASCTTPSPVNQISSSGTTCTALGDLRSAAIDITPGQDGLMMTFYHGLAVSHIQEYSTRLYLLCDPSGAASIPHVEHLKSFYAWHIYLTTKLTC
Ga0206092_113525Ga0206092_1135251F011252SCRSGRNFLSQYSFIEPNAAFPLPPTFNFDEYIETEQELWALFPETDGRRVNFVQIGLTAMIYAEAPDRGAELGPDETYILMPCADYTMRPIRMRACRRLTLSEYRMGHLTAIKLVGKVNEAGDVMREIGCQILGEEVIQEAQARLADYGDDQ
Ga0206092_114974Ga0206092_1149741F015326CYCDWGEHGGDWCMCLGFFSGNMECCEPCCGKPCNTNDGVYCCLSWFPGCCFAGPKCLAASQNQECAFVNHGVPFLLLLIFIIPVIGILGYVVLWTIETAIRFNLRKQHGIGDTSKWDICDCCGIWFIVPGPCFACQEMRSVPKDYWDWYKAFNDKKFPAETQLEPCYTLM

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.