NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300027087

3300027087: Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaG HF017 (SPAdes)



Overview

Basic Information
IMG/M Taxon OID3300027087 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0085736 | Gp0057298 | Ga0208605
Sample NameForest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaG HF017 (SPAdes)
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size12254333
Sequencing Scaffolds17
Novel Protein Genes18
Associated Families18

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria → Proteobacteria2
Not Available9
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium2
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → Bradyrhizobium liaoningense1
All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Candidatus Acidoferrales → Candidatus Acidoferrum → Candidatus Acidoferrum panamensis1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales → Acetobacteraceae → Dankookia → Dankookia rubra1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameForest Soil Microbial Communities From Harvard Forest Long Term Ecological Research (Lter) Site In Petersham, Ma, For Long-Term Soil Warming Studies
TypeEnvironmental
TaxonomyEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil → Forest Soil Microbial Communities From Harvard Forest Long Term Ecological Research (Lter) Site In Petersham, Ma, For Long-Term Soil Warming Studies

Alternative Ecosystem Assignments
Environment Ontology (ENVO)forest biomesolid layerforest soil
Earth Microbiome Project Ontology (EMPO)Free-living → Non-saline → Soil (non-saline)

Location Information
LocationHarvard Forest LTER, Petersham, MA, USA
CoordinatesLat. (o)42.471116Long. (o)-72.17263Alt. (m)N/ADepth (m)0 to .1
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F003614Metagenome / Metatranscriptome477Y
F008837Metagenome / Metatranscriptome327Y
F012266Metagenome / Metatranscriptome282Y
F014678Metagenome / Metatranscriptome261Y
F015918Metagenome / Metatranscriptome251Y
F016708Metagenome / Metatranscriptome245N
F020059Metagenome / Metatranscriptome226Y
F020723Metagenome222Y
F021381Metagenome / Metatranscriptome219N
F022117Metagenome216Y
F023941Metagenome / Metatranscriptome208Y
F025875Metagenome / Metatranscriptome200Y
F029209Metagenome / Metatranscriptome189Y
F029291Metagenome / Metatranscriptome189N
F031985Metagenome / Metatranscriptome181Y
F047145Metagenome / Metatranscriptome150Y
F064528Metagenome / Metatranscriptome128Y
F085519Metagenome111Y

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0208605_100008All Organisms → cellular organisms → Bacteria → Proteobacteria3650Open in IMG/M
Ga0208605_100204All Organisms → cellular organisms → Bacteria → Proteobacteria1551Open in IMG/M
Ga0208605_100318Not Available1335Open in IMG/M
Ga0208605_100380Not Available1250Open in IMG/M
Ga0208605_100458All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium1147Open in IMG/M
Ga0208605_100616Not Available983Open in IMG/M
Ga0208605_100899Not Available820Open in IMG/M
Ga0208605_100956Not Available791Open in IMG/M
Ga0208605_101273All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia688Open in IMG/M
Ga0208605_101350All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → Bradyrhizobium liaoningense668Open in IMG/M
Ga0208605_101530All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Candidatus Acidoferrales → Candidatus Acidoferrum → Candidatus Acidoferrum panamensis636Open in IMG/M
Ga0208605_101574All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium629Open in IMG/M
Ga0208605_102081All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales → Acetobacteraceae → Dankookia → Dankookia rubra561Open in IMG/M
Ga0208605_102191Not Available550Open in IMG/M
Ga0208605_102457Not Available524Open in IMG/M
Ga0208605_102617Not Available512Open in IMG/M
Ga0208605_102650Not Available510Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0208605_100008Ga0208605_1000083F029291MKDTRCAGLEMLKGRVKCKMRKTKSAVTNGETREEGVKSYPCGSAPVWKTLAKVLGRGQSGRSQCAAAMRRNDGAGQRHSGGWRFGLIDARVPALTAKRNCGFLPAVPGGLMRLTRQAFRVLRVATWRKRPLSSTIELPPRRQPLAVVTDRYKAVLKGGRYARLSGTSHLRPAVWGTQLDQSSPPGPRAKEMKGRSAGSSACHPGFGVVSSLAQAAVGPGRASSRLNPRQAKQHPSHTGNRRAGRLVHRQVRKVQP
Ga0208605_100204Ga0208605_1002041F012266CLALQLPFCFEVRALSDRLVSSTPERPCDDMCGLDSLSSKSDGDAADFLD
Ga0208605_100318Ga0208605_1003183F047145MTIVYVTTGAWGTGTGTPNSAAQVDGNFYTLDQLIVALNADLAEGKRIDSVTYTNTSMTFHFTDGTTQTIPLPVAVITYVGQWTNSTPYVVGQMFSVPARGMYQVLVNHTTPALPAVFDPNATDGSSNPLYSFWMPLYDINYDAAIFVPGSVQRSAGEVLFQAIAGRTMRLVSGSGHAYAYLDVGIGSGSNIVLSIQKNRVQIGTITFTAGATLDA
Ga0208605_100380Ga0208605_1003802F008837VMKVYNEAMSTRIIARVTSGKCEGAAFVFGLIESAIRKEATEILLTPIQELDMVRMDFGRLHCYYRQPDITRAQFAYLAIYLVLIGNVAAGEARTERLFRGSDEGLYNLQFGKMRLPDNDPAVVIHIEPVTQPTTAGGGWRKSRR
Ga0208605_100458Ga0208605_1004581F022117MPAHYGLGPDDSYGLKEARATAIEPNEQSAVDPTQMQSTAWRTLPKNVELMPQYQDFGLQPPSRLEAVAQ
Ga0208605_100616Ga0208605_1006162F020059MIKPDLALSQIAARFTQHDVEWSRGAFIIIDRRTTNPIARLRPIPDTDRFELFYWSNAKGRWTTFGNLGR
Ga0208605_100899Ga0208605_1008991F015918VSAQAKPISEVVKDFEQSLGNLRPSTKRVYVAGARAAIRAASLELWQSPSTTDLLASIGKSPIEKRARISPFLDFLGDGGSKQSISDEEIAALQNWVIQRLAKQMRSVKNPSITSRRDTALIAAICAAPAKGTPRKWPQNCLKITENEVLLWDASIEEPCFAISLRFWHAWRERLARPDQRRRAHHSGKDQSRFP
Ga0208605_100956Ga0208605_1009561F003614MKPSELPGTRSLYRLTNRALTQIKELWQYAGEAELPSESLLQAQVEALRILHIDVPLSAKRQFQLFCSEHFPGLADILDCLAADQVWTLLSMRVQLADSDQPHALSAAVRFGGLGRRYILVVFDDRPVGLLSHADGSGLFPEARSKLIALSWEEIRLALMLDIGSN
Ga0208605_101273Ga0208605_1012731F025875VIRFSAFLVVVAVGLLVAGVVTSKLLLVYIAIGVSGVALLAL
Ga0208605_101350Ga0208605_1013501F014678MIGSFLVERRGACSEIGFDHVGDDGARLGKIERCDSRIHLVETLAATQKLGIDCANL
Ga0208605_101530Ga0208605_1015301F016708METYFVSKWRKEHLEEPEDNIFANNPFLEDLLEWRHSPEGEQFAELADALCDLMDDVQLDAKQRQLIWPDAERLDLVQSIQRIQKLYPDFPGHEIEEVLLNWIDMGYDPKNYSQAQLNEFDSLTERWVADH
Ga0208605_101574Ga0208605_1015741F064528MDIYEVELCHRGRWEQQDARFVAARNADEAAYKVTGIQLRSEGE
Ga0208605_101852Ga0208605_1018521F021381CLRFFLVASALFKGRCLFCHAKQTSRQKANPWQSPSHQKNVVQKKPAPDKVENLAKEYLAELREVDVTFEKGTAVLEKYRAKYPELASYIREIVPGERRIEIRQRLEGEDASLHLFAIRVLEDEARYEVKLSGDAEHKTVKPPEWLSLLHDKLARIVAKILYARETSKPEKQDLNLSGPA
Ga0208605_102081Ga0208605_1020811F085519MINRRFMIVPPALLSGGHAEQLEQTSKDFPCLLLEMANMRVPRHSDFDSLAVSG
Ga0208605_102191Ga0208605_1021911F029209SCIEAQEGRAVTNKSAPAKFPLDEIDLLVGWRAGEDEYLFTKAVAMVREWLLDSEEKDRLRKVLKEILAEAGPPAGNRPLD
Ga0208605_102457Ga0208605_1024571F023941EGEGFELVWGFSCQVVVLGFAESSLFGAGKPFFIPSPAIRFAERAEGVKGPKR
Ga0208605_102617Ga0208605_1026172F020723MTLTPRQISAYLEFNDKLDRIDRANALVISAIGAQGDKQAIDKALRELSSG
Ga0208605_102650Ga0208605_1026501F031985ITGIDRLVRVYPSVAASLGDPNGQVDQAKPDGATAKADTDGGA

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.