NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300026731

3300026731: Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G05A4-10 (SPAdes)



Overview

Basic Information
IMG/M Taxon OID3300026731 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0095510 | Gp0072043 | Ga0207523
Sample NameSoil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G05A4-10 (SPAdes)
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size19672863
Sequencing Scaffolds16
Novel Protein Genes18
Associated Families18

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
Not Available9
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales1
All Organisms → cellular organisms → Archaea2
All Organisms → cellular organisms → Bacteria → Proteobacteria1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1
All Organisms → cellular organisms → Bacteria1
All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameSoil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts)
TypeEnvironmental
TaxonomyEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil → Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts)

Alternative Ecosystem Assignments
Environment Ontology (ENVO)terrestrial biomeagricultural fieldagricultural soil
Earth Microbiome Project Ontology (EMPO)Free-living → Non-saline → Soil (non-saline)

Location Information
LocationWisconsin, United States
CoordinatesLat. (o)43.3Long. (o)-89.38Alt. (m)N/ADepth (m)0
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F000268Metagenome / Metatranscriptome1411Y
F002896Metagenome / Metatranscriptome522N
F003755Metagenome / Metatranscriptome470Y
F014308Metagenome / Metatranscriptome264Y
F019867Metagenome227Y
F020379Metagenome / Metatranscriptome224Y
F022740Metagenome / Metatranscriptome213Y
F034564Metagenome / Metatranscriptome174Y
F040358Metagenome / Metatranscriptome162N
F045732Metagenome / Metatranscriptome152N
F061030Metagenome / Metatranscriptome132Y
F064778Metagenome / Metatranscriptome128Y
F072223Metagenome / Metatranscriptome121Y
F081936Metagenome / Metatranscriptome114N
F084328Metagenome112N
F086321Metagenome111N
F088977Metagenome / Metatranscriptome109N
F097305Metagenome / Metatranscriptome104N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0207523_100020Not Available1598Open in IMG/M
Ga0207523_100559All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales815Open in IMG/M
Ga0207523_100762All Organisms → cellular organisms → Archaea753Open in IMG/M
Ga0207523_100954Not Available703Open in IMG/M
Ga0207523_101343All Organisms → cellular organisms → Bacteria → Proteobacteria640Open in IMG/M
Ga0207523_101673Not Available601Open in IMG/M
Ga0207523_101678All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria601Open in IMG/M
Ga0207523_101853Not Available586Open in IMG/M
Ga0207523_102025Not Available574Open in IMG/M
Ga0207523_102059All Organisms → cellular organisms → Archaea571Open in IMG/M
Ga0207523_102509All Organisms → cellular organisms → Bacteria540Open in IMG/M
Ga0207523_102752Not Available526Open in IMG/M
Ga0207523_102872Not Available519Open in IMG/M
Ga0207523_102874Not Available519Open in IMG/M
Ga0207523_103105All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium508Open in IMG/M
Ga0207523_103208Not Available504Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0207523_100020Ga0207523_1000202F022740QAALNHAQAVDKAAGAGSRNVGFYHRRRAQILAARSNELGGLVSFISVGGQPVNTTRNGTIVAAFTFDDIVWTDIQQKTFAAATAQIRQIRPGSTPVLAATGTITPLADTEIKKLGWKIVQLKPER
Ga0207523_100559Ga0207523_1005591F064778SLNPSSKPMEKAAFSAACERGKPPGVPGAANGAGYSGDGRKGQEITNLGRQ
Ga0207523_100696Ga0207523_1006962F045732VRAAACGCVCAGHVVCNMRHRRRRVFIDTQKGRVSAGFTVAAEADAADVSMRLRERGWIAYRLRLEAEQYAWIATVIDWARRAA
Ga0207523_100762Ga0207523_1007621F081936PPNVEASVNPKVEQLAHVLEITLIVVPDSADLLNFNLRFLIIYRLKFTKTAIRTELIQVKINASIISLGT
Ga0207523_100954Ga0207523_1009541F088977MRTTSLAGVLSLSLSAPLFSQTFNQTATYIALMRSSVGGLPPVATSTLQGDLQDGVALAIRYGYVPSSSRMDLPSMNNFGLTAVLPTGTASTVSITGGLSSLSRGGSDAWIIGAGGDLRLTDWAFSQGRSAPHLRVAVNGQLDYSKPRESALIAGSVGLPLSIIRPNRPKQEMQVVPFVTPSFAFGN
Ga0207523_101343Ga0207523_1013433F003755SENSVMNATVIELPTVESLSDEIRGVVYERQTLRAVGAGREELERNRAELVRLQQALVDALIRRHLPANAA
Ga0207523_101561Ga0207523_1015611F072223RPNVKFATFFQTRFENYGTQCTGPSPSGLPVKCAGEDIAAVTSNQLKVLKPKKKRR
Ga0207523_101673Ga0207523_1016731F097305NEKSGKDKKSDSRSQRLAAELRENLRRRKAQVRGRKAPAAGEGTKRPGLPPRGG
Ga0207523_101678Ga0207523_1016782F014308MSGSTIGIQLIMPKTGTSPKATPEQQASEREAAQPVVKAPPPPGMGKIVDKVA
Ga0207523_101853Ga0207523_1018531F019867MHVTPLSAATILAFASPLHAQGIEVFGGYSVNADYVQNRPAILVADQKVSPFFSHGSGPTGFEASFKHDVRNGLGIKVDASGYSDTFPPGPAAYCQSDGSAAGIACGTGLTFHAIGRAFYVTAGPEWKIRRGKRFAPFAQTLVGIVSTRSTFMMNGSDVQYTNPFTGGVLLFTSAGFLPDRSIHYADAHADAGL
Ga0207523_102025Ga0207523_1020251F040358MRIAIFALAILAGSGAAEARQVEVVSTSPRHIEIAAWCTAGSNCQQEASDVAQGYCHGPDYPRRALYVRSGLVERGFFSERVIFVYKCNRRSI
Ga0207523_102059Ga0207523_1020591F002896LYVIKTIYGMDKMNEDTILHFYRILENSLLESDISKINEEDIDAWSQSFKKVVRESREKSGKGVFVPFLMWKLGEISPVEARKYLVNRKQDECRVSYDHNNVEYILRVMALMFMSWSVTNLKRKTQNGHCQNIDHPHGDKNPRLCKEGTIFHQELYNECVKTFKDLLIHSDAQHH
Ga0207523_102509Ga0207523_1025091F000268MLMRVVAVMLLLSAGIAAEAVSYSFVSKASGRLGGPIRFEFYRDSTTRPKTDIESFTVSMRTADDRWRAMWSILSGRGLTQPIEYGVTPPGFTTMIQPQKLIPGRVYAGFATDGHGGSSGVTFGFDKNGRMIFPDSFDSMSGE
Ga0207523_102752Ga0207523_1027521F084328MRPKNAAPSVSDACPDWEAPAFTKLPIASRTGSGANVANDAPRAIEPEAPGQPLSKLGFSFEMSFPM
Ga0207523_102872Ga0207523_1028721F061030DIFDKVACTCAVRNGGHVIPPPVGVKREGLKLRPKEEAGGTQTLDGGRVAFPKYYRREGLKFHRSRALEGYLACMRAAGRK
Ga0207523_102874Ga0207523_1028741F034564RSGRLSGYWYGAVCIAGLGFSLLGERRPFGQGILAHPFIVYAFVVAAGLLVIRVVRQQPVPELIPERALGLGCAAGVALFLAGNFIAAHLIGR
Ga0207523_103105Ga0207523_1031052F020379MLEVQRELAALKFIDGLSAHLKEVREPHKALRHALRDTREFFQATGGCIATLRAGRPQADLLFALPRRGAWDLGVLTRYIRHTHPPI
Ga0207523_103208Ga0207523_1032081F086321MNPSKKEIYKILERFTSQKGGILFILHKSFSSDSKPPQEQTSVVRDDERKSAIFEINIGTVAGLSMLCECEKVLADQVMSLDFLDSGIEGDVIWFGGILDKSGSEFIG

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.