NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300027397

3300027397: Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G10A3-11 (SPAdes)



Overview

Basic Information
IMG/M Taxon OID3300027397 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0095510 | Gp0055665 | Ga0207463
Sample NameSoil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G10A3-11 (SPAdes)
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size24334690
Sequencing Scaffolds13
Novel Protein Genes13
Associated Families13

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Hyphomicrobiaceae → Rhodoplanes → unclassified Rhodoplanes → Rhodoplanes sp. Z2-YC68601
All Organisms → cellular organisms → Archaea1
Not Available5
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales1
All Organisms → cellular organisms → Bacteria → Acidobacteria1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Xanthobacteraceae1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Xanthomonadales → unclassified Xanthomonadales → Xanthomonadales bacterium1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Hyphomicrobiales incertae sedis → Pseudorhodoplanes → Pseudorhodoplanes sinuspersici1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameSoil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts)
TypeEnvironmental
TaxonomyEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil → Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts)

Alternative Ecosystem Assignments
Environment Ontology (ENVO)terrestrial biomeagricultural fieldagricultural soil
Earth Microbiome Project Ontology (EMPO)Free-living → Non-saline → Soil (non-saline)

Location Information
LocationWisconsin, United States
CoordinatesLat. (o)43.2958Long. (o)-89.3799Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F000569Metagenome / Metatranscriptome1018Y
F002896Metagenome / Metatranscriptome522N
F034564Metagenome / Metatranscriptome174Y
F034995Metagenome173N
F042319Metagenome / Metatranscriptome158Y
F052029Metagenome143Y
F055839Metagenome / Metatranscriptome138Y
F059558Metagenome / Metatranscriptome133Y
F067154Metagenome / Metatranscriptome126Y
F071766Metagenome122Y
F089138Metagenome109N
F090480Metagenome / Metatranscriptome108Y
F097297Metagenome / Metatranscriptome104Y

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0207463_100024All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Hyphomicrobiaceae → Rhodoplanes → unclassified Rhodoplanes → Rhodoplanes sp. Z2-YC68601951Open in IMG/M
Ga0207463_100359All Organisms → cellular organisms → Archaea1116Open in IMG/M
Ga0207463_100556Not Available994Open in IMG/M
Ga0207463_100631All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales963Open in IMG/M
Ga0207463_100716All Organisms → cellular organisms → Bacteria → Acidobacteria926Open in IMG/M
Ga0207463_100903Not Available861Open in IMG/M
Ga0207463_100944All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Xanthobacteraceae850Open in IMG/M
Ga0207463_101056All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria818Open in IMG/M
Ga0207463_101256Not Available769Open in IMG/M
Ga0207463_101537All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Xanthomonadales → unclassified Xanthomonadales → Xanthomonadales bacterium721Open in IMG/M
Ga0207463_101661All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Hyphomicrobiales incertae sedis → Pseudorhodoplanes → Pseudorhodoplanes sinuspersici702Open in IMG/M
Ga0207463_101872Not Available672Open in IMG/M
Ga0207463_102361Not Available624Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0207463_100024Ga0207463_1000244F055839MRVVESMKHLVLASALILAGSTAAKPSDINFGETRTRFQIQNELTAWERLHPWDVDWRHTTLWQHGRALRTEFAPPGCTITRLVTTRSGT
Ga0207463_100359Ga0207463_1003591F002896MDKMNEDTILHFYRILENSLLESDISKINEEDIDAWSQSFKKVVRESREKSGKGVFVPFLMWKLGEISPVEARKYLVNRKQDECRVSYDHNNVEYILWVMALMFMSWSVTNLKRKTQNGHCQNIDHPHGDKNPRLCKEGTIFHQDLYNECVKTFKDLLIHSDAQHN
Ga0207463_100556Ga0207463_1005561F034995IAGIFLIGFPAWMMKEMSVPLPEYRKGGYADYVLLAYELLAVYGSVLLGIAVHVGRAKWSGQSLFPFGLFKAGMWGVLVAASAYFVAGGVYGLISYGAFDKVIGGTLWGLLAIFWGVFGALVSAGLALLFYAWRGSAGYRPPGSAGLGTPRS
Ga0207463_100631Ga0207463_1006313F052029ALNFRSILPVDSVAEQEINNTGALSEQFVTIGAASLALLVVAAIAVLIGMA
Ga0207463_100716Ga0207463_1007161F059558EAFYHPTDGLLLISSLKGEVKNKHNRNGWIFSCFAAHLFEQDLF
Ga0207463_100903Ga0207463_1009031F097297MLVTERMRSLGDHAKSSRQPPLKERKLTKSDASAATQDVINEKEFKKGAAHESAAVVIVDNPAPQTVVHENRNAAVAEITPDEEAAVLATVAAVLAKIDPQPPTADDNVTRPEPVREDAELSAAKRAIIHEWESWSALHSDELEDPNTSEYFFRHLQLKKADLLIFGLQNDLDAVRALIASQLR
Ga0207463_100944Ga0207463_1009442F042319YQTTTIGIADQSALLSRIAAWCTHFVEGVREGQEIAARYHALAQLSSPELARRGLNRQTIARAALTSY
Ga0207463_101056Ga0207463_1010561F000569MDLHIKKVWLPGAASCLLFFGFYWVLIWLPFDKNRFQFLAIPYLVLPFVGALAAYMSRRMKGSVLERIESALFPVFAFVALFAVRIVYGLFFEGQPYTLPHFLGGFAVTLVFIVVGGLLLVLGAWPFCRPHLREQLP
Ga0207463_101256Ga0207463_1012561F090480VDCKMNKIFACIVFTATSGFSFAVATAMPLVPMGTEQARLTIPVADGCGFNRYRDARGICRKKYVITRHQGRQPVYTGCGGLNSHRVCNLYGHCWMVCD
Ga0207463_101537Ga0207463_1015371F089138MLKEGMAKAEIVTVLRRYAYDINRISALMPDGGTAKDAQSRLKQLKDAIHSDYKHRNAITRSAQLTPLEQSNLASAILDVFSALQSIGVNTTPGREWRNALFGADMYIQRYLNELRDSEKSVESDTTEWL
Ga0207463_101661Ga0207463_1016611F034564LLSHSLTVRRGIIWCDFPRHNGRTQTGGMNRSGRLSGYWYGAVCIAGLGFGLLGERRPFGQGILAHPFIVYAFVVAAGLLVIRVVRQQPVPELIPERALGLGCAAGVALFLAGNFIAAHLIGR
Ga0207463_101872Ga0207463_1018721F067154SQAGSDRPPRDASVVARSILLVVAAGSVIATSNYFAYMIGRKQVTYEELRRQVDLVAQFIDQKRVEEEPAPVALKQRQERLEQNASGLTGLTTSTINGKTTAIAPERAPAIPAALFDDTAVARPDHEVRSQPPNAITKHRDMKRRQKVSPASAAAKRAPAEPAIAAQPGQSTTATATPDVGLAGQ
Ga0207463_102361Ga0207463_1023611F071766MDEDFITLEVEEEGRGRLQFELPLDITDEEIAYIKSLVPRAESGLLELLDPDTGEVVFSC

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.