NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300027472

3300027472: Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-HINK07-D (SPAdes)



Overview

Basic Information
IMG/M Taxon OID3300027472 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0095510 | Gp0091531 | Ga0207449
Sample NameSoil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-HINK07-D (SPAdes)
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size18054749
Sequencing Scaffolds21
Novel Protein Genes21
Associated Families21

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Archaea8
Not Available3
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium4
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria2
All Organisms → cellular organisms → Bacteria → Proteobacteria1
All Organisms → cellular organisms → Bacteria2
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → unclassified Hyphomicrobiales → Hyphomicrobiales bacterium1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameSoil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts)
TypeEnvironmental
TaxonomyEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil → Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts)

Alternative Ecosystem Assignments
Environment Ontology (ENVO)terrestrial biomeagricultural fieldagricultural soil
Earth Microbiome Project Ontology (EMPO)Free-living → Non-saline → Soil (non-saline)

Location Information
LocationWisconsin, United States
CoordinatesLat. (o)43.3Long. (o)-89.38Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F001406Metagenome / Metatranscriptome703Y
F001437Metagenome / Metatranscriptome695Y
F001497Metagenome / Metatranscriptome683Y
F002588Metagenome / Metatranscriptome546Y
F005305Metagenome / Metatranscriptome405N
F005950Metagenome / Metatranscriptome385Y
F010126Metagenome / Metatranscriptome308Y
F011016Metagenome / Metatranscriptome296Y
F013020Metagenome / Metatranscriptome275N
F016471Metagenome / Metatranscriptome247Y
F021150Metagenome / Metatranscriptome220Y
F029190Metagenome / Metatranscriptome189Y
F033177Metagenome / Metatranscriptome178Y
F033996Metagenome / Metatranscriptome176N
F053116Metagenome141N
F070478Metagenome / Metatranscriptome123N
F080180Metagenome / Metatranscriptome115Y
F087961Metagenome110Y
F095285Metagenome / Metatranscriptome105Y
F095448Metagenome / Metatranscriptome105N
F099830Metagenome / Metatranscriptome103N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0207449_100022All Organisms → cellular organisms → Archaea1880Open in IMG/M
Ga0207449_100070All Organisms → cellular organisms → Archaea1334Open in IMG/M
Ga0207449_100276All Organisms → cellular organisms → Archaea968Open in IMG/M
Ga0207449_100370All Organisms → cellular organisms → Archaea899Open in IMG/M
Ga0207449_100541Not Available813Open in IMG/M
Ga0207449_100978All Organisms → cellular organisms → Archaea695Open in IMG/M
Ga0207449_101028All Organisms → cellular organisms → Archaea688Open in IMG/M
Ga0207449_101083All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium682Open in IMG/M
Ga0207449_101138Not Available674Open in IMG/M
Ga0207449_101256All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria655Open in IMG/M
Ga0207449_101265All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium655Open in IMG/M
Ga0207449_101327Not Available646Open in IMG/M
Ga0207449_101341All Organisms → cellular organisms → Bacteria → Proteobacteria645Open in IMG/M
Ga0207449_101437All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium634Open in IMG/M
Ga0207449_101454All Organisms → cellular organisms → Bacteria632Open in IMG/M
Ga0207449_101589All Organisms → cellular organisms → Archaea618Open in IMG/M
Ga0207449_102242All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria565Open in IMG/M
Ga0207449_102428All Organisms → cellular organisms → Bacteria554Open in IMG/M
Ga0207449_102627All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → unclassified Hyphomicrobiales → Hyphomicrobiales bacterium543Open in IMG/M
Ga0207449_102662All Organisms → cellular organisms → Archaea541Open in IMG/M
Ga0207449_103017All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium523Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0207449_100022Ga0207449_1000221F016471MIEADKFNAVKHATVAIGLANKDTKDVISSFGTGFFIGGEYIVSSAHIFSQCIKYNAQYKEKNKGTEGIYSAFNITTKGDQLELKTYHIIKAIRLPPVKEVKGFTGSVDLDIGIGKLDCHSDNFLHIKEPTQLKLYDEIVL
Ga0207449_100070Ga0207449_1000702F087961LISNSHSLKLEETAKGVRISVHVYTNDKQTAINEAIATYLETKQICEKEKIQIAPMEKSK
Ga0207449_100276Ga0207449_1002761F010126METIKGPASFSIPSMKSSIGSLMVCTCICTAALLLVIIESQTAIAIDTPLSNMSGTNSGSQLESNVTNRTSSDQIAYRNPQYGIFLLFPSNWTFSTSGLPEYTQIAGFYAPLQNLSDPIPARFTISVMSYQQNVSLKDFTNVTLSSLNQTNQIKILSSGPTTLAGRPGYQVVFSTLPNIGNPVSFEIMHSWTAVDNKIYVFQYSVESSKFDTYLPTVKQILDSLRINGKG
Ga0207449_100370Ga0207449_1003701F013020MSAHNNNNDKQEQERLKLDVLNKIFGWIEDKETKAVMINKYYNNKEHRAALKAFLDDMVKALDESTAETNSKEEIKRQLSYIIREIDPLY
Ga0207449_100541Ga0207449_1005411F053116VLGDIYIKMNILKRNRFDRRSDSTLIVLIAISLLYIMVTITALYPFDRLQIVAAATLQENNNNTKPVFLSTISTTIATGVGATGAILTVPGYLRARKQPKFLAAYLLKIHNKHDELCRYPKLSDKSKNEYRNFLDSLRCDIIYSLKNGDINENQYRLIEDRIAEYLNRLNYPR
Ga0207449_100978Ga0207449_1009782F005950MTYAKQKVKAKIHRTSDYDDKHTGIRDFPDEKAMLHYGLSKVHKVIIKKYAKDDEFMAIARKTRGIKFDYDMELYDYIGTGVRESKR
Ga0207449_101028Ga0207449_1010281F005305VEDLEKELGPIIENFQNLLKDAKAKKIDSLREDEDLKKEFNKLSKDVIEPVMKKFESYLKSKDVNSSVNVRSEIVSGKNPSIEFILHFKLTHESRYPNIKFSSFGEKISIQEDRLITKGEVRQDMMPEYYDKEQITVEFIKERLIRLIKSCFDKNWQSIYS
Ga0207449_101083Ga0207449_1010831F001406MTGGEEAAPRRGRAPLRQRSRKRWAIAGGVAAGWLVLEVMTGSAISATLVLVAIAALAGASLAGLRALGITSDHPWIQRMASRPWRDGQDVLRIAMRHLPDVFVITPSGSLLAPAVVELQLNPADLDSLREQMDLDVINSSLTEVYEEQVVAYQARPAAPGSRKDSISRLRPRARSTRRARPARSSCSGPAMSG
Ga0207449_101138Ga0207449_1011381F080180GWYWSTGHYGNYTVDMFVLTTSAAFNQQKTVDAYLAKGNGPSKVLVETMKGVTAHASGKNLTSPGGIHTYPEILTLQWKNGTNSATLTLTNPSIVASPPTIVNTNATIYGNPQYMRLQGTGTLNVQWGGNNETASAPAIWEVSYLH
Ga0207449_101256Ga0207449_1012561F021150DVDPQDGDTSFVLTVKHSNRHGRAYKATGTATILVDAKTRVRRQGAKTLGALAPNDRVHVTAKACKADLKNGGTPDLTARKIGAHPVAAPTQPSS
Ga0207449_101265Ga0207449_1012652F033177WMSGDDDLRKLLATLDPQARNDLRRVLIHDQADRDAIASQLLRYRDEHGDDWTDIIDMLTMHPEVRRLLVRVLGELEADHRTG
Ga0207449_101327Ga0207449_1013271F001497YSDGALTARGYRLPDGTARAGSWVLVDLATGNVQGVVPPGEFSRRFRPVDLFADVPGLPTYLGTVTASDGTALVDLALDRNPFMLANGPRRVARPHALIVPRSHRDGWSSATAAELEACHTAMTLVAAWYRSMDGGHVVFCANDSAPNRDYLRDVEAADGILGDGGTDAAVTKNPRQEVQHAHLHAFYAERGGTENHESSALDGYPVIGAGYRA
Ga0207449_101341Ga0207449_1013412F095448MVASPHTRQSDPAIPNIVCPKCGLRMQVAAIEPAGNDDRTVTFGCDCGHRYDLSERAIVALARDSSDRW
Ga0207449_101437Ga0207449_1014371F001437MAGFLFRLETVDGAPAEPPTFATAVPNWSPGDEIPLGHRALRVIGKRDDDADQPPVLVVEDD
Ga0207449_101454Ga0207449_1014542F070478MSTNARKKNENHVRIYDPDDLEWNAEHQVCPFKGTTLSAAQQVVVKLCKRIKQLGGELPMNWLDCELIYLERFAQKLQRVLDRKVLGF
Ga0207449_101589Ga0207449_1015891F033996MQFNFTAKDKHALKSLSGDLRDMFSMKRVEDRLENPEYRKVFDASFFRTVEGNYVDEMLNWVEDFRKRLDNQESKESKEQLYSETKELVDAGWIQNPNG
Ga0207449_102242Ga0207449_1022421F095285MSVKAGQAQHCRCRFRAGKPGEIAACGCGFWWRVSKRGRWYAISKRHAFRALTPGLMRELGYLEVTGR
Ga0207449_102428Ga0207449_1024281F002588SLIVGATLAVLIWFFPRWFHDHISDEMNAFVLGLVPLLAGASVFLVRWFVSPYPIYMQVRRKVDSLTDTKKEERAKAVQACFERSAAILKQHHSLLLSFHALSRAEGHRLESNKEIADVCDLIQEAGYDHPFEGISPGYVPEKDWLPFLKYVKHAPNINPEEGKDYIDAANRWRDDHGYPLPPG
Ga0207449_102627Ga0207449_1026272F011016RVLVVLDGDTIVVTMPGTSYSVTYRKLHDSPLLVASDMRDDPDSPINKFAFRARAWIAANDKARELGWIV
Ga0207449_102662Ga0207449_1026621F099830NADSQNLENDVNKFLKGYENYITSVSLMTFIKDQLPTFLAVVTIQEKIPPVKVETPNTK
Ga0207449_103017Ga0207449_1030172F029190VQLVSHLADADFRRYVTGTVDPETERHVRVCVCCALRLADAAMQAYWWERRGPLGRLVRLNNTQAVDELLTEIAREQRRDAA

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.