NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300026940

3300026940: Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G10A1-12 (SPAdes)



Overview

Basic Information
IMG/M Taxon OID3300026940 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0095510 | Gp0055680 | Ga0207521
Sample NameSoil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G10A1-12 (SPAdes)
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size21917726
Sequencing Scaffolds22
Novel Protein Genes23
Associated Families23

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2
Not Available12
All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Candidatus Acidoferrales → Candidatus Acidoferrum → Candidatus Acidoferrum panamensis1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → unclassified Bradyrhizobiaceae → Bradyrhizobiaceae bacterium1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Hyphomicrobiaceae → Rhodoplanes1
All Organisms → cellular organisms → Bacteria → Terrabacteria group1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1
All Organisms → cellular organisms → Archaea1
All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameSoil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts)
TypeEnvironmental
TaxonomyEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil → Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts)

Alternative Ecosystem Assignments
Environment Ontology (ENVO)terrestrial biomeagricultural fieldagricultural soil
Earth Microbiome Project Ontology (EMPO)Free-living → Non-saline → Soil (non-saline)

Location Information
LocationWisconsin, United States
CoordinatesLat. (o)43.2958Long. (o)-89.3799Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F005305Metagenome / Metatranscriptome405N
F010961Metagenome / Metatranscriptome297Y
F019338Metagenome / Metatranscriptome230Y
F021340Metagenome219Y
F031610Metagenome / Metatranscriptome182N
F033920Metagenome176N
F038225Metagenome / Metatranscriptome166Y
F038480Metagenome166Y
F042319Metagenome / Metatranscriptome158Y
F051273Metagenome / Metatranscriptome144N
F051439Metagenome / Metatranscriptome144N
F052029Metagenome143Y
F062518Metagenome130N
F067965Metagenome / Metatranscriptome125Y
F068796Metagenome / Metatranscriptome124Y
F071715Metagenome122N
F073765Metagenome / Metatranscriptome120N
F083399Metagenome113N
F085659Metagenome / Metatranscriptome111Y
F085779Metagenome / Metatranscriptome111N
F088579Metagenome / Metatranscriptome109Y
F089374Metagenome / Metatranscriptome109Y
F095448Metagenome / Metatranscriptome105N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0207521_100118All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales1820Open in IMG/M
Ga0207521_100251All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1516Open in IMG/M
Ga0207521_100785Not Available1040Open in IMG/M
Ga0207521_101103Not Available918Open in IMG/M
Ga0207521_101152All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Candidatus Acidoferrales → Candidatus Acidoferrum → Candidatus Acidoferrum panamensis901Open in IMG/M
Ga0207521_101325All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → unclassified Bradyrhizobiaceae → Bradyrhizobiaceae bacterium849Open in IMG/M
Ga0207521_101547All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Hyphomicrobiaceae → Rhodoplanes799Open in IMG/M
Ga0207521_101720Not Available764Open in IMG/M
Ga0207521_102055Not Available710Open in IMG/M
Ga0207521_102155Not Available695Open in IMG/M
Ga0207521_102217Not Available687Open in IMG/M
Ga0207521_102367Not Available671Open in IMG/M
Ga0207521_102394All Organisms → cellular organisms → Bacteria → Terrabacteria group668Open in IMG/M
Ga0207521_102574All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria649Open in IMG/M
Ga0207521_102605Not Available645Open in IMG/M
Ga0207521_103053Not Available602Open in IMG/M
Ga0207521_103206All Organisms → cellular organisms → Archaea591Open in IMG/M
Ga0207521_103551Not Available563Open in IMG/M
Ga0207521_103648Not Available557Open in IMG/M
Ga0207521_103883Not Available543Open in IMG/M
Ga0207521_104109All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium532Open in IMG/M
Ga0207521_104205All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria528Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0207521_100118Ga0207521_1001183F021340IARANVARSQFWHVAIVTLFFVIVLGASLFLGTVMVIGTFHGVDPSNELTANGRAGRIARTLQDGTLCHYMIFDNKTAQTVEDRIGRCDENKPKPKQEKPATFTWGK
Ga0207521_100251Ga0207521_1002511F083399MAEESAVSGDTRSWGFFATFVLGAIAFLAGQLAGMAALVGWYGFDLRNVPVLSQHGGAIILFIFVSAPVQVAILALAAGYKGNIADYLGYKLPRRGEVVLCLAILAAMIAVGDAMSWLAGRSVVDRFQTDIYHAANSVGQLPLLLAAVVVLIPI
Ga0207521_100785Ga0207521_1007851F068796VRKTKLLSTVAMALLLGGVAASAQGMGKEPPERAPAAQQSAPAEKVAPSIKAGEQKSPQTTSQAAPDSKPTGKGHETTGQSPKSQATDKPGAMDKDK
Ga0207521_101103Ga0207521_1011034F052029LALNFRSLLPANPLAQQEINNTGALSEQLVTIGAASL
Ga0207521_101152Ga0207521_1011521F051273MRRFIPLLILLGLIFGASYASALINRVMGPWSSTAIHQDGSLSHMQFGVDLPRPEWVPVYPGAWVVGGSKITSVEHPAGFHGLDLGTRASLEGVVNTLKT
Ga0207521_101325Ga0207521_1013251F010961NKLSFAVEAKMQRVTAMMAYVVSEGRLCALKPAEWSFLLVGVTLCGIATLLFLMLHA
Ga0207521_101547Ga0207521_1015472F038480TAVHSMPLSLLNANVAQPVIAVSDQCGDRCGSSRSYVRDRRTVMAGYSGGYVLVRDPLIQRRPYCPFGSYVACVVSGTYCVDLCH
Ga0207521_101720Ga0207521_1017201F062518NNVVNSPGASPTGAAVVDPTGPNLTPDTPWVQGGQLVVHTGSSSFTGISWFFLGSESGFQVTFHSPALADFTEGNQNNSAYSGSPPLTVGFLGSKLNVGSGPIQFSLTWATGSVDNSALQPNPGSGVPNLIFSYATFVEPDLLMLTKSVSDIIIFALNDGGADNDHDDFIGAAIVTERADCECALQQATTPIPGALPLFGSVLGGGLLLHRWRKRRSARASRSFTASYLRG
Ga0207521_102055Ga0207521_1020551F095448GAACEPEANIERGFVMVASPHTRQSDPAIPNIVCPKCGLRMQVAAIEPAGTDDRTVTFGCDCGHRYDLSERAIVALARDSSDRW
Ga0207521_102155Ga0207521_1021552F073765MRQLIGKIPIKGQRVVEEVYLSEQIRCRECQKTAPIGVEVVTVQKDGPSKKVLKRAFYCRSHAGDYE
Ga0207521_102217Ga0207521_1022171F089374MTNILDSARASDEEPRLIVRKASHAPIWSVWAVLEGTPSEEIFEGSSE
Ga0207521_102367Ga0207521_1023672F051439MPYHVRITTDTDGLINDAKPYPTLAAAMQVAGLSLKTGSATTAWIEDVHGNVCANTDDVKKHCGLT
Ga0207521_102394Ga0207521_1023941F085659VTYDETWDDAPARRLPLAAIAVVVALGALLTSSYVLYRQSRVVDSERSARRAEIGRLEKQVTLLQSRGAALAGRVGSAEKTLKRRDSGIAPLASRVLKSVFTVETNTGLGSGFIAWRDADASYLLTANHVVEGHLIGDVTVSRKGGSWSGEV
Ga0207521_102574Ga0207521_1025741F067965GSVKVKADRSLQGRKVYLQKFSRFHEWVKVRGVILGNGSAKRFRLGLPFGGHRYAVRIFMSLNQAGAGYLDGFSQTVVVRTPRR
Ga0207521_102605Ga0207521_1026051F085779MKNSHCLLGTSGVAAAVLLYSLGAAVFAQDEPRRPLNPGLVPNTGQINTGAAPQPWSKSAEVDNIPAPAEAWAALVQPISRQPSAGGGATTTGTGGQQPPASGTINASSEPPPSGPIGSFGQTIPAKFSKRNDILDHLPTMAIPLPLTQEQRKQIYDAVMAEKSQPVVGADALELASELSPNQALNGMRPLPESV
Ga0207521_103053Ga0207521_1030531F019338GSMPAMRRTEAVLTAGAVACLAFVSIVYPVFAQDVDPRCKDIFDKVACTCAVRNGGHVIPPPVGVKREGLKLRPKEEAGGTQTLDGGRVAFPKYYRREGLKFHRSRALEGYLACMRAAGR
Ga0207521_103206Ga0207521_1032061F005305MEDLEKELGPLIENFQNLVKDAKSKKLESLREDEDFKKEFNQLSKYVIEPVMRKFESYLESKDVNSSVHIQSQIVAGKNPSIEFSLHLKLTHESRYPNIKFSSSGEKISVQEERLMTKDEVSQDMIPEYYDNELITEEFVKERLIRLIKSCFDKDWQSF
Ga0207521_103551Ga0207521_1035511F071715MRKLAIGILAAAGVALSVPASAQGVWIGAGPVGVGVGVGPGYY
Ga0207521_103648Ga0207521_1036481F038225MKRLSLALLGAVGAFFVLTPVQAADYRVIQYNDTKICQVVDMAGPFKPISSKYTVLTKKSLPSFADAMKARADVGAKAKCIL
Ga0207521_103883Ga0207521_1038831F033920RSPIPYTLTGSEIAAVENGVRSVRRDLDNSAFRGFRATQHEDGQIDVCGWILPTGNLSEQPFIGTLFAGTFAPERIGGNEVDNAQIISDCQNRGARIA
Ga0207521_104109Ga0207521_1041092F088579ADHARTDTPVVKPETELNGFKVAKQLDPTDHHVRATIVVLEGDKKVVGFIPHPEEVPGIVFA
Ga0207521_104205Ga0207521_1042051F042319IGIADQSALLSRIAAWCTHFVEGVREGQEIAARYHALSRLSTPELARRGLNRHMIARVALTGY
Ga0207521_104251Ga0207521_1042513F031610MLAIGRAAGDVAGYPGKITLITIGMGEAAIAANNAIAQIRGEKVQPKYSTD

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.