NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300026805

3300026805: Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-BECK01-A (SPAdes)



Overview

Basic Information
IMG/M Taxon OID3300026805 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0095510 | Gp0091523 | Ga0207507
Sample NameSoil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-BECK01-A (SPAdes)
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size28934388
Sequencing Scaffolds40
Novel Protein Genes45
Associated Families45

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Geodermatophilales → Geodermatophilaceae1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia3
All Organisms → cellular organisms → Bacteria7
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria5
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium5
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1
Not Available5
All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Thiotrichales → unclassified Thiotrichales → Thiotrichales bacterium HS_081
All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia2
All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2
All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium2
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Thermoleophilia → unclassified Thermoleophilia → Thermoleophilia bacterium1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium1
All Organisms → cellular organisms → Bacteria → Proteobacteria1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Thermoleophilia → Solirubrobacterales → Patulibacteraceae → Patulibacter → Patulibacter americanus1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Propionibacteriales → unclassified Propionibacteriales → Propionibacteriales bacterium1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameSoil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts)
TypeEnvironmental
TaxonomyEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil → Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts)

Alternative Ecosystem Assignments
Environment Ontology (ENVO)terrestrial biomeagricultural fieldagricultural soil
Earth Microbiome Project Ontology (EMPO)Free-living → Non-saline → Soil (non-saline)

Location Information
LocationWisconsin, United States
CoordinatesLat. (o)43.3Long. (o)-89.38Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F000146Metagenome / Metatranscriptome1918Y
F001033Metagenome / Metatranscriptome799Y
F002205Metagenome / Metatranscriptome584Y
F002362Metagenome / Metatranscriptome567Y
F004145Metagenome / Metatranscriptome451Y
F004735Metagenome / Metatranscriptome426Y
F004992Metagenome / Metatranscriptome416Y
F005522Metagenome / Metatranscriptome398Y
F006189Metagenome / Metatranscriptome379Y
F006413Metagenome / Metatranscriptome374Y
F006720Metagenome / Metatranscriptome366Y
F007280Metagenome / Metatranscriptome354Y
F010205Metagenome / Metatranscriptome307N
F010485Metagenome / Metatranscriptome303N
F011422Metagenome / Metatranscriptome291Y
F012596Metagenome / Metatranscriptome279Y
F017411Metagenome / Metatranscriptome241Y
F017623Metagenome / Metatranscriptome239Y
F020424Metagenome224Y
F025803Metagenome / Metatranscriptome200Y
F029219Metagenome / Metatranscriptome189Y
F031613Metagenome / Metatranscriptome182Y
F032298Metagenome / Metatranscriptome180Y
F038819Metagenome165Y
F039571Metagenome / Metatranscriptome163Y
F040042Metagenome / Metatranscriptome162Y
F040373Metagenome / Metatranscriptome162Y
F041998Metagenome / Metatranscriptome159Y
F042529Metagenome158Y
F042600Metagenome / Metatranscriptome158Y
F046673Metagenome / Metatranscriptome151Y
F047230Metagenome150Y
F052916Metagenome142Y
F064139Metagenome / Metatranscriptome129Y
F069406Metagenome / Metatranscriptome124Y
F081180Metagenome / Metatranscriptome114N
F090794Metagenome108Y
F091829Metagenome / Metatranscriptome107N
F094209Metagenome106Y
F097907Metagenome104N
F099644Metagenome / Metatranscriptome103Y
F101950Metagenome102Y
F102828Metagenome101Y
F103539Metagenome101N
F105120Metagenome / Metatranscriptome100Y

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0207507_100200All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Geodermatophilales → Geodermatophilaceae1211Open in IMG/M
Ga0207507_100257All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia1143Open in IMG/M
Ga0207507_100278All Organisms → cellular organisms → Bacteria1129Open in IMG/M
Ga0207507_100297All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1109Open in IMG/M
Ga0207507_100560All Organisms → cellular organisms → Bacteria922Open in IMG/M
Ga0207507_100837All Organisms → cellular organisms → Bacteria837Open in IMG/M
Ga0207507_100872All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium827Open in IMG/M
Ga0207507_100932All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria814Open in IMG/M
Ga0207507_100976All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria806Open in IMG/M
Ga0207507_101157All Organisms → cellular organisms → Bacteria772Open in IMG/M
Ga0207507_101180All Organisms → cellular organisms → Bacteria769Open in IMG/M
Ga0207507_101194All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria766Open in IMG/M
Ga0207507_101554Not Available720Open in IMG/M
Ga0207507_101925All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium682Open in IMG/M
Ga0207507_102092All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium666Open in IMG/M
Ga0207507_102191All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Thiotrichales → unclassified Thiotrichales → Thiotrichales bacterium HS_08658Open in IMG/M
Ga0207507_102355All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium646Open in IMG/M
Ga0207507_102511All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia634Open in IMG/M
Ga0207507_102531All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia633Open in IMG/M
Ga0207507_102592All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium629Open in IMG/M
Ga0207507_102843Not Available613Open in IMG/M
Ga0207507_102904All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium610Open in IMG/M
Ga0207507_103210All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria593Open in IMG/M
Ga0207507_103211All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium593Open in IMG/M
Ga0207507_103275All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Thermoleophilia → unclassified Thermoleophilia → Thermoleophilia bacterium591Open in IMG/M
Ga0207507_103299All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium589Open in IMG/M
Ga0207507_103407All Organisms → cellular organisms → Bacteria → Proteobacteria584Open in IMG/M
Ga0207507_103681All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria571Open in IMG/M
Ga0207507_103812All Organisms → cellular organisms → Bacteria565Open in IMG/M
Ga0207507_103926All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia561Open in IMG/M
Ga0207507_103961Not Available558Open in IMG/M
Ga0207507_103962Not Available558Open in IMG/M
Ga0207507_104210All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium550Open in IMG/M
Ga0207507_104706All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium533Open in IMG/M
Ga0207507_104722All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium533Open in IMG/M
Ga0207507_104724All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Thermoleophilia → Solirubrobacterales → Patulibacteraceae → Patulibacter → Patulibacter americanus532Open in IMG/M
Ga0207507_105180All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Propionibacteriales → unclassified Propionibacteriales → Propionibacteriales bacterium520Open in IMG/M
Ga0207507_105665All Organisms → cellular organisms → Bacteria506Open in IMG/M
Ga0207507_105717All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia505Open in IMG/M
Ga0207507_105936Not Available500Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0207507_100200Ga0207507_1002001F101950ATGFGRGFLVAAGIALLTLAINIAVIRVSRADLAGAQQPVPAAPADPAQPAVTAKDTNDRRDPPMTTPASKPG
Ga0207507_100257Ga0207507_1002572F029219RSELEQRRDELTRSLPGLSDTRADSYRRNSDALARDRAGVKHEQPDEMLRLAKAEVVADEAIRDVQRELRDLDAEIKLAPSDGVGARLRRALSRG
Ga0207507_100278Ga0207507_1002782F001033MNFVSRTKSMAVPHQILGASTNEKRELLMCGHSLIARLTIAVFVFLMLGVTSVVHAERPDSTAGTSNAGTRKLFINPSSTSVALRGKASLIVSPLTHRDGNYVGDYQLKVKPYFFKSEKGSLLLAASDDAVRKLQTGTAINFTGKAVTHKDGRTHIVLG
Ga0207507_100297Ga0207507_1002972F040373LRGPRLALGVAGTACLLLLAPLAGASNSSPYADGVADAPAAAPDLTAVDVSNDDAGNVFFRVSIPNRAALAYTDLISLFVDADGKTGTGCARGAFGAEYALDVLSDRYVFGRCVRGSWDFTRRPASFGGSFAGSTLTLKANRRDLGGASRFHFRIGAASATGADPA
Ga0207507_100560Ga0207507_1005603F004735VRRPWAWAVGAFALLSFLRRRREPAADPRADELRRKLDESRSIVEERDEFEGGELTVDQAEPAPEDPESRRRVVHDSARA
Ga0207507_100837Ga0207507_1008373F002205KGSDPLRSAGCTMAQPPLDFALLRRLREVLDDRPATESELRMLHEQAEAWARTVSGQLESSERRIRRLNQNPASSLAQIAAELRRVEQLRPQLIEVRQLLADLEARARQVRTEWLLSQATSRRSTGHQP
Ga0207507_100872Ga0207507_1008721F040042MIHDDIRTLLEAPSSGEEAPSLDHIEHTLTEGYARALALEAERWRIERQIAGVAARLGDEVTDEDASELAQLGRRLSSADGDLNKLRTLLVALRTRADEVRTAA
Ga0207507_100932Ga0207507_1009322F042600MSIHELDISWAMTASERRRLHWELMSCDEVSGVFLTNRDDVLAVLYSGERRGFEVLARALAPESSEPVR
Ga0207507_100976Ga0207507_1009761F081180MNRAETEDGSYASARNRKLLAEWAAKAGENEYAEPALNPCQTTAYAFFTGDQRRSQRHRLTWERMR
Ga0207507_101157Ga0207507_1011572F097907MGKEKLKANFKRDRSIPNRNTTRPENVSRKEIKQMAHHAPVRRAASKARGF
Ga0207507_101180Ga0207507_1011802F004145MAKYTLLVDGERVANVESADDVRAWLVTYRDEHAEDDPSAAHVQIIERGALAWITGGKLVDRLRFL
Ga0207507_101194Ga0207507_1011942F091829VDELGAGFWLKMIGVIFAIGIAGLVVFFIIGAAWYAWGFLGAFLFVVAVLAVIAWSYDRSHARRYDDLSDA
Ga0207507_101317Ga0207507_1013171F000146MPVARVVTFDGVSKDRMQEMQKEMEDGQPPEGFPSAEMLVLHDADAEKSVVVIVFDNDEDYKKGDEILSAMPAGDTPGQRTSVSRYDVAMRMSS
Ga0207507_101554Ga0207507_1015542F039571MLITQVVTLDVIGHGITLSEREAEELRHAAAADAGRSSARRDLSLLLERGLRTKATVALSRAEARELAELLAAGDASTNFALLQEAFLLALA
Ga0207507_101925Ga0207507_1019252F025803YWKWLGHRLRLSQPGRVLFTLALLVGVTEMVIARAQGAREGGAPLVVLTLLLIGVVTLDAVGRGVVAVVRRLILDGGLRRL
Ga0207507_102092Ga0207507_1020921F006413MLRKISTLGALVVVAVSFGVGNAFAAPKNSDSPAGCHSHGSWTDGFSDGRD
Ga0207507_102191Ga0207507_1021911F041998IHVDHEMSIALSAGQNILELLEERLSPLRVGPAEQLLGLLPRQLAAVQDRADRLATAPQPKALADPMDEAAQGPARGWISPFEGWGGRRALGGADHLAELGFALWAKRGRRPPVRRNASASGPPWL
Ga0207507_102355Ga0207507_1023552F046673MKRVVKLAVLAGAAVTALAFAGNALAVQKLSVSQSATSLTIKVTQAQSDPQPAKIQIFVPTGYSLNTSAAPGTTIGTTAG
Ga0207507_102511Ga0207507_1025112F020424QLSLARAGGLGATVATREFFHAPGGIDKFLFAGEKRMAGGTDADFNVSFGRASVINRAARANDVGLLIIRMNVRLHIQKRANNLAVEGKIRK
Ga0207507_102531Ga0207507_1025311F010205KISPMENCGLRGAAAFFMSASPTLSHDAPVHEPPDRRFPWQEQQSPRLCSVGRPLPKNTIRFKCVDCGRPMEARAQDGGVDTNCPHCAAPLTVPRIPPNPYLRPIKNIVRQMKGVPLPFPYQPQLIQLAVLTSVLALFGVLFITIGVVTQAAGVFRGLILDTQKHLREGSTVERSAQAISIGFYPLLFLPLWTIQLPFSLVGSIWSSRRL
Ga0207507_102592Ga0207507_1025922F038819MATYSGNASLSQAVKDRVLSTFQQTVTLYKEGRSAEVVEGCTLILRMDPTFDPARKLLEKTRNPNAPIDVDSLLPAPPGSDVLNEARAALAARDFQR
Ga0207507_102843Ga0207507_1028431F017411PMPAGTWYELPVNPNDTQAQVQECLNEPLFFWDVPGAPLPGIQIPPQTLAQLALAKMNVPQAGQMKLSPRSGNSYSNLPTFSRVTLSFRPEFGPGGLPYVTDNAQLGAQGATVWVEATPLQLSTNDNAARLETAGCGYLGSTEMVRNPGAVARTGANGTADCGVTFRQPGQWNITATLTWRACWAVGVQDGPPPAGCNPVPGAA
Ga0207507_102904Ga0207507_1029043F031613MPITITLLPSNEDGSGSNLFYAIGALTFSGSYPTGGDTLDFTTVA
Ga0207507_103210Ga0207507_1032101F002362VFRQIEAMPFARESSELFDRPSINLGRIAGGKYLKPIVAFLMKLPLVGRLMRRASQAALERQNPELASAVKKMERAGVARDPQRAAQAISRLTREERAAYLDATTEQSDAVPMNRQMKRQMERAKKGRRR
Ga0207507_103211Ga0207507_1032111F090794LDFRSSCDGSGPFVPARRAVDDGDDSLRRATGGGHASASDGNAFPSAADGDVLDRHFATTPAGRDATVNCSYATAERDAAEFFVK
Ga0207507_103275Ga0207507_1032751F005522MNNRRPYSLLQIRELGHGIHDLPGCDTSDFRWFALRLGPHEARLPIARSAEMLLLVVEGSLALVWRDATRMSTQLTEGDTASTNPPAVEVLLAGAEGATVLVLVSRH
Ga0207507_103299Ga0207507_1032992F094209MTEPSSARTPYLLLWSTGIAALVLSMAAFALWGTTGARTLFDMIVALC
Ga0207507_103347Ga0207507_1033472F032298MLADQLDYIIGVDPHRDSHALAVVQVVSGVVVFESTIDANSDGYAQ
Ga0207507_103407Ga0207507_1034071F103539ARIEGGIAASEILMSIEDELLNLARLCRSQARLTKDRAAKQSLRKLGDHYESEAQRLQEQLSADLQKSE
Ga0207507_103624Ga0207507_1036242F007280LVVDYPARLPQQFGDLAIAVAAVLPGKLDNIGRETLLVVTTARDLALCRAMLPERRTGATLGDMQLRSDLLNAGAATRGA
Ga0207507_103681Ga0207507_1036811F017623LPRWKYVHLHGLIHMLPIKEVGMSASREDPTAAAGGDEIQLSAAGDHVKGEFRCADCGYAITVCRELPPCAMCGCESWQAGLWRPFGRALATSTLARD
Ga0207507_103812Ga0207507_1038121F012596MLALKYLLMILGLGLFGSSGALVAYDIYLAEQLRRLLARSKTSEPGGETGITAHRPFGPVRWRL
Ga0207507_103926Ga0207507_1039261F047230RRYARAWALISPGLATGQSYAEFVAGYTCTGTERPAKLSQSGHQVSFRLTVIDSCTGAAQYYTGTDIVRGGKIVAAHVTRTS
Ga0207507_103961Ga0207507_1039611F064139LTFGMRVSAVICLGGAVAAAALIRKYRHVEQVQPLAEAA
Ga0207507_103962Ga0207507_1039621F010485ATAFDLKETLKDLMPCKYAAIRLCERHQPVSAAALWKCGATLAAAPQEEIGKRCMAVLKRYGAALN
Ga0207507_104210Ga0207507_1042102F042529MPDAIVLAIHAEHADAILDGTARFEHRAFPPKRLPARAYLAVVEARAVVGQCDLGAPTRKSAKGWALPVTNARRYRAPRALSEFGLAK
Ga0207507_104706Ga0207507_1047061F004992MSNETPAAIDPDMFATVFSQNWDNARHIKSERLSFMNAYSVICAGVLALLQSVQASDLIRIALLFFMTLFSLIGLLTSLRLKGELEECLAKIEAMSVQARVNDFVALGQLEGRSSRYPRFRWIFPIFYAMTTVGFITLIVYRLVTGEAI
Ga0207507_104722Ga0207507_1047222F006720FLRSLGQTVLVAGPTRLEVEIPRSELEIYLRVWAVLYPDAEVQLGGNGDDEAPAA
Ga0207507_104724Ga0207507_1047241F011422AGNSEEAIAFADRALRETLPALQEAEVRLSIAGMFAISPEIRISAGRLALNLPDLPASLRARHLACLFHNLVTAGRIEESREVLDETRAAVASADDARASFTLRVAESAFEYADDRFDLSLELITSAYRDGIFAGDDQRLRLAHMWHGELLSVADRDEEAFAIAADGLAAAQRDRQ
Ga0207507_105180Ga0207507_1051801F006189VAILAISAGSFPDVVAYPEHGYAWVATRDPDPELLGQGDDPGIIARFEIPIE
Ga0207507_105299Ga0207507_1052991F105120LQKSMVGIRVSAVSSGTKGQYGATRAFNTDDDRLHANTWYAILGATVQTQVCTISLIGPDWGGQRIGLPAGSPQDDSGSWFLDQSWKWSQLPCIPCFNSNNRGNIFVSIADSGTSVTPQCDFLLYELTGQPGGT
Ga0207507_105665Ga0207507_1056652F102828HAAAVASAPRHYRAVRLHRGMIRGVLLGRGHLVIARTRYLDFTQQPAWYGLRQFG
Ga0207507_105717Ga0207507_1057172F052916MSGAFPPKVVAAGEGKTVMLFGVRFGYKVVSGDSG
Ga0207507_105867Ga0207507_1058671F099644MREGDQSADVGSVVEIACECGRSGCAEVITLPRCVYEEAQREPRRLLLAPGHELPGFSPLIRCDSF
Ga0207507_105936Ga0207507_1059362F069406PVVPPLGGFARVISFFKMFAEVFLDAQREAAAAHKRYPFTDW

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.