NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300027427

3300027427: Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G08A2-11 (SPAdes)



Overview

Basic Information
IMG/M Taxon OID3300027427 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0095510 | Gp0072058 | Ga0207472
Sample NameSoil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G08A2-11 (SPAdes)
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size19681766
Sequencing Scaffolds25
Novel Protein Genes28
Associated Families27

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
Not Available14
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Xanthobacteraceae → Pseudolabrys → unclassified Pseudolabrys → Pseudolabrys sp. Root14622
All Organisms → cellular organisms → Bacteria → Proteobacteria3
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales1
All Organisms → cellular organisms → Bacteria1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2
All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1
All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameSoil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts)
TypeEnvironmental
TaxonomyEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil → Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts)

Alternative Ecosystem Assignments
Environment Ontology (ENVO)terrestrial biomeagricultural fieldagricultural soil
Earth Microbiome Project Ontology (EMPO)Free-living → Non-saline → Soil (non-saline)

Location Information
LocationWisconsin, United States
CoordinatesLat. (o)43.3Long. (o)-89.38Alt. (m)N/ADepth (m)0
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F000268Metagenome / Metatranscriptome1411Y
F006812Metagenome / Metatranscriptome364Y
F011965Metagenome285Y
F013520Metagenome / Metatranscriptome270Y
F017759Metagenome239N
F019867Metagenome227Y
F022685Metagenome / Metatranscriptome213N
F024822Metagenome / Metatranscriptome204N
F025757Metagenome200N
F026499Metagenome197N
F028554Metagenome / Metatranscriptome191N
F029148Metagenome189Y
F035133Metagenome173Y
F037212Metagenome / Metatranscriptome168Y
F049708Metagenome / Metatranscriptome146Y
F049951Metagenome / Metatranscriptome146Y
F050353Metagenome / Metatranscriptome145N
F052694Metagenome / Metatranscriptome142N
F057488Metagenome136N
F058528Metagenome / Metatranscriptome135N
F061986Metagenome / Metatranscriptome131Y
F065246Metagenome / Metatranscriptome128Y
F070441Metagenome / Metatranscriptome123Y
F077414Metagenome117Y
F082749Metagenome / Metatranscriptome113Y
F090606Metagenome108Y
F105676Metagenome / Metatranscriptome100Y

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0207472_100111Not Available2175Open in IMG/M
Ga0207472_100129All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Xanthobacteraceae → Pseudolabrys → unclassified Pseudolabrys → Pseudolabrys sp. Root14622111Open in IMG/M
Ga0207472_100227All Organisms → cellular organisms → Bacteria → Proteobacteria1823Open in IMG/M
Ga0207472_100228Not Available1822Open in IMG/M
Ga0207472_100313All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales1640Open in IMG/M
Ga0207472_100453Not Available1470Open in IMG/M
Ga0207472_100457All Organisms → cellular organisms → Bacteria1464Open in IMG/M
Ga0207472_100631All Organisms → cellular organisms → Bacteria → Proteobacteria1301Open in IMG/M
Ga0207472_101121Not Available1034Open in IMG/M
Ga0207472_101463All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Xanthobacteraceae → Pseudolabrys → unclassified Pseudolabrys → Pseudolabrys sp. Root1462907Open in IMG/M
Ga0207472_101912All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria794Open in IMG/M
Ga0207472_101979All Organisms → cellular organisms → Bacteria → Proteobacteria779Open in IMG/M
Ga0207472_102039Not Available767Open in IMG/M
Ga0207472_102172Not Available737Open in IMG/M
Ga0207472_102405Not Available695Open in IMG/M
Ga0207472_102447Not Available689Open in IMG/M
Ga0207472_102763Not Available643Open in IMG/M
Ga0207472_103046Not Available614Open in IMG/M
Ga0207472_103186Not Available601Open in IMG/M
Ga0207472_103393All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria585Open in IMG/M
Ga0207472_103468Not Available578Open in IMG/M
Ga0207472_103986All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia542Open in IMG/M
Ga0207472_104116Not Available533Open in IMG/M
Ga0207472_104351All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium520Open in IMG/M
Ga0207472_104705Not Available502Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0207472_100111Ga0207472_1001112F017759LLFGVVVGLLNSECPQQYGAYESKYGAHGQHIELQGKVHGSASLVDALRLARNDPAPKAPVTRPAFPAGGIAYRTCAIDNRLIERLKKSEGPKILIVQNSCDTEFFMANLRQIPSILRG
Ga0207472_100129Ga0207472_1001291F025757SLFWLEMAVLIGCVVLSIRMRSRAAMWVALAIVAHCAVWLAIHDEEILIRLVASALVYLGLLKFSPNAARVWLCAGGALAGAFLLGTMALSLLMSFPGRWSVFGLSMGATLLFLTSGLLIGFWVVYRWTDPTQLGDTQPRQEA
Ga0207472_100227Ga0207472_1002272F070441MRISTLDISGTTTTVRRHEALLRFARYLLVYNLAIISWVVISKLSQMPSFEGLLEWLTGPTWKFAALVLGSVGASYLILAHRLWWAYAVIMLAQVVYFFSAGSAELAVMRASVLFAYGVITLPPMRPLAPLATDIAVMVICFCYIMIVLCLYSAAWVASGGRVPQGAYGRRPTPLEPLRPSRLLDTFLPGHRSQDVTLWEGAVFALSSLLFVAASMAPFYGIRRVQNAFQSTFATQVQQVCPAQPPEAMIACWAQFYPWSRVAIDLGAPIVIAVICLVLANRLRHFGRQHFINRLAELKVLPAASTLFLRAFRDDQVRIRRASRNLFSSVFDLGRVPATLDELMLERLDGR
Ga0207472_100228Ga0207472_1002284F013520MHPAETYRRRAERAERDFENARDPKAKRFAQVAAQRWRELAELAERQETEGAPIPPHFRDASEAVHYAQAQGYLLYWKGTPAFTKRQRELGGRFATLPVFTRKGMTHVSLVPLDEQVKASEKE
Ga0207472_100313Ga0207472_1003132F024822VRQLFIRVAGVILCALALSGCVDSAGPLLSEAQPVLGERLRLQFYSLSKGTADEPEQATYKWDRGAYQRTGGGMTDISSFSVHPLARDIFVVQSAAAKRPGTFEYAVARRLVDGVYQVIAVDEADAGQVTRARFCKRASDSSCRIQTRNQLYAFARATAERRRGQGGLVLRLADGVAESS
Ga0207472_100453Ga0207472_1004531F057488MRALLLGTLLAIGLIPGATAQLAVGPVPIMSSINGIPITVSVTSWITVNSVGDETTVDARIFADLIDLQKKFSDVVDSFKRSARNCNRSADGQNPVVSFKSGSLWPRNDQLIMFVRGDIDIWSCSVGPPQSAIRWEKTKVSFLTLKLPVRRTWRNVKRNMDGTQPFHGTLLVSLAEKDGANVALRNTEPNLRLDGEPTFATNANLSLAKTDMNDKVSKTLRSAIDLTKLKDVLPKELQKFNMTVNSARFRDRGGHAIAEINLVGKASSTTTTSLLQQIDA
Ga0207472_100457Ga0207472_1004571F006812MLGTALLLLIILLVALPFGTAIACGALMQWAGNMISGWSSIGGLALGIYFFHKCMEVLAV
Ga0207472_100631Ga0207472_1006312F000268MLMRVVAVMLLLSAGIAAEAVSYSFVSKASGRLGGPIRFEFHRDLTTRPKTDIQSFTVSMRTADDRWKAMWSILSGRGLTQPIEYGVTPPGFTTMIQPQKLIPGRVYAGFASDGHGGSSGVTFGFDKNGRMTFPDSFDQ
Ga0207472_101121Ga0207472_1011211F049708MRVKWMLIGFGILYLVFALSFIPTDKILPRPIEESWELVRDILWAGMTIISIGLLEVMSPLATPFASASGLRRGDAAEILLAVILVAVAVFSALRLRRKDLTPRWRGTHTFFLLLALTLVAMVRFTLYSWSHFA
Ga0207472_101122Ga0207472_1011222F082749MDGESVDAAGKLGRKRLINHAMTLDAGLSLERLRHDIHPEVSLPARPVPGMALMLVRFINHFEVLRRESLGQLFCDEIGGSHIARLGERSLPVNGYKQVLKASPVKAHNVRS
Ga0207472_101463Ga0207472_1014632F049951MKAFIMACVAAVVIAVVGVVALNSVPDSAEKAFSSPTGVSLST
Ga0207472_101912Ga0207472_1019121F058528AFKRIGWKAKRDTSVNDVPDGLTVWPEDDVARAICNALTMATGALVAVREDQHLKDQGTYAIGVGYKLI
Ga0207472_101979Ga0207472_1019791F035133MAKVFTDRTSRTAIDARIAEERRNPVESWNELTAAREDNDLSFDKVEYEIKPAKWP
Ga0207472_102039Ga0207472_1020391F025757MSYSTPLFWLEMAVLICCVALSIKRRSRPAMWVALGIVAHCATWLAMHDEEILIRLIAGALVYLGLLKLSPRAARVWLCAGGALAGAFLLGITALSLLMSFPSRWSVFGLSIGATLLLLVCGLLIGFWVVYRWTEPAQFSESQPRQEA
Ga0207472_102050Ga0207472_1020501F029148KYRGKPVKGRYNVSDNLVTVVAWSGTKTARLGVLPAERLAKMLLRELADQGETK
Ga0207472_102172Ga0207472_1021722F037212AAKIGFADWRRAMTNNEIGAADISEPADAIAWRHFWLANVVVESDGRPLVRATRPALQPRKDSLSRRLQRLRHMRAILDAPRH
Ga0207472_102405Ga0207472_1024051F026499EKFARQSAELRLRLTEVLELHNVQQRQANELQDRYDDIDRLSQTVSALQEAVNQYKMGTAAAEDKIILLESEKAVLQAQLDVALEESKTLADRVHAAEAASDRREATVALSIKQIEFLNTELTAAAAERFRLVAAMQGEQRRQRSVFSQQKSILEDKLQEKEALAATQGTKIKQLEGVRDELDKRVRVIEALLASEREVAERKTRRPTEILGAAG
Ga0207472_102447Ga0207472_1024471F105676IAIGNPQDRKGAARQSPWGAQRLYVDDAHWQETVTMLARDADRIVLCIDASDGVRWEIAHVLQSGHAGKTLFFLNPSTDVQTRKRQLQEDFGVSAADLASIDVNRVLALRTTSTDQLILMVCDKPERDAYLVAARLAFEGRAGNLTMSAKGR
Ga0207472_102763Ga0207472_1027632F028554MRKAKQVRNRALSAGEGRRAIIVTAALTGLFLLAAFVTAGSFLSTDPQAMSSVAKVTPLPRTEGGEAASRVASIVMETDKKGRCEERQFDNRTGKMVSANYVNCDARLEPERDSTPSENI
Ga0207472_103046Ga0207472_1030462F022685TLMKAILGLAFGIALMSALQTVGVWSLQEHIKSQSDAGLPIGNTPVITNFDADALKNGILPKYGPIDTREGQRLAIEGAARRIDLQNRAVQKYLPR
Ga0207472_103186Ga0207472_1031862F065246VQQSADAAGGGGDRLGCKADNRVPKNANVRILAQSGAWSGVDVDEDGYADGQVRTADLTSDLATMPPWGSS
Ga0207472_103393Ga0207472_1033931F050353FRTLCERPHKAPKRAVRLSILGRAAQEARMRRALNVALVVAGVLLNVLASRSTTVANSQAAQRSAQNGTIVYGLHVALPSNMKNFPPELVPLP
Ga0207472_103468Ga0207472_1034681F019867SAATILAFASPLHAQGIEVFGGYSVNADYVQNRPAILVADQKVSPFFSHGSGPTGFEASFKHEVRNGLGIKVDVSGYSDTFPPGPAAYCQPDGSTAGIACGTGLTFQATGRAFYVTAGPEWKIRRGKRFAPFAQALVGIAYTRSTFMMSGSDVQYTNPFTGGVLLFTSAGFPPDRSIHYADAHADAGLALAI
Ga0207472_103480Ga0207472_1034801F090606PRPVAPEVKTPVAKVETAPIAKEDKPDAARPQADDGGKPGATPNCEKELRRTADLLRFFANRIQGGEDAQSVVADMRQQEKKISAVCD
Ga0207472_103986Ga0207472_1039862F061986AALRPNVMTTMMVFWALINPLGVFLAGPILDAFGTTPVLIGFAAVQTVTMAISSLAAARELGRKREEPALAPST
Ga0207472_104116Ga0207472_1041162F077414DALEIPPSRFGRLITLQDAARAAAAAWPGEIAAAV
Ga0207472_104351Ga0207472_1043511F011965IGIHALYGSAAGAWLGVGFVHHSWGANGEAVLNLSTNTWSLMTNANMYSSGHSSIGTRFVNGSGSINGMYSGGACLRNPSNLMDATRYTFIMQPPSTATGWHDGEHSSWFNASTNPHAPVLFSRYNISTPPRPVPWYGEIIAAATDGSNRVWRFAHNHNGGLGGYYGSAFAQI
Ga0207472_104705Ga0207472_1047051F052694MSTRFRSVLTLVVATLIVGTSTAWAGPLAKGANTHVVDDGTVSWTLPAGQCPAAPGGLTGSGERHRVTITKVNADGSTTIIINDVVRGTAWDATGTYKFVYENHSIDQAPAGGGVHQISMEDNFILNGNGSVGHLAVGFNWAWTYTDPNGPFDVLPLANLVER

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.