NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300026755

3300026755: Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G01K5-12 (SPAdes)



Overview

Basic Information
IMG/M Taxon OID3300026755 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0095510 | Gp0091555 | Ga0207632
Sample NameSoil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G01K5-12 (SPAdes)
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size17624739
Sequencing Scaffolds16
Novel Protein Genes16
Associated Families16

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
Not Available10
All Organisms → cellular organisms → Bacteria1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium1
All Organisms → cellular organisms → Eukaryota → Viridiplantae → Streptophyta → Streptophytina → Embryophyta → Tracheophyta → Euphyllophyta → Spermatophyta → Magnoliopsida → Mesangiospermae → Liliopsida → Petrosaviidae → commelinids → Poales → Poaceae → PACMAD clade → Panicoideae → Andropogonodae → Andropogoneae → Tripsacinae → Zea → Zea mays1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Clostridiaceae → Paraclostridium → Paraclostridium benzoelyticum1
All Organisms → cellular organisms → Eukaryota → Viridiplantae → Streptophyta → Streptophytina → Embryophyta1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameSoil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts)
TypeEnvironmental
TaxonomyEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil → Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts)

Alternative Ecosystem Assignments
Environment Ontology (ENVO)terrestrial biomeagricultural fieldagricultural soil
Earth Microbiome Project Ontology (EMPO)Free-living → Non-saline → Soil (non-saline)

Location Information
LocationWisconsin, United States
CoordinatesLat. (o)43.3Long. (o)-89.38Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F003406Metagenome / Metatranscriptome488Y
F004935Metagenome / Metatranscriptome418Y
F015943Metagenome / Metatranscriptome251Y
F023981Metagenome / Metatranscriptome208Y
F034995Metagenome173N
F037297Metagenome / Metatranscriptome168N
F038003Metagenome / Metatranscriptome167Y
F041998Metagenome / Metatranscriptome159Y
F056406Metagenome137N
F063479Metagenome / Metatranscriptome129Y
F066980Metagenome / Metatranscriptome126N
F070071Metagenome / Metatranscriptome123N
F070441Metagenome / Metatranscriptome123Y
F072104Metagenome / Metatranscriptome121N
F074725Metagenome119N
F080568Metagenome / Metatranscriptome115Y

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0207632_100185Not Available987Open in IMG/M
Ga0207632_100201Not Available974Open in IMG/M
Ga0207632_100311Not Available904Open in IMG/M
Ga0207632_100400All Organisms → cellular organisms → Bacteria856Open in IMG/M
Ga0207632_100569Not Available771Open in IMG/M
Ga0207632_100961Not Available680Open in IMG/M
Ga0207632_101139All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium648Open in IMG/M
Ga0207632_101215All Organisms → cellular organisms → Eukaryota → Viridiplantae → Streptophyta → Streptophytina → Embryophyta → Tracheophyta → Euphyllophyta → Spermatophyta → Magnoliopsida → Mesangiospermae → Liliopsida → Petrosaviidae → commelinids → Poales → Poaceae → PACMAD clade → Panicoideae → Andropogonodae → Andropogoneae → Tripsacinae → Zea → Zea mays638Open in IMG/M
Ga0207632_101293All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria627Open in IMG/M
Ga0207632_101481Not Available604Open in IMG/M
Ga0207632_101504All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Clostridiaceae → Paraclostridium → Paraclostridium benzoelyticum602Open in IMG/M
Ga0207632_101610Not Available590Open in IMG/M
Ga0207632_102280All Organisms → cellular organisms → Eukaryota → Viridiplantae → Streptophyta → Streptophytina → Embryophyta539Open in IMG/M
Ga0207632_102513Not Available523Open in IMG/M
Ga0207632_102712Not Available512Open in IMG/M
Ga0207632_102747Not Available510Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0207632_100185Ga0207632_1001851F063479MSARNWAETHWIARSTRGGGRPKSGEVDLGPPVKSGRVRGLGELHGLLAELAEAQVGLEGGWSGLATAAVALAAMAGGNTLAGAKERWLAGEGECGAKRGAPGEAL
Ga0207632_100201Ga0207632_1002012F066980MVKLDLPADAHWIPADEISQAAPDWDGHARHFGSLRRAIDFVMQELTIAARANARITTQDGNLTIEQIEKLQ
Ga0207632_100311Ga0207632_1003111F070441SRLLSCRAGSRMTSSRFAAGQARRNSRFAQRPPGSRDVASRRMRISTLDISGTTTTVRRHEALLRFARYLLVYNLAIISWVVISKLSQMPSFEGLLEWLTGPTWKFAALVLGSVGASYLILAHRLWWAYAVIMLAQVVYFFSAGSAELAVMRASVLFAYGVITLPPMRPLAPLATDIAVMVICFCYIMIVLCLYSAAWVASGGRVPQGAYGRRPTPLEPLRPSRLLDTFLPGHRSQDVTLWEGAVFALSSLLFVAASMAPFYGIRRVQNAFQSTFATQVQQVCPAQPPEAMIACWAQFYP
Ga0207632_100400Ga0207632_1004001F041998TAEIRVQVDVRLIDVDQQVLVALGAGQHALELLDKGFPPLRVGPAEQRLGFLPGQLEAVQRCSDRLATAPQPEPLASPMDKAAQRPARGWIGPNYGRRRGRALGGVDDLAEFGSAVRAKKGRRPPVRRNASASGPPWL
Ga0207632_100569Ga0207632_1005692F023981MTDPGLDLHEWESQWAQLQEDAADSPEEALPEIVRLIESMLTDRGFDLENPVIVEDESRDVVADFLAARDISRAAETTKLDQEDIQTALDD
Ga0207632_100961Ga0207632_1009611F080568MAATPSIKIVKTFPYRGTTRTFSNRYHFNGGLPADNAHWTTLSDAIVTAEKAIYANFVTIVSGVGYAAGSEVPVFSKTYSTVGTGATFGTDYQAGDVAALIRYSTTARTSKNHPIYLFNYYHGVLGDGTNAGKPHAAQVTAYGTYAAAWIAGFSDGTTSYVRAGPNGATATGSLVETYLTHRDLVH
Ga0207632_101139Ga0207632_1011392F056406WHPSGWTALRKEATAPKPPVSVTEQYTGSIIIVPTRGEDCRQMMLDNRTGRMWDKGVVNCYEAVSRSEKEQRGGMSSLRINAIGKAFNRRDE
Ga0207632_101215Ga0207632_1012152F038003SHPTSFGFSLDPPSGFALAGALVEASPNPLGFRMRSPWDRLTDVSTYGPSGSEEDDEPSFCWDFSGLGNPSAMRDFMTACDYCLSDCSDGSRSLGDEDCGPSRECFHVDLGGPGEGNHLGIPENGDPPRPVPRVDILRELAVVPVPAGGHDP
Ga0207632_101293Ga0207632_1012932F015943MMGSRAAKLLVVIALGGGCASGVPAGAQTLKTEPAGDQLACGQKVLVENNTCPADQILEVTGSCLKTAPEIDVVRTPRGTQYNCVKRKRE
Ga0207632_101481Ga0207632_1014811F004935ASMSRPLKVQIVERARELVADEQNWCQRHLAQDVNGASVSPTSTAAVKRCGLGAVIAAAYQLTHDYDAAHRLGHEALRPHYSPATLIYLNDIRGHSAVLALFDEVIAAR
Ga0207632_101504Ga0207632_1015041F003406MTEWFYVKNDLSAREDIKGIIMRPIWQSFGLRRPKVEMNEAAEECQRAFGVVCSFIGTRDLVQEHIAFRVWPLAEKWEMPQETIKEADEGKLVRLKYTFKFGDKFIEPDDEWLKSIDNLSDELLGTYSKAEDNAMSAAFGGRKKKRLNRVFDAIGFVYPDYCYP
Ga0207632_101610Ga0207632_1016101F070071LVAGTSAGFWYLLPRNGEEHPLVQNPGVGSMVTIVILTLFIVGVAVLCEGLLG
Ga0207632_102280Ga0207632_1022801F072104AEVPDLSASRDNSDLAEDDPCDPIRTSKPEAPMQSLNQLPVVTDLADARSA
Ga0207632_102513Ga0207632_1025132F034995MTKILALIAGIFLIGFPAWMLKEMSVPLPQYRKGGYADYVLLAYELLAVYGSVLLGIAVHVGRAKWSGQSLFPFDLFKAGMWGVLVAASAYFVAGGVYGLISYGAFDKVIGGTLWGLLAIFWGVFGALV
Ga0207632_102712Ga0207632_1027121F074725ANRTRTQVGGSPVRSWLAIPCLLVLLLGCTAAPEPLAPYTLSNAEVTAVVRGFYSSVKDLDAPSFRSFKAVRSSGGEVYVCGGMSSRNAGEQVFIGTLSSGRFVPDRIGKDQYSTGEVLAKCQERGIPIQ
Ga0207632_102747Ga0207632_1027471F037297AIGRAFYVTAGPEWKIRRGKRFAPFAQALAGIVYTRSTFMMSGSDVQYTNPFTGGVLLFTSAGFPQDRSIHYADAHADAGLALAIGGGFDVRLSRRIGLRAAMDYDPTFLVRPVFPDLSPDAQGRVVLRPASNERQRQDHARLSIGMGWRIR

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.