NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300027455

3300027455: Soil microbial communities from Kellog Biological Station, Michigan, USA - Nitrogen cycling UWRJ-G06K3-12 (SPAdes)



Overview

Basic Information
IMG/M Taxon OID3300027455 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0095510 | Gp0072111 | Ga0207504
Sample NameSoil microbial communities from Kellog Biological Station, Michigan, USA - Nitrogen cycling UWRJ-G06K3-12 (SPAdes)
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size24196465
Sequencing Scaffolds24
Novel Protein Genes26
Associated Families26

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales2
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae1
Not Available10
All Organisms → cellular organisms → Bacteria3
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → Bradyrhizobium liaoningense1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1
All Organisms → cellular organisms → Archaea2
All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Myxococcales → Cystobacterineae → Myxococcaceae → unclassified Myxococcaceae → Myxococcaceae bacterium1
All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium2
All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameSoil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts)
TypeEnvironmental
TaxonomyEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil → Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts)

Alternative Ecosystem Assignments
Environment Ontology (ENVO)terrestrial biomeagricultural fieldagricultural soil
Earth Microbiome Project Ontology (EMPO)Free-living → Non-saline → Soil (non-saline)

Location Information
LocationUSA: Michigan
CoordinatesLat. (o)42.4Long. (o)-85.37Alt. (m)N/ADepth (m)0
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F002275Metagenome576Y
F002896Metagenome / Metatranscriptome522N
F010961Metagenome / Metatranscriptome297Y
F013219Metagenome / Metatranscriptome273Y
F015863Metagenome / Metatranscriptome251Y
F017166Metagenome / Metatranscriptome242Y
F018256Metagenome / Metatranscriptome236Y
F020731Metagenome / Metatranscriptome222Y
F022905Metagenome / Metatranscriptome212N
F022944Metagenome / Metatranscriptome212Y
F025530Metagenome201Y
F025757Metagenome200N
F028554Metagenome / Metatranscriptome191N
F029148Metagenome189Y
F034616Metagenome174N
F037759Metagenome / Metatranscriptome167N
F049092Metagenome147N
F053430Metagenome / Metatranscriptome141Y
F056406Metagenome137N
F061030Metagenome / Metatranscriptome132Y
F081318Metagenome / Metatranscriptome114Y
F081321Metagenome / Metatranscriptome114N
F084687Metagenome / Metatranscriptome112Y
F090709Metagenome108Y
F094081Metagenome / Metatranscriptome106Y
F094116Metagenome106Y

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0207504_100049All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales1996Open in IMG/M
Ga0207504_100077All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales1772Open in IMG/M
Ga0207504_100167All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae1482Open in IMG/M
Ga0207504_100347Not Available1230Open in IMG/M
Ga0207504_100363Not Available1223Open in IMG/M
Ga0207504_100704All Organisms → cellular organisms → Bacteria987Open in IMG/M
Ga0207504_100839Not Available931Open in IMG/M
Ga0207504_100864All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → Bradyrhizobium liaoningense923Open in IMG/M
Ga0207504_100931All Organisms → cellular organisms → Bacteria901Open in IMG/M
Ga0207504_101131All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria841Open in IMG/M
Ga0207504_101243All Organisms → cellular organisms → Archaea811Open in IMG/M
Ga0207504_101377Not Available781Open in IMG/M
Ga0207504_101701Not Available726Open in IMG/M
Ga0207504_101986Not Available686Open in IMG/M
Ga0207504_102117All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Myxococcales → Cystobacterineae → Myxococcaceae → unclassified Myxococcaceae → Myxococcaceae bacterium672Open in IMG/M
Ga0207504_102207All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium664Open in IMG/M
Ga0207504_102254Not Available658Open in IMG/M
Ga0207504_103144Not Available585Open in IMG/M
Ga0207504_103901All Organisms → cellular organisms → Archaea544Open in IMG/M
Ga0207504_104547All Organisms → cellular organisms → Bacteria517Open in IMG/M
Ga0207504_104777Not Available510Open in IMG/M
Ga0207504_104803Not Available509Open in IMG/M
Ga0207504_104871All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium506Open in IMG/M
Ga0207504_104895All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium505Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0207504_100049Ga0207504_1000493F025530MSDMRISTSTTMQIGEAARDNIAAGIWFAVLAGSLFLYAQSILMTTGLMLELTAAYSTFVLCGKGARSLFVHAVPYAFALAGAVFLCLAPDFPNAVQASLVFLGVTALMHGSVVYSALKNPRETEDPVYASAT
Ga0207504_100077Ga0207504_1000773F020731MPVLAFLAVAGSALIALLFLADATLEKDGSPVIVTSQRSGLPESSHRPDKIPVLTMAPAPDPDMTSKIVRDAQPKPVAQDPMKIHPAARAARAEAMPQTPSVTQPMNDRPPMNYHYRRSQEFDRFSIKGL
Ga0207504_100167Ga0207504_1001672F028554MRKAKQVRNRALSAGEGRRAIIVMAALTGLFFLAAFVTAGSLLSTDPRPMSSLSKVTQLPPTEGGEAASRVASIVMETDKKGRCEERQFDNRTGKMVSANYVNCDARLEPERDSTPSENINRERIRAILGAFKK
Ga0207504_100347Ga0207504_1003471F025757MNYSSSLFWLEMAVLIGCVAFSIRMRSRAAMWVALAIVAHCAVWLAIHDEEILIRLVASALVYLGLLKFSPNAARVWLCAGGALAGAFLLGTMALSLLMSFPGRWSVFGLSMGATLLFLTSGLLIGFWVVYRWTDPTQLGDTQPRQEA
Ga0207504_100363Ga0207504_1003632F090709PAYAFDDTINRHAIDPSVTLPIDKLNWMQNELVKAGNLKAPFDLTTMTAPDIRAEAAKRATK
Ga0207504_100704Ga0207504_1007041F094116QNLPGFEEAFDVAATQIRVADLEERTGLDFGDIKKFDHFAAGGASGTLELPSIEGIVQRAKIVRNGNDIVV
Ga0207504_100839Ga0207504_1008391F049092MKFIEPAFDTGGDLWRPPLNVPDSDAHALNEKFARQSAELRLRLTEVLELHNVQQRQANELQDRYDDIDRLSQTVSALQEAVNQYKMGTAAAEDKIILLESEKAVLQAQLDVALEESKTLADRVHAAEAASDRREATVALSIKQIEF
Ga0207504_100864Ga0207504_1008641F029148MTDDPLGWKDIEIKYRGKPVKGRYNVSDNLVTVVAWSGTKTARLGVLPAERLAKMLLRELADQGET
Ga0207504_100931Ga0207504_1009311F002275MLMRVVAVMLLLSAGIAAEAMSYSFVSKASGRLGGPIRFEFYRDSTTHPKTDIKSFTVSTRTADDRWKAMWSILSGRGLTQPIEYGVTPPGFTTMIQPQKLIPGRVYAG
Ga0207504_101043Ga0207504_1010432F094081MRDDQGKILKKEFVERFDKAKGRLQQSVPEIQHLLKTSNVVEAARKIIDPAQSIFKQFADDIQLKDLFAKAEALVANLTVAKSVSRDTPPTETSPSETPASKLPSKKAS
Ga0207504_101131Ga0207504_1011312F061030VIPPPVGVKREGLKLRPNEAAGGTQTLDGGRVAFPKYYRREGLKFRRSDALEGYLGCMRAAGRK
Ga0207504_101243Ga0207504_1012432F002896RFLKLFKVTLIQRHRLYIIKTTYDMEKINQDTISHFYRILENSLLESDISKMNERDIDAWSQSFKKVVAESKEKSGKGVFVPFLMWKLGEISPVEASKYLDKRKDDECRVSYDHNNVEYVIWVMTLMFMSWSITNLKRKRQNGHCQNINHPHGNTNPRYCQEGTKFHQELYNECVKTFEDLLTLSNSLKEEKL
Ga0207504_101377Ga0207504_1013772F081321MRAVMCTVLRAAALISLALVLVSPTHAACRGNCEPNVEVARAAMQQIFKQTFLSPYTLVSFERLDGRSGERYGGAFYEMRIRAVLHYDGVRLRCRRPSCPELHHYLLENDAASKKATVAGWLFLANDGDGWKTVPLTLQSPQ
Ga0207504_101701Ga0207504_1017011F015863MRRFIPLLILLGLVFGASYASALINRVMGPWSSTAIHQDGSLSHMQFGVDLPRPEWVPVYPGAWVVGGSKITSVEHPAGFHGLDLGTRASLDEVRRFYTEQLTAAGFEVSDLGLMGLNPMTAAYLGVDGMLSAKRHATDDAIDVQIRTPDGIIPSRLLQIHWRKISATPG
Ga0207504_101986Ga0207504_1019861F010961RHRRRYNFNKISFAVEAKMQRVTAMMAYVVSEGRLCALKPAEWSFLLVGVTLCGIATLLFLMLHA
Ga0207504_102117Ga0207504_1021171F081318MIARVVTLLLGIAIAATSVWEPRDVALHDVVVGLAIAAASMVAMAVRGVNWVITALALWMFFSGMVYPVVPHIYIGLIGGTLVFVFSPVSSSDRTFWPFGRTAP
Ga0207504_102207Ga0207504_1022072F053430LIVSPLTHRDGNYVGDYQLKVRPYFFKSEKGSLLLAASDDAVRKLQAGTAINFTGQAVTHKDGRTHIVLGRATPSSRDRGSVTFSIVTDDARIVFNTSYHFPAPRP
Ga0207504_102254Ga0207504_1022541F056406IVPSRGEDCRQMMLDNRTGRMWDKGVVNCYEAVSRSEKEQRGGMSSLRINAIGKAFNRRD
Ga0207504_102499Ga0207504_1024992F084687MKLVRYRSASSEKPGLILDGEIFDLSGSFAALNPRAPTLDDIEAIAAVPAKALVKV
Ga0207504_103144Ga0207504_1031441F018256VVISKLSQMPSFEGLLEWLTGPTWKFAALVLGSVGASYLILAHRLWWAYAVIMLAQVVYFFSAGSAELAVMRASVLFAYGVITLPPMRPLAPLATDIAVMVICFCYIMIVLCLYSAAWVASGGRVPQGAYGRRPTPLEPLRPSRLLDTFLPGHRSQDVTLWEGAVFALSSLLFVAASMAPFYGIRRVQNAFQSTF
Ga0207504_103901Ga0207504_1039011F022905NWQAAPNNNSTSMVWFQNSTKSVIGIKKAPDILSFPLFLAGPFVTQFLADKGVLESADQVTFGHSNSGYRYFLNLSSPSKLLDSFSGLPQIGSFLPTIPEGYDVPYKGMLILTQKQGDLYAIVLLSPNENFDSMANQIKPTLDSIELINS
Ga0207504_104547Ga0207504_1045471F013219MDVPLAVVYALSPVLEHERVLVPGAAMSGLMRLLPSAVTGPRLLNPAIVSVPVVNA
Ga0207504_104777Ga0207504_1047771F017166VNKIMFIVALGAMLFIGWLYLGEMSDVRSIKAAVTAAADSTAVALSQSANPERNTDADADGIFIKHIQTSSTLEDVSVKQSVEPISAGRLRQSVKVTARARTTLSE
Ga0207504_104803Ga0207504_1048031F037759HPYYRAILLRPWAPRTPGVPGRIRVNHLVSAQGQVDNEQGFAMKGTGLLHPGKLAAVIVAGGLLSGAARAQSPELGAPSIGILPPSDILASVSYLGLDPSGEPVRRGAYYMLHAFDRAGIELLVVVDAQFGDVLFMAPALNTSLTPPYVRAARIIQVEPPESGGQQKK
Ga0207504_104871Ga0207504_1048711F034616NRVSFKPLVLILAGIMIGVPVGWFLRVKAPPEKSIIPPAKTTTYTSLSDEELKDRSAQLVAAIRGLTRSFYEEDNRMRMAADEKSAGVKSQAEQQRIRKAWVEDSAKLHDTFMQRYKDDFWADAVLLRQAIVARLGSVPGAQNPVLFEQPTNILGVEQVANSLELLEK
Ga0207504_104895Ga0207504_1048951F022944KIVVILVGSSVLALATSRAQDAKFAADSLKYSHDSYAKVHLVAIANLDFGDGGTVEFKYDRYPNGGPERIQAGNGEEFAWKDGKTWLKSTGCGDTGKPVDAQTAKRLNNWVSLIDGRLNGEPVSNDASEGATVLKFIGKEDKGDREEVVFEESKEKPKAGAYPRLTLR

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.